SlideShare a Scribd company logo
1 of 19
Download to read offline
BigData, Why Care?




Saturday 20 October 12
Speaker
              Daan Gerits
              - BigData Architect
              - DataCrunchers.eu
                     § Semantic Analysis, Data Harvesting, ...
                     § Hadoop, Azure, BigInsights, ...
                     § Storm
              BigData.be co-organizer




                                                                  Datacrunchers Consultancy Services   2


Saturday 20 October 12
BigData
              A lot of technical fuzz
              - Hadoop, Storm, Pig, ...
              Seems to be only for the big players
              - Google, Facebook, Linkedin, Twitter, ...
              So why should ‘we’ care?
              - we = Startups, Smaller and Medium Enterprises (SSME)




                                                           Datacrunchers Consultancy Services   3


Saturday 20 October 12
What BigData Promises
              Ability to store and process large amounts of data
              - Scalable in hardware and software
              - Scalable in budget
              Which means your budget can grow with your data
              - start small with a small cluster
              - the more data you want to manage, the more systems
                    you add
              Lower cost systems
              - Several low to medium end systems
              - instead of 1 big expensive one

                                                    Datacrunchers Consultancy Services   4


Saturday 20 October 12
But what can you do with it?
              Analyze your data with higher precision
              Analyze historical facts
              Prevent Data Loss
              - Infrastructure failure
              - Human errors
              Eliminate data silo’s




                                                 Datacrunchers Consultancy Services   5


Saturday 20 October 12
High Precision Analysis
              Traditional Technologies
              - Problems:
                     § Unable to store all data
              - Solutions:
                     § Sharding
                     § Aggregate data
              - Problems:
                     § Sharding has a high maintanance cost
                     § Sharding is complex for users and apps
                     § Manual sharding adds a high risk
                     § Data Aggregation causes loss in data precision


                                                                  Datacrunchers Consultancy Services   6


Saturday 20 October 12
High Precision Analysis
              BigData allows us to
              - Store and process large amounts of data
                     § So no need to aggregate
              - ‘Forget’ about sharding
                     § BigData technologies do this for you
                     § Makes it predictable
                     § And transparant
              But
              - You have to configure it correctly
              - You don’t have ad-hoc querying (yet)


                                                               Datacrunchers Consultancy Services   7


Saturday 20 October 12
Analyze Historical Facts
              Data Warehouse
              - Built on top of parameters
              What if we forget to add a parameter?
              - Add the parameter
              - Start gathering information for that parameter
              Problem:
              - We will only have information from the moment we add
                    the parameter!




                                                       Datacrunchers Consultancy Services   8


Saturday 20 October 12
Analyze Historical Facts
              Let’s store everything
              Determine the parameters later
              - by humans
              - by machine learning algorithms
              Analysis will process all data
              What if we forget to add a parameter?
              - add the parameter
              - regenerate your reports



                                                 Datacrunchers Consultancy Services   9


Saturday 20 October 12
Analyze Historical Data
              Conclusion
              - Traditionally: Ask first, store later
              - BigData: store first, ask later




                                                        Datacrunchers Consultancy Services   10


Saturday 20 October 12
Prevent Data Loss
              Traditional technologies
              - Machine Failure
                     § I hope you have a backup from yesterday?
              - Human Error
                     § Whoops I deleted those records
                     § I hope you have a backup from yesterday?
              - So in the worst case, you lose one day of data




                                                              Datacrunchers Consultancy Services   11


Saturday 20 October 12
Prevent Data Loss
              BigData allows us to
              - Survive machine failure without data-loss
              - Survive human error without data-loss
              But
              - You need a data-model which supports this
                     § Incremental model
              - You need to restrict operations
                     § Only append data, No updates or deletes




                                                                  Datacrunchers Consultancy Services   12


Saturday 20 October 12
Prevent Data Loss
              Conclusion
              - Traditional technologies
                     § requires very advanced setups to handle machine failure
                     § allow you to go back to yesterday’s state
              - BigData
                     § requires knowledge of how the failover algorithms work
                     § expects failure most of the time
                     § allows you to go back to the previous state




                                                                 Datacrunchers Consultancy Services   13


Saturday 20 October 12
Eliminate Data Silo’s
              Departments having their own data sources
              - start to modify that data
              - start to treat it as their master data
              - not coupled to the master dataset
              Causes a lot of overhead
              - Silo’s miss master data updates
              - Business decisions based on silo data, not the more
                    accurate master data
              No obvious way out



                                                         Datacrunchers Consultancy Services   14


Saturday 20 October 12
Eliminate Data Silo’s
              Consolidate the silo’s
              - Identify the silo’s
              - Import the data from the silo’s into one store
              - Reconstruct master data based on silo rules and priorities


                           Sales     Sa
                                                     Master
                         Marketing   M
                                                     Data

                          Support    Su


                                                       Datacrunchers Consultancy Services   15


Saturday 20 October 12
Eliminate Data Silo’s
              Generate read-only data-models per application
              Data changes are sent to the master data
              - using a specific api
              - using database triggers


                                          M1    ERP/CRM DB

                         Master
                                          M2     Public API
                         Data

                                          M3   DataWarehouse



                                                 Datacrunchers Consultancy Services   16


Saturday 20 October 12
Eliminate Data Silo’s
              Conclusion
              - You will have to consolidate
              - But you need a structural solution
              - Which can be provided by BigData
              - In a flexible and future-proof way




                                                     Datacrunchers Consultancy Services   17


Saturday 20 October 12
Conclusion
              There is a lot to think about
              But BigData can do a lot of things
              - A lot more than I explained today
              For a reasonable price
              And you are not alone
              - bigdata.be
              - datacrunchers.eu




                                                    Datacrunchers Consultancy Services   18


Saturday 20 October 12
Questions?




Saturday 20 October 12

More Related Content

What's hot

Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?
Guido Schmutz
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 

What's hot (20)

BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
White Paper - How Data Works
White Paper - How Data WorksWhite Paper - How Data Works
White Paper - How Data Works
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)
 
Slow Data versus Quick Data
Slow Data versus Quick DataSlow Data versus Quick Data
Slow Data versus Quick Data
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Data Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceData Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI Compliance
 
Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?
 
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
 
Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?
 
Big Data Week 2016 - Worldpay - Deploying Secure Clusters
Big Data Week 2016  - Worldpay - Deploying Secure ClustersBig Data Week 2016  - Worldpay - Deploying Secure Clusters
Big Data Week 2016 - Worldpay - Deploying Secure Clusters
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data Warehouse
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of People
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Terracotta Ditch the Disk webcast
Terracotta Ditch the Disk webcastTerracotta Ditch the Disk webcast
Terracotta Ditch the Disk webcast
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User Information
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digital
 

Similar to Big data, why care

Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
Trillium Software
 

Similar to Big data, why care (20)

Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Datacenter 2014: Raritan - Richard May
Datacenter 2014: Raritan -  Richard MayDatacenter 2014: Raritan -  Richard May
Datacenter 2014: Raritan - Richard May
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 
Big Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your OrganisationBig Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your Organisation
 
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management Success10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management Success
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Paving The Way To Data Driven
Paving The Way To Data DrivenPaving The Way To Data Driven
Paving The Way To Data Driven
 
Making Sense of Data
Making Sense of DataMaking Sense of Data
Making Sense of Data
 

More from Daan Gerits (6)

Apache kafka
Apache kafkaApache kafka
Apache kafka
 
Big Data BluePrint
Big Data BluePrintBig Data BluePrint
Big Data BluePrint
 
BigBoards.io Strata Ignite
BigBoards.io Strata IgniteBigBoards.io Strata Ignite
BigBoards.io Strata Ignite
 
IoT and BigData
IoT and BigDataIoT and BigData
IoT and BigData
 
Big data architectures
Big data architecturesBig data architectures
Big data architectures
 
Start small bigger biggest
Start small bigger biggestStart small bigger biggest
Start small bigger biggest
 

Recently uploaded

Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
daisycvs
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
amitlee9823
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
Abortion pills in Kuwait Cytotec pills in Kuwait
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
amitlee9823
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
dlhescort
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
lizamodels9
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
dlhescort
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 

Recently uploaded (20)

Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Call Girls Zirakpur👧 Book Now📱7837612180 📞👉Call Girl Service In Zirakpur No A...
Call Girls Zirakpur👧 Book Now📱7837612180 📞👉Call Girl Service In Zirakpur No A...Call Girls Zirakpur👧 Book Now📱7837612180 📞👉Call Girl Service In Zirakpur No A...
Call Girls Zirakpur👧 Book Now📱7837612180 📞👉Call Girl Service In Zirakpur No A...
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
Call Girls Electronic City Just Call 👗 7737669865 👗 Top Class Call Girl Servi...
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort ServiceEluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
 
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLBAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
 

Big data, why care

  • 2. Speaker Daan Gerits - BigData Architect - DataCrunchers.eu § Semantic Analysis, Data Harvesting, ... § Hadoop, Azure, BigInsights, ... § Storm BigData.be co-organizer Datacrunchers Consultancy Services 2 Saturday 20 October 12
  • 3. BigData A lot of technical fuzz - Hadoop, Storm, Pig, ... Seems to be only for the big players - Google, Facebook, Linkedin, Twitter, ... So why should ‘we’ care? - we = Startups, Smaller and Medium Enterprises (SSME) Datacrunchers Consultancy Services 3 Saturday 20 October 12
  • 4. What BigData Promises Ability to store and process large amounts of data - Scalable in hardware and software - Scalable in budget Which means your budget can grow with your data - start small with a small cluster - the more data you want to manage, the more systems you add Lower cost systems - Several low to medium end systems - instead of 1 big expensive one Datacrunchers Consultancy Services 4 Saturday 20 October 12
  • 5. But what can you do with it? Analyze your data with higher precision Analyze historical facts Prevent Data Loss - Infrastructure failure - Human errors Eliminate data silo’s Datacrunchers Consultancy Services 5 Saturday 20 October 12
  • 6. High Precision Analysis Traditional Technologies - Problems: § Unable to store all data - Solutions: § Sharding § Aggregate data - Problems: § Sharding has a high maintanance cost § Sharding is complex for users and apps § Manual sharding adds a high risk § Data Aggregation causes loss in data precision Datacrunchers Consultancy Services 6 Saturday 20 October 12
  • 7. High Precision Analysis BigData allows us to - Store and process large amounts of data § So no need to aggregate - ‘Forget’ about sharding § BigData technologies do this for you § Makes it predictable § And transparant But - You have to configure it correctly - You don’t have ad-hoc querying (yet) Datacrunchers Consultancy Services 7 Saturday 20 October 12
  • 8. Analyze Historical Facts Data Warehouse - Built on top of parameters What if we forget to add a parameter? - Add the parameter - Start gathering information for that parameter Problem: - We will only have information from the moment we add the parameter! Datacrunchers Consultancy Services 8 Saturday 20 October 12
  • 9. Analyze Historical Facts Let’s store everything Determine the parameters later - by humans - by machine learning algorithms Analysis will process all data What if we forget to add a parameter? - add the parameter - regenerate your reports Datacrunchers Consultancy Services 9 Saturday 20 October 12
  • 10. Analyze Historical Data Conclusion - Traditionally: Ask first, store later - BigData: store first, ask later Datacrunchers Consultancy Services 10 Saturday 20 October 12
  • 11. Prevent Data Loss Traditional technologies - Machine Failure § I hope you have a backup from yesterday? - Human Error § Whoops I deleted those records § I hope you have a backup from yesterday? - So in the worst case, you lose one day of data Datacrunchers Consultancy Services 11 Saturday 20 October 12
  • 12. Prevent Data Loss BigData allows us to - Survive machine failure without data-loss - Survive human error without data-loss But - You need a data-model which supports this § Incremental model - You need to restrict operations § Only append data, No updates or deletes Datacrunchers Consultancy Services 12 Saturday 20 October 12
  • 13. Prevent Data Loss Conclusion - Traditional technologies § requires very advanced setups to handle machine failure § allow you to go back to yesterday’s state - BigData § requires knowledge of how the failover algorithms work § expects failure most of the time § allows you to go back to the previous state Datacrunchers Consultancy Services 13 Saturday 20 October 12
  • 14. Eliminate Data Silo’s Departments having their own data sources - start to modify that data - start to treat it as their master data - not coupled to the master dataset Causes a lot of overhead - Silo’s miss master data updates - Business decisions based on silo data, not the more accurate master data No obvious way out Datacrunchers Consultancy Services 14 Saturday 20 October 12
  • 15. Eliminate Data Silo’s Consolidate the silo’s - Identify the silo’s - Import the data from the silo’s into one store - Reconstruct master data based on silo rules and priorities Sales Sa Master Marketing M Data Support Su Datacrunchers Consultancy Services 15 Saturday 20 October 12
  • 16. Eliminate Data Silo’s Generate read-only data-models per application Data changes are sent to the master data - using a specific api - using database triggers M1 ERP/CRM DB Master M2 Public API Data M3 DataWarehouse Datacrunchers Consultancy Services 16 Saturday 20 October 12
  • 17. Eliminate Data Silo’s Conclusion - You will have to consolidate - But you need a structural solution - Which can be provided by BigData - In a flexible and future-proof way Datacrunchers Consultancy Services 17 Saturday 20 October 12
  • 18. Conclusion There is a lot to think about But BigData can do a lot of things - A lot more than I explained today For a reasonable price And you are not alone - bigdata.be - datacrunchers.eu Datacrunchers Consultancy Services 18 Saturday 20 October 12