SlideShare a Scribd company logo
1 of 19
Download to read offline
BigData, Why Care?




Saturday 20 October 12
Speaker
              Daan Gerits
              - BigData Architect
              - DataCrunchers.eu
                     § Semantic Analysis, Data Harvesting, ...
                     § Hadoop, Azure, BigInsights, ...
                     § Storm
              BigData.be co-organizer




                                                                  Datacrunchers Consultancy Services   2


Saturday 20 October 12
BigData
              A lot of technical fuzz
              - Hadoop, Storm, Pig, ...
              Seems to be only for the big players
              - Google, Facebook, Linkedin, Twitter, ...
              So why should ‘we’ care?
              - we = Startups, Smaller and Medium Enterprises (SSME)




                                                           Datacrunchers Consultancy Services   3


Saturday 20 October 12
What BigData Promises
              Ability to store and process large amounts of data
              - Scalable in hardware and software
              - Scalable in budget
              Which means your budget can grow with your data
              - start small with a small cluster
              - the more data you want to manage, the more systems
                    you add
              Lower cost systems
              - Several low to medium end systems
              - instead of 1 big expensive one

                                                    Datacrunchers Consultancy Services   4


Saturday 20 October 12
But what can you do with it?
              Analyze your data with higher precision
              Analyze historical facts
              Prevent Data Loss
              - Infrastructure failure
              - Human errors
              Eliminate data silo’s




                                                 Datacrunchers Consultancy Services   5


Saturday 20 October 12
High Precision Analysis
              Traditional Technologies
              - Problems:
                     § Unable to store all data
              - Solutions:
                     § Sharding
                     § Aggregate data
              - Problems:
                     § Sharding has a high maintanance cost
                     § Sharding is complex for users and apps
                     § Manual sharding adds a high risk
                     § Data Aggregation causes loss in data precision


                                                                  Datacrunchers Consultancy Services   6


Saturday 20 October 12
High Precision Analysis
              BigData allows us to
              - Store and process large amounts of data
                     § So no need to aggregate
              - ‘Forget’ about sharding
                     § BigData technologies do this for you
                     § Makes it predictable
                     § And transparant
              But
              - You have to configure it correctly
              - You don’t have ad-hoc querying (yet)


                                                               Datacrunchers Consultancy Services   7


Saturday 20 October 12
Analyze Historical Facts
              Data Warehouse
              - Built on top of parameters
              What if we forget to add a parameter?
              - Add the parameter
              - Start gathering information for that parameter
              Problem:
              - We will only have information from the moment we add
                    the parameter!




                                                       Datacrunchers Consultancy Services   8


Saturday 20 October 12
Analyze Historical Facts
              Let’s store everything
              Determine the parameters later
              - by humans
              - by machine learning algorithms
              Analysis will process all data
              What if we forget to add a parameter?
              - add the parameter
              - regenerate your reports



                                                 Datacrunchers Consultancy Services   9


Saturday 20 October 12
Analyze Historical Data
              Conclusion
              - Traditionally: Ask first, store later
              - BigData: store first, ask later




                                                        Datacrunchers Consultancy Services   10


Saturday 20 October 12
Prevent Data Loss
              Traditional technologies
              - Machine Failure
                     § I hope you have a backup from yesterday?
              - Human Error
                     § Whoops I deleted those records
                     § I hope you have a backup from yesterday?
              - So in the worst case, you lose one day of data




                                                              Datacrunchers Consultancy Services   11


Saturday 20 October 12
Prevent Data Loss
              BigData allows us to
              - Survive machine failure without data-loss
              - Survive human error without data-loss
              But
              - You need a data-model which supports this
                     § Incremental model
              - You need to restrict operations
                     § Only append data, No updates or deletes




                                                                  Datacrunchers Consultancy Services   12


Saturday 20 October 12
Prevent Data Loss
              Conclusion
              - Traditional technologies
                     § requires very advanced setups to handle machine failure
                     § allow you to go back to yesterday’s state
              - BigData
                     § requires knowledge of how the failover algorithms work
                     § expects failure most of the time
                     § allows you to go back to the previous state




                                                                 Datacrunchers Consultancy Services   13


Saturday 20 October 12
Eliminate Data Silo’s
              Departments having their own data sources
              - start to modify that data
              - start to treat it as their master data
              - not coupled to the master dataset
              Causes a lot of overhead
              - Silo’s miss master data updates
              - Business decisions based on silo data, not the more
                    accurate master data
              No obvious way out



                                                         Datacrunchers Consultancy Services   14


Saturday 20 October 12
Eliminate Data Silo’s
              Consolidate the silo’s
              - Identify the silo’s
              - Import the data from the silo’s into one store
              - Reconstruct master data based on silo rules and priorities


                           Sales     Sa
                                                     Master
                         Marketing   M
                                                     Data

                          Support    Su


                                                       Datacrunchers Consultancy Services   15


Saturday 20 October 12
Eliminate Data Silo’s
              Generate read-only data-models per application
              Data changes are sent to the master data
              - using a specific api
              - using database triggers


                                          M1    ERP/CRM DB

                         Master
                                          M2     Public API
                         Data

                                          M3   DataWarehouse



                                                 Datacrunchers Consultancy Services   16


Saturday 20 October 12
Eliminate Data Silo’s
              Conclusion
              - You will have to consolidate
              - But you need a structural solution
              - Which can be provided by BigData
              - In a flexible and future-proof way




                                                     Datacrunchers Consultancy Services   17


Saturday 20 October 12
Conclusion
              There is a lot to think about
              But BigData can do a lot of things
              - A lot more than I explained today
              For a reasonable price
              And you are not alone
              - bigdata.be
              - datacrunchers.eu




                                                    Datacrunchers Consultancy Services   18


Saturday 20 October 12
Questions?




Saturday 20 October 12

More Related Content

What's hot

BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101IBM Analytics
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Denodo
 
White Paper - How Data Works
White Paper - How Data WorksWhite Paper - How Data Works
White Paper - How Data WorksDavid Walker
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Denodo
 
Slow Data versus Quick Data
Slow Data versus Quick DataSlow Data versus Quick Data
Slow Data versus Quick DataMartin Geddes
 
Data Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceData Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceDavid Walker
 
Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?Guido Schmutz
 
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...Dana Gardner
 
Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?Guido Schmutz
 
Big Data Week 2016 - Worldpay - Deploying Secure Clusters
Big Data Week 2016  - Worldpay - Deploying Secure ClustersBig Data Week 2016  - Worldpay - Deploying Secure Clusters
Big Data Week 2016 - Worldpay - Deploying Secure ClustersDavid Walker
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseDenodo
 
Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of Peoplemark madsen
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Denodo
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User InformationDenodo
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digitalsambiswal
 

What's hot (20)

BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101BLU Acceleration on the Cloud – 101
BLU Acceleration on the Cloud – 101
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
White Paper - How Data Works
White Paper - How Data WorksWhite Paper - How Data Works
White Paper - How Data Works
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)
 
Slow Data versus Quick Data
Slow Data versus Quick DataSlow Data versus Quick Data
Slow Data versus Quick Data
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Data Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI ComplianceData Works Berlin 2018 - Worldpay - PCI Compliance
Data Works Berlin 2018 - Worldpay - PCI Compliance
 
Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?Big Data and Fast Data – Big and Fast Combined, is it Possible?
Big Data and Fast Data – Big and Fast Combined, is it Possible?
 
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
 
Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?Big Data and Fast Data - big and fast combined, is it possible?
Big Data and Fast Data - big and fast combined, is it possible?
 
Big Data Week 2016 - Worldpay - Deploying Secure Clusters
Big Data Week 2016  - Worldpay - Deploying Secure ClustersBig Data Week 2016  - Worldpay - Deploying Secure Clusters
Big Data Week 2016 - Worldpay - Deploying Secure Clusters
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data Warehouse
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of People
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Terracotta Ditch the Disk webcast
Terracotta Ditch the Disk webcastTerracotta Ditch the Disk webcast
Terracotta Ditch the Disk webcast
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User Information
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digital
 

Similar to Big data, why care

Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Denodo
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Denodo
 
Big Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your OrganisationBig Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your OrganisationMatt Stubbs
 
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...exponential-inc
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperativeTrillium Software
 
10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management Success10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management SuccessRaritan
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationDenodo
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Denodo
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarioskcmallu
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Seeling Cheung
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 

Similar to Big data, why care (20)

Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Datacenter 2014: Raritan - Richard May
Datacenter 2014: Raritan -  Richard MayDatacenter 2014: Raritan -  Richard May
Datacenter 2014: Raritan - Richard May
 
Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)Best Practices in the Cloud for Data Management (US)
Best Practices in the Cloud for Data Management (US)
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 
Big Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your OrganisationBig Data LDN 2017: Unleash Data Science Upon Your Organisation
Big Data LDN 2017: Unleash Data Science Upon Your Organisation
 
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management Success10 Steps to Data Center Infrastructure Management Success
10 Steps to Data Center Infrastructure Management Success
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Paving The Way To Data Driven
Paving The Way To Data DrivenPaving The Way To Data Driven
Paving The Way To Data Driven
 
Making Sense of Data
Making Sense of DataMaking Sense of Data
Making Sense of Data
 

More from Daan Gerits

Big Data BluePrint
Big Data BluePrintBig Data BluePrint
Big Data BluePrintDaan Gerits
 
BigBoards.io Strata Ignite
BigBoards.io Strata IgniteBigBoards.io Strata Ignite
BigBoards.io Strata IgniteDaan Gerits
 
Big data architectures
Big data architecturesBig data architectures
Big data architecturesDaan Gerits
 
Start small bigger biggest
Start small bigger biggestStart small bigger biggest
Start small bigger biggestDaan Gerits
 

More from Daan Gerits (6)

Apache kafka
Apache kafkaApache kafka
Apache kafka
 
Big Data BluePrint
Big Data BluePrintBig Data BluePrint
Big Data BluePrint
 
BigBoards.io Strata Ignite
BigBoards.io Strata IgniteBigBoards.io Strata Ignite
BigBoards.io Strata Ignite
 
IoT and BigData
IoT and BigDataIoT and BigData
IoT and BigData
 
Big data architectures
Big data architecturesBig data architectures
Big data architectures
 
Start small bigger biggest
Start small bigger biggestStart small bigger biggest
Start small bigger biggest
 

Recently uploaded

8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCRashishs7044
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy Verified Accounts
 
Cyber Security Training in Office Environment
Cyber Security Training in Office EnvironmentCyber Security Training in Office Environment
Cyber Security Training in Office Environmentelijahj01012
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdfKhaled Al Awadi
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menzaictsugar
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCRashishs7044
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?Olivia Kresic
 
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxFinancial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxsaniyaimamuddin
 

Recently uploaded (20)

8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail Accounts
 
Cyber Security Training in Office Environment
Cyber Security Training in Office EnvironmentCyber Security Training in Office Environment
Cyber Security Training in Office Environment
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Call Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North GoaCall Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North Goa
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?
 
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptxFinancial-Statement-Analysis-of-Coca-cola-Company.pptx
Financial-Statement-Analysis-of-Coca-cola-Company.pptx
 

Big data, why care

  • 2. Speaker Daan Gerits - BigData Architect - DataCrunchers.eu § Semantic Analysis, Data Harvesting, ... § Hadoop, Azure, BigInsights, ... § Storm BigData.be co-organizer Datacrunchers Consultancy Services 2 Saturday 20 October 12
  • 3. BigData A lot of technical fuzz - Hadoop, Storm, Pig, ... Seems to be only for the big players - Google, Facebook, Linkedin, Twitter, ... So why should ‘we’ care? - we = Startups, Smaller and Medium Enterprises (SSME) Datacrunchers Consultancy Services 3 Saturday 20 October 12
  • 4. What BigData Promises Ability to store and process large amounts of data - Scalable in hardware and software - Scalable in budget Which means your budget can grow with your data - start small with a small cluster - the more data you want to manage, the more systems you add Lower cost systems - Several low to medium end systems - instead of 1 big expensive one Datacrunchers Consultancy Services 4 Saturday 20 October 12
  • 5. But what can you do with it? Analyze your data with higher precision Analyze historical facts Prevent Data Loss - Infrastructure failure - Human errors Eliminate data silo’s Datacrunchers Consultancy Services 5 Saturday 20 October 12
  • 6. High Precision Analysis Traditional Technologies - Problems: § Unable to store all data - Solutions: § Sharding § Aggregate data - Problems: § Sharding has a high maintanance cost § Sharding is complex for users and apps § Manual sharding adds a high risk § Data Aggregation causes loss in data precision Datacrunchers Consultancy Services 6 Saturday 20 October 12
  • 7. High Precision Analysis BigData allows us to - Store and process large amounts of data § So no need to aggregate - ‘Forget’ about sharding § BigData technologies do this for you § Makes it predictable § And transparant But - You have to configure it correctly - You don’t have ad-hoc querying (yet) Datacrunchers Consultancy Services 7 Saturday 20 October 12
  • 8. Analyze Historical Facts Data Warehouse - Built on top of parameters What if we forget to add a parameter? - Add the parameter - Start gathering information for that parameter Problem: - We will only have information from the moment we add the parameter! Datacrunchers Consultancy Services 8 Saturday 20 October 12
  • 9. Analyze Historical Facts Let’s store everything Determine the parameters later - by humans - by machine learning algorithms Analysis will process all data What if we forget to add a parameter? - add the parameter - regenerate your reports Datacrunchers Consultancy Services 9 Saturday 20 October 12
  • 10. Analyze Historical Data Conclusion - Traditionally: Ask first, store later - BigData: store first, ask later Datacrunchers Consultancy Services 10 Saturday 20 October 12
  • 11. Prevent Data Loss Traditional technologies - Machine Failure § I hope you have a backup from yesterday? - Human Error § Whoops I deleted those records § I hope you have a backup from yesterday? - So in the worst case, you lose one day of data Datacrunchers Consultancy Services 11 Saturday 20 October 12
  • 12. Prevent Data Loss BigData allows us to - Survive machine failure without data-loss - Survive human error without data-loss But - You need a data-model which supports this § Incremental model - You need to restrict operations § Only append data, No updates or deletes Datacrunchers Consultancy Services 12 Saturday 20 October 12
  • 13. Prevent Data Loss Conclusion - Traditional technologies § requires very advanced setups to handle machine failure § allow you to go back to yesterday’s state - BigData § requires knowledge of how the failover algorithms work § expects failure most of the time § allows you to go back to the previous state Datacrunchers Consultancy Services 13 Saturday 20 October 12
  • 14. Eliminate Data Silo’s Departments having their own data sources - start to modify that data - start to treat it as their master data - not coupled to the master dataset Causes a lot of overhead - Silo’s miss master data updates - Business decisions based on silo data, not the more accurate master data No obvious way out Datacrunchers Consultancy Services 14 Saturday 20 October 12
  • 15. Eliminate Data Silo’s Consolidate the silo’s - Identify the silo’s - Import the data from the silo’s into one store - Reconstruct master data based on silo rules and priorities Sales Sa Master Marketing M Data Support Su Datacrunchers Consultancy Services 15 Saturday 20 October 12
  • 16. Eliminate Data Silo’s Generate read-only data-models per application Data changes are sent to the master data - using a specific api - using database triggers M1 ERP/CRM DB Master M2 Public API Data M3 DataWarehouse Datacrunchers Consultancy Services 16 Saturday 20 October 12
  • 17. Eliminate Data Silo’s Conclusion - You will have to consolidate - But you need a structural solution - Which can be provided by BigData - In a flexible and future-proof way Datacrunchers Consultancy Services 17 Saturday 20 October 12
  • 18. Conclusion There is a lot to think about But BigData can do a lot of things - A lot more than I explained today For a reasonable price And you are not alone - bigdata.be - datacrunchers.eu Datacrunchers Consultancy Services 18 Saturday 20 October 12