SlideShare una empresa de Scribd logo
1 de 29
Big Data: Visualization




           Dr. Maurice Bouwhuis
SARA National High Performance Computing Services

               Big Data analytics– Almere 14-06-2012
About SARA: Our Mission
              Is to Support Innovation
   SARA national HPC Center has
   about ~170 fte’s in 2 locations
   (Amsterdam and Almere)
                                                                       Offices in Amsterdam
The mission of SARA is 2-fold:
4. Supporting research in the Netherlands
   [SARA BV for Science & Innovation]

5. Offering commercial high-end ICT                                   Data Center in Amsterdam
   services
   [Vancis BV for adVANCed Ict Services]

SARA werkt nauw samen met SURF

                                                                          Data Center in Almere
                              Big Data analytics– Almere 14-06-2012
Almere Big Data Hoofdstad




        Big Data analytics– Almere 14-06-2012
Big Data: The Deluge




     Big Data analytics– Almere 14-06-2012
What is Big Data?
        “I cannot define it, but I know it when I see it”
                    My Big Bear now




My Big Bear then




                        Big Data analytics– Almere 14-06-2012
Wikipedia Defining Big Data
                beyond Commonly used ICT




“Data sets whose size is beyond the ability of commonly
used tools to capture, manage, and process the data within
a tolerable elapsed time.
                         Big Data analytics– Almere 14-06-2012
Big Data as defined
                   by IDC (2011)
“Bringing together vast
amounts of data from public
and private sources,
combined with the intuition
of business and thought
leaders and the speed and
affordability of today's
computers.”
             (IDC October 2011)




                           Big Data analytics– Almere 14-06-2012
Defining Big Data
                           The 3 V’s
Volume
  Large amounts
  Massive historical archives
  Valuable for data mining                                        Velocity
Velocity
  At very high rates
   (sensors, streams, social media, …)
  Valuable in its “fresh” state
                                                   Volume                     Variety
Variety
  Structured, semi-structured and unstructured
  Variety also in Value




                           Big Data analytics– Almere 14-06-2012
2 new V’s: Viscosity & Virality




          Big Data analytics– Almere 14-06-2012
Big Data Drivers
“Internet of Things”
Commoditization of HPC
Human dynamics can be
easily stored and queried
with Apache Hadoop
  HDFS (storage)
    Hadoop Distributed File System
  MapReduce (processing)
    high performance parallel data
    processing
  Scalable & Self-healing
So, Big data is driven by
large scale data collection,
storage and (information)
processing

                                     Big Data analytics– Almere 14-06-2012
Data Deluge @SARA
                      It has always been there…
Scientific Data Deluge:
  Observations (e.g.
   LOFAR, Lifewatch)
  Large-scale Simulation
   (e.g. astrophysics,
   climate modeling)
  Experiments (e.g. Large
   Hadron Collider, DNA
                                e-Science and Technology Infrastructure for        Large Hadron Collider
   sequencers)                  Biodiversity Data and Ecosystem Research

Multi-Petabytes of data
growth at SARA each year
Single datasets of 10-100
Terabytes and larger
Multidisciplinary use of data
Science needs Insight, not
only Data                                     Low Frequency Array               Biobanking and Biomolecular
                                                                              Resources Research Infrastructure




                                     Big Data analytics– Almere 14-06-2012
Big Data Ultimate Challenge:
              How to get insight?
As volume, variety and
velocity of data
increase, use of
visualization is
imperative to help
getting the insight for
an ever increasingly
data-driven future




                          Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
                        Astronomy
Astron
1 Exabyte per
day raw data
2 times WWW
traffic per day

SKA
300 - 1500
Petabytes
storage per
year!
20 times LHC




                          Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
                               Healthcare
Erasmus MC
   Diagnostics:
   4%: costs
   72%: decisions

    Opportunities
    for disease
    management:
 1) New
    classification of
    patients for
    better
    diagnostics &
    combined
    therapy
 2) Assessing and
    managing risks




                                Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
                 Water Management
Total length primary flood
defenses in Netherlands:
2875km spread over 90 dike
“rings”…
Decision Support System:
Integration of: Sensor data +
AI, Simulation results, Maps,
weather, ships, roadwork,
traffic, twitter, GSM, location
of emergency services, ...




                         Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
      Infrawatch, Hollandse Brug




  145 x   100   x      60  x  60    x  24   x                  365 = big data
sensors   Hz        seconds minutes   hours                    days
                                         (Arno Knobbe, LIACS, 2011, http://infrawatch.liacs.nl)

                            Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
                             Ecology
Citizen Science:
>20,000 users, >50M
observations.
Bird radars: streaming
data, many terabytes
GPS-tracking:
Streaming data,
Word-wide projects.
Massive amounts of
complementary, multi-
scale information that
can not be “seen” in
the field.




                         Big Data analytics– Almere 14-06-2012
Some Applications of Big Data
        eScience is also (big) data mining



                              Cognition: image analysis
                                and data exchange                         Food Specific Ontologies for
  Climate Research:                                                        Food Focused Text Mining
  Regional Sea-Level




                            Chemical Metabolomics
                                Data Analysis

    Biography Portal:                                                    Data-Intensive Modeling
                               by SURF & NWO
interconnections, trends,                                                of the Global Water Cycle
 geographical maps and
        time lines               Big Data analytics– Almere 14-06-2012
CosmoGrid Case: The Need for
            Integrated e-Infrastructure Services
A cosmological N-body simulation with
8,589,934,592 particles, formation of large
structures of dark matter
Dutch Computing Challenge Project & DEISA
Extreme Computing Initiative: DCCP 2008 –
2009 / DECI 2009
Run 1 + 2:
4.25 M core hours Computing, 110 TB data
Huygens Amsterdam + Cray XT4 Tokyo,
coupled via light path and Amsterdam + Tokyo
+ Helsinki + Edinburgh
High resolution data remote visualization on
tiled panel display
Advanced support in porting and optimization,
visualization, data storage, networking and
project management
All infrastructure elements and their integration are crucial


          Visit SURF                        7-6-2012
                                Big Data analytics– Almere 14-06-2012
Visualization @ SARA
                  more than 20 years of experience and support




 Scientific visualization                                           High resolution      Scientific visualization
                               Scientific & industrial
        support                                                   visualization support           support
                               visualization support
      Rendering                                                  Remote visualization    Remote visualization
                                  Virtual Reality
 Animations and slides                                           & streaming service      Collaboration support




                                                 Big Data analytics– Almere 14-06-2012
How are we
              Coping with Big Data?
HPC centers, universities,
and in recent years, Internet
companies like Yahoo!,
Facebook en of course
Google are pioneers (lots of
knowledge exchange, by the
way.)
We collect Big Data, store it
and we have the knowledge
to interpret it.
What tools do we have to
pull this out?

                       Big Data analytics– Almere 14-06-2012
‘Collaboratorium’: New visualization
          and collaboration facility @SARA
                       videoconference            laptop 2              laptop 3      website in browser
Visualization of big      shared data

data and trends
Also for improving
business and
Science models
and computational
debugging.
PowerPoint, Video
Conferencing,
                        videoconference           laptop 1             data from        data from
telepresence, 3D         remote camera                                workstation 1    workstation 2

(stereo) projection
Based on proven
technology from
SARA and
partners EVL and
Calit2 (San Diego)

                              Big Data analytics– Almere 14-06-2012
Visualization @ SARA –
   Remote Visualization
Remote visualization service
q Provide dedicated visualization resources in SARA
  data center: Rendercluster and visualization software
  (i.e. Paraview, VisIt, VTK, VMD, Blender, ...
q Embedded in national e-Infrastructure
q Visualization resource has direct access to storage at
  SARA
q Avoid large data transfers over network (esp.
  Internet) by running visualization applications
  remotely
q Pixel output/remote desktop transferred to user,
  instead of files
q Application support for parallel rendering



                Big Data analytics– Almere 14-06-2012
Big Data Requires Big Computing


                                  What benefits could
                                  exascale computing bring?
                                  It will enable discovery in many
                                  areas of science. "Aerospace
                                  engineering, astrophysics,
                                  biology, climate modeling and
                                  national security all have
                                  applications with extreme
                                  computing requirements,"




          Big Data analytics– Almere 14-06-2012
Compute Ecosystem @SARA
1. Low-latency, high-bandwidth
   capability computing (Huygens)
2. Capacity compute clusters
   (LISA)
3. Loosely coupled compute Grids
   (Big Grid)
4. Sector, private and public
   Clouds Including our HPC
   Cloud) and Beehub storage
5. Special-purpose (GPU)
   clusters
6. Big Data Apache Hadoop
   systems (since 2009)



                             Big Data analytics– Almere 14-06-2012
Big Data Eco-System @SARA
DevOps   Programming algorithms                      Domain knowledge




             Big Data analytics– Almere 14-06-2012
To Summarize: Big Data
      Is Changing Rapidly our Life
Big Data is changing
science, medicine,
business, and technology.
A whole new way of
science: correlation
supersedes causation,
coherent models or unified
theories…
Biggest challenge for
science & business is not
storing or processing data
but how to make sense of
it without affecting our
privacy.
                      Big Data analytics– Almere 14-06-2012
Big Data… Big Enough?
      Thank You




        Big Data analytics– Almere 14-06-2012
Big Data analytics– Almere 14-06-2012

Más contenido relacionado

La actualidad más candente

Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebEdward Curry
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computingViet-Trung TRAN
 
Big Data Analytics in Government
Big Data Analytics in GovernmentBig Data Analytics in Government
Big Data Analytics in GovernmentDeepak Ramanathan
 
Interactive Water Services: The Waternomics Approach
Interactive Water Services: The Waternomics ApproachInteractive Water Services: The Waternomics Approach
Interactive Water Services: The Waternomics ApproachEdward Curry
 
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse..."Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...Dataconomy Media
 
Data Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentData Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentKalido
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big DataMatthew Dennis
 
Wikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data CurationWikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data CurationEdward Curry
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stackFlytxt
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehousemark madsen
 
Data Curation at the New York Times
Data Curation at the New York TimesData Curation at the New York Times
Data Curation at the New York TimesEdward Curry
 
Developing an Sustainable IT Capability: Lessons From Intel's Journey
Developing an Sustainable IT Capability: Lessons From Intel's JourneyDeveloping an Sustainable IT Capability: Lessons From Intel's Journey
Developing an Sustainable IT Capability: Lessons From Intel's JourneyEdward Curry
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Preferred Networks
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Edgar Alejandro Villegas
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)Denodo
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data FrameworkseXascale Infolab
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big DataDataWorks Summit
 

La actualidad más candente (20)

Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data Web
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
 
Big Data Analytics in Government
Big Data Analytics in GovernmentBig Data Analytics in Government
Big Data Analytics in Government
 
Interactive Water Services: The Waternomics Approach
Interactive Water Services: The Waternomics ApproachInteractive Water Services: The Waternomics Approach
Interactive Water Services: The Waternomics Approach
 
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse..."Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
 
Data Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business InvestmentData Scientists: Your Must-Have Business Investment
Data Scientists: Your Must-Have Business Investment
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
Wikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data CurationWikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data Curation
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stack
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
 
Data Curation at the New York Times
Data Curation at the New York TimesData Curation at the New York Times
Data Curation at the New York Times
 
Developing an Sustainable IT Capability: Lessons From Intel's Journey
Developing an Sustainable IT Capability: Lessons From Intel's JourneyDeveloping an Sustainable IT Capability: Lessons From Intel's Journey
Developing an Sustainable IT Capability: Lessons From Intel's Journey
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
Jubatus: Realtime deep analytics for BIgData@Rakuten Technology Conference 2012
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data Frameworks
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big Data
 

Destacado

Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenPiet J.H. Daas
 
Chapter11 International Finance Management
Chapter11 International Finance ManagementChapter11 International Finance Management
Chapter11 International Finance ManagementPiyush Gaur
 
勝つためのA/Bテスト方法論
勝つためのA/Bテスト方法論勝つためのA/Bテスト方法論
勝つためのA/Bテスト方法論Soichiro Akasaka
 
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyUsing Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyPiet J.H. Daas
 
Dr. Piet Daas (CBS) - Statistiek en grote data bestanden
Dr. Piet Daas (CBS) - Statistiek en grote data bestandenDr. Piet Daas (CBS) - Statistiek en grote data bestanden
Dr. Piet Daas (CBS) - Statistiek en grote data bestandenAlmereDataCapital
 
Creating Big Data: Methodology
Creating Big Data: MethodologyCreating Big Data: Methodology
Creating Big Data: MethodologyMoshe Kaplan
 
CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016Belmiro Moreira
 
Analyzing and Predicting Task Reminders
Analyzing and Predicting Task RemindersAnalyzing and Predicting Task Reminders
Analyzing and Predicting Task RemindersDavid Graus
 
Letterpress Revival (Jun 2013)
Letterpress Revival (Jun 2013)Letterpress Revival (Jun 2013)
Letterpress Revival (Jun 2013)Victoria Gaitskell
 
Managing Foreign Exchange Risk
Managing Foreign Exchange RiskManaging Foreign Exchange Risk
Managing Foreign Exchange RiskDanial822
 
Introduction to Aptitude Software
Introduction to Aptitude SoftwareIntroduction to Aptitude Software
Introduction to Aptitude SoftwareAptitude Software
 
Spiderストレージエンジンのご紹介
Spiderストレージエンジンのご紹介Spiderストレージエンジンのご紹介
Spiderストレージエンジンのご紹介Kentoku
 

Destacado (17)

Dary
DaryDary
Dary
 
Big Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in EindhovenBig Data @ CBS for Fontys students in Eindhoven
Big Data @ CBS for Fontys students in Eindhoven
 
Chapter11 International Finance Management
Chapter11 International Finance ManagementChapter11 International Finance Management
Chapter11 International Finance Management
 
Big data @ CBS
Big data @ CBSBig data @ CBS
Big data @ CBS
 
Hollywood Moms
Hollywood MomsHollywood Moms
Hollywood Moms
 
Mapas dcn
Mapas dcnMapas dcn
Mapas dcn
 
勝つためのA/Bテスト方法論
勝つためのA/Bテスト方法論勝つためのA/Bテスト方法論
勝つためのA/Bテスト方法論
 
Forex trading
Forex tradingForex trading
Forex trading
 
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyUsing Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
 
Dr. Piet Daas (CBS) - Statistiek en grote data bestanden
Dr. Piet Daas (CBS) - Statistiek en grote data bestandenDr. Piet Daas (CBS) - Statistiek en grote data bestanden
Dr. Piet Daas (CBS) - Statistiek en grote data bestanden
 
Creating Big Data: Methodology
Creating Big Data: MethodologyCreating Big Data: Methodology
Creating Big Data: Methodology
 
CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016
 
Analyzing and Predicting Task Reminders
Analyzing and Predicting Task RemindersAnalyzing and Predicting Task Reminders
Analyzing and Predicting Task Reminders
 
Letterpress Revival (Jun 2013)
Letterpress Revival (Jun 2013)Letterpress Revival (Jun 2013)
Letterpress Revival (Jun 2013)
 
Managing Foreign Exchange Risk
Managing Foreign Exchange RiskManaging Foreign Exchange Risk
Managing Foreign Exchange Risk
 
Introduction to Aptitude Software
Introduction to Aptitude SoftwareIntroduction to Aptitude Software
Introduction to Aptitude Software
 
Spiderストレージエンジンのご紹介
Spiderストレージエンジンのご紹介Spiderストレージエンジンのご紹介
Spiderストレージエンジンのご紹介
 

Similar a Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visualiseren

MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIBig Data Week
 
Thesis blending big data and cloud -epilepsy global data research and inform...
Thesis  blending big data and cloud -epilepsy global data research and inform...Thesis  blending big data and cloud -epilepsy global data research and inform...
Thesis blending big data and cloud -epilepsy global data research and inform...Anup Singh
 
Dealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time InformationDealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time InformationEdward Curry
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked .
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformIRJET Journal
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life RevolutionCapgemini
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerMicrosoft
 
SC6 Workshop 1: What can big data do for you?
SC6 Workshop 1: What can big data do for you? SC6 Workshop 1: What can big data do for you?
SC6 Workshop 1: What can big data do for you? BigData_Europe
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_publicAttila Barta
 
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a Palace
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a PalaceInternet of Things (IoT) is a King, Big data is a Queen and Cloud is a Palace
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a PalaceDr.-Ing Abdur Rahim Biswas
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataSitaram Kotnis
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Surveyijeei-iaes
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Tomasz Bednarz
 

Similar a Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visualiseren (20)

MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
 
Thesis blending big data and cloud -epilepsy global data research and inform...
Thesis  blending big data and cloud -epilepsy global data research and inform...Thesis  blending big data and cloud -epilepsy global data research and inform...
Thesis blending big data and cloud -epilepsy global data research and inform...
 
On Big Data
On Big DataOn Big Data
On Big Data
 
Dealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time InformationDealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time Information
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringer
 
SC6 Workshop 1: What can big data do for you?
SC6 Workshop 1: What can big data do for you? SC6 Workshop 1: What can big data do for you?
SC6 Workshop 1: What can big data do for you?
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a Palace
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a PalaceInternet of Things (IoT) is a King, Big data is a Queen and Cloud is a Palace
Internet of Things (IoT) is a King, Big data is a Queen and Cloud is a Palace
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Big data mining
Big data miningBig data mining
Big data mining
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Survey
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
 

Más de AlmereDataCapital

Karel Thönissen (Garabit) @ PIDS seminar
Karel Thönissen (Garabit) @ PIDS seminarKarel Thönissen (Garabit) @ PIDS seminar
Karel Thönissen (Garabit) @ PIDS seminarAlmereDataCapital
 
Steven van der Linden (Qforce) @ PIDS seminar
Steven van der Linden (Qforce) @ PIDS seminarSteven van der Linden (Qforce) @ PIDS seminar
Steven van der Linden (Qforce) @ PIDS seminarAlmereDataCapital
 
Maarten Stultjens (Elephant Security) @ PIDS seminar
Maarten Stultjens (Elephant Security) @ PIDS seminarMaarten Stultjens (Elephant Security) @ PIDS seminar
Maarten Stultjens (Elephant Security) @ PIDS seminarAlmereDataCapital
 
Sampo Kellomäki (Synergetics) @ PIDS seminar
Sampo Kellomäki (Synergetics) @ PIDS seminarSampo Kellomäki (Synergetics) @ PIDS seminar
Sampo Kellomäki (Synergetics) @ PIDS seminarAlmereDataCapital
 
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminarJaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminarAlmereDataCapital
 
Peter Kits (Holland Van Gijzen) @ PIDS seminar
Peter Kits (Holland Van Gijzen) @ PIDS seminarPeter Kits (Holland Van Gijzen) @ PIDS seminar
Peter Kits (Holland Van Gijzen) @ PIDS seminarAlmereDataCapital
 
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminarProf. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminarAlmereDataCapital
 
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'AlmereDataCapital
 
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...AlmereDataCapital
 
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...AlmereDataCapital
 
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de ZorgCarlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de ZorgAlmereDataCapital
 
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...AlmereDataCapital
 
Nicky Hekster (IBM) - Watson for Health
Nicky Hekster (IBM) - Watson for HealthNicky Hekster (IBM) - Watson for Health
Nicky Hekster (IBM) - Watson for HealthAlmereDataCapital
 
Freek Bomhof (TNO) - Big Data en kansen in de zorg
Freek Bomhof (TNO) - Big Data en kansen in de zorgFreek Bomhof (TNO) - Big Data en kansen in de zorg
Freek Bomhof (TNO) - Big Data en kansen in de zorgAlmereDataCapital
 
Harro Stokman (Euvision) - Big Brother Watches Big Data
Harro Stokman (Euvision) - Big Brother Watches Big DataHarro Stokman (Euvision) - Big Brother Watches Big Data
Harro Stokman (Euvision) - Big Brother Watches Big DataAlmereDataCapital
 
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storage
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storageArjan Hassing (Ernst & Young) - Kosten besparen op big data storage
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storageAlmereDataCapital
 
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdataLex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdataAlmereDataCapital
 
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe dataProf. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe dataAlmereDataCapital
 
Peter Walgemoed (Carelliance) - Businessmodels for Big Data
Peter Walgemoed (Carelliance) - Businessmodels for Big DataPeter Walgemoed (Carelliance) - Businessmodels for Big Data
Peter Walgemoed (Carelliance) - Businessmodels for Big DataAlmereDataCapital
 

Más de AlmereDataCapital (19)

Karel Thönissen (Garabit) @ PIDS seminar
Karel Thönissen (Garabit) @ PIDS seminarKarel Thönissen (Garabit) @ PIDS seminar
Karel Thönissen (Garabit) @ PIDS seminar
 
Steven van der Linden (Qforce) @ PIDS seminar
Steven van der Linden (Qforce) @ PIDS seminarSteven van der Linden (Qforce) @ PIDS seminar
Steven van der Linden (Qforce) @ PIDS seminar
 
Maarten Stultjens (Elephant Security) @ PIDS seminar
Maarten Stultjens (Elephant Security) @ PIDS seminarMaarten Stultjens (Elephant Security) @ PIDS seminar
Maarten Stultjens (Elephant Security) @ PIDS seminar
 
Sampo Kellomäki (Synergetics) @ PIDS seminar
Sampo Kellomäki (Synergetics) @ PIDS seminarSampo Kellomäki (Synergetics) @ PIDS seminar
Sampo Kellomäki (Synergetics) @ PIDS seminar
 
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminarJaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar
Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar
 
Peter Kits (Holland Van Gijzen) @ PIDS seminar
Peter Kits (Holland Van Gijzen) @ PIDS seminarPeter Kits (Holland Van Gijzen) @ PIDS seminar
Peter Kits (Holland Van Gijzen) @ PIDS seminar
 
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminarProf. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar
Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar
 
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
 
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...
Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...
 
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...
Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...
 
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de ZorgCarlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg
Carlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg
 
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...
Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...
 
Nicky Hekster (IBM) - Watson for Health
Nicky Hekster (IBM) - Watson for HealthNicky Hekster (IBM) - Watson for Health
Nicky Hekster (IBM) - Watson for Health
 
Freek Bomhof (TNO) - Big Data en kansen in de zorg
Freek Bomhof (TNO) - Big Data en kansen in de zorgFreek Bomhof (TNO) - Big Data en kansen in de zorg
Freek Bomhof (TNO) - Big Data en kansen in de zorg
 
Harro Stokman (Euvision) - Big Brother Watches Big Data
Harro Stokman (Euvision) - Big Brother Watches Big DataHarro Stokman (Euvision) - Big Brother Watches Big Data
Harro Stokman (Euvision) - Big Brother Watches Big Data
 
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storage
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storageArjan Hassing (Ernst & Young) - Kosten besparen op big data storage
Arjan Hassing (Ernst & Young) - Kosten besparen op big data storage
 
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdataLex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata
Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata
 
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe dataProf. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data
Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data
 
Peter Walgemoed (Carelliance) - Businessmodels for Big Data
Peter Walgemoed (Carelliance) - Businessmodels for Big DataPeter Walgemoed (Carelliance) - Businessmodels for Big Data
Peter Walgemoed (Carelliance) - Businessmodels for Big Data
 

Último

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visualiseren

  • 1. Big Data: Visualization Dr. Maurice Bouwhuis SARA National High Performance Computing Services Big Data analytics– Almere 14-06-2012
  • 2. About SARA: Our Mission Is to Support Innovation SARA national HPC Center has about ~170 fte’s in 2 locations (Amsterdam and Almere) Offices in Amsterdam The mission of SARA is 2-fold: 4. Supporting research in the Netherlands [SARA BV for Science & Innovation] 5. Offering commercial high-end ICT Data Center in Amsterdam services [Vancis BV for adVANCed Ict Services] SARA werkt nauw samen met SURF Data Center in Almere Big Data analytics– Almere 14-06-2012
  • 3. Almere Big Data Hoofdstad Big Data analytics– Almere 14-06-2012
  • 4. Big Data: The Deluge Big Data analytics– Almere 14-06-2012
  • 5. What is Big Data? “I cannot define it, but I know it when I see it” My Big Bear now My Big Bear then Big Data analytics– Almere 14-06-2012
  • 6. Wikipedia Defining Big Data beyond Commonly used ICT “Data sets whose size is beyond the ability of commonly used tools to capture, manage, and process the data within a tolerable elapsed time. Big Data analytics– Almere 14-06-2012
  • 7. Big Data as defined by IDC (2011) “Bringing together vast amounts of data from public and private sources, combined with the intuition of business and thought leaders and the speed and affordability of today's computers.” (IDC October 2011) Big Data analytics– Almere 14-06-2012
  • 8. Defining Big Data The 3 V’s Volume  Large amounts  Massive historical archives  Valuable for data mining Velocity Velocity  At very high rates (sensors, streams, social media, …)  Valuable in its “fresh” state Volume Variety Variety  Structured, semi-structured and unstructured  Variety also in Value Big Data analytics– Almere 14-06-2012
  • 9. 2 new V’s: Viscosity & Virality Big Data analytics– Almere 14-06-2012
  • 10. Big Data Drivers “Internet of Things” Commoditization of HPC Human dynamics can be easily stored and queried with Apache Hadoop  HDFS (storage) Hadoop Distributed File System  MapReduce (processing) high performance parallel data processing  Scalable & Self-healing So, Big data is driven by large scale data collection, storage and (information) processing Big Data analytics– Almere 14-06-2012
  • 11. Data Deluge @SARA It has always been there… Scientific Data Deluge:  Observations (e.g. LOFAR, Lifewatch)  Large-scale Simulation (e.g. astrophysics, climate modeling)  Experiments (e.g. Large Hadron Collider, DNA e-Science and Technology Infrastructure for Large Hadron Collider sequencers) Biodiversity Data and Ecosystem Research Multi-Petabytes of data growth at SARA each year Single datasets of 10-100 Terabytes and larger Multidisciplinary use of data Science needs Insight, not only Data Low Frequency Array Biobanking and Biomolecular Resources Research Infrastructure Big Data analytics– Almere 14-06-2012
  • 12. Big Data Ultimate Challenge: How to get insight? As volume, variety and velocity of data increase, use of visualization is imperative to help getting the insight for an ever increasingly data-driven future Big Data analytics– Almere 14-06-2012
  • 13. Some Applications of Big Data Astronomy Astron 1 Exabyte per day raw data 2 times WWW traffic per day SKA 300 - 1500 Petabytes storage per year! 20 times LHC Big Data analytics– Almere 14-06-2012
  • 14. Some Applications of Big Data Healthcare Erasmus MC Diagnostics: 4%: costs 72%: decisions Opportunities for disease management: 1) New classification of patients for better diagnostics & combined therapy 2) Assessing and managing risks Big Data analytics– Almere 14-06-2012
  • 15. Some Applications of Big Data Water Management Total length primary flood defenses in Netherlands: 2875km spread over 90 dike “rings”… Decision Support System: Integration of: Sensor data + AI, Simulation results, Maps, weather, ships, roadwork, traffic, twitter, GSM, location of emergency services, ... Big Data analytics– Almere 14-06-2012
  • 16. Some Applications of Big Data Infrawatch, Hollandse Brug 145 x 100 x 60 x 60 x 24 x 365 = big data sensors Hz seconds minutes hours days (Arno Knobbe, LIACS, 2011, http://infrawatch.liacs.nl) Big Data analytics– Almere 14-06-2012
  • 17. Some Applications of Big Data Ecology Citizen Science: >20,000 users, >50M observations. Bird radars: streaming data, many terabytes GPS-tracking: Streaming data, Word-wide projects. Massive amounts of complementary, multi- scale information that can not be “seen” in the field. Big Data analytics– Almere 14-06-2012
  • 18. Some Applications of Big Data eScience is also (big) data mining Cognition: image analysis and data exchange Food Specific Ontologies for Climate Research: Food Focused Text Mining Regional Sea-Level Chemical Metabolomics Data Analysis Biography Portal: Data-Intensive Modeling by SURF & NWO interconnections, trends, of the Global Water Cycle geographical maps and time lines Big Data analytics– Almere 14-06-2012
  • 19. CosmoGrid Case: The Need for Integrated e-Infrastructure Services A cosmological N-body simulation with 8,589,934,592 particles, formation of large structures of dark matter Dutch Computing Challenge Project & DEISA Extreme Computing Initiative: DCCP 2008 – 2009 / DECI 2009 Run 1 + 2: 4.25 M core hours Computing, 110 TB data Huygens Amsterdam + Cray XT4 Tokyo, coupled via light path and Amsterdam + Tokyo + Helsinki + Edinburgh High resolution data remote visualization on tiled panel display Advanced support in porting and optimization, visualization, data storage, networking and project management All infrastructure elements and their integration are crucial Visit SURF 7-6-2012 Big Data analytics– Almere 14-06-2012
  • 20. Visualization @ SARA more than 20 years of experience and support  Scientific visualization  High resolution  Scientific visualization  Scientific & industrial support visualization support support visualization support  Rendering  Remote visualization  Remote visualization  Virtual Reality  Animations and slides & streaming service  Collaboration support Big Data analytics– Almere 14-06-2012
  • 21. How are we Coping with Big Data? HPC centers, universities, and in recent years, Internet companies like Yahoo!, Facebook en of course Google are pioneers (lots of knowledge exchange, by the way.) We collect Big Data, store it and we have the knowledge to interpret it. What tools do we have to pull this out? Big Data analytics– Almere 14-06-2012
  • 22. ‘Collaboratorium’: New visualization and collaboration facility @SARA videoconference laptop 2 laptop 3 website in browser Visualization of big shared data data and trends Also for improving business and Science models and computational debugging. PowerPoint, Video Conferencing, videoconference laptop 1 data from data from telepresence, 3D remote camera workstation 1 workstation 2 (stereo) projection Based on proven technology from SARA and partners EVL and Calit2 (San Diego) Big Data analytics– Almere 14-06-2012
  • 23. Visualization @ SARA – Remote Visualization Remote visualization service q Provide dedicated visualization resources in SARA data center: Rendercluster and visualization software (i.e. Paraview, VisIt, VTK, VMD, Blender, ... q Embedded in national e-Infrastructure q Visualization resource has direct access to storage at SARA q Avoid large data transfers over network (esp. Internet) by running visualization applications remotely q Pixel output/remote desktop transferred to user, instead of files q Application support for parallel rendering Big Data analytics– Almere 14-06-2012
  • 24. Big Data Requires Big Computing What benefits could exascale computing bring? It will enable discovery in many areas of science. "Aerospace engineering, astrophysics, biology, climate modeling and national security all have applications with extreme computing requirements," Big Data analytics– Almere 14-06-2012
  • 25. Compute Ecosystem @SARA 1. Low-latency, high-bandwidth capability computing (Huygens) 2. Capacity compute clusters (LISA) 3. Loosely coupled compute Grids (Big Grid) 4. Sector, private and public Clouds Including our HPC Cloud) and Beehub storage 5. Special-purpose (GPU) clusters 6. Big Data Apache Hadoop systems (since 2009) Big Data analytics– Almere 14-06-2012
  • 26. Big Data Eco-System @SARA DevOps Programming algorithms Domain knowledge Big Data analytics– Almere 14-06-2012
  • 27. To Summarize: Big Data Is Changing Rapidly our Life Big Data is changing science, medicine, business, and technology. A whole new way of science: correlation supersedes causation, coherent models or unified theories… Biggest challenge for science & business is not storing or processing data but how to make sense of it without affecting our privacy. Big Data analytics– Almere 14-06-2012
  • 28. Big Data… Big Enough? Thank You Big Data analytics– Almere 14-06-2012
  • 29. Big Data analytics– Almere 14-06-2012

Notas del editor

  1. Dames en heren een heel Goede morgen Het is een heel grote plezier voor mij om de eerste keynote op deze Big data dag hier in Almere te mogen geven.
  2. Iets over SARA
  3. Almere begrijpt als geen andere het belang van schaal Almere wil tweemaal zo groot worden en is al jaren bezig op vele fronten om dit te bewerkstelligen. ICT heeft in Almere altijd een belangrijke rol gespeeld. SARA is al begin van deze eeuw zich in Almere gaan vestigen en de eerste pilote voor Breedband was ook in Almere. Met het programma Almere DataCapital, wil Almere zich profileren als Dutch Media en Health Hub. De positie als datahoofdstad moet de gemeente op termijn 2500 nieuwe banen opleveren
  4. De Economist heeft als eerste de wereld bewust gemaakt van de potentie en belang van Big Data met hun befaamde artikel over the data deluge. In december 2011 was Big Data het hoofdthema van de World Economic Forum in Davos. Centraal stond belang van data intelligence voor besluitvorming op politieke, sociale en economische vlakken. IDC in maart dit jaar een forecast gedaan over de groei van de Big Data markt: maar liefst 40% groei per jaar, 7 keer zoveel als de gemiddelde groei van de ICT markt
  5. Maar eerst wat verstaan we onder Big data? Eerst de befaamde uitspraak van de Amerikaanse senator, wel in een heel andere context: I Cannot define it, but I know it when I see it”… We weten ook dat het begrip grote hoeveelheden data is relatief. Wat vandaag veel data is, kan morgen normaal of zelfs als klein aangemerkt worden.
  6. Volgens Wikipedia is wordt het Big Data, wanneer de data hoeveelheid groter is dan wat gebruikelijk is om data te collecteren, managen en processen binnen een acceptabele tijdsbestek
  7. Volgens IDC gaat Big Data over het samenbrengen van grote hoeveelheden data van publieke en private bronnen gecombineerd met de intuïtie en ideeën van business leiders en de mogelijkheden van betaalbare ICT.
  8. Big Data wordt doorgaan gekaracteriseerd met de 3 V’s Volume Grote hoeveelheden data. Snelheid van dataproductie veroorzaakt ook gigantische logfiles en archieven. Allen zeer waardevol voor data minimig. Velocity – zowel batch, als near-time, en real-time, streams. Big data wordt gekenmerkt met zeer hoge snelheden, sensoren, media streams Het is heel belangrijk om snel te handelen maar ook de data heel veel waarde als die zogenaamd nog vers is. Variety – Big data is gestructureerd, half gestructureerd maar ook ongestructureerd. Verschillende bronnen oud en nieuw leveren een grote diversiteit aan data format en vandaar dat oude DBMS technieken niet meer alleen toereikend zijn.
  9. Twee nieuwe V’s zijn recentelijk hieraan toegevoegd: Viscosity:  viscositeit is een maat van de weerstand om door de data heen komen en data in informatie en dus inzicht om te vormen. Betere technieken zijn vereist zoals beter streaming, betere integratie en processing technieken. Virality:  Virality beschrijft hoe snel de data verspreidt zich tussen mensen (P2P).
  10. Samenvattend de belangrijkste drijfveren zijn dus: Internet of things HPC is een commodity geworden: data acquisitie, opslag en verwerking En de derde doorbraak is de opkomst van Hadoop, een open-source project van Apache Software Foundation. Hadoop biedt een betrouwbare en ook vooral betaalbare data storage oplossing met het Hadoop Distributed File System (HDFS) en een high-performance parallel data processing techniek zogenaamd MapReduce.
  11. Dit Big Data Deluge is niets nieuws bij SARA Projecten als Lofar en Lifewatch leverden al grote hoeveelheden data Ook grootschalige simulaties in astrofysica maar ook klimaat modelleringen En uiteraard alleen al de experimenten bij De CERN in Genève leveren ruim 2 petabytes per jaar. SARA is gewend om vele petabytes per jaar te collecteren, managen en te processen. Enkele datasets hiervan zijn tientallen tot honderden terabytes groot. Maar de uitdaging is om inzicht uit de data te halen.
  12. Maar de hamvraag bij Big Data is niet meer hoe de data opgeslagen en verwerkt moet worden maar vooral het verkrijgen van inzicht uit de berg informatie die eruit komt!
  13. Een paar Big Data Applicatie: Astronomie: astronomische data hoeveelheden door grootschalige infrastructurele projecten zoals Lofar en Square Kilometer ARRAY. Alleen al bij Astron verwacht men om 1 exabyte (dat is 1000 peta) per dag te moeten verwerken. Dat is 2 maal de hoeveelheid dataverkeer op de WWW. Binnen het SKA project zal het om tussen 300 en 1500 exabytes opslag per jaar. Dat is minimaal ruim 20 maal de hoeveelheid data gegenereerd door de LHC van CERN.
  14. Andere voorbeeld komt uit de gezondheidszorg. Diagnosekosten zijn 4% van de totale kosten terwijl besluiten op hebben 72% impact. Big data waar genexpressie, gecombineerd met proteomics, screening chemoinformatics en ook tekstuele data mining van literatuur en patenten levert betere classificaties van patiënten, beter diagnose en beter therapie, ook betere beoordeling en management van risico’s
  15. Andere applicatie is water management. De totale lengte van primaire water defensielijnen in NL is een klein 3000 km verspreid over 90 dijkringen. Het project heeft met behulp van big data een decision support systeem opgezet met een combinatie van sensoren, AI, simulatie data, kaarten, weer gegevens, scheep en autoverkeergegevens, twitter, GSM, locatie van hulpdiensten, etc. Doelstelling is bij calamiteiten de hulpdiensten te assisteren om problemen vroegtijdig te detecteren en zoveel mogelijk mensen te redden.
  16. Zo is het project Infrawatch, hiet zien we de studie op de Hollandse brug. Sinds 2008 is de brug geëquipeerd met een grote verzameling aan sensoren die voortdurend de verkeerlast op de brug en de reactie van de infra hierop meten. De data die hierdoor verzameld wordt is rond de 11 Gb per dag. day.
  17. Ook is ecologie, heeft Big Data haar intrede gedaan. Hier zijn de burgers ook bij betrokken, met name met observaties die verspreid over het land moeten geschieden. Er is dan sprake van meer dan 20,000 gebruikers en meer dan 50 miljoen observaties. Voorbeelden zijn inzet van radars voor volgen van vogels, inzet van GPS systemen, en een veelvoud aan extra informatie door sensoren en camera’s die door observaties alleen niet gegeven kunnen worden.
  18. Er zijn nog meer applicaties van Big Data waar SARA bij betrokken is zoals Klimaat onderzoek voor bijvoorbeeld zeeniveau Beeldverwerking op het gebied van Cognitie Tekstverwerking en ontologie in voedsel Tekst mining van biografieën. Chemische metabolomics en Modellering van watercycli op wereldschaal. Allen worden gekenmerkt door grote hoeveelheden data uit simulaties, modelleringen, tekst- en beeldverwerking en ook sensoren en opnemers.
  19. Een voorbeeld waarin de geintegreerde infrastructuur belicht wordt Probleem stelling, simulatie over tijd van de beweging van sterrenstelsel, en de simulaties vergelijken met de geobserveerde structuren die we nu zien. N-body simulatie waarbij de bodies galaxies zijn. Gebruikte infrastructuren: Compute: meerdere supers in de wereld tegelijkertijd Opslag: Netwerken: om de computers aan elkaar te verbinden en data terug te leveren Visualisatie: kennis opdoen door data zichtbaar te maken Support: als verbindende factor om alles efficient te laten verlopen en expertise in te schieten Deze partij is ook betrokken in een EyR-3 aanvraag (Seintra, VU).
  20. Visualisatie is hierin onontbeerlijk. SARA heeft een hele historie met visualisatie sinds ruim 20 jaar. Eerst met een van de eerste visualisatie centera in NL. Daarna met de eerste CAVE in Europa. Daarna met de Tiled Panel Displays voor met name remote visualisatie van grote hoeveelheden data. en recentelijk met de opening van het Collaboratorium
  21. Hoe gaan we om met Big Data? Voor HPC Centra, universiteiten en de laatste jaren internet bedrijven zoals Yahoo!, Google en Facebook zijn de pioniers in de aanpk van Big data. Veel kennisuitwisseling gebeurt tussen die partijen. Bij SARA hebben we de kennis en ervaring in het verzamelen en opslaan van zowel gestructureerde als ongestructureerde big data. Ook verwerking van de data en hoe deze te interpreteren. Welke tools hebben we hiervoor nodig?.
  22. Het Collaboratorium is uitermate geschikt voor de visualisatie van grote hoeveelheden data en datatrends. Kan ook gebruikt worden voor het verbeteren van business en wetenschappelijke modellen en ook het debuggen van big data sofware. Het is een hoge resolutie TPD met de mogelijkheden om PPT, video conferencing, 3D stereo projectie en telepresence door verschillende gebruikers ook remote en tegelijk in te zetten. Het TPD is ook op de nationale infra van SARA aangesloten. Het Collaboratorium maakt gebruik van technologie die internationaal ontwikkeld wordt binnen een samenwerking waar SARA aan deelneemt.
  23. Big Data vereist ook grootschalige opslag en verwerking. Vandaar ook de race naar exascale computing. Snelle data verwerking levert doorbraken in verschillende gebieden van wetenschap, en business en is steeds belangrijker voor sociale en maatschappelijke vraagstukken zoals de macro-economie en nationale veiligheid.
  24. Over de jaren heen, heeft SARA een eco-systeem opgebouwd voor o.a. opslag en verwerking van Big Data met data en taak parallelisme. Het eco-systeem dekt de meeste gangbare reken en data problemen in wetenschap en business. Van zogenaamde low-latency, high bandwidth capability computing voor het ineens oplossen van 1 groot ondeelbaar probleem met de nationale supercomputer. Tot het ineens oplossen van een heel groot, wel opdeelbaar probleem op het nationale capacity computing clutser LISA, of losser gekoppeld op een cloud of de grid. SARA beschikt over een aparte rekencluster met Hadoop dataopslag en verwerkingcapaciteit voor Big Data.
  25. Samenwerking tussen devops (dat zijn developers die ook operators zijn), programmeurs en domeinspecialisten is cruciaal in de aanpak van Big data. Devops zorgen voor de parallelle data opslag en met Hadoop, de programmeurs voor de data verwerking met MapReduce en uiteindelijk zorgen de domein experts voor de domeinapplicatie en data interpretatie. Het is wel belangrijk voor de big data applicatie ontwikkeling om de experts van alle drie domeinen in huis te hebben.
  26. Samenvattend: Big data is heel snel onze leven aan het veranderen, van wetenschap tot, medicijnen, to business en technologie. Big data leidt tot een heel nieuwe methode van wetenschap waar correlaties belangrijker zijn dan klassieke causale verbanden zeker in levenswetenschappen. De grootste uitdaging van Big Data is het verkrijgen van inzichten met respect voor onze privacy en keuzevrijheid.
  27. En veel dank voor uw aandacht