SlideShare una empresa de Scribd logo
1 de 31
1
The Yotta is not Enough!
The Need for Rethinking Information Science.
Dr Bruno Jacobfeuerborn
Telekom Deutschland GmbH
“Information Science in an Age of Change”, 2nd Conference
Institute for Information Science and Book Studies, University of Warsaw
Warsaw, April 15-16th, 2013
Yotta (Y)
=
1024 or 1 000 000 000 000 000 000 000 000
2© B. Jacobfeuerborn
Metric Prefixes (ISO)
3© B. Jacobfeuerborn
Dr. Bruno Jacobfeuerborn
- Moved to Deutsche Telekom in 1989.
- Head of Radio and Transmission Department in Hanover , 1991.
- Regional Director in Leipzig, 1991.
- Regional Director Technology and later Regional Director Business, responsible for Sales,
Marketing and Technology, Hanover, 1995.
- T-Mobile; the acquisition of the GSM license in Poland, 1996.
- Technical Director T-Mobile Netherlands and Member of the Management Board, 2002.
- Head of Service Management Europe in the T-Mobile International, 2004.
- Technical Director PTC and Member of the Management Board, 2007.
- Director and Management Board Member responsible for technology (fixed and mobile)
in Germany at Telekom Deutschland GmbH, 2009.
- Invited speaker to international conferences and coach of workshops.
- MOST Foundation General Assembly member.
4
5
Contents
─ Prologue
─ Thesis
─ Big Data
─ 4 Paradigms of Science
─ Data Science
─ Epilogue
New Scientific
Paradigm
Big Data
© B. Jacobfeuerborn
6
Prologue.
© B. Jacobfeuerborn
7
Data is the raw material of the XXI century.
Credo
© B. Jacobfeuerborn
8
Thesis.
© B. Jacobfeuerborn
9
A new scientific paradigm emerges.
Information science has to face and cope with it!
Thesis
Source: Cartoonbank.com
© B. Jacobfeuerborn
10
Big Data.
© B. Jacobfeuerborn
11© B. Jacobfeuerborn
12
“Big data refers to datasets whose size is beyond the
ability of typical database software tools to capture,
store, manage, and analyze”.
--- McKinsey, 2011
Big Data
© B. Jacobfeuerborn
How Big is Big?
Today: between Exabytes (1018) and Zettabytes (1021)
Tomorrow: over Zettabytes
13© B. Jacobfeuerborn
Big Data – the Flood
Walmart drags a million hourly retail transactions into a database that
long ago passed 2.5 petabytes; Facebook processes 2.5 billion pieces of
content and 500 terabytes of data each day; and Google, whose YouTube
division alone gains 72 hours of new video every minute, accumulates 24
petabytes of data in a single day.
− David Rowan, Editor, WIRED UK,
http://www.edge.org/response-detail/23859
“Each day, according to IBM,
we collectively generate 2.5
quintillion bytes—a tsunami of
structured and unstructured
data that's growing, in IDC's
reckoning, at 60 per cent a
year.
14© B. Jacobfeuerborn
15
Four Paradigms of Science.
© B. Jacobfeuerborn
Scientific Revolutions
T.S. Kuhn
1922 - 1996
16© B. Jacobfeuerborn
17
Science has been developing
from idea-centricity to data-centricity.
Data leverage ideas!
My Addendum to Kuhn’s Claim
Data
Idea
© B. Jacobfeuerborn
The School of Athens, Raphael, 1509
18© B. Jacobfeuerborn
1. Platonic Approach
In the Greek language science means
knowledge. According to Aristotle and
Plato science/knowledge is:
universal, necessary, certain, and
timeless. Deduction is the only
allowed way of reasoning.
Mathematics is a prototype (model) of
science and a language of nature.
19© B. Jacobfeuerborn
2. Baconian Approach
Francis Bacon’s new methodology of
science and knowledge, empiricism,
that relayed on observation,
collection of data, and experimenting,
along with accepting induction as a
legal inference method for scientific
endeavors can be characterized as
data-centric.
20© B. Jacobfeuerborn
Francis Bacon, 1561 - 1626
3. Computers at Work (Simulation, Modelling)
−J.P. Rini
“The idea is to use a computer program
to perform lengthy computations, and to
provide a proof that the result of these
computations implies the given theorem.
In 1976, the four color theorem was the
first major theorem to be verified using a
computer program.”
http://en.wikipedia.org/wiki/Computer-assisted_proof
21© B. Jacobfeuerborn
22
“It is a capital mistake to
theorize before one has
data.”
− Sherlock Holmes,
A Study in Scarlett
(Arthur Conan Doyle)
The Role of Data
© B. Jacobfeuerborn
23
“We can stop looking for models. We
can analyze the data without
hypotheses about what it might show.
We can throw the numbers into the
biggest computing clusters the world
has ever seen and let algorithms find
patterns where science cannot.”
–Chris Anderson
4. Big Data at Work
© B. Jacobfeuerborn
24
Data Science.
© B. Jacobfeuerborn
25
Data science is a set of scientific theories, methods, tools, and best
practices (including hacking!) aimed to analyse and explore big
datasets in order to discover hidden knowledge thru inference.
Data Science
source: Data Science: An Introduction,
http://en.wikibooks.org/wiki/Data_Science:_An_Introduction
© B. Jacobfeuerborn
!
26© B. Jacobfeuerborn
27
My Vision
Information
Science
Data
Science
© B. Jacobfeuerborn
28
− Definitions of data, information, and knowledge.
− Data structures and databases.
− Big data and analytics trends.
− Elements of logics and non-standard inference mechanisms for big data.
− Assorted methods of knowledge representation.
− Elements of machine learning and artificial intelligence.
− Methods of browsing and retrieval of big data, with a focus on methods to fast delivery of the retrieved
hits.
− Methods and tools to create metadata.
− Data integration.
− Deep data analysis: statistics and data mining technologies.
− Architecture of scalable big data systems.
− Cloud computing; methods of physical storage of big data; virtualization technologies for sharing
processing power and memory.
− Security and privacy within big data infrastructures.
− Big data case studies (e.g. social networking, governance, marketing, health).
Data Science Curriculum for Information Science Students
© B. Jacobfeuerborn
29
Epilogue.
© B. Jacobfeuerborn
30
“With too little data, you won’t be
able to make any conclusions
that you trust. …
Big data isn’t about bits, it’s
about talent.”
–Douglas Merrill
http://www.forbes.com/sites/douglasmerrill/
2012/05/01/r-is-not-enough-for-big-data/
To Remember
© B. Jacobfeuerborn
Thank you for listening!

Más contenido relacionado

La actualidad más candente

ADED 7330 Introduction
ADED 7330 IntroductionADED 7330 Introduction
ADED 7330 Introduction
queenofrug
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
Feyzi R. Bagirov
 
Aula 4 27032015 sii-v1
Aula 4   27032015 sii-v1Aula 4   27032015 sii-v1
Aula 4 27032015 sii-v1
Aneesh Zutshi
 

La actualidad más candente (19)

ADED 7330 Introduction
ADED 7330 IntroductionADED 7330 Introduction
ADED 7330 Introduction
 
Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007
 
Prescottimperialbigdata
PrescottimperialbigdataPrescottimperialbigdata
Prescottimperialbigdata
 
Philipine internet
Philipine internetPhilipine internet
Philipine internet
 
InfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake newsInfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake news
 
Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?
 
Internet
InternetInternet
Internet
 
Health and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programmeHealth and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programme
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
Assignment1.final version
Assignment1.final versionAssignment1.final version
Assignment1.final version
 
Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)
 
Aula 4 27032015 sii-v1
Aula 4   27032015 sii-v1Aula 4   27032015 sii-v1
Aula 4 27032015 sii-v1
 
Science and Culture in the EU‘s Digital Agenda
Science and Culture  in the EU‘s Digital AgendaScience and Culture  in the EU‘s Digital Agenda
Science and Culture in the EU‘s Digital Agenda
 
Polinter01
Polinter01Polinter01
Polinter01
 
Polinter01
Polinter01Polinter01
Polinter01
 
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesSetting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
 
Internet
InternetInternet
Internet
 
Moldova ICT Summit Open Data Session
Moldova ICT Summit Open Data SessionMoldova ICT Summit Open Data Session
Moldova ICT Summit Open Data Session
 
University of Texas, Austin
University of Texas, AustinUniversity of Texas, Austin
University of Texas, Austin
 

Similar a The Yotta is not Enough! / Bruno Jacobfeuerborn

Information Overload
Information OverloadInformation Overload
Information Overload
Miro Pusnik
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
Anna Fensel
 
Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014
Marios Kyriazis
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM University
Leandro Agro'
 

Similar a The Yotta is not Enough! / Bruno Jacobfeuerborn (20)

The_Information_Age.pptx
The_Information_Age.pptxThe_Information_Age.pptx
The_Information_Age.pptx
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value Chain
 
Big dataorig
Big dataorigBig dataorig
Big dataorig
 
Internet and Sudan
Internet and SudanInternet and Sudan
Internet and Sudan
 
Information Overload
Information OverloadInformation Overload
Information Overload
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Data science general
Data science generalData science general
Data science general
 
top 10 Data Mining Algorithms
top 10 Data Mining Algorithmstop 10 Data Mining Algorithms
top 10 Data Mining Algorithms
 
What is opendata
What is opendata What is opendata
What is opendata
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Technology innovation 1
Technology innovation 1Technology innovation 1
Technology innovation 1
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017
 
Why not use ict in sudan
Why not use ict in sudanWhy not use ict in sudan
Why not use ict in sudan
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
The Internet of Things
The Internet of ThingsThe Internet of Things
The Internet of Things
 
Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM University
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 

Más de Zakład Systemów Informacyjnych, Instytut Informacji Naukowej i Studiów Bibliologicznych (UW)

Más de Zakład Systemów Informacyjnych, Instytut Informacji Naukowej i Studiów Bibliologicznych (UW) (20)

O postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
O postaci książek w bibliotekach cyfrowych / Zdzisław DobrowolskiO postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
O postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
 
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
 
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
 
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
 
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-SzakielKoncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
 
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
 
The Library as Place at the Digital Age / Laurence Favier
The Library as Place at the Digital Age /  Laurence FavierThe Library as Place at the Digital Age /  Laurence Favier
The Library as Place at the Digital Age / Laurence Favier
 
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
 
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
 
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
 
Transitioning from Technical Services to Center for Digital Scholarship and S...
Transitioning from Technical Services to Center for Digital Scholarship and S...Transitioning from Technical Services to Center for Digital Scholarship and S...
Transitioning from Technical Services to Center for Digital Scholarship and S...
 
School library as a heterotopic place / Béatrice Micheau
School library as a heterotopic place / Béatrice Micheau School library as a heterotopic place / Béatrice Micheau
School library as a heterotopic place / Béatrice Micheau
 
Social Media Strategy Design / Robin Effing
Social Media Strategy Design / Robin EffingSocial Media Strategy Design / Robin Effing
Social Media Strategy Design / Robin Effing
 
Information culture as a social cultural practice: (re)defining the concept i...
Information culture as a social cultural practice: (re)defining the concept i...Information culture as a social cultural practice: (re)defining the concept i...
Information culture as a social cultural practice: (re)defining the concept i...
 
Digitization for Access and Preservation: The Evolving Debate in the Cultural...
Digitization for Access and Preservation: The Evolving Debate in the Cultural...Digitization for Access and Preservation: The Evolving Debate in the Cultural...
Digitization for Access and Preservation: The Evolving Debate in the Cultural...
 
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
 
The road to providing access to Iran’s heritage resources: Iranian Consortium...
The road to providing access to Iran’s heritage resources: Iranian Consortium...The road to providing access to Iran’s heritage resources: Iranian Consortium...
The road to providing access to Iran’s heritage resources: Iranian Consortium...
 
Information Overload and Information Science / Mieczysław Muraszkiewicz
Information Overload and Information Science / Mieczysław MuraszkiewiczInformation Overload and Information Science / Mieczysław Muraszkiewicz
Information Overload and Information Science / Mieczysław Muraszkiewicz
 
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
 
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

The Yotta is not Enough! / Bruno Jacobfeuerborn

  • 1. 1 The Yotta is not Enough! The Need for Rethinking Information Science. Dr Bruno Jacobfeuerborn Telekom Deutschland GmbH “Information Science in an Age of Change”, 2nd Conference Institute for Information Science and Book Studies, University of Warsaw Warsaw, April 15-16th, 2013
  • 2. Yotta (Y) = 1024 or 1 000 000 000 000 000 000 000 000 2© B. Jacobfeuerborn
  • 3. Metric Prefixes (ISO) 3© B. Jacobfeuerborn
  • 4. Dr. Bruno Jacobfeuerborn - Moved to Deutsche Telekom in 1989. - Head of Radio and Transmission Department in Hanover , 1991. - Regional Director in Leipzig, 1991. - Regional Director Technology and later Regional Director Business, responsible for Sales, Marketing and Technology, Hanover, 1995. - T-Mobile; the acquisition of the GSM license in Poland, 1996. - Technical Director T-Mobile Netherlands and Member of the Management Board, 2002. - Head of Service Management Europe in the T-Mobile International, 2004. - Technical Director PTC and Member of the Management Board, 2007. - Director and Management Board Member responsible for technology (fixed and mobile) in Germany at Telekom Deutschland GmbH, 2009. - Invited speaker to international conferences and coach of workshops. - MOST Foundation General Assembly member. 4
  • 5. 5 Contents ─ Prologue ─ Thesis ─ Big Data ─ 4 Paradigms of Science ─ Data Science ─ Epilogue New Scientific Paradigm Big Data © B. Jacobfeuerborn
  • 7. 7 Data is the raw material of the XXI century. Credo © B. Jacobfeuerborn
  • 9. 9 A new scientific paradigm emerges. Information science has to face and cope with it! Thesis Source: Cartoonbank.com © B. Jacobfeuerborn
  • 10. 10 Big Data. © B. Jacobfeuerborn
  • 12. 12 “Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze”. --- McKinsey, 2011 Big Data © B. Jacobfeuerborn
  • 13. How Big is Big? Today: between Exabytes (1018) and Zettabytes (1021) Tomorrow: over Zettabytes 13© B. Jacobfeuerborn
  • 14. Big Data – the Flood Walmart drags a million hourly retail transactions into a database that long ago passed 2.5 petabytes; Facebook processes 2.5 billion pieces of content and 500 terabytes of data each day; and Google, whose YouTube division alone gains 72 hours of new video every minute, accumulates 24 petabytes of data in a single day. − David Rowan, Editor, WIRED UK, http://www.edge.org/response-detail/23859 “Each day, according to IBM, we collectively generate 2.5 quintillion bytes—a tsunami of structured and unstructured data that's growing, in IDC's reckoning, at 60 per cent a year. 14© B. Jacobfeuerborn
  • 15. 15 Four Paradigms of Science. © B. Jacobfeuerborn
  • 16. Scientific Revolutions T.S. Kuhn 1922 - 1996 16© B. Jacobfeuerborn
  • 17. 17 Science has been developing from idea-centricity to data-centricity. Data leverage ideas! My Addendum to Kuhn’s Claim Data Idea © B. Jacobfeuerborn
  • 18. The School of Athens, Raphael, 1509 18© B. Jacobfeuerborn
  • 19. 1. Platonic Approach In the Greek language science means knowledge. According to Aristotle and Plato science/knowledge is: universal, necessary, certain, and timeless. Deduction is the only allowed way of reasoning. Mathematics is a prototype (model) of science and a language of nature. 19© B. Jacobfeuerborn
  • 20. 2. Baconian Approach Francis Bacon’s new methodology of science and knowledge, empiricism, that relayed on observation, collection of data, and experimenting, along with accepting induction as a legal inference method for scientific endeavors can be characterized as data-centric. 20© B. Jacobfeuerborn Francis Bacon, 1561 - 1626
  • 21. 3. Computers at Work (Simulation, Modelling) −J.P. Rini “The idea is to use a computer program to perform lengthy computations, and to provide a proof that the result of these computations implies the given theorem. In 1976, the four color theorem was the first major theorem to be verified using a computer program.” http://en.wikipedia.org/wiki/Computer-assisted_proof 21© B. Jacobfeuerborn
  • 22. 22 “It is a capital mistake to theorize before one has data.” − Sherlock Holmes, A Study in Scarlett (Arthur Conan Doyle) The Role of Data © B. Jacobfeuerborn
  • 23. 23 “We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let algorithms find patterns where science cannot.” –Chris Anderson 4. Big Data at Work © B. Jacobfeuerborn
  • 24. 24 Data Science. © B. Jacobfeuerborn
  • 25. 25 Data science is a set of scientific theories, methods, tools, and best practices (including hacking!) aimed to analyse and explore big datasets in order to discover hidden knowledge thru inference. Data Science source: Data Science: An Introduction, http://en.wikibooks.org/wiki/Data_Science:_An_Introduction © B. Jacobfeuerborn !
  • 28. 28 − Definitions of data, information, and knowledge. − Data structures and databases. − Big data and analytics trends. − Elements of logics and non-standard inference mechanisms for big data. − Assorted methods of knowledge representation. − Elements of machine learning and artificial intelligence. − Methods of browsing and retrieval of big data, with a focus on methods to fast delivery of the retrieved hits. − Methods and tools to create metadata. − Data integration. − Deep data analysis: statistics and data mining technologies. − Architecture of scalable big data systems. − Cloud computing; methods of physical storage of big data; virtualization technologies for sharing processing power and memory. − Security and privacy within big data infrastructures. − Big data case studies (e.g. social networking, governance, marketing, health). Data Science Curriculum for Information Science Students © B. Jacobfeuerborn
  • 30. 30 “With too little data, you won’t be able to make any conclusions that you trust. … Big data isn’t about bits, it’s about talent.” –Douglas Merrill http://www.forbes.com/sites/douglasmerrill/ 2012/05/01/r-is-not-enough-for-big-data/ To Remember © B. Jacobfeuerborn
  • 31. Thank you for listening!