Nandan Shah's presentation covered key topics related to big data including definitions, important technologies like Hadoop and MapReduce, benefits such as enabling real-time analysis and faster decision making, risks around data security and privacy, and applications across various domains such as banking, retail, and healthcare. The presentation also provided interesting facts about the size of data holdings and growth of the big data industry.
4. Entry to the big data world
Definitions of big data
Important technologies of big data
Benefits of big data
Risks of big data
Applications of big data in various domains
Interesting facts about big data
Quote for conclusion
8. Wikipedia: an all-encompassing term
for any collection of data sets so
large and complex that it becomes
difficult to process using on-hand
data management tools or traditional
data processing applications.
McKinsey: datasets whose size is
beyond the ability of typical
database software tools to capture,
store, manage, and analyze
10. Hadoop is a framework that allows
for the distributed processing of
large data sets across clusters of
computers using programming
models.
It consists of the storage part(HDFS)
& the processing part(Map Reduce)
11. MAP REDUCE
Map-Reduce is a programming
model and an associated
implementation for processing
and generating large data sets
with a parallel, distributed
algorithm on a cluster.
Master-slave architecture
where we have a job tracker &
task tracker
14. Helps in real time analysis of data.
Can store & process any type of data
using technologies like Hadoop &
Map-Reduce.
Provides faster & better decision-
making abilities.
19. Making your cities smarter.
Customizing your website on real
time.
Media domain calculates TRP of
programs using big data.
Genome mapping blended with big
data is the biggest boost to health
care.
Taking recruiters from “I think” to “I
Know”.
21. AT&T is said to be holding the largest volume of data in its database which now contains phone records
database of 312 terabytes in size & approximately 2 trillion rows.
6 million new jobs would be created in 2015
Data Science termed as "Sexiest job of the 21st century" by Harvard Business School
The number of bits stored in digital universe are more than the number of stars in the physical universe.
Currently US$10.2 billion industry. Expected to reach US$54.3 by 2017
The data volumes are exploding, more data has been created in the past two years than in the entire previous
history of the human race.
73% of organizations have already invested or plan to invest in big data by 2016
At the moment less than 0.5% of all data is ever analysed and used, just imagine the potential here.
Stacking a pile of CD-ROMs on top of one another until you’d reached the current global storage capacity for
digital information would stretch 80,000 km beyond the moon