SlideShare una empresa de Scribd logo
1 de 16
BIG DATA
Introduction
• Big Data represents technological advancement focussed on the massive data
being generated at breakneck speeds & variety .
• It came to the forefront as one of the rapidly growing IT pillars of the future such
as blockchain and was driven by Iot & pervasive use of social media
• Lead to shift in companies attitude now focussing on making optimal use of data
and becoming data driven
• Data comes in 2 forms- a)structured b)unstructured
• Growing at an exponential rate and a 50$+ billion dollar market currently
• Its roots can be traced back to as early as 1995 when it started taking shape.
2
Dimensions of Big Data
Volume
• Volume refers to the amount of data an organization or an individual collects
and/or generates
• Any data exceeding 1TB is called as big data.
Variety
• Data are mostly classified into 3 types namely - Unstructured, Semi structured and
Structured.
• Unstructured - Text, photo, video, audio, sensor data, and clickstream data
• Semi structured - Extensible Business Reporting Language (XBRL)
• Structured - Traditional databases (Relational database, NoSQL database)
...
3
Velocity
• Velocity means the rate at which the data is being generated
• With increase in technology the velocity of data has also increased
• The enhanced capability of data generation from connected devices will continue
to accelerate the velocity
Veracity (Introduced by IBM)
• Veracity refers to the uncertainty and unreliability of data sources.
• These uncertainty arise due to latency, redundancy, inaccuracy and deception of
data.
...
4
Variability and Complexity (Introduced by SAS)
• Variation in rate of data flow is called variability. can fluctuat with unpredicted
eaks and troughs.
• Complexity refers to the number of data sources. reduction in this is necessary
Value (Introduced by Oracle)
• The value of data can not be judged initially, data cannot be of high value in its
initial from but using data analytics it can be transformed into high value asset.
• Everything is upon IT professionals and managers to extract the value out of the
given data
5
Traditional
Data
Variety
Big Data
Veracity(-)
Variability(+)
Complexity(+)
Decay(+)
Value(+)
Volume Added
Integrated view of Big data
6
Evolution of Big Data
The advent of the World Wide Web (WWW) in the early 1990s led to the
explosive growth of data and the development of big data analytics and evolved
through three major stages.
Big Data 1.0
• Arrival of e-commerce in 1994
• Online firms were the main contributors of the web content and web mining
techniques were developed to analyze users’ online activities
• Mining processes helped to discover web users’ usage browsing pattern
• Connectivity through hyperlink
• Classification of web pages
• Mining techniques in image processing and computer vision application was
limited
...
7
Big Data 2.0
• Social media analytics support social media content mining, usage
mining, and structure mining activities
• Sentiment analysis
• Lexical –based methods and machine-learning methods, to overcome the
sentiment analysis flaws
• Social networking sites were the central point to socialize
Big Data 3.0
• Introduction of IoT applications
• Devices used sensors that have unique identifiers which has the ability to share
data, collaborate over the internet without human intervention
• Trending streaming analytics which was far better than social media analytics
8
An Illustrative Example : Merchant Reviews
• A very good example of application of Big Data in recent times would be
Merchant Reviews. With multiple big name sites popping up with customer
reviews as their product, these Merchant Review sites have been the target of
many researchers.
• Customer written reviews are perceived as most credible.
• Other users can rate the reviews as helpful or not, which further refines the most
useful data.
• This data is regularly researched upon and run through various models for
companies to translate it into business value.
• Most reviews with higher scores are perceived as less helpful than those with
lower scores.
• Number of words in a review shows direct relationship to helpfulness.
9
Impact of
Big Data
Create New
Business
Develop
New
Products/
Services
Improving
Business
Operations
Cost
Savings
Better
Decision
Making
Higher
Service
Quality
10
Personalised Marketing
• Personalised products/services, coupons, promotional offers
• Macy’s and Target analyse shopper’s preferences and sentiments to improve
shopping experience
• Banks – increase revenue, increase client retention, better services
• U.S. Bank used both online and offline channels to enhance Customer relation
management, thereby leading to a rise in conversion rate up to 100%
Better Pricing
• Big data helps to set prices appropriately
• Use of open source technology helps in cost optimization and customer
satisfaction, e.g., eBay’s use of open source Hadoop technology
...
11
Cost Reduction
• Faster and effective reaction in supply chain issues
• Better demand forecasts, Real-time tracking , optimised distribution network
management and reduction in operational costs, e.g., Retail industry
• GE helped Oil and Gas industry(better efficiency with higher productivity)
and Southwest Airlines(fuel saving opportunities)
Improved Customer Service
• Integration of data from multiple channels helps the firms to understand the
customer better , e.g., Hertz in U.S
• Real time transaction analysis to detect fraudulent activities and informing
the customers
• Use of speech analytics and social media analytics helped Southwest Airlines
to provide better service offerings
12
Challenges in Big Data
• Data Quality - Data quality means relevance of data respective to the key
management decisions that have to be made. Low quality, unstructured data can
lead to false analysis and insights and thus affect the management processes.
There should be internal control systems in place to assure the quality and
reliability of data collected.
• Data Security - Data branches, data leaks and weak security can cause huge
financial losses for the company as well as damage to brand reputation. Highly
efficient firewalls and detection systems should be in place to ensure security of
confidential data.
• Privacy - The level of users data collected by firms can also raise concern about
user privacy and consent. On the flip side, the collection and use of personal data
can be used to improve quality of services and reduce costs, which is beneficial
for both the firms and the customers. Hence, there is trade off involved between
customer privacy and deeper customer insights for product development.
...
13
• High investment - Data analytics is a very efficient technology but its
applicability is limited to certain aspects of business. Thus, firms should properly
conduct the cost-benefit analysis before investing huge sums of money and
resources in big data analytics. Future cash inflows and projections should
justify the investment.
• Data Management - Data analytics require highly efficient hardware and
software resource for seamless functioning. Traditional DBMS and systems may
not be compatible with big data applications. Data warehouses management is
also very important as petabytes of big data is stored there.
• Required Talent and Expertise - The key element of successful data
analytics is the human resource that will manage, filter, and organize, loads of
unstructured data. Firms will have to invest highly in talent acquisition channels
and competitive salaries to attract qualified data scientists. Internal training
programs to be conducted to train employees.
14
Future in Big Data
• There is a prediction that the data generated would reach 175 zettabytes by 2025
• Machine learning would help in forming more powerful unsupervised
algorithms, greater personalisation, and cognitive services will greatly improve
computer’s ability to learn from data
• Demand for data scientists and chief data officers would be high with the
increasing availability of data so as to suffice for the analytical purposes
• Privacy would be a hot issue as data volumes increase, safeguarding it against
invasions and cyberattacks becomes more difficult, as data protection standards
cannot keep up with the rate of data expansion
• Unlike Big data, Fast data and actionable data would come into play as it allows
for processing real time streams
15
Thankyou
16

Más contenido relacionado

La actualidad más candente

Analytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingAnalytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingGianpaolo Zampol
 
Big Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingBig Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingGianpaolo Zampol
 
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...Pieter De Leenheer
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overviewoptier
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...ijdpsjournal
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnIBM Danmark
 
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...DATAVERSITY
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperExperian
 
In the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen HarrisIn the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen HarrisMolly Alexander
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying dataHans Verstraeten
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...Pieter De Leenheer
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Denodo
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with HadoopAptitude Software
 

La actualidad más candente (20)

Analytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in BankingAnalytics driving innovation and efficiency in Banking
Analytics driving innovation and efficiency in Banking
 
Big Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in BankingBig Data & Analytics perspectives in Banking
Big Data & Analytics perspectives in Banking
 
Big data
Big dataBig data
Big data
 
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...
MIT ICIQ 2017 Keynote: Data Governance and Data Capitalization in the Big Dat...
 
Big Data Forum - Phoenix
Big Data Forum - PhoenixBig Data Forum - Phoenix
Big Data Forum - Phoenix
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
 
Future of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren RavnFuture of Power: Big Data - Søren Ravn
Future of Power: Big Data - Søren Ravn
 
Big data
Big dataBig data
Big data
 
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White Paper
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012IBM Big Data Platform Nov 2012
IBM Big Data Platform Nov 2012
 
In the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen HarrisIn the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen Harris
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying data
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with Hadoop
 

Similar a Big data

Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad talk in South Africa:  From BigData to Data ScienceUsama Fayyad talk in South Africa:  From BigData to Data Science
Usama Fayyad talk in South Africa: From BigData to Data ScienceUsama Fayyad
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentDenodo
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataUmair Shafique
 
Big Data, Analytics and Data Science
Big Data, Analytics and Data ScienceBig Data, Analytics and Data Science
Big Data, Analytics and Data Sciencedlamb3244
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
Increasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationIncreasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationDenodo
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Operationalize analytics through modern data strategy
Operationalize analytics through modern data strategyOperationalize analytics through modern data strategy
Operationalize analytics through modern data strategyNagarro
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformationLoihde Advisory
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxmuflehaljarrah
 
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...Steven Callahan
 

Similar a Big data (20)

Big data
Big dataBig data
Big data
 
Trends in data analytics
Trends in data analyticsTrends in data analytics
Trends in data analytics
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad talk in South Africa:  From BigData to Data ScienceUsama Fayyad talk in South Africa:  From BigData to Data Science
Usama Fayyad talk in South Africa: From BigData to Data Science
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
Customer 360
Customer 360Customer 360
Customer 360
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data Environment
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data, Analytics and Data Science
Big Data, Analytics and Data ScienceBig Data, Analytics and Data Science
Big Data, Analytics and Data Science
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Increasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationIncreasing Agility Through Data Virtualization
Increasing Agility Through Data Virtualization
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Operationalize analytics through modern data strategy
Operationalize analytics through modern data strategyOperationalize analytics through modern data strategy
Operationalize analytics through modern data strategy
 
Value of data in digital transformation
Value of data in digital transformationValue of data in digital transformation
Value of data in digital transformation
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptx
 
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...
20140826 I&T Webinar_The Proliferation of Data - Finding Meaning Amidst the N...
 
Pres_Big Data for Finance_vsaini
Pres_Big Data for Finance_vsainiPres_Big Data for Finance_vsaini
Pres_Big Data for Finance_vsaini
 

Último

Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 

Último (20)

Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 

Big data

  • 2. Introduction • Big Data represents technological advancement focussed on the massive data being generated at breakneck speeds & variety . • It came to the forefront as one of the rapidly growing IT pillars of the future such as blockchain and was driven by Iot & pervasive use of social media • Lead to shift in companies attitude now focussing on making optimal use of data and becoming data driven • Data comes in 2 forms- a)structured b)unstructured • Growing at an exponential rate and a 50$+ billion dollar market currently • Its roots can be traced back to as early as 1995 when it started taking shape. 2
  • 3. Dimensions of Big Data Volume • Volume refers to the amount of data an organization or an individual collects and/or generates • Any data exceeding 1TB is called as big data. Variety • Data are mostly classified into 3 types namely - Unstructured, Semi structured and Structured. • Unstructured - Text, photo, video, audio, sensor data, and clickstream data • Semi structured - Extensible Business Reporting Language (XBRL) • Structured - Traditional databases (Relational database, NoSQL database) ... 3
  • 4. Velocity • Velocity means the rate at which the data is being generated • With increase in technology the velocity of data has also increased • The enhanced capability of data generation from connected devices will continue to accelerate the velocity Veracity (Introduced by IBM) • Veracity refers to the uncertainty and unreliability of data sources. • These uncertainty arise due to latency, redundancy, inaccuracy and deception of data. ... 4
  • 5. Variability and Complexity (Introduced by SAS) • Variation in rate of data flow is called variability. can fluctuat with unpredicted eaks and troughs. • Complexity refers to the number of data sources. reduction in this is necessary Value (Introduced by Oracle) • The value of data can not be judged initially, data cannot be of high value in its initial from but using data analytics it can be transformed into high value asset. • Everything is upon IT professionals and managers to extract the value out of the given data 5
  • 7. Evolution of Big Data The advent of the World Wide Web (WWW) in the early 1990s led to the explosive growth of data and the development of big data analytics and evolved through three major stages. Big Data 1.0 • Arrival of e-commerce in 1994 • Online firms were the main contributors of the web content and web mining techniques were developed to analyze users’ online activities • Mining processes helped to discover web users’ usage browsing pattern • Connectivity through hyperlink • Classification of web pages • Mining techniques in image processing and computer vision application was limited ... 7
  • 8. Big Data 2.0 • Social media analytics support social media content mining, usage mining, and structure mining activities • Sentiment analysis • Lexical –based methods and machine-learning methods, to overcome the sentiment analysis flaws • Social networking sites were the central point to socialize Big Data 3.0 • Introduction of IoT applications • Devices used sensors that have unique identifiers which has the ability to share data, collaborate over the internet without human intervention • Trending streaming analytics which was far better than social media analytics 8
  • 9. An Illustrative Example : Merchant Reviews • A very good example of application of Big Data in recent times would be Merchant Reviews. With multiple big name sites popping up with customer reviews as their product, these Merchant Review sites have been the target of many researchers. • Customer written reviews are perceived as most credible. • Other users can rate the reviews as helpful or not, which further refines the most useful data. • This data is regularly researched upon and run through various models for companies to translate it into business value. • Most reviews with higher scores are perceived as less helpful than those with lower scores. • Number of words in a review shows direct relationship to helpfulness. 9
  • 10. Impact of Big Data Create New Business Develop New Products/ Services Improving Business Operations Cost Savings Better Decision Making Higher Service Quality 10
  • 11. Personalised Marketing • Personalised products/services, coupons, promotional offers • Macy’s and Target analyse shopper’s preferences and sentiments to improve shopping experience • Banks – increase revenue, increase client retention, better services • U.S. Bank used both online and offline channels to enhance Customer relation management, thereby leading to a rise in conversion rate up to 100% Better Pricing • Big data helps to set prices appropriately • Use of open source technology helps in cost optimization and customer satisfaction, e.g., eBay’s use of open source Hadoop technology ... 11
  • 12. Cost Reduction • Faster and effective reaction in supply chain issues • Better demand forecasts, Real-time tracking , optimised distribution network management and reduction in operational costs, e.g., Retail industry • GE helped Oil and Gas industry(better efficiency with higher productivity) and Southwest Airlines(fuel saving opportunities) Improved Customer Service • Integration of data from multiple channels helps the firms to understand the customer better , e.g., Hertz in U.S • Real time transaction analysis to detect fraudulent activities and informing the customers • Use of speech analytics and social media analytics helped Southwest Airlines to provide better service offerings 12
  • 13. Challenges in Big Data • Data Quality - Data quality means relevance of data respective to the key management decisions that have to be made. Low quality, unstructured data can lead to false analysis and insights and thus affect the management processes. There should be internal control systems in place to assure the quality and reliability of data collected. • Data Security - Data branches, data leaks and weak security can cause huge financial losses for the company as well as damage to brand reputation. Highly efficient firewalls and detection systems should be in place to ensure security of confidential data. • Privacy - The level of users data collected by firms can also raise concern about user privacy and consent. On the flip side, the collection and use of personal data can be used to improve quality of services and reduce costs, which is beneficial for both the firms and the customers. Hence, there is trade off involved between customer privacy and deeper customer insights for product development. ... 13
  • 14. • High investment - Data analytics is a very efficient technology but its applicability is limited to certain aspects of business. Thus, firms should properly conduct the cost-benefit analysis before investing huge sums of money and resources in big data analytics. Future cash inflows and projections should justify the investment. • Data Management - Data analytics require highly efficient hardware and software resource for seamless functioning. Traditional DBMS and systems may not be compatible with big data applications. Data warehouses management is also very important as petabytes of big data is stored there. • Required Talent and Expertise - The key element of successful data analytics is the human resource that will manage, filter, and organize, loads of unstructured data. Firms will have to invest highly in talent acquisition channels and competitive salaries to attract qualified data scientists. Internal training programs to be conducted to train employees. 14
  • 15. Future in Big Data • There is a prediction that the data generated would reach 175 zettabytes by 2025 • Machine learning would help in forming more powerful unsupervised algorithms, greater personalisation, and cognitive services will greatly improve computer’s ability to learn from data • Demand for data scientists and chief data officers would be high with the increasing availability of data so as to suffice for the analytical purposes • Privacy would be a hot issue as data volumes increase, safeguarding it against invasions and cyberattacks becomes more difficult, as data protection standards cannot keep up with the rate of data expansion • Unlike Big data, Fast data and actionable data would come into play as it allows for processing real time streams 15