SlideShare una empresa de Scribd logo
1 de 12
Applying Big Data
Presented by John Dougherty, Viriton
4/25/2013
john.dougherty@viriton.com
Big Data Buzzwords
• Volume, Velocity, and Variety
• Agility/Agile Development
• Modeling Data
The 3V’s originated in the early 2000’s. META (Gartner, now)
Volume…self contained. Velocity = Speed of transaction
Variety = Data profiling from multiple data sources
The Agile Manifesto, created February 2001 (Remember Scrum?)
Incorporation into Big Data software becoming mandatory
Adaptive and Predictive approaches are hotly contested
Data Modeling is paramount, given Big or Small datasets
Design must be confronted at ingress and egress
Hybrid data modeling and remodeling existing models
Veracity has been added, but has not yet been fully adopted
Big Data Buzzwords – Agile Dev.
• Informatics
• Daily Batch
• Classic Dept.
Informaticists are leveraged across multiple disciplines
There is no strict definition for a data scientist/informaticist
Greatest likelihood to adopt an agile/adaptive model
Development _should_ be incorporated into existing process
workflows. Seamlessness should be the goal.
Utilizing an agile approach to finding new uses to existing data
Least likely to need/adopt new development approaches
Relevant data must still be filtered through
Staff should not be re-learning the wheel with deployment
• Example of Hybrid Modeling
• Every project/objective must have properly
defined models to reach maximum efficacy
• Data silos are losing their complicit positioning
• Transitioning modeling to enumeration
Big Data Buzzwords – Data Models
Big Data Buzzwords – Question Inception
Connecting these lines is a great
example of the work that lies ahead in
identifying the objectives and goals of
the business environment
Big Picture
There is a lot of data
As of 2009, Google generates at least >2 EB per
year, >2TB indexed URLs, >9B page views per day
Facebook houses one billion users; utilizing >500TB
per day, housing 35% or more of the world’s photos
YouTube houses >1EB of data, >72 hours of video
per minute, >4B views per day
Twitter >125B tweets per year, >390M per
day, approximately 4500 per second
~2.3B people use the internet today, of which, 90% of
the world’s data has been generated within the last
two years
The Internet of Things
(connected devices and data)
What will you be aggregating?
In 2002, recorded media and electronic information flows
generated about 22 exabytes (1018) of information
In 2006, the amount of digital information
created, captured, and replicated was 161 EB
Use Cases
IBM’s 5 High Value Use Cases
Big Data Exploration
Find, visualize, understand all big data to improve decision making. Big data exploration addresses the challenge
that every large organization faces: information is stored in many different systems and silos and people need
access to that data to do their day-to-day work and make important decisions.
Enhanced 360º View of the Customer
Extend existing customer views by incorporating additional internal and external information sources. Gain a full
understanding of customers—what makes them tick, why they buy, how they prefer to shop, why they switch, what
they’ll buy next, and what factors lead them to recommend a company to others.
Security/Intelligence Extension
Lower risk, detect fraud and monitor cyber security in real time. Augment and enhance cyber security and
intelligence analysis platforms with big data technologies to process and analyze new types (e.g. social
media, emails, sensors, Telco) and sources of under-leveraged data to significantly improve intelligence, security
and law enforcement insight
Operations Analysis
Analyze a variety of machine and operational data for improved business results. The abundance and growth of
machine data, which can include anything from IT machines to sensors and meters and GPS devices requires
complex analysis and correlation across different types of data sets. By using big data for operations
analysis, organizations can gain real-time visibility into operations, customer experience, transactions and behavior.
Data Warehouse Augmentation
Integrate big data and data warehouse capabilities to increase operational efficiency. Optimize your data
warehouse to enable new types of analysis. Use big data technologies to set up a staging area or landing zone for
your new data before determining what data should be moved to the data warehouse. Offload infrequently accessed
or aged data from warehouse and application databases using information integration software and tools.
• Applied since data science began, 1970’s
• Many different products available, augmenting existing
solutions, and providing all-in-one
• SAS
• SAP (though called predictive analytics, still fits)
• Same problems incur with extensibility as do with
design/deployment
Use Cases
Visual Analytics
Science: Visual analytics is the
science of analytical reasoning
facilitated by interactive visual
interfaces
Sensor Analytics
Internet of Things: The first speaking of the gargantuan brontobyte
(1 Bit = Binary Digit · 8 Bits = 1 Byte · 1024 Bytes = 1 Kilobyte · 1024 Kilobytes = 1 Megabyte · 1024 Megabytes = 1 Gigabyte · 1024 Gigabytes = 1 Terabyte · 1024
Terabytes = 1 Petabyte · 1024 Petabytes = 1 Exabyte· 1024 Exabytes = 1 Zettabyte · 1024 Zettabytes = 1 Yottabyte · 1024 Yottabytes = 1 Brontobyte· 1024 Brontobytes =
1 Geopbyte)
• ROI Metrics are difficult to predict, but follow a
trend of double and triple digits
• What keeps the CEO up at night, decision
journeys
• An anecdotal report (questionarre) shows
44% of CMO’s can measure their ROI
• Design and development will continue to be
tantamount to a successful return
ROI?
ROI
Nucleus Research
• Becoming an analytic enterprise requires Big Data
• Average ROI of 241%
• Increased productivity
• A major metropolitan police department achieved an 863 percent ROI when it combined its criminal
records database with a national crime database created by a major university.
• Reduced labor costs
• A major resort earned an ROI of 1,822 percent when it integrated shift scheduling processes with data from a
national weather service, enabling managers to avoid unnecessary shift assignments and increase staff
utilization.
Four Stages of an
Analytic Enterprise
Telco reduces costs associated to CO management
and circuit deployment by 230%
QoS data expected to expand well into Petabytes for
the Telco industry
Moving Forward
How to formulate the right questions
• Communication between C-Suite and VP isn’t enough
• Considering old data, wholistic approaches work best
• Objectives and goals begin with dialogue at the highest
levels
• What are the questions we should be asking?
Should we start now? Yes.
Brought to you by:
BIBLIOGRAPHY
 http://blogs.starcio.com/2012/12/what-is-big-data-real-challenges-beyond.html - Big Data for All Businesses
 http://www.nytimes.com/2013/03/24/nyregion/mayor-bloombergs-geek-squad.html?pagewanted=all&_r=2& - NYC Mayor’s use case
 http://www.312analytics.com/what-is-machine-learning-big-data-modeling/ - Data Modeling, the big challenges
 http://goo.gl/wH3qG - Origin of VVV
 http://www.businessinsider.com/cia-presentation-on-big-data-2013-3?op=1 - CIA CTO Presentation
 http://www.rosebt.com/1/post/2013/03/data-science-and-analytics-workflow.html - Rose Business Technologies, Workflow
 http://nucleusresearch.com/research/notes-and-reports/the-big-returns-from-big-data/ - Nucleus Research

Más contenido relacionado

La actualidad más candente

Manufacturing Data Center Fast Facts: Big Data, Storage, Security & Recovery
Manufacturing Data Center Fast Facts: Big Data, Storage, Security & RecoveryManufacturing Data Center Fast Facts: Big Data, Storage, Security & Recovery
Manufacturing Data Center Fast Facts: Big Data, Storage, Security & RecoveryInsight
 
Big Data – Manufacturing
Big Data – ManufacturingBig Data – Manufacturing
Big Data – ManufacturingWilliam Pagnon
 
The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...IBM Analytics
 
Big Data & Analytics in the Manufacturing Industry: The Vaasan Group
Big Data & Analytics in the Manufacturing Industry: The Vaasan GroupBig Data & Analytics in the Manufacturing Industry: The Vaasan Group
Big Data & Analytics in the Manufacturing Industry: The Vaasan GroupIBM Analytics
 
Lean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseLean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseDatameer
 
Big data & Its influence in the IT
Big data & Its influence in the ITBig data & Its influence in the IT
Big data & Its influence in the ITHeamalatha Pradeeba
 
Big Data - Insights & Challenges
Big Data - Insights & ChallengesBig Data - Insights & Challenges
Big Data - Insights & ChallengesRupen Momaya
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analyticsAhmed Banafa
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industryParviz Iskhakov
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET Journal
 
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStreamIoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStreamgogo6
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingNitesh Khilwani
 

La actualidad más candente (20)

Big data in manufacturing
Big data in manufacturingBig data in manufacturing
Big data in manufacturing
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
Manufacturing Data Center Fast Facts: Big Data, Storage, Security & Recovery
Manufacturing Data Center Fast Facts: Big Data, Storage, Security & RecoveryManufacturing Data Center Fast Facts: Big Data, Storage, Security & Recovery
Manufacturing Data Center Fast Facts: Big Data, Storage, Security & Recovery
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
 
Big Data – Manufacturing
Big Data – ManufacturingBig Data – Manufacturing
Big Data – Manufacturing
 
The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...The current challenges and opportunities of big data and analytics in emergen...
The current challenges and opportunities of big data and analytics in emergen...
 
Big Data & Analytics in the Manufacturing Industry: The Vaasan Group
Big Data & Analytics in the Manufacturing Industry: The Vaasan GroupBig Data & Analytics in the Manufacturing Industry: The Vaasan Group
Big Data & Analytics in the Manufacturing Industry: The Vaasan Group
 
Lean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use CaseLean Production Meets Big Data: A Next Generation Use Case
Lean Production Meets Big Data: A Next Generation Use Case
 
Big data & Its influence in the IT
Big data & Its influence in the ITBig data & Its influence in the IT
Big data & Its influence in the IT
 
Big Data - Insights & Challenges
Big Data - Insights & ChallengesBig Data - Insights & Challenges
Big Data - Insights & Challenges
 
IOT DATA AND BIG DATA
IOT DATA AND BIG DATAIOT DATA AND BIG DATA
IOT DATA AND BIG DATA
 
Big data
Big dataBig data
Big data
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
Big Data
Big DataBig Data
Big Data
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industry
 
Data Science
Data ScienceData Science
Data Science
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStreamIoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Big data
Big dataBig data
Big data
 

Destacado

페차쿠차_ 조연진
페차쿠차_ 조연진페차쿠차_ 조연진
페차쿠차_ 조연진연진 조
 
페차쿠차
페차쿠차페차쿠차
페차쿠차ekwjd4793
 
Evaluation Question 4
Evaluation Question 4Evaluation Question 4
Evaluation Question 4annaskelding
 
Catedra virtual de cultura ciudadana
Catedra virtual de cultura ciudadanaCatedra virtual de cultura ciudadana
Catedra virtual de cultura ciudadanaLuisa Paternina
 
Qlitan wid my cousins
Qlitan wid my cousinsQlitan wid my cousins
Qlitan wid my cousinsbhabyjekang
 
Evolución de los avances tecnológicos
Evolución de los avances tecnológicosEvolución de los avances tecnológicos
Evolución de los avances tecnológicosMerlys Escarpeta
 
Sitios de interes
Sitios de interesSitios de interes
Sitios de interesVane Avella
 
Top 150 global design firms
Top 150 global design firmsTop 150 global design firms
Top 150 global design firmsSamar Momin
 
Rosalia de Castro
Rosalia de CastroRosalia de Castro
Rosalia de CastroDeni-sa
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)John Dougherty
 
Aca advocacy
Aca advocacyAca advocacy
Aca advocacyschacctf
 
Hadoop Infrastructure (Oct. 3rd, 2012)
Hadoop Infrastructure (Oct. 3rd, 2012)Hadoop Infrastructure (Oct. 3rd, 2012)
Hadoop Infrastructure (Oct. 3rd, 2012)John Dougherty
 

Destacado (20)

Enc 3241 document_design1
Enc 3241 document_design1Enc 3241 document_design1
Enc 3241 document_design1
 
페차쿠차_ 조연진
페차쿠차_ 조연진페차쿠차_ 조연진
페차쿠차_ 조연진
 
페차쿠차
페차쿠차페차쿠차
페차쿠차
 
Evaluation Question 4
Evaluation Question 4Evaluation Question 4
Evaluation Question 4
 
Catedra virtual de cultura ciudadana
Catedra virtual de cultura ciudadanaCatedra virtual de cultura ciudadana
Catedra virtual de cultura ciudadana
 
Qlitan wid my cousins
Qlitan wid my cousinsQlitan wid my cousins
Qlitan wid my cousins
 
PRUEBA TOEFL
PRUEBA TOEFLPRUEBA TOEFL
PRUEBA TOEFL
 
Evolución de los avances tecnológicos
Evolución de los avances tecnológicosEvolución de los avances tecnológicos
Evolución de los avances tecnológicos
 
Subculture hippie
Subculture hippieSubculture hippie
Subculture hippie
 
Sitios de interes
Sitios de interesSitios de interes
Sitios de interes
 
Top 150 global design firms
Top 150 global design firmsTop 150 global design firms
Top 150 global design firms
 
Rosalia de Castro
Rosalia de CastroRosalia de Castro
Rosalia de Castro
 
Big Data ROI
Big Data ROIBig Data ROI
Big Data ROI
 
Qtllb
QtllbQtllb
Qtllb
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)
 
Aca advocacy
Aca advocacyAca advocacy
Aca advocacy
 
Hadoop Infrastructure (Oct. 3rd, 2012)
Hadoop Infrastructure (Oct. 3rd, 2012)Hadoop Infrastructure (Oct. 3rd, 2012)
Hadoop Infrastructure (Oct. 3rd, 2012)
 
Enc lecture day3
Enc lecture day3Enc lecture day3
Enc lecture day3
 
Diapositiva asesores
Diapositiva asesoresDiapositiva asesores
Diapositiva asesores
 
SEO Pricing & Cost
SEO Pricing & CostSEO Pricing & Cost
SEO Pricing & Cost
 

Similar a Applying Big Data

Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 IBM Sverige
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationDenodo
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Precisely
 
Aitp presentation ed holub - october 23 2010
Aitp presentation   ed holub - october 23 2010Aitp presentation   ed holub - october 23 2010
Aitp presentation ed holub - october 23 2010AITPHouston
 
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data IntegrationWebinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data IntegrationSnapLogic
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxdickonsondorris
 
IRJET- Search Improvement using Digital Thread in Data Analytics
IRJET- Search Improvement using Digital Thread in Data AnalyticsIRJET- Search Improvement using Digital Thread in Data Analytics
IRJET- Search Improvement using Digital Thread in Data AnalyticsIRJET Journal
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsExtraHop Networks
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptxkalai75
 

Similar a Applying Big Data (20)

Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
Aitp presentation ed holub - october 23 2010
Aitp presentation   ed holub - october 23 2010Aitp presentation   ed holub - october 23 2010
Aitp presentation ed holub - october 23 2010
 
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data IntegrationWebinar: The 5 Most Critical Things to Understand About Modern Data Integration
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
IRJET- Search Improvement using Digital Thread in Data Analytics
IRJET- Search Improvement using Digital Thread in Data AnalyticsIRJET- Search Improvement using Digital Thread in Data Analytics
IRJET- Search Improvement using Digital Thread in Data Analytics
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Big data
Big dataBig data
Big data
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 

Último

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Último (20)

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Applying Big Data

  • 1. Applying Big Data Presented by John Dougherty, Viriton 4/25/2013 john.dougherty@viriton.com
  • 2. Big Data Buzzwords • Volume, Velocity, and Variety • Agility/Agile Development • Modeling Data The 3V’s originated in the early 2000’s. META (Gartner, now) Volume…self contained. Velocity = Speed of transaction Variety = Data profiling from multiple data sources The Agile Manifesto, created February 2001 (Remember Scrum?) Incorporation into Big Data software becoming mandatory Adaptive and Predictive approaches are hotly contested Data Modeling is paramount, given Big or Small datasets Design must be confronted at ingress and egress Hybrid data modeling and remodeling existing models Veracity has been added, but has not yet been fully adopted
  • 3. Big Data Buzzwords – Agile Dev. • Informatics • Daily Batch • Classic Dept. Informaticists are leveraged across multiple disciplines There is no strict definition for a data scientist/informaticist Greatest likelihood to adopt an agile/adaptive model Development _should_ be incorporated into existing process workflows. Seamlessness should be the goal. Utilizing an agile approach to finding new uses to existing data Least likely to need/adopt new development approaches Relevant data must still be filtered through Staff should not be re-learning the wheel with deployment
  • 4. • Example of Hybrid Modeling • Every project/objective must have properly defined models to reach maximum efficacy • Data silos are losing their complicit positioning • Transitioning modeling to enumeration Big Data Buzzwords – Data Models
  • 5. Big Data Buzzwords – Question Inception Connecting these lines is a great example of the work that lies ahead in identifying the objectives and goals of the business environment
  • 6. Big Picture There is a lot of data As of 2009, Google generates at least >2 EB per year, >2TB indexed URLs, >9B page views per day Facebook houses one billion users; utilizing >500TB per day, housing 35% or more of the world’s photos YouTube houses >1EB of data, >72 hours of video per minute, >4B views per day Twitter >125B tweets per year, >390M per day, approximately 4500 per second ~2.3B people use the internet today, of which, 90% of the world’s data has been generated within the last two years The Internet of Things (connected devices and data) What will you be aggregating? In 2002, recorded media and electronic information flows generated about 22 exabytes (1018) of information In 2006, the amount of digital information created, captured, and replicated was 161 EB
  • 7. Use Cases IBM’s 5 High Value Use Cases Big Data Exploration Find, visualize, understand all big data to improve decision making. Big data exploration addresses the challenge that every large organization faces: information is stored in many different systems and silos and people need access to that data to do their day-to-day work and make important decisions. Enhanced 360º View of the Customer Extend existing customer views by incorporating additional internal and external information sources. Gain a full understanding of customers—what makes them tick, why they buy, how they prefer to shop, why they switch, what they’ll buy next, and what factors lead them to recommend a company to others. Security/Intelligence Extension Lower risk, detect fraud and monitor cyber security in real time. Augment and enhance cyber security and intelligence analysis platforms with big data technologies to process and analyze new types (e.g. social media, emails, sensors, Telco) and sources of under-leveraged data to significantly improve intelligence, security and law enforcement insight Operations Analysis Analyze a variety of machine and operational data for improved business results. The abundance and growth of machine data, which can include anything from IT machines to sensors and meters and GPS devices requires complex analysis and correlation across different types of data sets. By using big data for operations analysis, organizations can gain real-time visibility into operations, customer experience, transactions and behavior. Data Warehouse Augmentation Integrate big data and data warehouse capabilities to increase operational efficiency. Optimize your data warehouse to enable new types of analysis. Use big data technologies to set up a staging area or landing zone for your new data before determining what data should be moved to the data warehouse. Offload infrequently accessed or aged data from warehouse and application databases using information integration software and tools.
  • 8. • Applied since data science began, 1970’s • Many different products available, augmenting existing solutions, and providing all-in-one • SAS • SAP (though called predictive analytics, still fits) • Same problems incur with extensibility as do with design/deployment Use Cases Visual Analytics Science: Visual analytics is the science of analytical reasoning facilitated by interactive visual interfaces Sensor Analytics Internet of Things: The first speaking of the gargantuan brontobyte (1 Bit = Binary Digit · 8 Bits = 1 Byte · 1024 Bytes = 1 Kilobyte · 1024 Kilobytes = 1 Megabyte · 1024 Megabytes = 1 Gigabyte · 1024 Gigabytes = 1 Terabyte · 1024 Terabytes = 1 Petabyte · 1024 Petabytes = 1 Exabyte· 1024 Exabytes = 1 Zettabyte · 1024 Zettabytes = 1 Yottabyte · 1024 Yottabytes = 1 Brontobyte· 1024 Brontobytes = 1 Geopbyte)
  • 9. • ROI Metrics are difficult to predict, but follow a trend of double and triple digits • What keeps the CEO up at night, decision journeys • An anecdotal report (questionarre) shows 44% of CMO’s can measure their ROI • Design and development will continue to be tantamount to a successful return ROI?
  • 10. ROI Nucleus Research • Becoming an analytic enterprise requires Big Data • Average ROI of 241% • Increased productivity • A major metropolitan police department achieved an 863 percent ROI when it combined its criminal records database with a national crime database created by a major university. • Reduced labor costs • A major resort earned an ROI of 1,822 percent when it integrated shift scheduling processes with data from a national weather service, enabling managers to avoid unnecessary shift assignments and increase staff utilization. Four Stages of an Analytic Enterprise Telco reduces costs associated to CO management and circuit deployment by 230% QoS data expected to expand well into Petabytes for the Telco industry
  • 11. Moving Forward How to formulate the right questions • Communication between C-Suite and VP isn’t enough • Considering old data, wholistic approaches work best • Objectives and goals begin with dialogue at the highest levels • What are the questions we should be asking? Should we start now? Yes. Brought to you by:
  • 12. BIBLIOGRAPHY  http://blogs.starcio.com/2012/12/what-is-big-data-real-challenges-beyond.html - Big Data for All Businesses  http://www.nytimes.com/2013/03/24/nyregion/mayor-bloombergs-geek-squad.html?pagewanted=all&_r=2& - NYC Mayor’s use case  http://www.312analytics.com/what-is-machine-learning-big-data-modeling/ - Data Modeling, the big challenges  http://goo.gl/wH3qG - Origin of VVV  http://www.businessinsider.com/cia-presentation-on-big-data-2013-3?op=1 - CIA CTO Presentation  http://www.rosebt.com/1/post/2013/03/data-science-and-analytics-workflow.html - Rose Business Technologies, Workflow  http://nucleusresearch.com/research/notes-and-reports/the-big-returns-from-big-data/ - Nucleus Research

Notas del editor

  1. Thank you for coming to Big Data for BusinessNo other speakers, see if there is interest
  2. Concentrate on Veracity…not too muchDevelopment is not rigid, and agility may not be the only option.We have evidence that other approaches may yield just as good or better results.Data modeling is tantamount to a proper deployment
  3. Discuss why these are important to recognize for deployment and designThese styles all have similar issues with finding the right value
  4. Real time data flow is now the next step in finding answers.We still have to develop the right questions, or the right methods to finding the right questions
  5. Thank Michael Walker for the graphicThis illustrates a great abstraction for discourses at the business necessity perspective
  6. That’s a lot of data!End with the possibility of data aggregation sources, novel and extablished
  7. IBM has a pretty good grasp of Big Data’s implementations
  8. There are, fortunately or unfortunately, far fewer use cases than there are companies to provide solutions for those use cases
  9. Decision journeys for customers and predicting usage/purchasing patternsUtilized heavily in the Amazon space (both by Amazon and by their market source partners)
  10. Return on investment is proven, but not guaranteed for your businessFinding one massive return might justify the costs, but guaranteeing small returns will win more arguments
  11. There are a slew of resources available, and I will post this presentation along with other materials on the meetup page.Thanks again for coming, let’s have a discussion, and make sure to fill up
  12. This will be available online in a few days