Enviar búsqueda
Cargar
Hadoop 2015: what we larned -Think Big, A Teradata Company
•
8 recomendaciones
•
1,704 vistas
DataWorks Summit
Seguir
Think Big, A Teradata Company Rick Farnell, Co-Founder & SVP International
Leer menos
Leer más
Tecnología
Denunciar
Compartir
Denunciar
Compartir
1 de 24
Recomendados
Making the Case for Hadoop in a Large Enterprise-British Airways
Making the Case for Hadoop in a Large Enterprise-British Airways
DataWorks Summit
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
Capgemini
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
Capgemini
Hadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business Unit
DataWorks Summit
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Tristan Baker
The principles of the business data lake
The principles of the business data lake
Capgemini
Teradata Professional Services Overview
Teradata Professional Services Overview
Teradata
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
Jeffrey T. Pollock
Recomendados
Making the Case for Hadoop in a Large Enterprise-British Airways
Making the Case for Hadoop in a Large Enterprise-British Airways
DataWorks Summit
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
Capgemini
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
Capgemini
Hadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business Unit
DataWorks Summit
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Tristan Baker
The principles of the business data lake
The principles of the business data lake
Capgemini
Teradata Professional Services Overview
Teradata Professional Services Overview
Teradata
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
Jeffrey T. Pollock
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
DLT Solutions
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
Regulation and Compliance in the Data Driven Enterprise
Regulation and Compliance in the Data Driven Enterprise
Denodo
Building Your Enterprise Data Marketplace with DMX-h
Building Your Enterprise Data Marketplace with DMX-h
Precisely
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
Capgemini
Slides: Accelerating Queries on Cloud Data Lakes
Slides: Accelerating Queries on Cloud Data Lakes
DATAVERSITY
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
The Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
Analytics8
The Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud World
DATAVERSITY
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Denodo
Three Big Data Case Studies
Three Big Data Case Studies
Atidan Technologies Pvt Ltd (India)
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera, Inc.
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Denodo
The Evolution of Data Architecture
The Evolution of Data Architecture
Wei-Chiu Chuang
IBM Industry Models and Data Lake
IBM Industry Models and Data Lake
Pat O'Sullivan
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Precisely
Agile BI: How to Deliver More Value in Less Time
Agile BI: How to Deliver More Value in Less Time
Perficient, Inc.
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
ibi
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Precisely
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
Denodo
Más contenido relacionado
La actualidad más candente
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
DLT Solutions
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
Regulation and Compliance in the Data Driven Enterprise
Regulation and Compliance in the Data Driven Enterprise
Denodo
Building Your Enterprise Data Marketplace with DMX-h
Building Your Enterprise Data Marketplace with DMX-h
Precisely
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
Capgemini
Slides: Accelerating Queries on Cloud Data Lakes
Slides: Accelerating Queries on Cloud Data Lakes
DATAVERSITY
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
The Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
Analytics8
The Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud World
DATAVERSITY
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Denodo
Three Big Data Case Studies
Three Big Data Case Studies
Atidan Technologies Pvt Ltd (India)
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera, Inc.
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Denodo
The Evolution of Data Architecture
The Evolution of Data Architecture
Wei-Chiu Chuang
IBM Industry Models and Data Lake
IBM Industry Models and Data Lake
Pat O'Sullivan
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Precisely
Agile BI: How to Deliver More Value in Less Time
Agile BI: How to Deliver More Value in Less Time
Perficient, Inc.
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
ibi
La actualidad más candente
(20)
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Regulation and Compliance in the Data Driven Enterprise
Regulation and Compliance in the Data Driven Enterprise
Building Your Enterprise Data Marketplace with DMX-h
Building Your Enterprise Data Marketplace with DMX-h
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
Slides: Accelerating Queries on Cloud Data Lakes
Slides: Accelerating Queries on Cloud Data Lakes
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
The Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud World
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Three Big Data Case Studies
Three Big Data Case Studies
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
The Evolution of Data Architecture
The Evolution of Data Architecture
IBM Industry Models and Data Lake
IBM Industry Models and Data Lake
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Agile BI: How to Deliver More Value in Less Time
Agile BI: How to Deliver More Value in Less Time
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
Similar a Hadoop 2015: what we larned -Think Big, A Teradata Company
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Precisely
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
Denodo
Insights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
DataWorks Summit
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
DataWorks Summit
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
Inside Analysis
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
Enterprise Management Associates
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
Revolution Analytics
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
Perficient, Inc.
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
Cloudera, Inc.
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Cloudera, Inc.
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
Contexti
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Cloudera, Inc.
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
Dataconomy Media
How to implement Hadoop successfully
How to implement Hadoop successfully
Adir Sharabi
Similar a Hadoop 2015: what we larned -Think Big, A Teradata Company
(20)
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
Insights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
How to implement Hadoop successfully
How to implement Hadoop successfully
Más de DataWorks Summit
Data Science Crash Course
Data Science Crash Course
DataWorks Summit
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
Managing the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
Security Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
Más de DataWorks Summit
(20)
Data Science Crash Course
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Último
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Lorenzo Miniero
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
LoriGlavin3
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
Mattias Andersson
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
gvaughan
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Fwdays
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
2toLead Limited
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
BookNet Canada
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Precisely
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Stephanie Beckett
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
LoriGlavin3
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Lonnie McRorey
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
BkGupta21
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
Rizwan Syed
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
NavinnSomaal
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
LoriGlavin3
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
Lars Bell
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Pixlogix Infotech
Último
(20)
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Hadoop 2015: what we larned -Think Big, A Teradata Company
1.
Think Big, A
Teradata Company Rick Farnell, Co-Founder & SVP International
2.
2 Open Data Platform
Support © 2015 Think Big, a Teradata Company
3.
3 What we learned
over 5 years at Think Big Teamwork is Critical Skills Matter Celebrate Success © 2015 Think Big, a Teradata Company
4.
4 Our hunch was
right…this market will be BIG Source: www.Indeed.com April 8, 2015 © 2015 Think Big, a Teradata Company http://wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017
5.
5 • 100% Big
Data Focus • Founded in 2010 with over 100 engagements across 70 clients • Unlock client value of big data with data science and data engineering services • Proven vendor-neutral open source integration expertise • Agile team-based development methodology • Think Big Academy for skills and organizational development • Global delivery model on-site services with near shore and off shore support Who is Think Big? © 2015 Think Big, a Teradata Company
6.
6 Think Big Services
Engagement Model STRATEGY IMPLEMENTATION SOLUTION SUPPORT Think Big offers end-to-end Big Data strategy, implementation and support services focused on helping customers quickly achieve ROI on their Big Data investments Enterprise Data Lake Software Frameworks Big Data Roadmap Establish Data Lake Analytic Solutions Managed Services Data Lake Optimization © 2015 Think Big, a Teradata Company
7.
7 5 Years of
Big Data Services with Industry Leaders Think Big Clients eCommerce 2 of Global Top 5 Retail 2 of Global Top 5 Social Networking Global #1 Banking 4 of Global Top 10 Credit Issuer 2 of Global Top 5 Financial Data Services 2 of Global Top 5 Financial Exchanges Global #2 Brokerage & Mutual Funds 2 of Global Top 5 Asset Management Global #1 Semiconductor 2 of Global Top 5 Data Storage Devices 3 of Global Top 5 Disk Drive Manufacturing Global #1 Telecommunications 2 of Global Top 5 Media & Advertising 2 of Global Top 4 Internet Transaction Security Global #1 © 2015 Think Big, a Teradata Company
8.
8 Top 5 Learnings
over 5 Years in Big Data 1. Big Data is a journey not an event. 2. Open source is great for innovation but requires engineering sophistication in design patterns & best practices to continually scale analytics in production. 3. Importance of metadata, lineage, data management, data governance, data quality is critical to successful enterprise Big Data deployments. 4. A new data platform will not resolve organizational problems. 5. The Hadoop & opensource ecosystem is moving at the speed of light, you must be agile. Test, build, learn, repeat. © 2015 Think Big, a Teradata Company
9.
Think Big Enterprise
Data Lake Case Study: Global Manufacturing Client
10.
10 Manufacturing Client Overview Metrics: •
Collecting >2M manufacturing/testing binary files daily across Americas and APAC Facilities • Collecting from ~500 tables across 6 databases tens of millions of records daily • Over 140 analytics users to date • Over 150 attendees participated in Big Data Platform training Business Goals for Investment in Enterprise Data Lake: • Provide Enterprise-wide data access for timely analytics to product quality engineers • Establish foundation for large scale proactive manufacturing and quality analytics • Reduce $Millions in costs of scrap waste • Reduce $Millions in costs of containment processes • Reduce $Millions in costs of data search parties • Increase revenue and market share by accelerating time to market of products © 2015 Think Big, a Teradata Company
11.
11 Engineering and Ingestion
Design are Critical Day 1 1. Ingestion1. Utilize a robust Ingestion Design with a Buffer Server Zone 2. Packaging 2. Robust packaging many small flies into larger files with metadata. 3. Parse on Demand 3. Parse on demand. Increase velocity with right level of parsing, parse and refine in more detail as needed. 4. Establish Zones for Control 4. Establish Zones for comprehensive pipeline control, buffer, landing, ingest, core and publish. © 2015 Think Big, a Teradata Company
12.
12 Think Globally EDW EDW 5. Replication
on Ingestion 5. Replicate at ingestion with collection of key metadata information. 6. Data Treatment Facility 6. Utilize HDFS as giant file system, all data lands but some will continue in pipe to other MPP/EDW systems etc. 7. WAN Accelerators 7. WAN accelerates the speed transfer of ASIA to/from North America 8. Published Data 8. Processing and format is customized for the audience and consumption pattern. © 2015 Think Big, a Teradata Company
13.
13 Governance for Production
Workloads 9. Design Zones for Enterprise Governance 9. Zones will have different retention periods and different access patterns and in order to have pipeline control within your governance strategy you must design for this upfront. © 2015 Think Big, a Teradata Company
14.
14 Optimize Access Patterns
of your Data in Hadoop 10. Relational access pattern 10. Optimized for known queries. Equivalent of a Ferrari moving a bag of groceries. 11. Hadoop access pattern 11. Recast your problem. Hadoop optimal for large batches of work. Optimize hierarchical queries with transitive closure. Equivalent of a freight train moving multiple warehouses of groceries. © 2015 Think Big, a Teradata Company
15.
15 Pipeline Management, Monitoring
& Control HDFS 12. Data Pipeline 12. End to End Pipeline visibility is extremely important in an Enterprise Data Lake. Metadata Design and use of best practices are key in the use of this pattern. © 2015 Think Big, a Teradata Company
16.
16 Detailed Ingestion Design
Patterns 13. Source Systems 13. Hundreds of servers & hundreds of data streams require expert enterprise engineering. 14. Utilities 14. Layer in logging, monitoring and scheduling for complete control of your Enterprise Data Lake. © 2015 Think Big, a Teradata Company
17.
17 ROI for Enterprise
Data Lake Manufacturing Client Est. at $11M year 1, $30M cumulative year 2 © 2015 Think Big, a Teradata Company Operations Engineer: a recent production issue required detailed historical testing data. Our current systems did not have the required retention for this request. The Big Data team was able to pull and analyze all the required data from the Big Data Platform in minutes, as opposed to 3+ weeks that we used to take to pull the data from multiple systems and off tape archive. Legacy Systems Enterprise Data Lake Retention 3-6 months scattered DBs & tape archive. 100% data online in enterprise data lake for 3+ years. Coverage Summaries, samples for several data sets. 100% parametric data captured in raw form. Analysis Reactive, missed quality improvement opportunities. Daily dashboard with larger data sets to support proactive improvements.
18.
18 © 2015
Think Big, a Teradata Company Enterprise Data Lake Information Sources Evaluate Source Data Ingest Collect & Manage Metadata Apply Structure Sequence Compress Automate Protect Prepare Data for Ingest Prepare Source Metadata Perimeter-Authentication-Authorization InfoSec Downstream Applications Dashboard Engine Think Big Enterprise Data Lake Industry Leading Assets, Services and Methodology
19.
19
20.
20 © 2015
Think Big, a Teradata Company • Think Big is expanding, bringing its focus on open source consulting to the international region • An office in the UK’s London Bridge Business district will serve as its international hub • Think Big is aggressively hiring a team of data engineers, data scientists, technology project managers and sales leaders • Rick Farnell, Think Big co-founder and SVP, International will lead the international practice London Bridge Business District Photo credit: Duncan Harris. Courtesy of Flickr. Creative Commons Think Big International Expansion
21.
21 © 2015
Think Big, a Teradata Company Dublin, Ireland Munich, Germany Mumbai, India Think Big International Expansion: Phase 1 Think Big International Hub London, England Photo credits: London (Duncan Harris). Dublin (Guiseppe Milo), Munich (John Morgan), Mumbai (McKay Savage). Courtesy of Flickr. Creative Commons
22.
22
23.
23 Dashboard Engine for
Hadoop Fast Access for Comprehensive Historical Data Stored in Hadoop “Right” time data Latencies under a second Scales easily for thousands of simultaneous users Reporting, Visualization and Analytics © 2015 Think Big, a Teradata Company Visit our Think Big Booth to see our Dashboard Engine Demo. Enterprise Data Lake Information Sources Downstream Applications
24.
24 © 2015
Think Big, a Teradata Company Thank you Rick Farnell Co-Founder and SVP International, Think Big Rick.Farnell@thinkbiganalytics.com
Notas del editor
Think Big Americas Office Locations Mountain View Chicago Salt Lake City Boston New York