SlideShare una empresa de Scribd logo
1 de 33
Big Data Marketing in the
  AWS Cloud: Improving
Cross-Media Effectiveness
Welcome

       Sheri Sullivan
  Senior Marketing Manager
    Global SI Ecosystem
   Amazon Web Services
Webinar Overview
•   Submit Your Questions using the Q/A tool.
•   A copy of today’s presentation will be made available on:
    • AWS SlideShare Channel@
       http://www.slideshare.net/AmazonWebServices/
    • AWS YouTube Channel@
       http://www.youtube.com/user/AmazonWebServices


         Special Note: Today’s Webinar is being recorded.
What We’ll Cover
• Intro to AWS Database and Big Data Services
• Customer Use Cases and Solutions
• Delivering Cross-Media Analytics
• MarketShare Planner Platform
John Gannon
    AWS Business
Development Manager
 jgannon@amazon.com
Big Data and Databases on AWS
  Managed services designed to reduce administration, accelerate
  deployment, and minimize the cost of analysis and experimentation

            DynamoDB
            Schema-less data store that enables fast deployment of new applications
            without the burden of database administration


             Relational Database Service (RDS)
             Manage existing database applications without the effort required to
             provision, upgrade, backup and scale highly available instances

             ElastiCache
             Accelerate data retrieval performance by caching data in memory and
             avoiding slower disk-based systems


            Elastic MapReduce (EMR)
            Hadoop-based infrastructure service enabling the parallel processing of
            massive amounts of data
Amazon Relational Database
           Service
RDS is a fully managed Relational database service that is
simple to deploy, easy to scale, reliable and cost-effective

                                         Choice of Database Engines

                                           Fully Managed Service

                                           Push Button Scalability

                                        Fault Tolerance with Multi-AZ

                                        Works with EC2 & ElastiCache
Amazon DynamoDB
DynamoDB is a fully managed NoSQL database
service that provides extremely fast and
predictable performance with seamless scalability
                                 Authors of NoSQL


                                 Zero Administration


                                 Low Latency SSD’s


                                 Unlimited Potential
                               Storage and Throughput
AMAZON ELASTIC MAPREDUCE
 Reduces complexity & cost of Hadoop Management
 Integrates with AWS Services and 3rd Party vendors
 Highly customizable
Operated 2 million+ Hadoop
    clusters last year
Amazon EMR is the #1
       Enterprise Hadoop Solution
AWS is “the most
prominent Hadoop cloud
service provider” and
“leads the pack (of
Leaders) due to its
proven, feature-rich Elastic
MapReduce service…”

-The Forrester Wave™:
Enterprise Hadoop
Solutions Q1 2012
Success Story
Business Challenge
   Needed a real-time analytics tool to determine dynamic live event pricing during the
   ticket sales life cycle
   Optimize event ticket pricing, improve yield management & generate incremental
   revenue

AWS Services




           Elastic Load                     Amazon Elastic
                          Amazon SimpleDB                     Amazon Simple
            Balancer                         MapReduce                             Amazon CloudWatch
                                                             Email Service (SES)
Business Benefits
   Ease of use, reducing developers’ infrastructure management time by 3 hours per day
   Estimated 80% cost reduction annually, compared to fixed service costs
Anupam Singh
   MarketShare
  VP, Technology
asingh@marketshare.com
Elastic Data Management
Multi-Cluster, Elastic, Failure
          Resistant
Who we are
                                            MarketShare                           MarketShare
                                             Planner™                               Price™

    The global marketer partner of choice   MarketShare                           MarketShare
    for understanding, optimizing and          360™                               Optimizer™


    driving revenue                              MarketShare Platform
                                            Cloud modeling | Saas infrastructure | Data
                                                            connectors




• Recognized industry leader
                                                       Risky           Strong


•
                                                       Bets Contenders Performers   Leaders

  Cloud-based software solutions             Strong




• Over half the Fortune 100
• Strong media and agency                   Current
                                            Offering


  partnerships
• Global presence
                                             Weak
                                                      Weak             Strategy               Strong
Terabytes per                                                              1000+ variables
   customer
                                    Data
                                  Architect
Client Data
                 ETL                                  Reportin              Modeling
                                                         g




                                                                                                  Sim-Opt

FTP
                 Scale Complex Modeling                                                Simulation Engineer




                       Modeling       Sim-Opts   Tool Stack   Production
                        Stack           Stack      Tables       Tables
                        Tables         Tables                                    Application



                                                                                                Modeler


100+ Customers                                                             100+ data sources
Brand                    Product
                   Earned media
                      ETL                     Organic search               Reporting                                 Modeling
                                                                                 Innovation
                                                                                           Quality                  Events
                                                                                              Conferences
                                                                                                                                                        Controllable
                                                                    Bing
                                        WOM                Google                                     Trade shows
                                                                                                                               Sales
                                                 Blogs
                      Social media            Twitter                        Awareness                              Training
      Owned           PR
                                        Facebook                                                                                         Service
                                                                                                                       Support
      media                        Commerce
                                                                                                                                                 Simulatio
                  Website           Content                                  Consideration                                Displays
FTP                                                                                                                                                 n
                                                                                                                          Shelf space            In store
                                   Google
               Paid Search             Bing                                                                                Discounts
                                                                               Purchase                                     Bundles
                                Banner Ads
                                                                                                                           Coupons         Promotions
                 Display         Video Ads

                                   Magazine                                                                            Offering

                     Print          Newspaper
                                                                                                                     Pricing         Competition
                                                TV
                                                                                                                                     Applicati
                                               Radio
                                                                                                                                       on
                           Broadcast                 Signs
                                                                                                               Interest
                                                                                                                               Seasonality
                                                      Digital
                                                                                                                 rates                                      Non-
                                                                                                     Stock market
                                                      signage Catalog Direct              Mobile                                                            controllable
                                                                      mail        email
              Paid media                                                                                       Economy
                                          Outdoor
                                                                                Direct
ETL   Reporting   Modeling




                                         Simulation
FTP




                           Application
ETL   Reporting   Modeling




                                         Simulation
FTP




                           Application
ETL   Reporting   Modeling




                                         Simulatio
FTP                                         n




                             Applicati
                               on
ETL   Reporting   Modeling




                                     Simulation
FTP




                           Application
Many applications in
         production


Marketing Efficiency                     Attribution




                       Dynamic Pricing
The Technology That Makes
             It Possible
Elastic Cloud™                  AWS
                                         Amazon EC2               Amazon EC2
                                      Permanent Instances     On-Demand Instances

                                        EC2        EC2                 Amazon
                                      Instance   Instance        Elastic MapReduce
                 Elastic Load
                  Balancer
                                       Web         App
                                      Server      Server




                                AWS
                                         Amazon EC2                 Amazon
                                      Permanent Instances        Managed Storage

                                        EC2        EC2      RDS Database   Amazon Simple
                                      Instance   Instance     Instance     Storage Service
                                                                                (S3)
                                       Web         App
                                       Serve      Serve
                                         r          r
Giant Hadoop cluster
ISSUE   1
               Overwhelmed for small periods
               Unused for large periods
Partition the data pipeline
SOLUTION   1
                  Identify independent data sources
                  Redesign ETL for independent stages
Cluster proliferation
ISSUE    2
                Manual bring up and tear down of clusters
                Dramatic increase in maintenance costs
Cluster proliferation
SOLUTION         2
                          Use Elastic Map Reduce
                          Dynamically change the size of cluster based on:
                             Volume of data & Historical performance




       Amazon EC2                          Amazon EC2              Amazon EC2
   On-Demand Instances                 On-Demand Instances     On-Demand Instances
            Amazon                              Amazon                  Amazon
      Elastic MapReduce                   Elastic MapReduce       Elastic MapReduce
Too many failure points
     ISSUE         3                                             Amazon EC2
                                                             On-Demand Instances
                                                                       Amazon
                                                                 Elastic MapReduce


   Jobs fail after ~90%
    completion
                                 Amazon
   Rerunning costs $$$       Managed Storage
                                        Amazon Simple
                                      Storage Service (S3)




         Amazon EC2             Amazon EC2                       Amazon EC2
                                                                  Amazon EC2
     On-Demand Instances    On-Demand Instances               On-Demand Instances
                                                             On-Demand Instances
              Amazon                 Amazon                             Amazon
                                                                       Amazon
        Elastic MapReduce      Elastic MapReduce                  Elastic MapReduce
                                                                 Elastic MapReduce
Invent technology for partial restarts
SOLUTION           3                                             Amazon EC2
                                                             On-Demand Instances
                                                                      Amazon
                                                                Elastic MapReduce


   Collect job stats
    obsessively
                                 Amazon
                              Managed Storage
   Restart based on                    Amazon Simple
    patented technology               Storage Service (S3)

    called PauseNPlayTM




         Amazon EC2             Amazon EC2                       Amazon EC2
     On-Demand Instances    On-Demand Instances              On-Demand Instances
              Amazon                 Amazon                            Amazon
        Elastic MapReduce      Elastic MapReduce                 Elastic MapReduce
Summary
Design your data pipeline for a multi-cluster environment
 • Write Configurable ETL to become independent, partitioned
   workflows
 • A cluster that stays up the entire month is not elastic 
Save your intermediate results in low cost storage
 • Think about compression
 • Do not underestimate schema complexity
Loosely coupled architecture has failure points
 • Save state obsessively
 • Build restart-ability into your architecture
Programs to help you get started
         with Big Data on AWS

        Big Data
                                         EMR
        Discovery                                            EMR Training
                                       Bootcamp
        Workshop



Identify and prioritize target   Deploy a sample use case    3 day intensive
     Big Data use cases           with real customer data   developer training
EMR Training Schedule
•   Los Angeles, CA – 10/16-10/18
•   Boston, MA – 10/30-11/1
•   Mountain View, CA – 11/13-11/15
•   Dallas, TX – 11/27-11/29
•   New York, NY – 12/11-12/13

Visit http://bit.ly/AWS_EMR_Training for class details and registration
Questions?

Contact:

William Merchan
VP, Business Development
MarketShare
wmerchan@marketshare.com

John Gannon
Business Development Manager, AWS
jgannon@amazon.com

Más contenido relacionado

La actualidad más candente

Develop Composite Business Services To Enable Reuse In A Service Orien...
Develop  Composite  Business  Services To  Enable  Reuse In A  Service  Orien...Develop  Composite  Business  Services To  Enable  Reuse In A  Service  Orien...
Develop Composite Business Services To Enable Reuse In A Service Orien...Kirill Osipov
 
[Cloud Summit 2010] Cezar Taurion - IBM
[Cloud Summit 2010] Cezar Taurion - IBM[Cloud Summit 2010] Cezar Taurion - IBM
[Cloud Summit 2010] Cezar Taurion - IBMTecla Internet
 
IBM Watson vs. Your Data Center
IBM Watson vs. Your Data CenterIBM Watson vs. Your Data Center
IBM Watson vs. Your Data CenterHerb Hernandez
 
Democratization of BI
Democratization of BIDemocratization of BI
Democratization of BIMAIA_1KEY
 
Cloud is Transforming the Enterprise
Cloud is Transforming the EnterpriseCloud is Transforming the Enterprise
Cloud is Transforming the Enterpriseshawnnewman
 
Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)IBM India Smarter Computing
 
[201] salesforce for power user day 1
[201] salesforce for power user   day 1[201] salesforce for power user   day 1
[201] salesforce for power user day 1Amigo 陳兆祥
 
Technology in support of utilities challenges
Technology in support of utilities challengesTechnology in support of utilities challenges
Technology in support of utilities challengesAitor Ibañez
 
Product information in organizations practical solutions
Product information in organizations   practical solutionsProduct information in organizations   practical solutions
Product information in organizations practical solutionsContribyte
 
Datawarehouse på System z (IBM Systems z)
Datawarehouse på System z (IBM Systems z)Datawarehouse på System z (IBM Systems z)
Datawarehouse på System z (IBM Systems z)IBM Danmark
 
Enterprise resource planning
Enterprise resource planningEnterprise resource planning
Enterprise resource planningjuliangoal
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBigDataCloud
 

La actualidad más candente (19)

Develop Composite Business Services To Enable Reuse In A Service Orien...
Develop  Composite  Business  Services To  Enable  Reuse In A  Service  Orien...Develop  Composite  Business  Services To  Enable  Reuse In A  Service  Orien...
Develop Composite Business Services To Enable Reuse In A Service Orien...
 
[Cloud Summit 2010] Cezar Taurion - IBM
[Cloud Summit 2010] Cezar Taurion - IBM[Cloud Summit 2010] Cezar Taurion - IBM
[Cloud Summit 2010] Cezar Taurion - IBM
 
IBM Watson vs. Your Data Center
IBM Watson vs. Your Data CenterIBM Watson vs. Your Data Center
IBM Watson vs. Your Data Center
 
Democratization of BI
Democratization of BIDemocratization of BI
Democratization of BI
 
Cloud is Transforming the Enterprise
Cloud is Transforming the EnterpriseCloud is Transforming the Enterprise
Cloud is Transforming the Enterprise
 
Masrtjack eXchange
Masrtjack eXchangeMasrtjack eXchange
Masrtjack eXchange
 
Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)Introducing the SAP high-performance analytic appliance (SAP HANA)
Introducing the SAP high-performance analytic appliance (SAP HANA)
 
[201] salesforce for power user day 1
[201] salesforce for power user   day 1[201] salesforce for power user   day 1
[201] salesforce for power user day 1
 
Technology in support of utilities challenges
Technology in support of utilities challengesTechnology in support of utilities challenges
Technology in support of utilities challenges
 
RunITbiz For Cio100
RunITbiz For Cio100RunITbiz For Cio100
RunITbiz For Cio100
 
2012 06 hortonworks paris hug
2012 06 hortonworks paris hug2012 06 hortonworks paris hug
2012 06 hortonworks paris hug
 
Product information in organizations practical solutions
Product information in organizations   practical solutionsProduct information in organizations   practical solutions
Product information in organizations practical solutions
 
Oracle BI Server By AORTA
Oracle BI Server By AORTAOracle BI Server By AORTA
Oracle BI Server By AORTA
 
Power Investment Tools
Power Investment ToolsPower Investment Tools
Power Investment Tools
 
Datawarehouse på System z (IBM Systems z)
Datawarehouse på System z (IBM Systems z)Datawarehouse på System z (IBM Systems z)
Datawarehouse på System z (IBM Systems z)
 
Enterprise resource planning
Enterprise resource planningEnterprise resource planning
Enterprise resource planning
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
 
SQL Server: Data Mining
SQL Server: Data MiningSQL Server: Data Mining
SQL Server: Data Mining
 
Customer Relationship Management
Customer Relationship ManagementCustomer Relationship Management
Customer Relationship Management
 

Similar a Improve cross-media effectiveness with big data marketing in AWS cloud

IBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Sverige
 
[Webinar] Drawing insights from social media
[Webinar] Drawing insights from social media[Webinar] Drawing insights from social media
[Webinar] Drawing insights from social mediaScupSocial
 
Sage ERPX3 "A vision for growth"
Sage ERPX3 "A vision for growth"Sage ERPX3 "A vision for growth"
Sage ERPX3 "A vision for growth"Sage España
 
Healthcare cio summit dallas feb 2013
Healthcare cio summit dallas feb 2013Healthcare cio summit dallas feb 2013
Healthcare cio summit dallas feb 2013Shyam Desigan
 
Perfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudPerfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudStanton Jones
 
Salesforce Solution For Software Industry
Salesforce Solution For Software IndustrySalesforce Solution For Software Industry
Salesforce Solution For Software Industrykdwangxi
 
Information Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise ChallengeInformation Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise ChallengeBob Rhubart
 
Track 3 Session 2_從傳統 legacy 邁向數位化與現代化架構
Track 3 Session 2_從傳統  legacy  邁向數位化與現代化架構Track 3 Session 2_從傳統  legacy  邁向數位化與現代化架構
Track 3 Session 2_從傳統 legacy 邁向數位化與現代化架構Amazon Web Services
 
Practical Approach to Data Maintenance in for PLM in Oracle EBS
Practical Approach to Data Maintenance in for PLM in Oracle EBSPractical Approach to Data Maintenance in for PLM in Oracle EBS
Practical Approach to Data Maintenance in for PLM in Oracle EBSSamsung Electronics
 
Saleseffectivity and business intelligence
Saleseffectivity and business intelligenceSaleseffectivity and business intelligence
Saleseffectivity and business intelligencemarekdan
 
Striving for an Outstanding IT Organization
Striving for an Outstanding IT OrganizationStriving for an Outstanding IT Organization
Striving for an Outstanding IT OrganizationHuberto Garza
 
Automobile industry group5
Automobile industry group5Automobile industry group5
Automobile industry group5mbaslides
 
Dynamics Day '11 - Dynamics CRM Update and Roadmap
Dynamics Day '11 - Dynamics CRM Update and RoadmapDynamics Day '11 - Dynamics CRM Update and Roadmap
Dynamics Day '11 - Dynamics CRM Update and RoadmapIntergen
 
Website Design and Development
Website Design and DevelopmentWebsite Design and Development
Website Design and DevelopmentGaurav Kumar
 
AWS Empowering Digital Marketing - September 2013
AWS Empowering Digital Marketing - September 2013AWS Empowering Digital Marketing - September 2013
AWS Empowering Digital Marketing - September 2013Amazon Web Services
 
Intergen - Dynamics CRM Roadmap and Social Media
Intergen - Dynamics CRM Roadmap and Social MediaIntergen - Dynamics CRM Roadmap and Social Media
Intergen - Dynamics CRM Roadmap and Social MediaIntergen
 
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...Mingxia Zhang, Ph.D.
 

Similar a Improve cross-media effectiveness with big data marketing in AWS cloud (20)

IBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användare
 
[Webinar] Drawing insights from social media
[Webinar] Drawing insights from social media[Webinar] Drawing insights from social media
[Webinar] Drawing insights from social media
 
Sage ERPX3 "A vision for growth"
Sage ERPX3 "A vision for growth"Sage ERPX3 "A vision for growth"
Sage ERPX3 "A vision for growth"
 
Healthcare cio summit dallas feb 2013
Healthcare cio summit dallas feb 2013Healthcare cio summit dallas feb 2013
Healthcare cio summit dallas feb 2013
 
Extending the reach of your Microsoft Dynamics AX Application with the next-g...
Extending the reach of your Microsoft Dynamics AX Application with the next-g...Extending the reach of your Microsoft Dynamics AX Application with the next-g...
Extending the reach of your Microsoft Dynamics AX Application with the next-g...
 
Keynote Day 1 2009
Keynote Day 1 2009Keynote Day 1 2009
Keynote Day 1 2009
 
Perfect Storm: HR in the Cloud
Perfect Storm: HR in the CloudPerfect Storm: HR in the Cloud
Perfect Storm: HR in the Cloud
 
Salesforce Solution For Software Industry
Salesforce Solution For Software IndustrySalesforce Solution For Software Industry
Salesforce Solution For Software Industry
 
Information Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise ChallengeInformation Management: Answering Today’s Enterprise Challenge
Information Management: Answering Today’s Enterprise Challenge
 
Track 3 Session 2_從傳統 legacy 邁向數位化與現代化架構
Track 3 Session 2_從傳統  legacy  邁向數位化與現代化架構Track 3 Session 2_從傳統  legacy  邁向數位化與現代化架構
Track 3 Session 2_從傳統 legacy 邁向數位化與現代化架構
 
Practical Approach to Data Maintenance in for PLM in Oracle EBS
Practical Approach to Data Maintenance in for PLM in Oracle EBSPractical Approach to Data Maintenance in for PLM in Oracle EBS
Practical Approach to Data Maintenance in for PLM in Oracle EBS
 
Saleseffectivity and business intelligence
Saleseffectivity and business intelligenceSaleseffectivity and business intelligence
Saleseffectivity and business intelligence
 
Striving for an Outstanding IT Organization
Striving for an Outstanding IT OrganizationStriving for an Outstanding IT Organization
Striving for an Outstanding IT Organization
 
Automobile industry group5
Automobile industry group5Automobile industry group5
Automobile industry group5
 
Enterprise Services Solutions
Enterprise Services SolutionsEnterprise Services Solutions
Enterprise Services Solutions
 
Dynamics Day '11 - Dynamics CRM Update and Roadmap
Dynamics Day '11 - Dynamics CRM Update and RoadmapDynamics Day '11 - Dynamics CRM Update and Roadmap
Dynamics Day '11 - Dynamics CRM Update and Roadmap
 
Website Design and Development
Website Design and DevelopmentWebsite Design and Development
Website Design and Development
 
AWS Empowering Digital Marketing - September 2013
AWS Empowering Digital Marketing - September 2013AWS Empowering Digital Marketing - September 2013
AWS Empowering Digital Marketing - September 2013
 
Intergen - Dynamics CRM Roadmap and Social Media
Intergen - Dynamics CRM Roadmap and Social MediaIntergen - Dynamics CRM Roadmap and Social Media
Intergen - Dynamics CRM Roadmap and Social Media
 
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...
TeleManagement Forum OSSera Case Study - AIS Thailand Service Manager Present...
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Improve cross-media effectiveness with big data marketing in AWS cloud

  • 1. Big Data Marketing in the AWS Cloud: Improving Cross-Media Effectiveness
  • 2. Welcome Sheri Sullivan Senior Marketing Manager Global SI Ecosystem Amazon Web Services
  • 3. Webinar Overview • Submit Your Questions using the Q/A tool. • A copy of today’s presentation will be made available on: • AWS SlideShare Channel@ http://www.slideshare.net/AmazonWebServices/ • AWS YouTube Channel@ http://www.youtube.com/user/AmazonWebServices Special Note: Today’s Webinar is being recorded.
  • 4. What We’ll Cover • Intro to AWS Database and Big Data Services • Customer Use Cases and Solutions • Delivering Cross-Media Analytics • MarketShare Planner Platform
  • 5. John Gannon AWS Business Development Manager jgannon@amazon.com
  • 6. Big Data and Databases on AWS Managed services designed to reduce administration, accelerate deployment, and minimize the cost of analysis and experimentation DynamoDB Schema-less data store that enables fast deployment of new applications without the burden of database administration Relational Database Service (RDS) Manage existing database applications without the effort required to provision, upgrade, backup and scale highly available instances ElastiCache Accelerate data retrieval performance by caching data in memory and avoiding slower disk-based systems Elastic MapReduce (EMR) Hadoop-based infrastructure service enabling the parallel processing of massive amounts of data
  • 7. Amazon Relational Database Service RDS is a fully managed Relational database service that is simple to deploy, easy to scale, reliable and cost-effective Choice of Database Engines Fully Managed Service Push Button Scalability Fault Tolerance with Multi-AZ Works with EC2 & ElastiCache
  • 8. Amazon DynamoDB DynamoDB is a fully managed NoSQL database service that provides extremely fast and predictable performance with seamless scalability Authors of NoSQL Zero Administration Low Latency SSD’s Unlimited Potential Storage and Throughput
  • 9. AMAZON ELASTIC MAPREDUCE Reduces complexity & cost of Hadoop Management Integrates with AWS Services and 3rd Party vendors Highly customizable
  • 10. Operated 2 million+ Hadoop clusters last year
  • 11. Amazon EMR is the #1 Enterprise Hadoop Solution AWS is “the most prominent Hadoop cloud service provider” and “leads the pack (of Leaders) due to its proven, feature-rich Elastic MapReduce service…” -The Forrester Wave™: Enterprise Hadoop Solutions Q1 2012
  • 12. Success Story Business Challenge Needed a real-time analytics tool to determine dynamic live event pricing during the ticket sales life cycle Optimize event ticket pricing, improve yield management & generate incremental revenue AWS Services Elastic Load Amazon Elastic Amazon SimpleDB Amazon Simple Balancer MapReduce Amazon CloudWatch Email Service (SES) Business Benefits Ease of use, reducing developers’ infrastructure management time by 3 hours per day Estimated 80% cost reduction annually, compared to fixed service costs
  • 13. Anupam Singh MarketShare VP, Technology asingh@marketshare.com
  • 14. Elastic Data Management Multi-Cluster, Elastic, Failure Resistant
  • 15. Who we are MarketShare MarketShare Planner™ Price™ The global marketer partner of choice MarketShare MarketShare for understanding, optimizing and 360™ Optimizer™ driving revenue MarketShare Platform Cloud modeling | Saas infrastructure | Data connectors • Recognized industry leader Risky Strong • Bets Contenders Performers Leaders Cloud-based software solutions Strong • Over half the Fortune 100 • Strong media and agency Current Offering partnerships • Global presence Weak Weak Strategy Strong
  • 16. Terabytes per 1000+ variables customer Data Architect Client Data ETL Reportin Modeling g Sim-Opt FTP Scale Complex Modeling Simulation Engineer Modeling Sim-Opts Tool Stack Production Stack Stack Tables Tables Tables Tables Application Modeler 100+ Customers 100+ data sources
  • 17. Brand Product Earned media ETL Organic search Reporting Modeling Innovation Quality Events Conferences Controllable Bing WOM Google Trade shows Sales Blogs Social media Twitter Awareness Training Owned PR Facebook Service Support media Commerce Simulatio Website Content Consideration Displays FTP n Shelf space In store Google Paid Search Bing Discounts Purchase Bundles Banner Ads Coupons Promotions Display Video Ads Magazine Offering Print Newspaper Pricing Competition TV Applicati Radio on Broadcast Signs Interest Seasonality Digital rates Non- Stock market signage Catalog Direct Mobile controllable mail email Paid media Economy Outdoor Direct
  • 18. ETL Reporting Modeling Simulation FTP Application
  • 19. ETL Reporting Modeling Simulation FTP Application
  • 20. ETL Reporting Modeling Simulatio FTP n Applicati on
  • 21. ETL Reporting Modeling Simulation FTP Application
  • 22. Many applications in production Marketing Efficiency Attribution Dynamic Pricing
  • 23. The Technology That Makes It Possible Elastic Cloud™ AWS Amazon EC2 Amazon EC2 Permanent Instances On-Demand Instances EC2 EC2 Amazon Instance Instance Elastic MapReduce Elastic Load Balancer Web App Server Server AWS Amazon EC2 Amazon Permanent Instances Managed Storage EC2 EC2 RDS Database Amazon Simple Instance Instance Instance Storage Service (S3) Web App Serve Serve r r
  • 24. Giant Hadoop cluster ISSUE 1  Overwhelmed for small periods  Unused for large periods
  • 25. Partition the data pipeline SOLUTION 1  Identify independent data sources  Redesign ETL for independent stages
  • 26. Cluster proliferation ISSUE 2  Manual bring up and tear down of clusters  Dramatic increase in maintenance costs
  • 27. Cluster proliferation SOLUTION 2 Use Elastic Map Reduce Dynamically change the size of cluster based on:  Volume of data & Historical performance Amazon EC2 Amazon EC2 Amazon EC2 On-Demand Instances On-Demand Instances On-Demand Instances Amazon Amazon Amazon Elastic MapReduce Elastic MapReduce Elastic MapReduce
  • 28. Too many failure points ISSUE 3 Amazon EC2 On-Demand Instances Amazon Elastic MapReduce  Jobs fail after ~90% completion Amazon  Rerunning costs $$$ Managed Storage Amazon Simple Storage Service (S3) Amazon EC2 Amazon EC2 Amazon EC2 Amazon EC2 On-Demand Instances On-Demand Instances On-Demand Instances On-Demand Instances Amazon Amazon Amazon Amazon Elastic MapReduce Elastic MapReduce Elastic MapReduce Elastic MapReduce
  • 29. Invent technology for partial restarts SOLUTION 3 Amazon EC2 On-Demand Instances Amazon Elastic MapReduce  Collect job stats obsessively Amazon Managed Storage  Restart based on Amazon Simple patented technology Storage Service (S3) called PauseNPlayTM Amazon EC2 Amazon EC2 Amazon EC2 On-Demand Instances On-Demand Instances On-Demand Instances Amazon Amazon Amazon Elastic MapReduce Elastic MapReduce Elastic MapReduce
  • 30. Summary Design your data pipeline for a multi-cluster environment • Write Configurable ETL to become independent, partitioned workflows • A cluster that stays up the entire month is not elastic  Save your intermediate results in low cost storage • Think about compression • Do not underestimate schema complexity Loosely coupled architecture has failure points • Save state obsessively • Build restart-ability into your architecture
  • 31. Programs to help you get started with Big Data on AWS Big Data EMR Discovery EMR Training Bootcamp Workshop Identify and prioritize target Deploy a sample use case 3 day intensive Big Data use cases with real customer data developer training
  • 32. EMR Training Schedule • Los Angeles, CA – 10/16-10/18 • Boston, MA – 10/30-11/1 • Mountain View, CA – 11/13-11/15 • Dallas, TX – 11/27-11/29 • New York, NY – 12/11-12/13 Visit http://bit.ly/AWS_EMR_Training for class details and registration
  • 33. Questions? Contact: William Merchan VP, Business Development MarketShare wmerchan@marketshare.com John Gannon Business Development Manager, AWS jgannon@amazon.com

Notas del editor

  1. We’ve been operating the service for over 3 years now and in the last year alone we’ve operated over 2 MILLIONHadoop clusters
  2. Forrester wave report named Amazon EMR the #1 enterprise hadoop solution because of it’s integration with various data stores, it’s ecosystem of vendors and the number of customers the service supports.
  3. Hi, my name is Anupam Singh. I am the Vice President of Technology at MarketShare.
  4. MarketShare builds solutions for marketing organizations at Fortune 100 companies. Our customers provide us data and we provide a cloud based analytic applications to improve the efficiency of our customer’s marketing.
  5. So, what are the big challenges that we face? Our entire business is based on scaling complex data modeling. Our scaling challenges are across 4 major dimensions. Each customer has 10s of terabytes of data. The data comes from hundreds of data sources. This data has thousands of variables to analyze. And we need to do this for hundreds of customers. Let us look at the various stages to build a solution that scales.
  6. The first stage is bringing the data together. Today’s marketing organization is faced with hundreds of data sources. Consider this picture where we bring together data from the customer’s website, the advertising logs from their vendors, revenue data from the ERP systems, variables like Seasonality & Economy. As you can see, we have to gather more than 40 data sources in this single picture. Just managing the storage for daily, weekly and monthly updates is a challenge.
  7. A lot of this data is machine generated. And it is not ready for analytics. Each data source has to be scrubbed and cleaned through an ETL pipeline before doing analytics. Our ETL pipelines have 20-30 main stages with 100s of sub-stages. Scheduling these and correcting data errors is one of our biggest technical challenges. We will dive deeper into this later. Once the data has been cleaned, it is ready for analytics.
  8. Many of our customers have never seen these data sources in a single dashboard. Even before running the data through our proprietary modeling platform, we can help our customers get dashboards on previous data black holes.
  9. The term data scientist has been in vogue lately. At MarketShare, we have a large team of modelers who run modeling on the cloud. As the data has been cleaned up, the modelers run thousands of different equations. Many analytic applications stop their cloud usage at reporting. At MarketShare, we believe that reporting is not enough to answer the questions. Building a predictive model is key to answering business questions on terabytes of data. We use the cloud to build custom models for each one of our customers. We use the power of distributed systems to validate these models for accuracy.
  10. Once the models have been prepared, they are deployed in an easy to use application. It should be noted that reducing big data should not mean that the user is lost in a forest of reports. At MarketShare, we believe in simplifying access to Big Data. We hide the model complexity behind easy to use applications that let our users build many different scenarios for their business.
  11. So, what does all this give our customers? We have been able to release many different applications on top of this analytics pipeline. The first one is marketing efficiency. The second application is Attribution. The third one is Dynamic Pricing.
  12. So, what makes this pipeline run? Our entire analytics workflow is built using various services from Amazon as building blocks. Our applications are deployed behind the elastic load balancer service. The data is stored in Storage services like S3, RDS and we are trying out Dynamo DB. Our analytics jobs are executed on dynamic clusters provided by elastic map reduce.
  13. So, let us quickly go under the hood. 3 years ago, we started with a hadoop cluster to store all our data. Very quickly we noticed two important things with the cluster. The first observation is that however big we made the cluster, jobs kept running into each other. Try as we might, the cluster would get hot for some time when many different stages would start executing at the same time. The second observation was how unused the cluster was for large periods of our time. So, while we are spending a lot of dollars on this large cluster, our customers are still unhappy with the response times!
  14. So, what was our solution? We rewrote our entire data pipeline to run many different clusters. So,
  15. Big Data Discovery WorkshopBrainstorm pilot use casesIdentify data sources and formatsReview business and financial driversRecommended use casesRoadmap for data migration and production rolloutReference architectureEstimated pilot costNext stepsEMR BootcampInteractive onsite workshop (is not classroom training)Work w/customer to architect, install, and config EMRRun and debug production job flowsCustomer’s dataset(s) must be on S3