SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
Introduction to Mechanical Turk
Artificial Artificial Intelligence




AWS User Group Berlin
Thomas Metschke
25.03.2010
Peritor GmbH
Amazon Mechanical Turk
is a marketplace for work.




                             2
Mechanical Turk Marketplace



 400,000+ Workers
 In 100+ Countries
 Available 24/7
 Programmatically
  Accessible
                         http://www.flickr.com/photos/diamond_rain/2543837414/




                                                                                 3
So there are basically

         Workers                                   Requesters




      http://www.flickr.com/photos/saad/1968774   http://www.flickr.com/photos/chicagobart/4181533461




                                                                                                        4
Mechanical Turk as a Worker

        Workers



                                                  Make money by working on
                                                   Human Intelligence Tasks

                                                  Workers can work from home
                                                   and choose their own work
                                                   hours

     http://www.flickr.com/photos/saad/1968774




                                                                                5
Your Dashboard




                 6
Your Dashboard




       The number of available tasks.




                                        7
Your Dashboard




       Total Earnings and Bonuses.




                                     8
Your Dashboard



          HIT Status and Totals.




                                   9
How do I get the money?



                Amazon         Bank
 U.S. Bank
                  Gift       Checks in
  account
               Certificate    Rupees




                                         10
Mechanical Turk as a Requester

                                 Requesters


 Have access to a global,
  on-demand, 24 x 7 workforce

 Can get thousands of HITs
  completed in minutes

 Pay only when they are
  satisfied with the results
                                http://www.flickr.com/photos/chicagobart/4181533461




                                                                                      11
Requesting HITs




      Requesters              Workers          Requesters


• define and create   • work on your     • approve and pay
  your HITs             HITs               for completed
• load HITs to        • submit results     HITs
  Mechanical Turk                        • use the results




                                                             12
Design HITs




               Enter Properties
               Design Layout



                                   13
Design HITs - faster




                       Take developer and use
                       CSV files
                       SOAP / REST or
                       Amazon Mechanical Turk
                       developer tools




                                                14
What would it look like


 http://mechanicalturk.amazonaws.com/
          ?Service=AWSMechanicalTurkRequester
          &AWSAccessKeyId=[the Requester's Access Key ID]
          &Version=2008-08-02
          &Operation=CreateHIT
          &Signature=[signature for this request]
          &Timestamp=[your system's local time]
          &Title=Location%20and%20Photograph%20Identification
          &Description=Select%20the%20image%20that%20best%20represents
          &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD
          &Question=[URL-encoded question data]
          &AssignmentDurationInSeconds=30
          &LifetimeInSeconds=604800
          &Keywords=location,%20photograph,%20image,%20identification,%20opinion




                                                                                   15
Publish HITs




 credit card   debit card
                             HITs have to be paid in
                              advance
  Amazon                     Amazon takes 10% on top
               U.S. bank
 Payments
                account
  account




                                                        16
Use Mechanical Turk for


                   Work that requires Human
                    Judgment
                   Work that algorithms
                    cannot completely solve
                   Work that has
                    unpredictable or spiky
                    volume

                                               17
Improving Data Quality

                                           Background
    Are these two
                                             Data is the company’s business
businesses the same?                         Accuracy and breadth are key to
                                              differentiation

                                           Process
  Peritor GmbH        Peritor Consulting     1 MM data points to ingest each day
 Blücherstraße 22     Blücherstraße 22       200 data sources
   10961 Berlin       Hof III Aufgang 6
 http://peritor.com     10961 Berlin       Problem
                                             Data needs to be normalized,
                                              enhanced and de-dupped
                                             Algorithms could get data about 70%
       YES                  NO                clean

                                                                                    18
Moderating User
Generated Content

Is this image explicit?
                                                      Background
                                                        User generated content is a key part
                                                         of a web 2.0 experience

                                                      Process
                                                        Millions of photos uploaded every
                                                         day

                                                      Problem
                                                        Need to ensure user generated
      http://www.flickr.com/photos/cmak/1521356521/

                                                         content meets site guidelines

    YES                                          NO

                                                                                                19
Categorization
                                                           Background
 What kind of dress is                                       Consumers need to be able to
        this?                                                 quickly find a product when shopping
                                                              online

                                                           The Business Process
                                                             Millions of new products are
                                                              introduced everyday
                                                             Products are sourced from hundreds
                                                              of merchants and manufacturers,
    http://www.flickr.com/photos/34801476@N00/296743627/      each with their own taxonomy
                  Cocktail                                 Problem
                                                             Need to properly categorize new
             Bridal dress
                                                              products quickly in order to monetize

                                                                                                      20
Optimizing your HITs for


                  Price




      Accuracy             Speed
                                   21
Check it out!




          http://mturk.com
          http://turkers.proboards.com




                                         22
Thank you for your attention
Peritor GmbH
Blücherstr. 22, Hof III Aufgang 6
10961 Berlin
Tel.: +49 (0)30 69 20 09 84 0
Fax: +49 (0)30 69 20 09 84 9
Internet: www.peritor.com
E-Mail: info@peritor.com



© Peritor GmbH - Alle Rechte vorbehalten

Más contenido relacionado

La actualidad más candente

[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent AcquisitionAppcast
 
How AI is going to transform recruitment?
How AI is going to transform recruitment?How AI is going to transform recruitment?
How AI is going to transform recruitment?CplRecruitment
 
Cloud Computing careers India - by Karrox Technologies
Cloud Computing careers India -  by Karrox TechnologiesCloud Computing careers India -  by Karrox Technologies
Cloud Computing careers India - by Karrox TechnologiesDiscover Cloud Computing
 
AI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the TalentAI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the TalentSkyl.ai
 

La actualidad más candente (6)

How to hire software engineers - given at pymunich.com
How to hire software engineers - given at pymunich.comHow to hire software engineers - given at pymunich.com
How to hire software engineers - given at pymunich.com
 
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
 
How AI is going to transform recruitment?
How AI is going to transform recruitment?How AI is going to transform recruitment?
How AI is going to transform recruitment?
 
Cloud Computing careers India - by Karrox Technologies
Cloud Computing careers India -  by Karrox TechnologiesCloud Computing careers India -  by Karrox Technologies
Cloud Computing careers India - by Karrox Technologies
 
AI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the TalentAI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the Talent
 
AI in Talent Acquisition
AI in Talent AcquisitionAI in Talent Acquisition
AI in Talent Acquisition
 

Destacado

Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and moreScaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and moreDropsolid
 
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWSAmazon Web Services
 
Common Workloads on the AWS Cloud
Common Workloads on the AWS CloudCommon Workloads on the AWS Cloud
Common Workloads on the AWS CloudAmazon Web Services
 
Journey through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS CloudJourney through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS CloudAmazon Web Services
 
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)Amazon Web Services
 
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWSAmazon Web Services
 
DNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS ShieldDNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS ShieldAmazon Web Services
 
cloud computing in e commerce
cloud computing in e commercecloud computing in e commerce
cloud computing in e commercesteffz
 
Best Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS CloudBest Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS CloudAmazon Web Services
 
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & ArchiveAmazon Web Services
 
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)Amazon Web Services
 
Search technologies & aws cloud search
Search technologies & aws cloud searchSearch technologies & aws cloud search
Search technologies & aws cloud searchAmazon Web Services
 
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)Amazon Web Services
 
AWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage GatewayAWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage GatewayAmazon Web Services Japan
 
Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)Amazon Web Services
 
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013Amazon Web Services
 
(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNs(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNsAmazon Web Services
 

Destacado (18)

Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and moreScaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
 
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
 
Common Workloads on the AWS Cloud
Common Workloads on the AWS CloudCommon Workloads on the AWS Cloud
Common Workloads on the AWS Cloud
 
Journey through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS CloudJourney through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS Cloud
 
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
 
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
 
DNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS ShieldDNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS Shield
 
cloud computing in e commerce
cloud computing in e commercecloud computing in e commerce
cloud computing in e commerce
 
Best Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS CloudBest Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS Cloud
 
Cloudschool 2014
Cloudschool 2014Cloudschool 2014
Cloudschool 2014
 
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
 
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
 
Search technologies & aws cloud search
Search technologies & aws cloud searchSearch technologies & aws cloud search
Search technologies & aws cloud search
 
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
 
AWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage GatewayAWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage Gateway
 
Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)
 
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
 
(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNs(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNs
 

Similar a Introduction to Mechanical Turk for AI Tasks

Agileload - load testing tool for better web performance
Agileload - load testing tool for better web performanceAgileload - load testing tool for better web performance
Agileload - load testing tool for better web performanceAgileload testing
 
The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...TIBCO Jaspersoft
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overviewoptier
 
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 EditionNotes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 EditionJohn Head
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overviewnickychu
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overviewoptier
 
Auto ai for skillsfuture
Auto ai for skillsfuture Auto ai for skillsfuture
Auto ai for skillsfuture Sunny Panjabi
 
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM iEnhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM iPrecisely
 
Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16AppDynamics
 
The Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance TestingThe Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance TestingFred Beringer
 
DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.ZaraaTitima1
 
Introduction to IBM API Management
Introduction to IBM API Management Introduction to IBM API Management
Introduction to IBM API Management Patrick Bouillaud
 
De-mystifying Robotic Process Automation
De-mystifying Robotic Process AutomationDe-mystifying Robotic Process Automation
De-mystifying Robotic Process AutomationNICSA
 
Unlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open SolutionUnlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open SolutionBonitasoft
 
Mdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitizedMdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitizedmtlgirlgeeks
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationRobert Gleave
 
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday SeasonG3 Communications
 
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...Fresche Solutions
 

Similar a Introduction to Mechanical Turk for AI Tasks (20)

Agileload - load testing tool for better web performance
Agileload - load testing tool for better web performanceAgileload - load testing tool for better web performance
Agileload - load testing tool for better web performance
 
The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 EditionNotes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Auto ai for skillsfuture
Auto ai for skillsfuture Auto ai for skillsfuture
Auto ai for skillsfuture
 
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM iEnhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
 
Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16
 
The Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance TestingThe Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance Testing
 
DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.
 
Introduction to IBM API Management
Introduction to IBM API Management Introduction to IBM API Management
Introduction to IBM API Management
 
De-mystifying Robotic Process Automation
De-mystifying Robotic Process AutomationDe-mystifying Robotic Process Automation
De-mystifying Robotic Process Automation
 
Migrate to microservices
Migrate to microservicesMigrate to microservices
Migrate to microservices
 
Unlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open SolutionUnlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open Solution
 
Mdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitizedMdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitized
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
 
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
 
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
 
Death to Manual Deployments
Death to Manual DeploymentsDeath to Manual Deployments
Death to Manual Deployments
 

Último

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 

Último (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 

Introduction to Mechanical Turk for AI Tasks

  • 1. Introduction to Mechanical Turk Artificial Artificial Intelligence AWS User Group Berlin Thomas Metschke 25.03.2010 Peritor GmbH
  • 2. Amazon Mechanical Turk is a marketplace for work. 2
  • 3. Mechanical Turk Marketplace  400,000+ Workers  In 100+ Countries  Available 24/7  Programmatically Accessible http://www.flickr.com/photos/diamond_rain/2543837414/ 3
  • 4. So there are basically Workers Requesters http://www.flickr.com/photos/saad/1968774 http://www.flickr.com/photos/chicagobart/4181533461 4
  • 5. Mechanical Turk as a Worker Workers  Make money by working on Human Intelligence Tasks  Workers can work from home and choose their own work hours http://www.flickr.com/photos/saad/1968774 5
  • 7. Your Dashboard The number of available tasks. 7
  • 8. Your Dashboard Total Earnings and Bonuses. 8
  • 9. Your Dashboard HIT Status and Totals. 9
  • 10. How do I get the money? Amazon Bank U.S. Bank Gift Checks in account Certificate Rupees 10
  • 11. Mechanical Turk as a Requester Requesters  Have access to a global, on-demand, 24 x 7 workforce  Can get thousands of HITs completed in minutes  Pay only when they are satisfied with the results http://www.flickr.com/photos/chicagobart/4181533461 11
  • 12. Requesting HITs Requesters Workers Requesters • define and create • work on your • approve and pay your HITs HITs for completed • load HITs to • submit results HITs Mechanical Turk • use the results 12
  • 13. Design HITs  Enter Properties  Design Layout 13
  • 14. Design HITs - faster Take developer and use CSV files SOAP / REST or Amazon Mechanical Turk developer tools 14
  • 15. What would it look like http://mechanicalturk.amazonaws.com/ ?Service=AWSMechanicalTurkRequester &AWSAccessKeyId=[the Requester's Access Key ID] &Version=2008-08-02 &Operation=CreateHIT &Signature=[signature for this request] &Timestamp=[your system's local time] &Title=Location%20and%20Photograph%20Identification &Description=Select%20the%20image%20that%20best%20represents &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD &Question=[URL-encoded question data] &AssignmentDurationInSeconds=30 &LifetimeInSeconds=604800 &Keywords=location,%20photograph,%20image,%20identification,%20opinion 15
  • 16. Publish HITs credit card debit card  HITs have to be paid in advance Amazon  Amazon takes 10% on top U.S. bank Payments account account 16
  • 17. Use Mechanical Turk for  Work that requires Human Judgment  Work that algorithms cannot completely solve  Work that has unpredictable or spiky volume 17
  • 18. Improving Data Quality Background Are these two  Data is the company’s business businesses the same?  Accuracy and breadth are key to differentiation Process Peritor GmbH Peritor Consulting  1 MM data points to ingest each day Blücherstraße 22 Blücherstraße 22  200 data sources 10961 Berlin Hof III Aufgang 6 http://peritor.com 10961 Berlin Problem  Data needs to be normalized, enhanced and de-dupped  Algorithms could get data about 70% YES NO clean 18
  • 19. Moderating User Generated Content Is this image explicit? Background  User generated content is a key part of a web 2.0 experience Process  Millions of photos uploaded every day Problem  Need to ensure user generated http://www.flickr.com/photos/cmak/1521356521/ content meets site guidelines YES NO 19
  • 20. Categorization Background What kind of dress is  Consumers need to be able to this? quickly find a product when shopping online The Business Process  Millions of new products are introduced everyday  Products are sourced from hundreds of merchants and manufacturers, http://www.flickr.com/photos/34801476@N00/296743627/ each with their own taxonomy Cocktail Problem  Need to properly categorize new Bridal dress products quickly in order to monetize 20
  • 21. Optimizing your HITs for Price Accuracy Speed 21
  • 22. Check it out! http://mturk.com http://turkers.proboards.com 22
  • 23. Thank you for your attention Peritor GmbH Blücherstr. 22, Hof III Aufgang 6 10961 Berlin Tel.: +49 (0)30 69 20 09 84 0 Fax: +49 (0)30 69 20 09 84 9 Internet: www.peritor.com E-Mail: info@peritor.com © Peritor GmbH - Alle Rechte vorbehalten