SlideShare una empresa de Scribd logo
1 de 39
IBM’s Watson: Is the World’s Trivia Champion the Future of Search? Josh Dreller VP Media Technology & Analytics Fuor Digital  @fuordigital
Two Types of Innovation Incremental – improving current projects and advancing them in a linear fashion Grand Challenges – pushing the limits of science Must be Difficult (has to be a challenge) Must be Inspiring Irresistible Vision – “just has to be done”
Previous IBM Grand Challenges
Open Question Answering The way normal humans communicate Natural Language – very ambiguous, but at the heart of human intelligence Last night I shot an    elephant in my pajamas.         How it got into my    pajamas I’ll never know.
The Ultimate Advancement:Computers that can communicate with humans in Natural Language
What Can Search Learn From Watson? The only way to push forward is to take huge leaps and look for self-imposed challenges even if we can’t prove out the business case right now What kind of Grand Challenges could Search create? A non-spammable Search Engine? No need for Search Engine Optimization? ?? 8
The Jeopardy! Challenge: A compelling and notable way to drive and measure the technology of automatic Question Answering along 5 Key Dimensions Broad/Open Domain $200 If you're standing, it's the direction you should look to check out the wainscoting. Complex Language $1000 Of the 4 countries in the world that the U.S. does not have diplomatic relations with, the one that’s farthest north High Precision $800 In cell division, mitosis splits the nucleus & cytokinesis splits this liquid cushioning the nucleus Accurate  Confidence High Speed
Jeopardy Reaction Not too inspired at first Weren’t interested in a circus sideshow 2009, IBM setup a moch-studio at their New York research facility Sparring matches with ex-Jeopardy winners Eventually saw the potential and thought it was something special
A Very Simple Question for a Computer In (( 12,546,798 * P ) ^ 2) / 34,567.46 = ? Greater than or less than 1? 50/50 Shot = .00885
Real Language is Real Hard Chess A finite, mathematically well-defined search space Limited number of moves and states Grounded in explicit, unambiguous mathematical rules Human Language Ambiguous, contextual and implicit Grounded only in human cognition Seemingly infinitenumber of ways to       express the same meaning
The Opposite of Current Computer Language Questions not designed for a computer to answer  Slang Crafty questions Shorthand Rhyme Regionalism Anagrams Complex Language!
Structured vs. Unstructured Data Where was Einstein born? Structured Unstructured One day, from among his city views of Ulm, Otto chose a watercolor to send to Albert Einstein as a remembrance of Einstein´s birthplace.
Common Sense Knowledge Base An ontology of classes and individuals Parts and materials of objects Properties of objects (such as color and size) Functions and uses of objects Locations of objects and layouts of locations Locations of actions and events Durations of actions and events Preconditions of actions and events Effects (post conditions) of actions and events Subjects and objects of actions Behaviors of devices Stereotypical situations or scripts Human goals and needs Emotions Plans and strategies Story themes Contexts 16 Can a can CanCan?
What Can Search Learn From Watson? We need to focus on what computers aren’t good at, not what they are good at Keywords and Links are not savvy enough.   Natural language is key to a next generation search engine Most of human knowledge is kept in unstructured data sources or based on common sense context 17
The DeepQA Project Dr. David Ferrucci 25-30 full time      researchers from      many disciplines.  2007-2011 Millions of dollars Post Jeopardy implications
Speed Results Deployed Watson on 2,880 IBM POWER 750 computer cores Went from 2 hours per question on a single CPU to an average ofjust 3 seconds – fast enough to compete with the best.
Example Question Related Content (Structured & Unstructured) Keywords: 1698, comet,    paramour, pink, … AnswerType(comet discoverer) Date(1698) Took(discoverer, ship) Called(ship, Paramour Pink) … Primary  Search Question  Analysis Candidate Answer Generation … Temporal Taxonomic Lexical Spatial [0.58 0 -1.3 … 0.97] Isaac Newton [0.71 1 13.4 … 0.72] Wilhelm Tempel [0.12 0  2.0 … 0.40] HMS Paramour Edmond Halley (0.85) Christiaan Huygens (0.20) Peter Sellers (0.05) [0.84 1 10.6 … 0.21] Christiaan Huygens [0.33 0  6.3 … 0.83] Halley’s Comet [0.21 1 11.1 … 0.92] Edmond Halley Merging & Ranking [0.91 0 -8.2 … 0.61] Pink Panther [0.91 0 -1.7 … 0.60] Peter Sellers Evidence Retrieval … Evidence Scoring IN 1698, THIS COMET DISCOVERER TOOK A SHIP CALLED THE PARAMOUR PINK ON THE FIRST PURELY SCIENTIFIC SEA VOYAGE
Confidence is Key Watson only rings in if it can reach a statistically significant confidence in time Some questions take longer than others Some questions will be able to answer less confident than others Watson manages risk in betting based on confidence
Embarrassingly Parallel Computing Def:  “Little or no effort is required to separate the problem into a number of parallel tasks” Works on many algorithms at once and comes back together with confidence scores. different from distributed computing problems (such as Google’s MapReduce) that require communication between tasks, especially communication of intermediate results.
What Can Search Learn From Watson? We can’t be bound by the constraints of current technology Two fold process of first coming up with answers then vetting them with more evidence Ideas like Parallel Processing will allow us to jump ahead 24
Ken Jennings & Brad Rutter
Top human players are remarkably good. Computers? The Best Human Performance: Our Analysis Reveals the Winner’s Cloud Each dot represents an actual historical human Jeopardy! game Winning Human Performance Grand Champion Human Performance 2007 QA Computer System More Confident Less Confident 27
DeepQA: Incremental Progress in Precision and Confidence 6/2007-11/2010  Now Playing in the Winners Cloud 11/2010 4/2010 10/2009 5/2009 12/2008 Precision 8/2008 5/2008 12/2007 Baseline
Confidence Bar
What Can Search Learn From Watson? Confidence bar would be a great addition to SERPs  We must benchmark what is “good” and then aim higher These things take time (and money) 30
TJ Watson Research CenterYorktown, NYTwo Games: Aired February 14-16, 2011
The End – Humans Win
Financial Industry Generates large amounts of data and growing 70% per year Not just numbers, but all info that would influence the biz landscape (news, articles, blogs, etc) Recent financial crisis shows failures of lack of understanding in interdependencies
DeepQA in Continuous Evidence-Based Diagnostic Analysis Considers and synthesizes a broad range of evidence improving quality, reducing cost Symptoms Diagnosis Models Meds Symp Fam Hist Find Confidence Renal failure Family History Patient History UTI Medications Tests/Findings Diabetes Influenza Notes/Hypotheses hypokalemia esophogitis Most Confident Diagnosis: Diabetes Most Confident Diagnosis: UTI Most Confident Diagnosis: Influenza Most Confident Diagnosis: Diabetes and Esophogitis Huge Volumes of Texts, Journals, References, DBs etc.
What Can Search Learn From Watson? Even the most daunting task can be overcome It’s not company versus company, it’s stretching human knowledge How can search engines help other industries 37
Sources “What is Watson?” presentation by Adam Lally, IBM Research ,[object Object]
NYTimes article: “What Is I.B.M.’s Watson?http://www.nytimes.com/2010/06/20/magazine/20Computer-t.html?_r=2&ref=opinionWired magazine:“IBM’s Watson Supercomputer Wins Practice Jeopardy Round”http://www.wired.com/epicenter/2011/01/ibm-watson-jeopardy/# More technical: AI magazine “Building Watson: An Overview of the DeepQA Project”http://www.stanford.edu/class/cs124/AIMagzine-DeepQA.pdf

Más contenido relacionado

Destacado

1500 revision matt del re
1500 revision matt del re1500 revision matt del re
1500 revision matt del re
MediaPost
 
1630 omma video dan beer hung nguyen
1630 omma video dan beer hung nguyen1630 omma video dan beer hung nguyen
1630 omma video dan beer hung nguyen
MediaPost
 
Sis fri 0845 jeremiah andrick
Sis fri 0845 jeremiah andrickSis fri 0845 jeremiah andrick
Sis fri 0845 jeremiah andrick
MediaPost
 
Omma metrics alex_yoder
Omma metrics alex_yoderOmma metrics alex_yoder
Omma metrics alex_yoder
MediaPost
 
1315 revision sponsor acxiom
1315 revision sponsor acxiom1315 revision sponsor acxiom
1315 revision sponsor acxiom
MediaPost
 

Destacado (7)

1500 revision matt del re
1500 revision matt del re1500 revision matt del re
1500 revision matt del re
 
1630 omma video dan beer hung nguyen
1630 omma video dan beer hung nguyen1630 omma video dan beer hung nguyen
1630 omma video dan beer hung nguyen
 
Sis fri 0845 jeremiah andrick
Sis fri 0845 jeremiah andrickSis fri 0845 jeremiah andrick
Sis fri 0845 jeremiah andrick
 
Keynote: Social's Influence on Search
Keynote: Social's Influence on SearchKeynote: Social's Influence on Search
Keynote: Social's Influence on Search
 
Omma metrics alex_yoder
Omma metrics alex_yoderOmma metrics alex_yoder
Omma metrics alex_yoder
 
1315 revision sponsor acxiom
1315 revision sponsor acxiom1315 revision sponsor acxiom
1315 revision sponsor acxiom
 
Presentation: What Are The Top 5 Emerging Search Categories and Why?
Presentation: What Are The Top 5 Emerging Search Categories and Why?Presentation: What Are The Top 5 Emerging Search Categories and Why?
Presentation: What Are The Top 5 Emerging Search Categories and Why?
 

Similar a Sis sat 1000 josh dreller

Big, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near YouBig, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near You
Biplav Srivastava
 
Sp14 cs188 lecture 1 - introduction
Sp14 cs188 lecture 1  - introductionSp14 cs188 lecture 1  - introduction
Sp14 cs188 lecture 1 - introduction
Amer Noureddin
 
SP14 CS188 Lecture 1 -- Introduction.pptx
SP14 CS188 Lecture 1 -- Introduction.pptxSP14 CS188 Lecture 1 -- Introduction.pptx
SP14 CS188 Lecture 1 -- Introduction.pptx
ssuser851498
 
COM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data MiningCOM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data Mining
butest
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science Tales
Bertram Ludäscher
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
butest
 

Similar a Sis sat 1000 josh dreller (20)

Big, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near YouBig, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near You
 
Sp14 cs188 lecture 1 - introduction
Sp14 cs188 lecture 1  - introductionSp14 cs188 lecture 1  - introduction
Sp14 cs188 lecture 1 - introduction
 
Jdb code biology and ai final
Jdb code biology and ai finalJdb code biology and ai final
Jdb code biology and ai final
 
Mass declassification sept 23 2010v2.1
Mass declassification sept 23 2010v2.1Mass declassification sept 23 2010v2.1
Mass declassification sept 23 2010v2.1
 
SP14 CS188 Lecture 1 -- Introduction.pptx
SP14 CS188 Lecture 1 -- Introduction.pptxSP14 CS188 Lecture 1 -- Introduction.pptx
SP14 CS188 Lecture 1 -- Introduction.pptx
 
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
The Future of AI: Going BeyondDeep Learning, Watson, and the Semantic WebThe Future of AI: Going BeyondDeep Learning, Watson, and the Semantic Web
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
 
COM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data MiningCOM 578 Empirical Methods in Machine Learning and Data Mining
COM 578 Empirical Methods in Machine Learning and Data Mining
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Ntegra 20231003 v3.pptx
Ntegra 20231003 v3.pptxNtegra 20231003 v3.pptx
Ntegra 20231003 v3.pptx
 
Chatbots in 2017 -- Ithaca Talk Dec 6
Chatbots in 2017 -- Ithaca Talk Dec 6Chatbots in 2017 -- Ithaca Talk Dec 6
Chatbots in 2017 -- Ithaca Talk Dec 6
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Big data new physics giga om structure conference ny - march 2011
Big data new physics   giga om structure conference ny - march 2011Big data new physics   giga om structure conference ny - march 2011
Big data new physics giga om structure conference ny - march 2011
 
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
 
Crowdsourcing for Online Data Collection
Crowdsourcing for Online Data CollectionCrowdsourcing for Online Data Collection
Crowdsourcing for Online Data Collection
 
Human computer cooperation
Human computer cooperationHuman computer cooperation
Human computer cooperation
 
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
A primer on Artificial Intelligence (AI) and Machine Learning (ML)A primer on Artificial Intelligence (AI) and Machine Learning (ML)
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
 
From Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science TalesFrom Research Objects to Reproducible Science Tales
From Research Objects to Reproducible Science Tales
 
Big Data and the Art of Data Science
Big Data and the Art of Data ScienceBig Data and the Art of Data Science
Big Data and the Art of Data Science
 
Machine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human MindsMachine Learning, Artificial General Intelligence, and Robots with Human Minds
Machine Learning, Artificial General Intelligence, and Robots with Human Minds
 
Machine Learning ICS 273A
Machine Learning ICS 273AMachine Learning ICS 273A
Machine Learning ICS 273A
 

Más de MediaPost

Más de MediaPost (20)

Visible Wireless: Grass Roots Branding and Media Planning
Visible Wireless: Grass Roots Branding and Media PlanningVisible Wireless: Grass Roots Branding and Media Planning
Visible Wireless: Grass Roots Branding and Media Planning
 
MediaPost Data & Programmatic Insider Summit - Survey Results
MediaPost Data & Programmatic Insider Summit - Survey ResultsMediaPost Data & Programmatic Insider Summit - Survey Results
MediaPost Data & Programmatic Insider Summit - Survey Results
 
Can the Past Predict the Future of CTV?
Can the Past Predict the Future of CTV?Can the Past Predict the Future of CTV?
Can the Past Predict the Future of CTV?
 
First-Party Data Takes The Cake In A Post-Cookie World
First-Party Data Takes The Cake In A Post-Cookie WorldFirst-Party Data Takes The Cake In A Post-Cookie World
First-Party Data Takes The Cake In A Post-Cookie World
 
Real-time buying for real-time events: Leveraging Programmatic TV for Live Ev...
Real-time buying for real-time events: Leveraging Programmatic TV for Live Ev...Real-time buying for real-time events: Leveraging Programmatic TV for Live Ev...
Real-time buying for real-time events: Leveraging Programmatic TV for Live Ev...
 
The Right Audience for the Job: Cadillac’s First Party Data Engine
The Right Audience for the Job: Cadillac’s First Party Data Engine The Right Audience for the Job: Cadillac’s First Party Data Engine
The Right Audience for the Job: Cadillac’s First Party Data Engine
 
Sustained Innovation Through Creativity, Technology & Data
Sustained Innovation Through Creativity, Technology & DataSustained Innovation Through Creativity, Technology & Data
Sustained Innovation Through Creativity, Technology & Data
 
Search and Performance Insider Summit - Survey Results
Search and Performance Insider Summit - Survey ResultsSearch and Performance Insider Summit - Survey Results
Search and Performance Insider Summit - Survey Results
 
Reaching Buyers Without Cookies
Reaching Buyers Without CookiesReaching Buyers Without Cookies
Reaching Buyers Without Cookies
 
Cookie Apocalypse!!!
Cookie Apocalypse!!!Cookie Apocalypse!!!
Cookie Apocalypse!!!
 
Leveraging Performance Video on Amazon
Leveraging Performance Video on AmazonLeveraging Performance Video on Amazon
Leveraging Performance Video on Amazon
 
MediaPost Publishing Insider Summit Survey
MediaPost Publishing Insider Summit SurveyMediaPost Publishing Insider Summit Survey
MediaPost Publishing Insider Summit Survey
 
When Less is More: Building a Successful Advertising Business from a Subscrip...
When Less is More: Building a Successful Advertising Business from a Subscrip...When Less is More: Building a Successful Advertising Business from a Subscrip...
When Less is More: Building a Successful Advertising Business from a Subscrip...
 
What Do First Party Data and Golf Have In Common?
What Do First Party Data and Golf Have In Common? What Do First Party Data and Golf Have In Common?
What Do First Party Data and Golf Have In Common?
 
Turning Customers Into Fans: Church’s New Social Media Playbook
Turning Customers Into Fans: Church’s New Social Media PlaybookTurning Customers Into Fans: Church’s New Social Media Playbook
Turning Customers Into Fans: Church’s New Social Media Playbook
 
Restaurant Customer Engagement: The Path to Personalization
Restaurant Customer Engagement: The Path to PersonalizationRestaurant Customer Engagement: The Path to Personalization
Restaurant Customer Engagement: The Path to Personalization
 
Delivery & Streaming, the Ultimate Experience with Roku
Delivery & Streaming, the Ultimate Experience with RokuDelivery & Streaming, the Ultimate Experience with Roku
Delivery & Streaming, the Ultimate Experience with Roku
 
Focus Brands’ Licensing Calculus
Focus Brands’ Licensing CalculusFocus Brands’ Licensing Calculus
Focus Brands’ Licensing Calculus
 
Three Tips to Maximize Creative Asset Efficiency
Three Tips to Maximize Creative Asset EfficiencyThree Tips to Maximize Creative Asset Efficiency
Three Tips to Maximize Creative Asset Efficiency
 
The QSR Media Dispersion: Pre, Mid & Post Pandemic – By the Numbers
The QSR Media Dispersion: Pre, Mid & Post Pandemic – By the NumbersThe QSR Media Dispersion: Pre, Mid & Post Pandemic – By the Numbers
The QSR Media Dispersion: Pre, Mid & Post Pandemic – By the Numbers
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Sis sat 1000 josh dreller

  • 1. IBM’s Watson: Is the World’s Trivia Champion the Future of Search? Josh Dreller VP Media Technology & Analytics Fuor Digital @fuordigital
  • 2.
  • 3.
  • 4. Two Types of Innovation Incremental – improving current projects and advancing them in a linear fashion Grand Challenges – pushing the limits of science Must be Difficult (has to be a challenge) Must be Inspiring Irresistible Vision – “just has to be done”
  • 5. Previous IBM Grand Challenges
  • 6. Open Question Answering The way normal humans communicate Natural Language – very ambiguous, but at the heart of human intelligence Last night I shot an elephant in my pajamas. How it got into my pajamas I’ll never know.
  • 7. The Ultimate Advancement:Computers that can communicate with humans in Natural Language
  • 8. What Can Search Learn From Watson? The only way to push forward is to take huge leaps and look for self-imposed challenges even if we can’t prove out the business case right now What kind of Grand Challenges could Search create? A non-spammable Search Engine? No need for Search Engine Optimization? ?? 8
  • 9.
  • 10. The Jeopardy! Challenge: A compelling and notable way to drive and measure the technology of automatic Question Answering along 5 Key Dimensions Broad/Open Domain $200 If you're standing, it's the direction you should look to check out the wainscoting. Complex Language $1000 Of the 4 countries in the world that the U.S. does not have diplomatic relations with, the one that’s farthest north High Precision $800 In cell division, mitosis splits the nucleus & cytokinesis splits this liquid cushioning the nucleus Accurate Confidence High Speed
  • 11. Jeopardy Reaction Not too inspired at first Weren’t interested in a circus sideshow 2009, IBM setup a moch-studio at their New York research facility Sparring matches with ex-Jeopardy winners Eventually saw the potential and thought it was something special
  • 12. A Very Simple Question for a Computer In (( 12,546,798 * P ) ^ 2) / 34,567.46 = ? Greater than or less than 1? 50/50 Shot = .00885
  • 13. Real Language is Real Hard Chess A finite, mathematically well-defined search space Limited number of moves and states Grounded in explicit, unambiguous mathematical rules Human Language Ambiguous, contextual and implicit Grounded only in human cognition Seemingly infinitenumber of ways to express the same meaning
  • 14. The Opposite of Current Computer Language Questions not designed for a computer to answer Slang Crafty questions Shorthand Rhyme Regionalism Anagrams Complex Language!
  • 15. Structured vs. Unstructured Data Where was Einstein born? Structured Unstructured One day, from among his city views of Ulm, Otto chose a watercolor to send to Albert Einstein as a remembrance of Einstein´s birthplace.
  • 16. Common Sense Knowledge Base An ontology of classes and individuals Parts and materials of objects Properties of objects (such as color and size) Functions and uses of objects Locations of objects and layouts of locations Locations of actions and events Durations of actions and events Preconditions of actions and events Effects (post conditions) of actions and events Subjects and objects of actions Behaviors of devices Stereotypical situations or scripts Human goals and needs Emotions Plans and strategies Story themes Contexts 16 Can a can CanCan?
  • 17. What Can Search Learn From Watson? We need to focus on what computers aren’t good at, not what they are good at Keywords and Links are not savvy enough. Natural language is key to a next generation search engine Most of human knowledge is kept in unstructured data sources or based on common sense context 17
  • 18.
  • 19. The DeepQA Project Dr. David Ferrucci 25-30 full time researchers from many disciplines. 2007-2011 Millions of dollars Post Jeopardy implications
  • 20. Speed Results Deployed Watson on 2,880 IBM POWER 750 computer cores Went from 2 hours per question on a single CPU to an average ofjust 3 seconds – fast enough to compete with the best.
  • 21. Example Question Related Content (Structured & Unstructured) Keywords: 1698, comet, paramour, pink, … AnswerType(comet discoverer) Date(1698) Took(discoverer, ship) Called(ship, Paramour Pink) … Primary Search Question Analysis Candidate Answer Generation … Temporal Taxonomic Lexical Spatial [0.58 0 -1.3 … 0.97] Isaac Newton [0.71 1 13.4 … 0.72] Wilhelm Tempel [0.12 0 2.0 … 0.40] HMS Paramour Edmond Halley (0.85) Christiaan Huygens (0.20) Peter Sellers (0.05) [0.84 1 10.6 … 0.21] Christiaan Huygens [0.33 0 6.3 … 0.83] Halley’s Comet [0.21 1 11.1 … 0.92] Edmond Halley Merging & Ranking [0.91 0 -8.2 … 0.61] Pink Panther [0.91 0 -1.7 … 0.60] Peter Sellers Evidence Retrieval … Evidence Scoring IN 1698, THIS COMET DISCOVERER TOOK A SHIP CALLED THE PARAMOUR PINK ON THE FIRST PURELY SCIENTIFIC SEA VOYAGE
  • 22. Confidence is Key Watson only rings in if it can reach a statistically significant confidence in time Some questions take longer than others Some questions will be able to answer less confident than others Watson manages risk in betting based on confidence
  • 23. Embarrassingly Parallel Computing Def: “Little or no effort is required to separate the problem into a number of parallel tasks” Works on many algorithms at once and comes back together with confidence scores. different from distributed computing problems (such as Google’s MapReduce) that require communication between tasks, especially communication of intermediate results.
  • 24. What Can Search Learn From Watson? We can’t be bound by the constraints of current technology Two fold process of first coming up with answers then vetting them with more evidence Ideas like Parallel Processing will allow us to jump ahead 24
  • 25.
  • 26. Ken Jennings & Brad Rutter
  • 27. Top human players are remarkably good. Computers? The Best Human Performance: Our Analysis Reveals the Winner’s Cloud Each dot represents an actual historical human Jeopardy! game Winning Human Performance Grand Champion Human Performance 2007 QA Computer System More Confident Less Confident 27
  • 28. DeepQA: Incremental Progress in Precision and Confidence 6/2007-11/2010 Now Playing in the Winners Cloud 11/2010 4/2010 10/2009 5/2009 12/2008 Precision 8/2008 5/2008 12/2007 Baseline
  • 30. What Can Search Learn From Watson? Confidence bar would be a great addition to SERPs We must benchmark what is “good” and then aim higher These things take time (and money) 30
  • 31.
  • 32. TJ Watson Research CenterYorktown, NYTwo Games: Aired February 14-16, 2011
  • 33. The End – Humans Win
  • 34.
  • 35. Financial Industry Generates large amounts of data and growing 70% per year Not just numbers, but all info that would influence the biz landscape (news, articles, blogs, etc) Recent financial crisis shows failures of lack of understanding in interdependencies
  • 36. DeepQA in Continuous Evidence-Based Diagnostic Analysis Considers and synthesizes a broad range of evidence improving quality, reducing cost Symptoms Diagnosis Models Meds Symp Fam Hist Find Confidence Renal failure Family History Patient History UTI Medications Tests/Findings Diabetes Influenza Notes/Hypotheses hypokalemia esophogitis Most Confident Diagnosis: Diabetes Most Confident Diagnosis: UTI Most Confident Diagnosis: Influenza Most Confident Diagnosis: Diabetes and Esophogitis Huge Volumes of Texts, Journals, References, DBs etc.
  • 37. What Can Search Learn From Watson? Even the most daunting task can be overcome It’s not company versus company, it’s stretching human knowledge How can search engines help other industries 37
  • 38.
  • 39. NYTimes article: “What Is I.B.M.’s Watson?http://www.nytimes.com/2010/06/20/magazine/20Computer-t.html?_r=2&ref=opinionWired magazine:“IBM’s Watson Supercomputer Wins Practice Jeopardy Round”http://www.wired.com/epicenter/2011/01/ibm-watson-jeopardy/# More technical: AI magazine “Building Watson: An Overview of the DeepQA Project”http://www.stanford.edu/class/cs124/AIMagzine-DeepQA.pdf

Notas del editor

  1. In order to know we are making progress on scientific problems like open-domain QA well-defined challenges help demonstrate we can solve concrete & difficult tasks. As you might know Jeopardy! Is a long-standing, well-regarded and highly challenging Television quiz show in the US that demands human contestants to quickly understand and answer richly expressed natural language questions over a staggering array of topics.The Jeopardy! Challenge uniquely provides a palpable, compelling and notable way to drive the technology of Question Answering along key dimensionsIf you are familiar with the quiz show it asks an I incredibly broad range of questions over a huge variety of topics.In a single round there is a grid of 6 Categories and for each category 5 rows with increasing $ values. Once a cell is chosen by 1 of three players, A question, or what is often called a Clue is revealed.Here you see some example questions.<read some of the questions> Jeopardy uses complex and often subtle language to describe what is being asked. To win you have to be extraordinarily precise. You must deliver the exact answer – no more and no less – it is not good enough for it be somewhere in the top 2, 10 or 20 documents – you must know it exactly and get it in first place – otherwise no credit – in fact you loose points. You must demonstrate Accurate Confidences -- That is -- you must know what you know – if you “buzz –in” and then get it wrong you lose the $$ value of the question. And you have to do this all very quickly – deeply analyze huge volumes of content, consider many possible answers, compute your confidence and buzz in – all in just seconds.As we shall see compete with human champions at this game represents a Grand Challenge in Automatic Open-Domain Question Answering.<STOP><NEXT SLIDE>
  2. Computer programs are natively explicit and exacting in their calculations over numbers and symbols.But Natural Language - -the words and phrases we humans use to communicate with one another -- is implicit -- the exact meaning is not completely and exactly indicated -- but instead is highly dependent on the context -- what has been said before, the topic, how it is being discussed -- factually, figuratively, fictionally etc.Moreover, natural language is often imprecise – it does not have to treat a subject with numerical precision…humans naturally interact and operate all the time with different degrees of uncertainty and fuzzy associations between words and concepts. We use huge amounts of background knowledge to reconcile and interpret what we read.Consider these examples….it is one thing to build a database table to exactly answer the question “Where is someone born?”. The computer looks up the name in one column and is programmed to know that the other column contains the birth place. STRUCUTRED information, like this database table, is designed for computers to make simple comparisons and to be exactly as accurate as the data entered into the database. Natural language is created and used by humans for humans. A reason we call natural language “Unstructured” is because it lacks the exact structure and meaning that computer programs typically use to answer questions. Understanding what is being represented is a whole other challenge for computer programs .Consider this sentence <read> It implies that Albert Einstein was born in Ulm – but there is a whole lot the computer has to do to figure that out any degree of certainty - it has to understand sentence structure, parts of speech, the possible meaning of words and phrases and how they related to the words and phrases in the question. What does a remembrance, a water color and an Otto have to do with where someone was born.Consider another question in the Jeopardy Style … X ran this? And this potentially answer-bearing sentence. Read the Sentence…Does this sentence answer the question for Jack Welch - -what does “ran” have to do with leadership or painting. How would a computer confidently infer from this sentence that Jack Welch ran GE – might be easer to deduce that he was at least a painter there.
  3. Human performance is one of the things that makes the Jeopardy! Challenge so compelling. The best humans are very, very good at this task. In this chart, each dot corresponds to actual historical Jeopardy! games and represents the performance of the winner of those games. We refer to this cluster of dots as the “Winners Cloud”. For each dot, the X-axis, along the bottom of the graph, represents the percentage of questions in a game that the winning player got a chance to ANSWER. These were the questions he or she was confident enough and fast enough to ring-in or buzz in for FIRST. The Y-axis, going up along the left of the graph, represents the winning player’s PRECISION – that is, the percentage of those questions answered the player got RIGHT. CORemember, if a player gets a question wrong then they lose the $ value of the clue and their competitors still get a chance to answer or rebound. But what we humans, tend to do really, really well is – confidently know what we know – computing an accurate confidence turns out to be key ability for winning at Jeopardy! Looking at the center of the green cloud, what you see is that, on average, WINNERS are confident enough and fast enough to answer nearly 50% of the questions in a game and do somewhere between 85% and 95% precision on those questions. That is, they get 85-95% of the ones they answer RIGHT. The red dots represents Ken Jennings's performance. Ken won 74 consecutive games against qualified players. He was confident and fast enough to acquire 60% and even up to 80% of a game’s questions from his competitors and still do 85% and 95% precision on average.Good Jeopardy! players are remarkable in their breadth, precision, confidence and speed.