SlideShare una empresa de Scribd logo
1 de 21
Text Analytics World
San Francisco – March 31, 2015
4:15-4:45pm
Speaker: Bryan Bell, Executive Vice President, Expert System USA
 What is in Your Business Requirement: Searching or Finding? Enterprise Search
 Product Demonstration: The Google Search Appliance (GSA) integrated with a
semantic technology platform.
1. Internal and external information comes at us faster than we can keep up with.
2. Business expectations for deploying solutions, using enterprise search and content navigation systems
to capture the hidden value of strategic information.
3. CONTEXT: Exploiting deep linguistic analysis, combined with semantics offers the ability to create
contextually correct metadata.
4. Dynamically enrich content with contextually relevant metadata and deploy as the heart of a
knowledge management applications and the Google Search Appliance.
1. Internal and external information comes at us faster than we can keep up with.
80 – 90% is unstructured text.
Zettabyte
1,000,000,000,000,000,000, 000 bytes
4
 The Google crawler visits 20 billion web sites a day.
 The search engine has located more than 30 trillion unique URLs.
 Processes 100 billion searches every month.
• 3.3 billion searches per day.
• Over 38,000 thousand searches per second.
• A single Google query uses 1,000 computers to retrieve an answer.
• This volume combined with the PageRank algorithm…
PR(A) = (1-d) + d (PR(T1)/C(T1) + PR(Tn)/C(Tn)) …. is why Google is so good on the internet.
• 16% to 20% of queries that get asked every day have never been asked before.
Amit Singhal,
Senior Vice President of development, Google Search
August 2012
The Internet
2. Deploying internal enterprise search
engine / content navigation system to
capture and share the hidden value of the
information that is available to the company.
The intranet / corporate portal
2. Deploying internal enterprise search
engine / content navigation system to
capture and share the hidden value of the
information that is available to the company.
The intranet / corporate portal
“Our search stinks!
I want it to work
like Google.”
9
Zettabyte
1,000,000,000,000,000,000,000 bytes
Good news:
PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn))
Don’t have 3.3 billion searches per day.
Don’t have 38,000 thousand searches per second.
Don’t have 1,000 computers to retrieve an answer.
10
Zettabyte
1,000,000,000,000,
000,000,000 bytes
Key words
No metadata
Poor metadata
Inconsistent
11
Zettabyte
1,000,000,000,000,
000,000,000 bytes
Key words
No metadata
Poor metadata
Inconsistent
=
POOR
CONTENT
FINDABILITY
12
stock
People are able to disambiguate “on the fly”, but machines cannot.
Key words vs. Context
Language ambiguity
13
People are able to disambiguate “on the fly”, but machines cannot.
stock
apple
Key words vs. Context
Language ambiguity
14
stock
apple
Apple
People are able to disambiguate “on the fly”, but machines cannot.
Key words vs. Context
Language ambiguity
15
stock
apple
Apple
“I bought 10,000 shares of stock in Apple.”
“I have 10,000 apples in stock.”
People are able to disambiguate “on the fly”, but machines cannot.
Context is King
3. Exploiting deep linguistic analysis,
combined with semantics.
4. Dynamically enrich content with contextually
relevant metadata.
How is word context established?
Morphological
analysis word forms dog, dog-catcher, doggy bag
Grammatical analysis parts of speech "There are 40 rows in the table." (noun)
"She rows 5 times a week." (verb)
Logical analysis
word
relationships
"The car I bought, to replace my Chrysler,
stinks."
Semantic analysis word context "I bought 10,000 shares of stock in Apple."
"I have 10,000 apples in stock."
"I used chicken broth for my soup stock."
Deep linguistic analysis of words to achieve word disambiguation.
How is word context established
and deployed with the GSA?
www.intelligenceapi.com
20
Linguistic and semantic analysis engine
27
Case Study: GSA – Google Search Appliance
What is in Your Business Requirement? Searching or Finding.
Contacts
Thank you
Bryan Bell
bbell@expertsystem.com
@bellbryan
+1.847.508.7938
www.expertsystem.com

Más contenido relacionado

La actualidad más candente

متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
جشنوارهٔ روز آزادی نرم‌افزار تهران
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
Ravi Teja
 
Glue Conference
Glue ConferenceGlue Conference
Glue Conference
Assist
 
Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark) Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark)
Matt Turck
 

La actualidad más candente (20)

Data scienceppt
Data sciencepptData scienceppt
Data scienceppt
 
Key Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist TeamKey Failure Factors of Building a Data Scientist Team
Key Failure Factors of Building a Data Scientist Team
 
متن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌دادهمتن‌بازسازی کلان‌داده
متن‌بازسازی کلان‌داده
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
 
Glue Conference
Glue ConferenceGlue Conference
Glue Conference
 
Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark) Big data landscape v 3.0 - Matt Turck (FirstMark)
Big data landscape v 3.0 - Matt Turck (FirstMark)
 
5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services5 important trends in big data cloud & big data services
5 important trends in big data cloud & big data services
 
presentation
presentationpresentation
presentation
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
 
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
 
Big data big rewards
Big data big rewards Big data big rewards
Big data big rewards
 
Linked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic WebLinked Open Government Data and the Semantic Web
Linked Open Government Data and the Semantic Web
 

Similar a Text Analytics World - Expert System USA

Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
ankur881120
 
ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010
steverz
 

Similar a Text Analytics World - Expert System USA (20)

Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
 
Business intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeBusiness intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lake
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarData Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch Seminar
 
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search LandscapeBearish SEO: Defining the User Experience for Google’s Panda Search Landscape
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
 
Test
TestTest
Test
 
Google
GoogleGoogle
Google
 
Intelligent search | Semantic Search
Intelligent search | Semantic SearchIntelligent search | Semantic Search
Intelligent search | Semantic Search
 
Google Research Paper
Google Research PaperGoogle Research Paper
Google Research Paper
 
Broad Data
Broad DataBroad Data
Broad Data
 
Interactive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and ElasticsearchInteractive and Conversational Search with Google Cloud and Elasticsearch
Interactive and Conversational Search with Google Cloud and Elasticsearch
 
Orbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdfOrbyfy Overview - Solutions_vF_x.pdf
Orbyfy Overview - Solutions_vF_x.pdf
 
The Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the CloudThe Future State of Document Management, Taxonomies and Metadata in the Cloud
The Future State of Document Management, Taxonomies and Metadata in the Cloud
 
The anatomy of google
The anatomy of googleThe anatomy of google
The anatomy of google
 
Search V Next Final
Search V Next FinalSearch V Next Final
Search V Next Final
 
Semantic Web Science
Semantic Web ScienceSemantic Web Science
Semantic Web Science
 
Career_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptxCareer_Jobs_in_Data_Science.pptx
Career_Jobs_in_Data_Science.pptx
 
ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010ΟΚΤΩΒΡΙΟΣ 2010
ΟΚΤΩΒΡΙΟΣ 2010
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
The Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback SearchThe Searchmaster's Toolbox - David Hawking, Funnelback Search
The Searchmaster's Toolbox - David Hawking, Funnelback Search
 

Último

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 

Último (20)

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 

Text Analytics World - Expert System USA

  • 1. Text Analytics World San Francisco – March 31, 2015 4:15-4:45pm Speaker: Bryan Bell, Executive Vice President, Expert System USA  What is in Your Business Requirement: Searching or Finding? Enterprise Search  Product Demonstration: The Google Search Appliance (GSA) integrated with a semantic technology platform. 1. Internal and external information comes at us faster than we can keep up with. 2. Business expectations for deploying solutions, using enterprise search and content navigation systems to capture the hidden value of strategic information. 3. CONTEXT: Exploiting deep linguistic analysis, combined with semantics offers the ability to create contextually correct metadata. 4. Dynamically enrich content with contextually relevant metadata and deploy as the heart of a knowledge management applications and the Google Search Appliance.
  • 2. 1. Internal and external information comes at us faster than we can keep up with. 80 – 90% is unstructured text.
  • 4. 4  The Google crawler visits 20 billion web sites a day.  The search engine has located more than 30 trillion unique URLs.  Processes 100 billion searches every month. • 3.3 billion searches per day. • Over 38,000 thousand searches per second. • A single Google query uses 1,000 computers to retrieve an answer. • This volume combined with the PageRank algorithm… PR(A) = (1-d) + d (PR(T1)/C(T1) + PR(Tn)/C(Tn)) …. is why Google is so good on the internet. • 16% to 20% of queries that get asked every day have never been asked before. Amit Singhal, Senior Vice President of development, Google Search August 2012 The Internet
  • 5. 2. Deploying internal enterprise search engine / content navigation system to capture and share the hidden value of the information that is available to the company. The intranet / corporate portal
  • 6. 2. Deploying internal enterprise search engine / content navigation system to capture and share the hidden value of the information that is available to the company. The intranet / corporate portal
  • 7. “Our search stinks! I want it to work like Google.”
  • 8.
  • 9. 9 Zettabyte 1,000,000,000,000,000,000,000 bytes Good news: PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn)) Don’t have 3.3 billion searches per day. Don’t have 38,000 thousand searches per second. Don’t have 1,000 computers to retrieve an answer.
  • 11. 11 Zettabyte 1,000,000,000,000, 000,000,000 bytes Key words No metadata Poor metadata Inconsistent = POOR CONTENT FINDABILITY
  • 12. 12 stock People are able to disambiguate “on the fly”, but machines cannot. Key words vs. Context Language ambiguity
  • 13. 13 People are able to disambiguate “on the fly”, but machines cannot. stock apple Key words vs. Context Language ambiguity
  • 14. 14 stock apple Apple People are able to disambiguate “on the fly”, but machines cannot. Key words vs. Context Language ambiguity
  • 15. 15 stock apple Apple “I bought 10,000 shares of stock in Apple.” “I have 10,000 apples in stock.” People are able to disambiguate “on the fly”, but machines cannot. Context is King
  • 16. 3. Exploiting deep linguistic analysis, combined with semantics. 4. Dynamically enrich content with contextually relevant metadata. How is word context established?
  • 17. Morphological analysis word forms dog, dog-catcher, doggy bag Grammatical analysis parts of speech "There are 40 rows in the table." (noun) "She rows 5 times a week." (verb) Logical analysis word relationships "The car I bought, to replace my Chrysler, stinks." Semantic analysis word context "I bought 10,000 shares of stock in Apple." "I have 10,000 apples in stock." "I used chicken broth for my soup stock." Deep linguistic analysis of words to achieve word disambiguation. How is word context established and deployed with the GSA?
  • 19. 20 Linguistic and semantic analysis engine
  • 20. 27 Case Study: GSA – Google Search Appliance What is in Your Business Requirement? Searching or Finding.