SlideShare una empresa de Scribd logo
1 de 12
Descargar para leer sin conexión
LIGHTNING TALKS
Powered by Lucene:
IBM Content Analytics with Enterprise Search




Wolfgang Jung



Barcelona, 19th October 2011               © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Our agenda in the next 10 minutes
LIGHTNING TALKS
    IBM is commited to Open Source
     – Decade of contribution to the community.

    Adoption of Apache Lucene to IBM Content Analytics
    – The Why, What & examples.

    Demonstration of IBM Content Analytics
    – see the development results live.
               Be enlightened !

2                                                  © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



IBM is commited to Open Source

    Decade of lineage and contributions to the open source community
      – Apache Hadoop.
          IBM‘s use of BigIndex for Search is mention in Chuck Lams‘s “Hadopp in Action”
      – Apache Derby
      – Apache Geronimo and Jetty
      – Eclipse: Founded by IBM, PMC Board of Directors
      – Apache UIMA: Unstructured Information Management Architecture.
          Developed by IBM, Contributed to Apache
      – Apache Jakarta: Lucene. PMC members
          Significant contributions via IBM Lucene Extension Library (ILEL)
      – Linux ... and more!


3                                                                                  © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    The use of UIMA is existing since first release in 2005 of IBM OmniFind and later
    IBM Content Analytics, continued into today‘s IBM Content Analytics with
    Enterprise Search
         http://www-01.ibm.com/software/data/content-management/analytics/uima.html


    IBM‘s decision for the use of Lucene
      –Index is a common technology and better to improve
      –lower cost of maintenance
      –advantage in incremental indexing
      –extensibility



4                                                                                     © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    IBM is a very active contributor. Look for PMC members:
      –Michael McCandless; Shai Erera; Doron Cohen
         http://lucene.apache.org/who.html

    IBM extended Lucene based on our needs. Two examples already
    contributed to community :
      –Query Parser
      –Facets




5                                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



Adoption of Apache Lucene
to IBM Content Analytics with Enterprise Search
    On 13th December 2006, IBM and Yahoo! announced IBM OmniFind Yahoo! Edition, as
    “no-cost, entry level enterprise search product developed to help eliminate financial and
    technology barriers to intranet and Web search.”
         http://www-03.ibm.com/press/us/en/pressrelease/20767.wss

    This technology included Lucene as index technology and had full support by IBM
      – 45,000+ downloads from the website http://omnifind.ibm.yahoo.net
      – IBM support contracts for clients with “IBM Elite Support for OmniFind Yahoo Edition“
      – Below 15 incidents regarding index technology


    Technology is seen as success for IBM




6                                                                                               © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search


Content Analytics generates new insights and aggregates key
findings gathered from large data volumes in a visualized form

                                                          Extracted Concept
                                                        Claimant: Soft Tissue Injury
                                                                                                     Automatic
                                                                                                     Visualizing
                                               Person    Injury    Body Part      Location     Results of concept evaluation
                                                                                                are displayed to the users
                                               Noun      Verb     Noun Phrase    Prep Phrase

                                               Claus sprained his ankle on the step




                                               Analysed documents
                                                 with identified concepts


       Sources of Information
       Internal (ECM, Files, DBMS, etc.)
        and External (Social, News, etc.)




7                                                                                                           © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search




Rapid Insights from Automotive Complaints

    We will be using publically available data from the National Highway Traffic Safety Agency (NHTSA)
    to demonstrate how IBM Content Analytics can be used to identify problems with automobiles.
    NHTSA receives various reports about malfunctions, accidents, and other issues with automobiles
    from dealerships, repair facilities, and from the general public. NHTSA publishes the data at
    http://www.nhtsa.gov. For this demo we have created a collection from the NHTSA “complaints”
    data spanning several years ending in early 2010. We will show how this and similar data can be
    analyzed to arrive at rapid insights not possible by manually reading through the complaint records.




8                                                                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



See Content Analytics live!




9                                              © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search



See Content Analytics live!




10                                             © 2011 IBM Corporation
IBM Content Analytics with Enterprise Search




                                               Be enlightened !



11                                                                © 2011 IBM Corporation
LIGHTNING TALKS
Powered by Lucene:
IBM Content Analytics with Enterprise Search




Wolfgang Jung



Barcelona, 19th October 2011                   © 2011 IBM Corporation

Más contenido relacionado

La actualidad más candente

Ml, AI and IBM Watson - 101 for Business
Ml, AI  and IBM Watson - 101 for BusinessMl, AI  and IBM Watson - 101 for Business
Ml, AI and IBM Watson - 101 for BusinessJouko Poutanen
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter AnalyticsAdrian Turcu
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonDaniela Zuppini
 
Watson and Analytics
Watson and AnalyticsWatson and Analytics
Watson and AnalyticsJorge W. Hago
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platformIBM Sverige
 
What Watson Explorer is and How it works
What Watson Explorer is and How it worksWhat Watson Explorer is and How it works
What Watson Explorer is and How it worksVirginia Fernandez
 
Oltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensieroOltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensieroJürgen Ambrosi
 
Watson AI platform for business - IBM Cloud
Watson AI platform for business - IBM CloudWatson AI platform for business - IBM Cloud
Watson AI platform for business - IBM CloudSarmad Ibrahim
 
Using Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on BluemixUsing Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on BluemixIBM
 
IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...Virginia Fernandez
 
AI future 2025 - IBM Watson Re
AI future 2025  - IBM Watson ReAI future 2025  - IBM Watson Re
AI future 2025 - IBM Watson ReSarmad Ibrahim
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationDavid Solomon
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Data Con LA
 
Big Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM PerspectiveBig Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM PerspectiveThe_IPA
 
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsWebinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsBigDataCloud
 
Master the art of Data Science
Master the art of Data ScienceMaster the art of Data Science
Master the art of Data ScienceInTTrust S.A.
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine LearningJürgen Ambrosi
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence EventMichail Pagiatakis
 

La actualidad más candente (20)

Ml, AI and IBM Watson - 101 for Business
Ml, AI  and IBM Watson - 101 for BusinessMl, AI  and IBM Watson - 101 for Business
Ml, AI and IBM Watson - 101 for Business
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM Watson
 
Watson and Analytics
Watson and AnalyticsWatson and Analytics
Watson and Analytics
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
 
What Watson Explorer is and How it works
What Watson Explorer is and How it worksWhat Watson Explorer is and How it works
What Watson Explorer is and How it works
 
Oltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensieroOltre l’intelligenza Artificiale: agire alla velocità del pensiero
Oltre l’intelligenza Artificiale: agire alla velocità del pensiero
 
Watson AI platform for business - IBM Cloud
Watson AI platform for business - IBM CloudWatson AI platform for business - IBM Cloud
Watson AI platform for business - IBM Cloud
 
IBM Watson
IBM Watson IBM Watson
IBM Watson
 
Building Bots Using IBM Watson
Building Bots Using IBM WatsonBuilding Bots Using IBM Watson
Building Bots Using IBM Watson
 
Using Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on BluemixUsing Watson to build Cognitive IoT Apps on Bluemix
Using Watson to build Cognitive IoT Apps on Bluemix
 
IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...IBM Watson Explorer: Explore, analyze and interpret information for better bu...
IBM Watson Explorer: Explore, analyze and interpret information for better bu...
 
AI future 2025 - IBM Watson Re
AI future 2025  - IBM Watson ReAI future 2025  - IBM Watson Re
AI future 2025 - IBM Watson Re
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
 
Big Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM PerspectiveBig Data and Analytics: The IBM Perspective
Big Data and Analytics: The IBM Perspective
 
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsWebinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
 
Master the art of Data Science
Master the art of Data ScienceMaster the art of Data Science
Master the art of Data Science
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
 
InTTrust -IBM Artificial Intelligence Event
InTTrust -IBM Artificial Intelligence  EventInTTrust -IBM Artificial Intelligence  Event
InTTrust -IBM Artificial Intelligence Event
 

Similar a Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung

"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009eLiberatica
 
Flex 4.5 and mobile development
Flex 4.5 and mobile developmentFlex 4.5 and mobile development
Flex 4.5 and mobile developmentMichael Chaize
 
Deploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with ArasDeploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with ArasAras
 
Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011Samir Batla
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...FIAT/IFTA
 
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...Amazon Web Services
 
Breizh camp adobe flex et les mobiles
Breizh camp   adobe flex et les mobilesBreizh camp   adobe flex et les mobiles
Breizh camp adobe flex et les mobilesMichael Chaize
 
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...Edureka!
 
Open source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategyOpen source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategyIBM Rational software
 
Starting mobile development
Starting mobile developmentStarting mobile development
Starting mobile developmentMihai Corlan
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventionsmalupului
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform javaCh'ti JUG
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform javaMichael Chaize
 
Native extensions webinar
Native extensions webinarNative extensions webinar
Native extensions webinarimmanuelnoel
 
The IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting SolutionThe IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting SolutionMarc Nehme
 
Convergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovationConvergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovationKerrie Holley
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest GraphVital.AI
 

Similar a Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung (20)

Smw+ semantic enterprise wiki en_153
Smw+ semantic enterprise wiki en_153Smw+ semantic enterprise wiki en_153
Smw+ semantic enterprise wiki en_153
 
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
"IBMs Open Source Strategy" by Adam Jollans @ eLiberatica 2009
 
Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011Smw+tutorial berlin-fall-2011
Smw+tutorial berlin-fall-2011
 
Flex 4.5 and mobile development
Flex 4.5 and mobile developmentFlex 4.5 and mobile development
Flex 4.5 and mobile development
 
Deploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with ArasDeploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with Aras
 
Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011Employ the Cloud for Efficient Content Analytics - 10 november 2011
Employ the Cloud for Efficient Content Analytics - 10 november 2011
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
 
Breizh camp adobe flex et les mobiles
Breizh camp   adobe flex et les mobilesBreizh camp   adobe flex et les mobiles
Breizh camp adobe flex et les mobiles
 
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
Splunk in 60 Minutes | Splunk Tutorial For Beginners | Splunk Training | Splu...
 
Open source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategyOpen source, commercial or a co-existance strategy
Open source, commercial or a co-existance strategy
 
Starting mobile development
Starting mobile developmentStarting mobile development
Starting mobile development
 
Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventions
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform java
 
Adobe flash platform java
Adobe flash platform javaAdobe flash platform java
Adobe flash platform java
 
Native extensions webinar
Native extensions webinarNative extensions webinar
Native extensions webinar
 
Jax2001 adobe keynote
Jax2001 adobe keynoteJax2001 adobe keynote
Jax2001 adobe keynote
 
The IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting SolutionThe IBM Rational Insight Reporting Solution
The IBM Rational Insight Reporting Solution
 
Convergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovationConvergence of mobility, analytics, social and cloud to drive innovation
Convergence of mobility, analytics, social and cloud to drive innovation
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Más de lucenerevolution

Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and LuceneText Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucenelucenerevolution
 
State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here! State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here! lucenerevolution
 
Building Client-side Search Applications with Solr
Building Client-side Search Applications with SolrBuilding Client-side Search Applications with Solr
Building Client-side Search Applications with Solrlucenerevolution
 
Integrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationsIntegrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationslucenerevolution
 
Scaling Solr with SolrCloud
Scaling Solr with SolrCloudScaling Solr with SolrCloud
Scaling Solr with SolrCloudlucenerevolution
 
Administering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud ClustersAdministering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud Clusterslucenerevolution
 
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and ParboiledImplementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiledlucenerevolution
 
Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs lucenerevolution
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchlucenerevolution
 
Real-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and StormReal-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and Stormlucenerevolution
 
Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?lucenerevolution
 
Schemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APISchemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APIlucenerevolution
 
High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucenelucenerevolution
 
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVMText Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVMlucenerevolution
 
Faceted Search with Lucene
Faceted Search with LuceneFaceted Search with Lucene
Faceted Search with Lucenelucenerevolution
 
Recent Additions to Lucene Arsenal
Recent Additions to Lucene ArsenalRecent Additions to Lucene Arsenal
Recent Additions to Lucene Arsenallucenerevolution
 
Turning search upside down
Turning search upside downTurning search upside down
Turning search upside downlucenerevolution
 
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...lucenerevolution
 
Shrinking the haystack wes caldwell - final
Shrinking the haystack   wes caldwell - finalShrinking the haystack   wes caldwell - final
Shrinking the haystack wes caldwell - finallucenerevolution
 

Más de lucenerevolution (20)

Text Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and LuceneText Classification Powered by Apache Mahout and Lucene
Text Classification Powered by Apache Mahout and Lucene
 
State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here! State of the Art Logging. Kibana4Solr is Here!
State of the Art Logging. Kibana4Solr is Here!
 
Search at Twitter
Search at TwitterSearch at Twitter
Search at Twitter
 
Building Client-side Search Applications with Solr
Building Client-side Search Applications with SolrBuilding Client-side Search Applications with Solr
Building Client-side Search Applications with Solr
 
Integrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationsIntegrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applications
 
Scaling Solr with SolrCloud
Scaling Solr with SolrCloudScaling Solr with SolrCloud
Scaling Solr with SolrCloud
 
Administering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud ClustersAdministering and Monitoring SolrCloud Clusters
Administering and Monitoring SolrCloud Clusters
 
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and ParboiledImplementing a Custom Search Syntax using Solr, Lucene, and Parboiled
Implementing a Custom Search Syntax using Solr, Lucene, and Parboiled
 
Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs Using Solr to Search and Analyze Logs
Using Solr to Search and Analyze Logs
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
 
Real-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and StormReal-time Inverted Search in the Cloud Using Lucene and Storm
Real-time Inverted Search in the Cloud Using Lucene and Storm
 
Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?Solr's Admin UI - Where does the data come from?
Solr's Admin UI - Where does the data come from?
 
Schemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APISchemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST API
 
High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucene
 
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVMText Classification with Lucene/Solr, Apache Hadoop and LibSVM
Text Classification with Lucene/Solr, Apache Hadoop and LibSVM
 
Faceted Search with Lucene
Faceted Search with LuceneFaceted Search with Lucene
Faceted Search with Lucene
 
Recent Additions to Lucene Arsenal
Recent Additions to Lucene ArsenalRecent Additions to Lucene Arsenal
Recent Additions to Lucene Arsenal
 
Turning search upside down
Turning search upside downTurning search upside down
Turning search upside down
 
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
Spellchecking in Trovit: Implementing a Contextual Multi-language Spellchecke...
 
Shrinking the haystack wes caldwell - final
Shrinking the haystack   wes caldwell - finalShrinking the haystack   wes caldwell - final
Shrinking the haystack wes caldwell - final
 

Último

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 

Último (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung

  • 1. LIGHTNING TALKS Powered by Lucene: IBM Content Analytics with Enterprise Search Wolfgang Jung Barcelona, 19th October 2011 © 2011 IBM Corporation
  • 2. IBM Content Analytics with Enterprise Search Our agenda in the next 10 minutes LIGHTNING TALKS IBM is commited to Open Source – Decade of contribution to the community. Adoption of Apache Lucene to IBM Content Analytics – The Why, What & examples. Demonstration of IBM Content Analytics – see the development results live. Be enlightened ! 2 © 2011 IBM Corporation
  • 3. IBM Content Analytics with Enterprise Search IBM is commited to Open Source Decade of lineage and contributions to the open source community – Apache Hadoop. IBM‘s use of BigIndex for Search is mention in Chuck Lams‘s “Hadopp in Action” – Apache Derby – Apache Geronimo and Jetty – Eclipse: Founded by IBM, PMC Board of Directors – Apache UIMA: Unstructured Information Management Architecture. Developed by IBM, Contributed to Apache – Apache Jakarta: Lucene. PMC members Significant contributions via IBM Lucene Extension Library (ILEL) – Linux ... and more! 3 © 2011 IBM Corporation
  • 4. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search The use of UIMA is existing since first release in 2005 of IBM OmniFind and later IBM Content Analytics, continued into today‘s IBM Content Analytics with Enterprise Search http://www-01.ibm.com/software/data/content-management/analytics/uima.html IBM‘s decision for the use of Lucene –Index is a common technology and better to improve –lower cost of maintenance –advantage in incremental indexing –extensibility 4 © 2011 IBM Corporation
  • 5. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search IBM is a very active contributor. Look for PMC members: –Michael McCandless; Shai Erera; Doron Cohen http://lucene.apache.org/who.html IBM extended Lucene based on our needs. Two examples already contributed to community : –Query Parser –Facets 5 © 2011 IBM Corporation
  • 6. IBM Content Analytics with Enterprise Search Adoption of Apache Lucene to IBM Content Analytics with Enterprise Search On 13th December 2006, IBM and Yahoo! announced IBM OmniFind Yahoo! Edition, as “no-cost, entry level enterprise search product developed to help eliminate financial and technology barriers to intranet and Web search.” http://www-03.ibm.com/press/us/en/pressrelease/20767.wss This technology included Lucene as index technology and had full support by IBM – 45,000+ downloads from the website http://omnifind.ibm.yahoo.net – IBM support contracts for clients with “IBM Elite Support for OmniFind Yahoo Edition“ – Below 15 incidents regarding index technology Technology is seen as success for IBM 6 © 2011 IBM Corporation
  • 7. IBM Content Analytics with Enterprise Search Content Analytics generates new insights and aggregates key findings gathered from large data volumes in a visualized form Extracted Concept Claimant: Soft Tissue Injury Automatic Visualizing Person Injury Body Part Location Results of concept evaluation are displayed to the users Noun Verb Noun Phrase Prep Phrase Claus sprained his ankle on the step Analysed documents with identified concepts Sources of Information Internal (ECM, Files, DBMS, etc.) and External (Social, News, etc.) 7 © 2011 IBM Corporation
  • 8. IBM Content Analytics with Enterprise Search Rapid Insights from Automotive Complaints We will be using publically available data from the National Highway Traffic Safety Agency (NHTSA) to demonstrate how IBM Content Analytics can be used to identify problems with automobiles. NHTSA receives various reports about malfunctions, accidents, and other issues with automobiles from dealerships, repair facilities, and from the general public. NHTSA publishes the data at http://www.nhtsa.gov. For this demo we have created a collection from the NHTSA “complaints” data spanning several years ending in early 2010. We will show how this and similar data can be analyzed to arrive at rapid insights not possible by manually reading through the complaint records. 8 © 2011 IBM Corporation
  • 9. IBM Content Analytics with Enterprise Search See Content Analytics live! 9 © 2011 IBM Corporation
  • 10. IBM Content Analytics with Enterprise Search See Content Analytics live! 10 © 2011 IBM Corporation
  • 11. IBM Content Analytics with Enterprise Search Be enlightened ! 11 © 2011 IBM Corporation
  • 12. LIGHTNING TALKS Powered by Lucene: IBM Content Analytics with Enterprise Search Wolfgang Jung Barcelona, 19th October 2011 © 2011 IBM Corporation