Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.

Apache UIMA and Semantic Search

9.687 visualizaciones

Publicado el

my introduction to Apache UIMA and Semantic Search at 2nd IKS Workshop in Rome 13th November 2009

Publicado en: Tecnología, Educación
  • this is because the original slide deck was made using Apple Keynote
       Responder 
    ¿Estás seguro?    No
    Tu mensaje aparecerá aquí
  • Believe this slide cannot be downloaded, the link points to a 'key' file instead of a PDF or other typical presentation file.
       Responder 
    ¿Estás seguro?    No
    Tu mensaje aparecerá aquí
  • the code snippet should've appeared with tags to highlight annotations:

    ||book||title||Pride and Prejudice||title|| was written by||author||
    Jane Austen||author||||book||, while ||book||||author||Joseph
    Conrad||author|| wrote ||title||Heart of Darkness||title||||book||.
       Responder 
    ¿Estás seguro?    No
    Tu mensaje aparecerá aquí
  • It means that if you use Lucas to 'write down' your documents' annotations in fields then you'll have documents enriched and that is good but people often have different understandings of the 'semantic search' topic. Infact even if you have done named entity recognition with UIMA and then written down named entities annotations to Lucene fields then you won't be able to make queries which take relations and context in count.
    Take the following example borrowed from a previous conversation I had with Thilo Goetz:

    Pride and Prejudice was written by
    Jane Austen, while Joseph
    Conrad wrote Heart of Darkness.

    'If you take the Lucas approach, you will not be able to
    find out what books Jane Austen has written, because authors
    and titles are indexed in different fields, independently.'

    I hope this clarifies things a little :-)
       Responder 
    ¿Estás seguro?    No
    Tu mensaje aparecerá aquí
  • what does 'Lucas example' mean? at the side number 4
       Responder 
    ¿Estás seguro?    No
    Tu mensaje aparecerá aquí

Apache UIMA and Semantic Search

  1. 1. Apache UIMA & Semantic Search Tommaso Teofili tommaso@apache.org
  2. 2. Apache UIMA - what is it? Unstructered Information Management Architecture Architectural Framework to manage (eventually large) volumes of unstructered data Former IBM Alphaworks project donated to ASF Currently an Incubator podling ( http://incubator.apache.org/uima ) Apache UIMA is an Oasis standard ( http://www.oasis-open.org )
  3. 3. Apache UIMA - how? Many pluggable reusable components (described via XML) Analysis Engines (primitive or aggregates) Asynchronous scaleout (JMS, Apache ActiveMQ) Flow controllers Type systems
  4. 4. Apache UIMA - what is NOT? It’s not a semantic search tool inherently the “Lucas example” the semantic search package for UIMA is not open source! ( http://www.alphaworks.ibm.com/tech/uima/download )
  5. 5. UIMA & Semantic Search Metadata generation engine for CM systems Data enrichment Linked data Jeopardy (see http://www.research.ibm.com/deepqa/ faq.shtml#24 ) Let’s see...
  6. 6. RE Market Analysis & UIMA Macpi: a real estate market analysis tool developed at DIA Webpipe (crawling and wrapping data) Apache UIMA Spring framework Knowledge extraction Extract metadata with Apache UIMA to build our search
  7. 7. Apache UIMA & AlchemyAPI AlchemyAPI from Orchestr8 services wrapped as UIMA AEs Named-entity recognition, word disambiguation “Barack Obama” is http://dbpedia.org/resource/Barack_Obama Exploiting linked data enriching free text with DBpedia, GeoNames, Freebase URIs Plugging with other UIMA AEs providing you with a reusable component to deal with Linked Data
  8. 8. UIMA & Semantic Search it’s demo time!

×