SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
From Google Search to
        Semantic Exploration

                 Jon Atle Gulla
                 Professor
                 Norwegian University of Science and Technology
                 jag@idi.ntnu.no




                                Semantic Days 2007
Jon Atle Gulla
Agenda

      Traditional search applications
      Adding shallow linguistics to traditional search
      The concept of semantic search
      Ontologies in search applications
      Ontologies for semantic annotation & exploration
      Ontology-driven query interpretation


         quot;Hakia thinks that indexing has plateaued and that semantic technologies will take over
          quot;Hakia thinks that indexing has plateaued and that semantic technologies will take over
                                    for the next generation of searchquot;.
                                     for the next generation of searchquot;.
                   MacManus, R. “Hakia Takes On Google With Semantic Technologies”.
                    MacManus, R. “Hakia Takes On Google With Semantic Technologies”.
          http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php
           http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php



                                             Semantic Days 2007
Jon Atle Gulla
The Language Problem in Search

      People use the language differently


         Authors

                                                           ?
                                                    cument
                                                  do        query?
                                         t of the        e
                                       n
                                e conte ent answers th
                             th
                  What is his docum
                               t
                    k n o w if
             How to
     Information
            user




                                  Semantic Days 2007
Jon Atle Gulla
The Google Search Experience


                                       Query



                                       Similarity
                                        Similarity
                                      Page rank      Index
                                       Page rank      Index
                                      Linguistics
                                       Linguistics


                                       Results




                 Semantic Days 2007
Jon Atle Gulla
Traditional Search Principles
      Bag-of-words principle                                     You see:
            Machine understands document as a
            set of word frequencies                                A drilling rig or oil rig is a structure housi
                                                                 equipment used to drill for and extract oil or
      Word matching principle                                     natural gas from underground reservoirs.
                                                                   Drilling rigs can also be used to drill for
            Syntactic search:
                                                                      water or for exploration purposes.
            Relevant documents are documents
            that contain exactly those words that
            appear in the query
                                                                 Machine sees:
            Morpho-syntactic search:
                                                                 drill(4)           purpos(1)
            Relevant documents are documents                     equip(1)           reservoir(1)
            that contain inflectional variants of                explor(1)          rig(3)
                                                                 extract(1)         structur(1)
            exactly those words that appear in
                                                                 hous(1)            underground(1)
            the query                                            natur(1)           water(1)
                                                                 oil(2)
      One shot principle
            Query and result set ignored when
            new query is posted



                                            Semantic Days 2007
Jon Atle Gulla
Traditional Search Principles
      Bag-of-words principle                                    User need:

            Machine understands document                        Christmas tree
            as a set of word frequencies
      Word matching principle
            Syntactic search:
            Relevant documents are
                                                                                                               Index
            documents that contain exactly
            those words that appear in the                A Christmas tree is one of the most popular traditions
            query                                           A Christmas tree is one of the most popular traditions
                                                          associated with the celebration
                                                            associated with the celebration
                                                          of Christmas.
            Morpho-syntactic search:                        of Christmas.
                                                          It is normally an evergreen coniferous tree that
            Relevant documents are                          It is normally an evergreen coniferous tree that
                                                          is brought into a home or used in the open, and is
                                                            is brought into a home or used in the open, and is
            documents that contain                        decorated with Christmas lights and colourful
                                                            decorated with Christmas lights and colourful
                                                          ornaments during the days around Christmas.
            inflectional variants of exactly                ornaments during the days around Christmas.
            those words that appear in the                        A Christmas tree is a set of valves, pipes, and fittings
                                                                   A Christmas tree is a set of valves, pipes, and fittings
                                                                  used to control the flow of oil and gas as it leaves a well
            query                                                  used to control the flow of oil and gas as it leaves a well
                                                                  and enters a pipeline.
                                                                   and enters a pipeline.
      One shot principle
            Query and result set ignored
            when new query is posted                             Relevance given by document similarity




                                           Semantic Days 2007
Jon Atle Gulla
Traditional Search Principles
      Bag-of-words principle                                     Search query
                         Implementation:
            Machine understands document                          Christmas trees

        as a set of word frequencies
      Word matchingDocument relevant to query if cosine similarity
                        principle
                       above a certain threshold:
            Syntactic search:                                  Result set
            Relevant documents are            n

                                            ∑
            documents that contain exactly (q *d )
            those words that appear in the1 i i                 q       d
                                                             =( ) •( )
                            sim(q, d) = i=
            query
                                            n         n
                                                                q       d
                                           ∑          ∑
            Morpho-syntactic search:             2         2
                                               qi *      di
            Relevant documents are
                                          i =1      i =1
            documents that contain
            inflectional variants of exactly
            those words thatvector representation of document
                           d: appear in the
            query          q: vector representation of vector
      One shot principle
            Query and result set ignored
            when new query is posted



                                            Semantic Days 2007
Jon Atle Gulla
Adding Shallow Linguistics to Search

                           Clustering or log analysis for grouping
                           search results for ‘oil’




                                                  Text categorizsation
                                                         Entity search
                                                   Teaser generation
                                                        Spell checking
                                                          Collocations


                 Semantic Days 2007
Jon Atle Gulla
But
  “A drilling rig or oil rig is a structure housing equipment used to drill for
  and extract oil or natural gas from underground reservoirs. Drilling rigs
  can also be used to drill for water or for exploration purposes.” (Ref:
  Wikipedia)
    Semantic Search Principle:
    Semantic Search Principle:
                                                  Text is still just a set of strings
                                rig
                            subclassOf

                              sameAs
                   drill(4) oil rig       purpos(1)
             drilling rig
                   equip(1)        partOf reservoir(1)
                     usedFor
                                             drill
                   explor(1)              rig(3)
                   extract(1)gas          structur(1)
        water             natural
                 oil
                   hous(1)                underground(1)
                   natur(1)               water(1)
    Use ontologies oil(2)
    Use ontologiesto represent domain vocabulary,
                      to represent domain vocabulary,
    documents’ content and/or user’s information needs
     documents’ content and/or user’s information needs


                                         Semantic Days 2007
Jon Atle Gulla
Semantic Approaches to Search
      Search principles
                                Syntactic search            Semantic search
       Document view            Bag-of-words                Terms and concepts
       Search approach          Word matching               Concept matching
       Search process           One shot                    Exploratory session


      Applications of ontologies in semantic search:
            Help user formulate semantic queries                              Scientific reports
            Reformulate/reinterpret queries                                   IIP project
            Browse domain
            Formulate related queries
            Interoperability between search applications
            Semantic indexing of documents




                                           Semantic Days 2007
Jon Atle Gulla
1. Ontologies in Semantic Exploration
      Use graphical ontologies for query formulation
            Semantic annotations of documents
            Construct queries graphically
            Use ontological structures to expand query
            Use ontology to visualize search results




                                      Semantic Days 2007
Jon Atle Gulla
Query Formulation
      Queries expanded from ontological structures




                             Semantic Days 2007
Jon Atle Gulla
Query Refinement
      Use ontological structures to explore the domain




                              Semantic Days 2007
Jon Atle Gulla
2. Ontology-Driven Query Interpretation
                 User terminology


                                      User query
                                       User query




                 Semantic layer
                                                                                        ---     -----

                                                                                        ---     -----




                      Semantic Query interpretation
                                                                                        ---     -----
                                                           ---   -----
                                                                                        ---     -----




                                                                                                                        Ontology trained
                                                           ---   -----
                                                                                        ---     -----




                       Semantic Query interpretation
                                                           ---   -----

                                                           ---   -----

                                                           ---   -----



                                                                         ---    -----

                                                                         ---    -----




                                                                                                                        on person and
                                                                         ---    -----

                                                                         ---    -----




                          User          Query                            ---    -----




                           User          Query                                                                          domain collection
                      interpretation   mapping
                                                                                          ---     -----

                                                                                          ---     -----

                                                                                          ---     -----




                       interpretation   mapping
                                                                                          ---     -----
                                                                          ---   -----                     ---   -----
                                                                                          ---     -----
                                                                          ---   -----                     ---   -----
                                                                          ---   -----                     ---   -----
                                                                          ---   -----                     ---   -----

                                                                          ---   -----                     ---   -----




                 Domain collection

                                                                                                                         Domain document
                                       Standard
                                        Standard
                                                                                                                         collection
                                     search engine
                                      search engine




                                               Semantic Days 2007
Jon Atle Gulla
Training Ontology for Search
                                       Characteristic terms in
                                     these documents express
                                       user’s interpretation of
   christmas tree



                                     CHRISTMAS TREE for this
                                        document collection




                                           Concept     Prominent document terms
                                          CHRISTMAS TREE
                                                       0.95   christmas tree
                                                       0.80   christmas trees
                                                       0.35   x-tree
                    Documents                          0.05   valves
                    viewed by
                                                       0.02   wellhead
                    user (and
                    considered
                    relevant)




                     Semantic Days 2007
Jon Atle Gulla
The Personalized Ontology
      Each concept described Ontology of weightedIndex terms
                              in terms             words
      Words correspond to user’s assessment of which information is relevant to
      a concept for this document base
      Concept – term associations created automatically based on user’s
      behavior
                            Football ontology
                                                                Concept
                                                                                         Index terms
                                                            CHRISTMAS TREE
                                                                           0.95         christmas tree
                                                          WELL
                         Concept-term matrix a dynamic structure                 0.80   christmas trees
                         that reflects user’s preferences and                    0.35   x-tree
                                                            0.35
                                                PIPE
                         behavior                                                0.05
                                                                                        valves
                                                                          0.50   0.02
                                                                                        wellhead
                                                                                 0.95
                                                                                        well
                                                                                 0.98
                                                                                        wells
                                                                                        ...
                                                                                 0.95   pipe
                                                                                 0.10   pipes




                                     Semantic Days 2007
Jon Atle Gulla
Semantic Search Query
                                                                       Retrieved from ontology
                                                                         An artefact that is an assembly of pipes and piping parts, with
    User query                                                            valves and associated control equipment that is connected
                                                                        to the top of a wellhead and is intended for control of fluid from
      CHRISTMAS TREE
                                                                                                      a well

                                                                       Matches in document base
                                       Query mapping                    Christmas trees are used on both subsea and surface wellheads
                                                                        and both are available in a wide range of sizes and configurations, ...
           christmas tree:0.95, christmas trees:0.8,
                                                                        A Christmas tree is one of the most popular traditions associated
           x-tree:0.35, valves:0.05, wellhead:0.04                      with the celebration of Christmas. ...

                                                                        The function of a christmas tree is to both prevent the release of oil or
                                                                        gas from an oil well into the environment and also to direct and control
                                                                        the flow of formation fluids from the well. ...

                                                                        Private Christmas trees are not usually put up until at least the middle
                                                                        of December and are usually taken down by the 6th of January , ...

                                                                        It is normally an evergreen tree that is brought into a home or used in
                                                                        the open, and is decorated with Christmas lights and colourful
       Concept           Prominent document terms                       ornaments during the days around Christmas.

                                                                        Good understanding of topside equipment used, including x-trees
    CHRISTMAS TREE                                                      and wellhead systems
                         0.95     christmas tree
                                                                        Wellhead valves are used to isolate the flow of oil or gas at the
                         0.80     christmas trees                       takeoff from an oil or gas well. .
                         0.35     x-tree                                VENTILTRE er en ventilenhet montert på toppen av stigerør eller
                         0.05                                           brønnhode, ofte kalt juletre
                                  valves
                         0.04                                           A wellhead consists of the spools, valves, and other components
                                  wellhead
                                                                        which contain the pressure within the well.




                                                       Semantic Days 2007
Jon Atle Gulla
Semantic Search Results
                                                                     Retrieved from ontology
                                                                       An artefact that is an assembly of pipes and piping parts, with
                                                                        valves and associated control equipment that is connected
                                                                      to the top of a wellhead and is intended for control of fluid from
         Query/document                                                                             a well
         similarity                                                  Matches in document base
                                                                      Christmas trees are used on both subsea and surface wellheads
                                                plural form
           strong                                                     and both are available in a wide range of sizes and configurations, ...

                                                                      A Christmas tree is one of the most popular traditions associated
                     singular form, but other words different
           weak                                                       with the celebration of Christmas. ...

                                                                      The function of a christmas tree is to both prevent the release of oil or
                                               singular form          gas from an oil well into the environment and also to direct and control
           strong
                                                                      the flow of formation fluids from the well. ...

                       plural form, but other words different 4/4
                                                Precision: 4/4        Private Christmas trees are not usually put up until at least the middle
           weak                                  Precision:           of December and are usually taken down by the 6th of January , ...
                                                Recall:       5/6
                                                 Recall:       5/6    It is normally an evergreen tree that is brought into a home or used in
                           different words, christmas related
             no                                                       the open, and is decorated with Christmas lights and colourful
                                                                      ornaments during the days around Christmas.

                                                                      Good understanding of topside equipment used, including x-trees
                                                  synonyms            and wellhead systems
           strong
                                                                      Wellhead valves are used to isolate the flow of oil or gas at the
                                              related words
        acceptable                                                    takeoff from an oil or gas well. .

                      related words, ontology not trained in          VENTILTRE er en ventilenhet montert på toppen av stigerør eller
             no                                                       brønnhode, ofte kalt juletre
                                             this language
                                                                      A wellhead consists of the spools, valves, and other components
        acceptable                            related words           which contain the pressure within the well.




                                                    Semantic Days 2007
Jon Atle Gulla
Keyword Search Query
    User query                                                               Retrieved from ontology
                                                                               An artefact that is an assembly of pipes and piping parts, with
      x-tree
                                                                                valves and associated control equipment that is connected
                                                                              to the top of a wellhead and is intended for control of fluid from
                                                                                                            a well
                                             User interpretation
                                                                             Matches in document base
                                                                              Christmas trees are used on both subsea and surface wellheads
           CHRISTMAS TREE:0.35
                                                                              and both are available in a wide range of sizes and configurations, ...


                                             Query mapping
                                                                              The function of a christmas tree is to both prevent the release of oil or
                 christmas tree:0.95, christmas trees:0.8,                    gas from an oil well into the environment and also to direct and control
                                                                              the flow of formation fluids from the well. ...
                  x-tree:0.35, valves:0.05, wellhead:0.04




       Concept              Prominent document terms
                                                                              Good understanding of topside equipment used, including x-trees
    CHRISTMAS TREE                                                            and wellhead systems
                            0.95    christmas tree
                                                                              Wellhead valves are used to isolate the flow of oil or gas at the
                            0.80    christmas trees                           takeoff from an oil or gas well. .
                            0.35    x-tree
                            0.05    valves
                            0.04                                              A wellhead consists of the spools, valves, and other components
                                    wellhead
                                                                              which contain the pressure within the well.




                                                             Semantic Days 2007
Jon Atle Gulla
Semantic Search - Learning
 No fixed set of relevant documents – depends on user preferences

                                                         User query
                       User query                          CHRISTMAS TREE
Personalized
concept-
term
matrix
                        Result page




                 Documents viewed
                 by user (and
                 considered relevant)




                                          Semantic Days 2007
Jon Atle Gulla
2. IIP Ontology on Web Documents
User terminology                                     Experiment with real document collection

                       Horizontal
                        Horizontal
                          tree
                           tree
                                                          Mapping to query based on document
                                                          Interpretation of ‘horizontal tree’ tree content
                                                     horizontal
                                                            HORIZONTAL VESSEL 0.162tree 1.0
                                                                  horizontal christmas
                                                                  HORIZONTAL CHRISTMAS TREE Score: WELLHEAD HOUSING 0.109
                                                                                                       0.01488
                                                            HORIZONTAL BOREHOLE 0.138 1.0
                                                                  horizontal christmas trees          CONDUCTOR HOUSING 0.109
                                                                  horizontal x-tree 1.0
                                                                  CONDUCTOR HOUSING, HORIZONTAL VESSEL Score: 0.00586
                                                            HORIZONTAL CHRISTMAS TREE 0.088           WEAR BUSHING 0.101
Semantic layer
                                                                  horixontal x-trees 1.0 HORIZONTAL VESSEL JOINT GASKET 0.096
                                                                  WELLHEAD HOUSING,                   RING Score: 0.00586
                                                            HORIZONTAL TUBING HANGER 0.072
                                                            PLANEWEAR BUSHING, HORIZONTAL VESSELSUBSEA PRODUCTION MANIFOLD 0.096
                                                                                                        Score: 0.00411
                                                                   0.057
                                                                  CHRISTMAS
                                                                  sentre 0.465TREE, HORIZONTAL CHRISTMAS TREE Score: 0.00369
                                                            INTERSECTION 0.055                        TESTING TOOL 0.088
                                                                     ---     -----




                                                                              Ontology adapted
                                                                     ---     -----




      Semantic Query interpretation
                                                                     ---     -----
                                       ---   -----
                                                                     ---     -----
                                       ---   -----
                                                                     ---     -----




       Semantic Query interpretation                              HORIZONTAL CHRISTMAS TREE, HORIZONTAL VESSEL Score: 0.00344
                                                            PIPING END 0.051 0.216                    BORE PROTECTOR 0.088
                                       ---   -----




                                                                  deepwater
                                       ---   -----

                                       ---   -----




                                                            BENDING STRESS 0.043 web documents HORIZONTAL CHRISTMAS TREE 0.085
                                                                              using
                                                     ---    -----




                                                                  TUBING0.092 HORIZONTAL VESSEL Score: 0.00323
                                                                            SPOOL,
                                                     ---    -----




                                                                  atlantic
                                                     ---    -----

                                                     ---    -----




          User          Query                        ---    -----




                                                            SHIFTING TOOL 0.040 the oil business
                                                                  horizontal from
                                                                  CONDUCTOR HOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00317
                                                                                                      RUNNING TOOL 0.076
           User          Query                                                0.088
                                                                           IIP ontology trained
      interpretation   mapping
                                                                       ---     -----




                                                                  WELLHEAD HOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00317
                                                            AXIS 0.037                                TREE 0.068
                                                                       ---     -----

                                                                       ---     -----




       interpretation   mapping
                                                                       ---     -----




                                                                  develop HANGER, HORIZONTAL TUBING HANGER Score: 0.00295
                                                                            0.085
                                                      ---   -----                      ---   -----
                                                                       ---     -----




                                                            FIXED TUBINGon web oil
                                                      ---   -----                      ---   -----
                                                      ---   -----                      ---   -----




                                                                  STRUCTURE 0.037                     TUBING SPOOL 0.060
                                                      ---   -----                      ---   -----

                                                      ---   -----                      ---   -----




                                                            FLUID investordocuments
                                                                            0.085
                                                                  BORE PROTECTOR, HORIZONTAL VESSEL Score: CHRISTMAS TREE 0.048
                                                                                                      SURFACE 0.00284
                                                                  SEPARATOR 0.036
                                                            ELECTRICAL0.085
                                                                  water PENETRATOR 0.034
                                                                  TESTING TOOL, HORIZONTAL BOREHOLE Score: 0.00243 0.046
                                                                                                      CONTROL MODULE
                                                                  gulf 0.083 0.034
                                                            TEST SEPARATORHOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00238
                                                                  WELLHEAD                            DELIVERY PRICE 0.045
Domain collection                                                 transocean 0.078
                                                            VOLUME FLOW RATE 0.033BOREHOLE Score: VALVE NORMALLY OPEN 0.045
                                                                  TREE, HORIZONTAL                    0.00235
                                                                  CHRISTMAS
                                                                  field 0.072 TREE, HORIZONTAL VESSEL Score: CHRISTMAS TREE 0.043
                                                                                                      SUBSEA 0.00227
                                                            HYDROGEN FLUORIDE 0.029
                                                                  WEAR BUSHING, HORIZONTAL CHRISTMAS TREE Score: 0.00222
                                                            BASE STEEL 0.028                          CHRISTMAS TREE 0.042
                                                                  bluewater Web collection
                                                                              0.070
                                                                  TREE,0.066
                                                                          HORIZONTAL VESSEL Score: 0.00220
                                                            ...                                       ...
                                                                  deep
                     Reformulated                                                                    from different
                      Reformulated
                        query                                                                        domains
                         query




                                                                    Semantic Days 2007
Jon Atle Gulla
Conclusions
      Traditional search based on keyword matching and shallow
      linguistics
      Ontologies provide vocabulary for semantic search
      Graphical ontology for query formulation
            Semantic exploration of domain
            Visual queries
      Trained ontology for query interpretation
            Ontology maps between concepts and domain terms
            Semantic interpretation hidden to users
      Challenges
            Linking concepts to terms
            Scalability




                                        Semantic Days 2007
Jon Atle Gulla
Thank you!




                   Semantic Days 2007
Jon Atle Gulla

Más contenido relacionado

Destacado

Harnessing search engines for KM
Harnessing search engines for KMHarnessing search engines for KM
Harnessing search engines for KMInvotra
 
In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?Irfan Ullah
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalMauro Dragoni
 
Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approachAgnieszka Ławrynowicz
 
Week IV: The Elements of Theatre
Week IV: The Elements of TheatreWeek IV: The Elements of Theatre
Week IV: The Elements of TheatreThomas C.
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEDiana Maynard
 
Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Natalia Díaz Rodríguez
 
Semantic Search at Yahoo
Semantic Search at YahooSemantic Search at Yahoo
Semantic Search at YahooPeter Mika
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processingATHMAN HAJ-HAMOU
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementAndre Freitas
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaDiana Maynard
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudOntotext
 
Text analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATEText analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATEDiana Maynard
 
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
Unit 6 -  Predicates, Referring Expressions, and Universe of DiscourseUnit 6 -  Predicates, Referring Expressions, and Universe of Discourse
Unit 6 - Predicates, Referring Expressions, and Universe of DiscourseAshwag Al Hamid
 
Unit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsUnit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsAshwag Al Hamid
 

Destacado (20)

Harnessing search engines for KM
Harnessing search engines for KMHarnessing search engines for KM
Harnessing search engines for KM
 
In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?In Search of a Semantic Book Search Engine: Are We There Yet?
In Search of a Semantic Book Search Engine: Are We There Yet?
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
 
Semantic data mining: an ontology based approach
Semantic data mining: an ontology based approachSemantic data mining: an ontology based approach
Semantic data mining: an ontology based approach
 
Week IV: The Elements of Theatre
Week IV: The Elements of TheatreWeek IV: The Elements of Theatre
Week IV: The Elements of Theatre
 
Elements of drama
Elements of dramaElements of drama
Elements of drama
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATE
 
Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...Semantic security framework and context-aware role-based access control ontol...
Semantic security framework and context-aware role-based access control ontol...
 
Technical Theatre
Technical TheatreTechnical Theatre
Technical Theatre
 
Semantic Search at Yahoo
Semantic Search at YahooSemantic Search at Yahoo
Semantic Search at Yahoo
 
Use of ontologies in natural language processing
Use of ontologies in natural language processingUse of ontologies in natural language processing
Use of ontologies in natural language processing
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Text analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATEText analysis and Semantic Search with GATE
Text analysis and Semantic Search with GATE
 
Elements of drama
Elements of dramaElements of drama
Elements of drama
 
Semantics ppt
Semantics  pptSemantics  ppt
Semantics ppt
 
Chapter One Ppt
Chapter One PptChapter One Ppt
Chapter One Ppt
 
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
Unit 6 -  Predicates, Referring Expressions, and Universe of DiscourseUnit 6 -  Predicates, Referring Expressions, and Universe of Discourse
Unit 6 - Predicates, Referring Expressions, and Universe of Discourse
 
Unit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and PropositionsUnit 2: Sentences, Utterances, and Propositions
Unit 2: Sentences, Utterances, and Propositions
 

Más de Vestforsk.no

Presentasjon fremtidens penger hvl 04.11.2019 civita
Presentasjon fremtidens penger   hvl 04.11.2019 civitaPresentasjon fremtidens penger   hvl 04.11.2019 civita
Presentasjon fremtidens penger hvl 04.11.2019 civitaVestforsk.no
 
Hvem skal lave fremtidens penge
Hvem skal lave fremtidens pengeHvem skal lave fremtidens penge
Hvem skal lave fremtidens pengeVestforsk.no
 
Seminar hvl04119 nb
Seminar hvl04119 nbSeminar hvl04119 nb
Seminar hvl04119 nbVestforsk.no
 
Seminar hvl 04112019 sogn sparebank
Seminar hvl 04112019 sogn sparebankSeminar hvl 04112019 sogn sparebank
Seminar hvl 04112019 sogn sparebankVestforsk.no
 
Money on the blockchain
Money on the blockchainMoney on the blockchain
Money on the blockchainVestforsk.no
 
Vassdragsvernraadet
VassdragsvernraadetVassdragsvernraadet
VassdragsvernraadetVestforsk.no
 
Norstella konferanse om blokkjede 17.10.2018
Norstella konferanse om blokkjede 17.10.2018Norstella konferanse om blokkjede 17.10.2018
Norstella konferanse om blokkjede 17.10.2018Vestforsk.no
 
TeknaStudentBergen
TeknaStudentBergenTeknaStudentBergen
TeknaStudentBergenVestforsk.no
 
Partnerforum-22.01.2018
Partnerforum-22.01.2018Partnerforum-22.01.2018
Partnerforum-22.01.2018Vestforsk.no
 
Naeringsutvikling2017
Naeringsutvikling2017Naeringsutvikling2017
Naeringsutvikling2017Vestforsk.no
 
Internettforum2017
Internettforum2017Internettforum2017
Internettforum2017Vestforsk.no
 
Likviditetsforum2017
Likviditetsforum2017Likviditetsforum2017
Likviditetsforum2017Vestforsk.no
 
Likviditetsforum2017
Likviditetsforum2017Likviditetsforum2017
Likviditetsforum2017Vestforsk.no
 

Más de Vestforsk.no (20)

Presentasjon fremtidens penger hvl 04.11.2019 civita
Presentasjon fremtidens penger   hvl 04.11.2019 civitaPresentasjon fremtidens penger   hvl 04.11.2019 civita
Presentasjon fremtidens penger hvl 04.11.2019 civita
 
Hvem skal lave fremtidens penge
Hvem skal lave fremtidens pengeHvem skal lave fremtidens penge
Hvem skal lave fremtidens penge
 
Seminar hvl04119 nb
Seminar hvl04119 nbSeminar hvl04119 nb
Seminar hvl04119 nb
 
Seminar hvl 04112019 sogn sparebank
Seminar hvl 04112019 sogn sparebankSeminar hvl 04112019 sogn sparebank
Seminar hvl 04112019 sogn sparebank
 
Money on the blockchain
Money on the blockchainMoney on the blockchain
Money on the blockchain
 
Vassdragsvernraadet
VassdragsvernraadetVassdragsvernraadet
Vassdragsvernraadet
 
Nhh18022019
Nhh18022019Nhh18022019
Nhh18022019
 
Norstella konferanse om blokkjede 17.10.2018
Norstella konferanse om blokkjede 17.10.2018Norstella konferanse om blokkjede 17.10.2018
Norstella konferanse om blokkjede 17.10.2018
 
Stortinget2018
Stortinget2018Stortinget2018
Stortinget2018
 
TeknaStudentBergen
TeknaStudentBergenTeknaStudentBergen
TeknaStudentBergen
 
ITS2018
ITS2018ITS2018
ITS2018
 
Partnerforum-22.01.2018
Partnerforum-22.01.2018Partnerforum-22.01.2018
Partnerforum-22.01.2018
 
Naeringsutvikling2017
Naeringsutvikling2017Naeringsutvikling2017
Naeringsutvikling2017
 
ITforum2017
ITforum2017ITforum2017
ITforum2017
 
Internettforum2017
Internettforum2017Internettforum2017
Internettforum2017
 
Likviditetsforum2017
Likviditetsforum2017Likviditetsforum2017
Likviditetsforum2017
 
Likviditetsforum2017
Likviditetsforum2017Likviditetsforum2017
Likviditetsforum2017
 
TeknaBergen
TeknaBergenTeknaBergen
TeknaBergen
 
oks-hvl
oks-hvloks-hvl
oks-hvl
 
NOKIOS2017
NOKIOS2017NOKIOS2017
NOKIOS2017
 

Último

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 

Último (20)

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 

From Google Search to Semantic Exploration - Semantic Technologies for Next Generation Search

  • 1. From Google Search to Semantic Exploration Jon Atle Gulla Professor Norwegian University of Science and Technology jag@idi.ntnu.no Semantic Days 2007 Jon Atle Gulla
  • 2. Agenda Traditional search applications Adding shallow linguistics to traditional search The concept of semantic search Ontologies in search applications Ontologies for semantic annotation & exploration Ontology-driven query interpretation quot;Hakia thinks that indexing has plateaued and that semantic technologies will take over quot;Hakia thinks that indexing has plateaued and that semantic technologies will take over for the next generation of searchquot;. for the next generation of searchquot;. MacManus, R. “Hakia Takes On Google With Semantic Technologies”. MacManus, R. “Hakia Takes On Google With Semantic Technologies”. http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php http://www.readwriteweb.com/archives/hakia_takes_on_google_semantic_search.php Semantic Days 2007 Jon Atle Gulla
  • 3. The Language Problem in Search People use the language differently Authors ? cument do query? t of the e n e conte ent answers th th What is his docum t k n o w if How to Information user Semantic Days 2007 Jon Atle Gulla
  • 4. The Google Search Experience Query Similarity Similarity Page rank Index Page rank Index Linguistics Linguistics Results Semantic Days 2007 Jon Atle Gulla
  • 5. Traditional Search Principles Bag-of-words principle You see: Machine understands document as a set of word frequencies A drilling rig or oil rig is a structure housi equipment used to drill for and extract oil or Word matching principle natural gas from underground reservoirs. Drilling rigs can also be used to drill for Syntactic search: water or for exploration purposes. Relevant documents are documents that contain exactly those words that appear in the query Machine sees: Morpho-syntactic search: drill(4) purpos(1) Relevant documents are documents equip(1) reservoir(1) that contain inflectional variants of explor(1) rig(3) extract(1) structur(1) exactly those words that appear in hous(1) underground(1) the query natur(1) water(1) oil(2) One shot principle Query and result set ignored when new query is posted Semantic Days 2007 Jon Atle Gulla
  • 6. Traditional Search Principles Bag-of-words principle User need: Machine understands document Christmas tree as a set of word frequencies Word matching principle Syntactic search: Relevant documents are Index documents that contain exactly those words that appear in the A Christmas tree is one of the most popular traditions query A Christmas tree is one of the most popular traditions associated with the celebration associated with the celebration of Christmas. Morpho-syntactic search: of Christmas. It is normally an evergreen coniferous tree that Relevant documents are It is normally an evergreen coniferous tree that is brought into a home or used in the open, and is is brought into a home or used in the open, and is documents that contain decorated with Christmas lights and colourful decorated with Christmas lights and colourful ornaments during the days around Christmas. inflectional variants of exactly ornaments during the days around Christmas. those words that appear in the A Christmas tree is a set of valves, pipes, and fittings A Christmas tree is a set of valves, pipes, and fittings used to control the flow of oil and gas as it leaves a well query used to control the flow of oil and gas as it leaves a well and enters a pipeline. and enters a pipeline. One shot principle Query and result set ignored when new query is posted Relevance given by document similarity Semantic Days 2007 Jon Atle Gulla
  • 7. Traditional Search Principles Bag-of-words principle Search query Implementation: Machine understands document Christmas trees as a set of word frequencies Word matchingDocument relevant to query if cosine similarity principle above a certain threshold: Syntactic search: Result set Relevant documents are n ∑ documents that contain exactly (q *d ) those words that appear in the1 i i q d =( ) •( ) sim(q, d) = i= query n n q d ∑ ∑ Morpho-syntactic search: 2 2 qi * di Relevant documents are i =1 i =1 documents that contain inflectional variants of exactly those words thatvector representation of document d: appear in the query q: vector representation of vector One shot principle Query and result set ignored when new query is posted Semantic Days 2007 Jon Atle Gulla
  • 8. Adding Shallow Linguistics to Search Clustering or log analysis for grouping search results for ‘oil’ Text categorizsation Entity search Teaser generation Spell checking Collocations Semantic Days 2007 Jon Atle Gulla
  • 9. But “A drilling rig or oil rig is a structure housing equipment used to drill for and extract oil or natural gas from underground reservoirs. Drilling rigs can also be used to drill for water or for exploration purposes.” (Ref: Wikipedia) Semantic Search Principle: Semantic Search Principle: Text is still just a set of strings rig subclassOf sameAs drill(4) oil rig purpos(1) drilling rig equip(1) partOf reservoir(1) usedFor drill explor(1) rig(3) extract(1)gas structur(1) water natural oil hous(1) underground(1) natur(1) water(1) Use ontologies oil(2) Use ontologiesto represent domain vocabulary, to represent domain vocabulary, documents’ content and/or user’s information needs documents’ content and/or user’s information needs Semantic Days 2007 Jon Atle Gulla
  • 10. Semantic Approaches to Search Search principles Syntactic search Semantic search Document view Bag-of-words Terms and concepts Search approach Word matching Concept matching Search process One shot Exploratory session Applications of ontologies in semantic search: Help user formulate semantic queries Scientific reports Reformulate/reinterpret queries IIP project Browse domain Formulate related queries Interoperability between search applications Semantic indexing of documents Semantic Days 2007 Jon Atle Gulla
  • 11. 1. Ontologies in Semantic Exploration Use graphical ontologies for query formulation Semantic annotations of documents Construct queries graphically Use ontological structures to expand query Use ontology to visualize search results Semantic Days 2007 Jon Atle Gulla
  • 12. Query Formulation Queries expanded from ontological structures Semantic Days 2007 Jon Atle Gulla
  • 13. Query Refinement Use ontological structures to explore the domain Semantic Days 2007 Jon Atle Gulla
  • 14. 2. Ontology-Driven Query Interpretation User terminology User query User query Semantic layer --- ----- --- ----- Semantic Query interpretation --- ----- --- ----- --- ----- Ontology trained --- ----- --- ----- Semantic Query interpretation --- ----- --- ----- --- ----- --- ----- --- ----- on person and --- ----- --- ----- User Query --- ----- User Query domain collection interpretation mapping --- ----- --- ----- --- ----- interpretation mapping --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- --- ----- Domain collection Domain document Standard Standard collection search engine search engine Semantic Days 2007 Jon Atle Gulla
  • 15. Training Ontology for Search Characteristic terms in these documents express user’s interpretation of christmas tree CHRISTMAS TREE for this document collection Concept Prominent document terms CHRISTMAS TREE 0.95 christmas tree 0.80 christmas trees 0.35 x-tree Documents 0.05 valves viewed by 0.02 wellhead user (and considered relevant) Semantic Days 2007 Jon Atle Gulla
  • 16. The Personalized Ontology Each concept described Ontology of weightedIndex terms in terms words Words correspond to user’s assessment of which information is relevant to a concept for this document base Concept – term associations created automatically based on user’s behavior Football ontology Concept Index terms CHRISTMAS TREE 0.95 christmas tree WELL Concept-term matrix a dynamic structure 0.80 christmas trees that reflects user’s preferences and 0.35 x-tree 0.35 PIPE behavior 0.05 valves 0.50 0.02 wellhead 0.95 well 0.98 wells ... 0.95 pipe 0.10 pipes Semantic Days 2007 Jon Atle Gulla
  • 17. Semantic Search Query Retrieved from ontology An artefact that is an assembly of pipes and piping parts, with User query valves and associated control equipment that is connected to the top of a wellhead and is intended for control of fluid from CHRISTMAS TREE a well Matches in document base Query mapping Christmas trees are used on both subsea and surface wellheads and both are available in a wide range of sizes and configurations, ... christmas tree:0.95, christmas trees:0.8, A Christmas tree is one of the most popular traditions associated x-tree:0.35, valves:0.05, wellhead:0.04 with the celebration of Christmas. ... The function of a christmas tree is to both prevent the release of oil or gas from an oil well into the environment and also to direct and control the flow of formation fluids from the well. ... Private Christmas trees are not usually put up until at least the middle of December and are usually taken down by the 6th of January , ... It is normally an evergreen tree that is brought into a home or used in the open, and is decorated with Christmas lights and colourful Concept Prominent document terms ornaments during the days around Christmas. Good understanding of topside equipment used, including x-trees CHRISTMAS TREE and wellhead systems 0.95 christmas tree Wellhead valves are used to isolate the flow of oil or gas at the 0.80 christmas trees takeoff from an oil or gas well. . 0.35 x-tree VENTILTRE er en ventilenhet montert på toppen av stigerør eller 0.05 brønnhode, ofte kalt juletre valves 0.04 A wellhead consists of the spools, valves, and other components wellhead which contain the pressure within the well. Semantic Days 2007 Jon Atle Gulla
  • 18. Semantic Search Results Retrieved from ontology An artefact that is an assembly of pipes and piping parts, with valves and associated control equipment that is connected to the top of a wellhead and is intended for control of fluid from Query/document a well similarity Matches in document base Christmas trees are used on both subsea and surface wellheads plural form strong and both are available in a wide range of sizes and configurations, ... A Christmas tree is one of the most popular traditions associated singular form, but other words different weak with the celebration of Christmas. ... The function of a christmas tree is to both prevent the release of oil or singular form gas from an oil well into the environment and also to direct and control strong the flow of formation fluids from the well. ... plural form, but other words different 4/4 Precision: 4/4 Private Christmas trees are not usually put up until at least the middle weak Precision: of December and are usually taken down by the 6th of January , ... Recall: 5/6 Recall: 5/6 It is normally an evergreen tree that is brought into a home or used in different words, christmas related no the open, and is decorated with Christmas lights and colourful ornaments during the days around Christmas. Good understanding of topside equipment used, including x-trees synonyms and wellhead systems strong Wellhead valves are used to isolate the flow of oil or gas at the related words acceptable takeoff from an oil or gas well. . related words, ontology not trained in VENTILTRE er en ventilenhet montert på toppen av stigerør eller no brønnhode, ofte kalt juletre this language A wellhead consists of the spools, valves, and other components acceptable related words which contain the pressure within the well. Semantic Days 2007 Jon Atle Gulla
  • 19. Keyword Search Query User query Retrieved from ontology An artefact that is an assembly of pipes and piping parts, with x-tree valves and associated control equipment that is connected to the top of a wellhead and is intended for control of fluid from a well User interpretation Matches in document base Christmas trees are used on both subsea and surface wellheads CHRISTMAS TREE:0.35 and both are available in a wide range of sizes and configurations, ... Query mapping The function of a christmas tree is to both prevent the release of oil or christmas tree:0.95, christmas trees:0.8, gas from an oil well into the environment and also to direct and control the flow of formation fluids from the well. ... x-tree:0.35, valves:0.05, wellhead:0.04 Concept Prominent document terms Good understanding of topside equipment used, including x-trees CHRISTMAS TREE and wellhead systems 0.95 christmas tree Wellhead valves are used to isolate the flow of oil or gas at the 0.80 christmas trees takeoff from an oil or gas well. . 0.35 x-tree 0.05 valves 0.04 A wellhead consists of the spools, valves, and other components wellhead which contain the pressure within the well. Semantic Days 2007 Jon Atle Gulla
  • 20. Semantic Search - Learning No fixed set of relevant documents – depends on user preferences User query User query CHRISTMAS TREE Personalized concept- term matrix Result page Documents viewed by user (and considered relevant) Semantic Days 2007 Jon Atle Gulla
  • 21. 2. IIP Ontology on Web Documents User terminology Experiment with real document collection Horizontal Horizontal tree tree Mapping to query based on document Interpretation of ‘horizontal tree’ tree content horizontal HORIZONTAL VESSEL 0.162tree 1.0 horizontal christmas HORIZONTAL CHRISTMAS TREE Score: WELLHEAD HOUSING 0.109 0.01488 HORIZONTAL BOREHOLE 0.138 1.0 horizontal christmas trees CONDUCTOR HOUSING 0.109 horizontal x-tree 1.0 CONDUCTOR HOUSING, HORIZONTAL VESSEL Score: 0.00586 HORIZONTAL CHRISTMAS TREE 0.088 WEAR BUSHING 0.101 Semantic layer horixontal x-trees 1.0 HORIZONTAL VESSEL JOINT GASKET 0.096 WELLHEAD HOUSING, RING Score: 0.00586 HORIZONTAL TUBING HANGER 0.072 PLANEWEAR BUSHING, HORIZONTAL VESSELSUBSEA PRODUCTION MANIFOLD 0.096 Score: 0.00411 0.057 CHRISTMAS sentre 0.465TREE, HORIZONTAL CHRISTMAS TREE Score: 0.00369 INTERSECTION 0.055 TESTING TOOL 0.088 --- ----- Ontology adapted --- ----- Semantic Query interpretation --- ----- --- ----- --- ----- --- ----- --- ----- Semantic Query interpretation HORIZONTAL CHRISTMAS TREE, HORIZONTAL VESSEL Score: 0.00344 PIPING END 0.051 0.216 BORE PROTECTOR 0.088 --- ----- deepwater --- ----- --- ----- BENDING STRESS 0.043 web documents HORIZONTAL CHRISTMAS TREE 0.085 using --- ----- TUBING0.092 HORIZONTAL VESSEL Score: 0.00323 SPOOL, --- ----- atlantic --- ----- --- ----- User Query --- ----- SHIFTING TOOL 0.040 the oil business horizontal from CONDUCTOR HOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00317 RUNNING TOOL 0.076 User Query 0.088 IIP ontology trained interpretation mapping --- ----- WELLHEAD HOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00317 AXIS 0.037 TREE 0.068 --- ----- --- ----- interpretation mapping --- ----- develop HANGER, HORIZONTAL TUBING HANGER Score: 0.00295 0.085 --- ----- --- ----- --- ----- FIXED TUBINGon web oil --- ----- --- ----- --- ----- --- ----- STRUCTURE 0.037 TUBING SPOOL 0.060 --- ----- --- ----- --- ----- --- ----- FLUID investordocuments 0.085 BORE PROTECTOR, HORIZONTAL VESSEL Score: CHRISTMAS TREE 0.048 SURFACE 0.00284 SEPARATOR 0.036 ELECTRICAL0.085 water PENETRATOR 0.034 TESTING TOOL, HORIZONTAL BOREHOLE Score: 0.00243 0.046 CONTROL MODULE gulf 0.083 0.034 TEST SEPARATORHOUSING, HORIZONTAL CHRISTMAS TREE Score: 0.00238 WELLHEAD DELIVERY PRICE 0.045 Domain collection transocean 0.078 VOLUME FLOW RATE 0.033BOREHOLE Score: VALVE NORMALLY OPEN 0.045 TREE, HORIZONTAL 0.00235 CHRISTMAS field 0.072 TREE, HORIZONTAL VESSEL Score: CHRISTMAS TREE 0.043 SUBSEA 0.00227 HYDROGEN FLUORIDE 0.029 WEAR BUSHING, HORIZONTAL CHRISTMAS TREE Score: 0.00222 BASE STEEL 0.028 CHRISTMAS TREE 0.042 bluewater Web collection 0.070 TREE,0.066 HORIZONTAL VESSEL Score: 0.00220 ... ... deep Reformulated from different Reformulated query domains query Semantic Days 2007 Jon Atle Gulla
  • 22. Conclusions Traditional search based on keyword matching and shallow linguistics Ontologies provide vocabulary for semantic search Graphical ontology for query formulation Semantic exploration of domain Visual queries Trained ontology for query interpretation Ontology maps between concepts and domain terms Semantic interpretation hidden to users Challenges Linking concepts to terms Scalability Semantic Days 2007 Jon Atle Gulla
  • 23. Thank you! Semantic Days 2007 Jon Atle Gulla