SlideShare una empresa de Scribd logo
1 de 2
Descargar para leer sin conexión
Integration of Visualization with Search Engines
                     Omar Alonso                                                      Ricardo Baeza-Yates
                   Oracle Corp.                                                   Dept. of Computer Science
                500 Oracle Parkway                                                    Universidad de Chile
             Redwood Shores, CA 94065                                         Av. Blanco Encalada 2120, Santiago
                       USA                                                                   Chile
             oalonso@us.oracle.com                                                    rbaeza@dcc.uchile.cl



ABSTRACT                                                            The user has the choice of selecting a searcher and a visualizer
                                                                    as part of his profile. Then he issues a query to the system and
In this poster we analyze the dependency problem of the user
                                                                    reviews the search results using his selected visualization
interface with the information retrieval software. We propose
                                                                    interface. Under the covers a series of transformations occurs
an architecture that allows the separation of the user interface
                                                                    to make this integration possible.
from the retrieval component. This is useful when the user
wants to select an interface or visualization metaphor that         When the user issues a query (1), a query parser module
could not always be available for different information             translates the query to the selected search engine syntax (2).
retrieval systems or search engines. We present a model for         The results from the query can be in IVL format or not
visualizing large collections of documents in World Wide Web        depending if the searcher supports IVL or not. If the searcher
retrieval, independently of the retrieval system.                   does not support IVL, a transformer will map the internal
                                                                    format of the searcher to IVL (3). Finally if the interface does
Keywords                                                            not support IVL, a transformer will perform another
Software architectures for WWW search, information                  transformation to generate the visualization’s format (4).
visualization, integration with database systems, previews,         Figure 1 shows a simplified version of the architecture.
overviews.
                                                                                           1                           2
1. INTRODUCTION                                                         Information
                                                                                                                              Search
     The process of information resource discovery on the Web           Visualization            Query translator
is a primary task that requires essential tools. Lately there has                                                             Engine
been a growth in the number of tools and applications in the              Interface
information retrieval area that suppose to make the process
easier for users. However, searching the Web is still difficult
and there are still many challenges and open problems [2].
                                                                                  4               Transformation             3
Search engines have a user interface for expressing the query
                                                                                                      Engine
and for displaying the answers. The interface is tightly coupled
with the retrieval software and it varies from system to system.
Given this scenario the user is not allowed to choose a user                      Figure 1. Software architecture
interface and a search engine independently. In other words,        A key feature in our architecture is the transformation engine.
searching and visualizing information are not independent           It is possible to automate, in some cases, wrappers or adaptors
components.                                                         generation. As an example, given a set of answers in IVL, it is
                                                                    very easy to generate a plug-in for Apple's Sherlock.
It is important that the user has an option to select a user
interface or visualization metaphor that he prefers over other      The following is an example in IVL of the results of a query
ones, even the default retrieval user interface for his             about “Java and XML”.
information seeking process. On the other hand, if a user has a     <?xml version="1.0"?>
preference on a retrieval system he should be able to use its
                                                                    <ivl><query>Java and XML</query>
services without hardware or user interface limitations. We
believe this is important for producing previews and overviews      <rank>1</rank>
of information in different systems (search engine, retrieval       <url>http://java.sun.com/xml</url>
system, digital library, etc.) [3]                                  <title>Java and XML</title>
                                                                    <abstract>Sun's Java XML resource site …
2. SOFTWARE ARCHITECTURE                                            </abstract>
     The software architecture consists of a set of searchers, a    <lm>Last modified 18-Oct-99 </lm>
set of visualizers, and an intermediate representation [1]. We      <size> page size 4K </size>
defined a language called IVL (Information Visualization
                                                                    <rank>2.</rank>
Language) that can be used as the intermediate representation
and therefore allows searchers and visualizers to interchange       <url>http://www.oreilly.com/catalog/javaxml
                                                                    </url>
data. We implemented IVL in XML.
                                                                    <title>Java and XML </title>
<abstract>Shows you how to put Java and XML
together </abstract>
<lm>Last modified 27-Nov-00</lm>
<size> page size 4K </size>
<rank> 3</rank>
<url>http://technet.oracle.com/tech/xml/</url>
<title>XML page</title>
<abstract>Oracle XML resource page.</abstract>
<lm>Last modified 11-Jan-01 </lm>
<size> page size 12K </size>
</ivl>



3. ONGOING WORK
     We implemented a prototype of the software architecture
that includes Oracle interMedia Text as the search retrieval
component [4] and several prototypes of user interfaces. We
wrote the user interface prototypes in different languages: Java,
JSP (Java Server Pages), and PSP (PL/SQL Server Pages) to
explore different scenarios.
The prototype works as follows. The user issues a query from                   Figure 3. AltaVista-like user interface
the browser. A Java servlet is responsible for opening a JDBC
connection to the Oracle8i database, issuing the text query, and
retrieving the rows. The search server returns the results in       4. CONCLUSIONS
IVL and with the XML supported features from the product, a              With so many systems and tools for web retrieval, our
component transform the results into the visualization’s format.    model defines a way to share the output of those so that the
In case the user interface has a proprietary format, another        visualization is more independent and therefore an option of
transformation is applied.                                          several to the user.
                                                                         Future work includes experiments with different
Figure 2 shows a screenshot of a two-view search interface.         hardware devices (e.g. cell phones, personal assistants, etc.)
The left side shows structure and the right side shows services     and integration with different visualization products that are
(e.g. highlighting, summary, etc.). The same user interface can     available in the market.
be used with a different search engine. Figure 3 shows an
AltaVista-like user interface using interMedia Text as the          5. REFERENCES
search engine. We are also exploring the automatic generation       [1] O. Alonso and R. Baeza-Yates. “A Model and Software
of simple user interfaces and previews/overviews with our               Architecture for Search Results Visualization on WWW”.
architecture.                                                           VII Symposium on String Processing and Information
                                                                        Retrieval (SPIRE), A Coruna, Spain. IEEE Computer
                                                                        Society Press (September 2000).
                                                                    [2] R. Baeza-Yates and B. Ribeiro-Neto. Modern Information
                                                                        Retrieval. Addison-Wesley Longman, Edinburgh Gate,
                                                                        London (1999).
                                                                    [3] S. Greene et al. “Previews and Overviews in Digital
                                                                        Libraries: Designing Surrogates to Support Visual
                                                                        Information Seeking”. UMIACS-TR-97-73 ISR-TR (1999).
                                                                    [4] Oracle interMedia Text Reference (8.1.6). Oracle Corp.
                                                                        Redwood Shores, CA (December 1999).




       Figure 2. A Two-View search results interface

Más contenido relacionado

Similar a Integration visualization

Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventionsmalupului
 
IRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET Journal
 
Sup (Semantic User Profiling)
Sup (Semantic User Profiling)Sup (Semantic User Profiling)
Sup (Semantic User Profiling)Emanuela Boroș
 
IRJET- Android Application for WIFI based Library Book Locator
IRJET-  	  Android Application for WIFI based Library Book LocatorIRJET-  	  Android Application for WIFI based Library Book Locator
IRJET- Android Application for WIFI based Library Book LocatorIRJET Journal
 
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...Kim Daniels
 
Pattern based approach for Natural Language Interface to Database
Pattern based approach for Natural Language Interface to DatabasePattern based approach for Natural Language Interface to Database
Pattern based approach for Natural Language Interface to DatabaseIJERA Editor
 
Web crawler with email extractor and image extractor
Web crawler with email extractor and image extractorWeb crawler with email extractor and image extractor
Web crawler with email extractor and image extractorAbhinav Gupta
 
miniprojectreport
miniprojectreportminiprojectreport
miniprojectreportsilpa mohan
 
IRJET- Hosting NLP based Chatbot on AWS Cloud using Docker
IRJET-  	  Hosting NLP based Chatbot on AWS Cloud using DockerIRJET-  	  Hosting NLP based Chatbot on AWS Cloud using Docker
IRJET- Hosting NLP based Chatbot on AWS Cloud using DockerIRJET Journal
 
IRJET- Conversational Commerce (ESTILO)
IRJET- Conversational Commerce (ESTILO)IRJET- Conversational Commerce (ESTILO)
IRJET- Conversational Commerce (ESTILO)IRJET Journal
 
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...IRJET Journal
 
Internship Report
Internship ReportInternship Report
Internship ReportJiali Chen
 

Similar a Integration visualization (20)

Mariana Alupului Inventions
Mariana Alupului InventionsMariana Alupului Inventions
Mariana Alupului Inventions
 
IRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data VisualizationIRJET- Plug-In based System for Data Visualization
IRJET- Plug-In based System for Data Visualization
 
Sup (Semantic User Profiling)
Sup (Semantic User Profiling)Sup (Semantic User Profiling)
Sup (Semantic User Profiling)
 
IRJET- Android Application for WIFI based Library Book Locator
IRJET-  	  Android Application for WIFI based Library Book LocatorIRJET-  	  Android Application for WIFI based Library Book Locator
IRJET- Android Application for WIFI based Library Book Locator
 
Sup documentation
Sup documentationSup documentation
Sup documentation
 
ThesisProposal
ThesisProposalThesisProposal
ThesisProposal
 
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
 
final ppt.pptx
final ppt.pptxfinal ppt.pptx
final ppt.pptx
 
final ppt.pptx
final ppt.pptxfinal ppt.pptx
final ppt.pptx
 
Pattern based approach for Natural Language Interface to Database
Pattern based approach for Natural Language Interface to DatabasePattern based approach for Natural Language Interface to Database
Pattern based approach for Natural Language Interface to Database
 
Web crawler with email extractor and image extractor
Web crawler with email extractor and image extractorWeb crawler with email extractor and image extractor
Web crawler with email extractor and image extractor
 
Persentation
PersentationPersentation
Persentation
 
miniprojectreport
miniprojectreportminiprojectreport
miniprojectreport
 
IRJET- Hosting NLP based Chatbot on AWS Cloud using Docker
IRJET-  	  Hosting NLP based Chatbot on AWS Cloud using DockerIRJET-  	  Hosting NLP based Chatbot on AWS Cloud using Docker
IRJET- Hosting NLP based Chatbot on AWS Cloud using Docker
 
IT6701-Information management question bank
IT6701-Information management question bankIT6701-Information management question bank
IT6701-Information management question bank
 
IRJET- Conversational Commerce (ESTILO)
IRJET- Conversational Commerce (ESTILO)IRJET- Conversational Commerce (ESTILO)
IRJET- Conversational Commerce (ESTILO)
 
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...
IRJET- Design of Closed Loop PI Controller Based Hybrid Z-Source DC-DC Conver...
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Internship Report
Internship ReportInternship Report
Internship Report
 
Archonnex at ICPSR
Archonnex at ICPSRArchonnex at ICPSR
Archonnex at ICPSR
 

Más de Stefanos Anastasiadis (14)

Webmaster guide-en
Webmaster guide-enWebmaster guide-en
Webmaster guide-en
 
Web search engines and search technology
Web search engines and search technologyWeb search engines and search technology
Web search engines and search technology
 
Ultra search
Ultra searchUltra search
Ultra search
 
Tips and technics for search engine market
Tips and technics for search engine marketTips and technics for search engine market
Tips and technics for search engine market
 
The little-joomla-seo-book-v1
The little-joomla-seo-book-v1The little-joomla-seo-book-v1
The little-joomla-seo-book-v1
 
The google best_practices_guide
The google best_practices_guideThe google best_practices_guide
The google best_practices_guide
 
Web search algorithms and user interfaces
Web search algorithms and user interfacesWeb search algorithms and user interfaces
Web search algorithms and user interfaces
 
Searching the web general
Searching the web generalSearching the web general
Searching the web general
 
Seminar algorithms of web
Seminar algorithms of webSeminar algorithms of web
Seminar algorithms of web
 
Search engines
Search enginesSearch engines
Search engines
 
Information organization
Information organization Information organization
Information organization
 
Get your-web-site-to-be-found
Get your-web-site-to-be-foundGet your-web-site-to-be-found
Get your-web-site-to-be-found
 
Search engine strategies 8 04
Search engine strategies 8 04Search engine strategies 8 04
Search engine strategies 8 04
 
Ecommerce webinar-oct-2010
Ecommerce webinar-oct-2010Ecommerce webinar-oct-2010
Ecommerce webinar-oct-2010
 

Integration visualization

  • 1. Integration of Visualization with Search Engines Omar Alonso Ricardo Baeza-Yates Oracle Corp. Dept. of Computer Science 500 Oracle Parkway Universidad de Chile Redwood Shores, CA 94065 Av. Blanco Encalada 2120, Santiago USA Chile oalonso@us.oracle.com rbaeza@dcc.uchile.cl ABSTRACT The user has the choice of selecting a searcher and a visualizer as part of his profile. Then he issues a query to the system and In this poster we analyze the dependency problem of the user reviews the search results using his selected visualization interface with the information retrieval software. We propose interface. Under the covers a series of transformations occurs an architecture that allows the separation of the user interface to make this integration possible. from the retrieval component. This is useful when the user wants to select an interface or visualization metaphor that When the user issues a query (1), a query parser module could not always be available for different information translates the query to the selected search engine syntax (2). retrieval systems or search engines. We present a model for The results from the query can be in IVL format or not visualizing large collections of documents in World Wide Web depending if the searcher supports IVL or not. If the searcher retrieval, independently of the retrieval system. does not support IVL, a transformer will map the internal format of the searcher to IVL (3). Finally if the interface does Keywords not support IVL, a transformer will perform another Software architectures for WWW search, information transformation to generate the visualization’s format (4). visualization, integration with database systems, previews, Figure 1 shows a simplified version of the architecture. overviews. 1 2 1. INTRODUCTION Information Search The process of information resource discovery on the Web Visualization Query translator is a primary task that requires essential tools. Lately there has Engine been a growth in the number of tools and applications in the Interface information retrieval area that suppose to make the process easier for users. However, searching the Web is still difficult and there are still many challenges and open problems [2]. 4 Transformation 3 Search engines have a user interface for expressing the query Engine and for displaying the answers. The interface is tightly coupled with the retrieval software and it varies from system to system. Given this scenario the user is not allowed to choose a user Figure 1. Software architecture interface and a search engine independently. In other words, A key feature in our architecture is the transformation engine. searching and visualizing information are not independent It is possible to automate, in some cases, wrappers or adaptors components. generation. As an example, given a set of answers in IVL, it is very easy to generate a plug-in for Apple's Sherlock. It is important that the user has an option to select a user interface or visualization metaphor that he prefers over other The following is an example in IVL of the results of a query ones, even the default retrieval user interface for his about “Java and XML”. information seeking process. On the other hand, if a user has a <?xml version="1.0"?> preference on a retrieval system he should be able to use its <ivl><query>Java and XML</query> services without hardware or user interface limitations. We believe this is important for producing previews and overviews <rank>1</rank> of information in different systems (search engine, retrieval <url>http://java.sun.com/xml</url> system, digital library, etc.) [3] <title>Java and XML</title> <abstract>Sun's Java XML resource site … 2. SOFTWARE ARCHITECTURE </abstract> The software architecture consists of a set of searchers, a <lm>Last modified 18-Oct-99 </lm> set of visualizers, and an intermediate representation [1]. We <size> page size 4K </size> defined a language called IVL (Information Visualization <rank>2.</rank> Language) that can be used as the intermediate representation and therefore allows searchers and visualizers to interchange <url>http://www.oreilly.com/catalog/javaxml </url> data. We implemented IVL in XML. <title>Java and XML </title>
  • 2. <abstract>Shows you how to put Java and XML together </abstract> <lm>Last modified 27-Nov-00</lm> <size> page size 4K </size> <rank> 3</rank> <url>http://technet.oracle.com/tech/xml/</url> <title>XML page</title> <abstract>Oracle XML resource page.</abstract> <lm>Last modified 11-Jan-01 </lm> <size> page size 12K </size> </ivl> 3. ONGOING WORK We implemented a prototype of the software architecture that includes Oracle interMedia Text as the search retrieval component [4] and several prototypes of user interfaces. We wrote the user interface prototypes in different languages: Java, JSP (Java Server Pages), and PSP (PL/SQL Server Pages) to explore different scenarios. The prototype works as follows. The user issues a query from Figure 3. AltaVista-like user interface the browser. A Java servlet is responsible for opening a JDBC connection to the Oracle8i database, issuing the text query, and retrieving the rows. The search server returns the results in 4. CONCLUSIONS IVL and with the XML supported features from the product, a With so many systems and tools for web retrieval, our component transform the results into the visualization’s format. model defines a way to share the output of those so that the In case the user interface has a proprietary format, another visualization is more independent and therefore an option of transformation is applied. several to the user. Future work includes experiments with different Figure 2 shows a screenshot of a two-view search interface. hardware devices (e.g. cell phones, personal assistants, etc.) The left side shows structure and the right side shows services and integration with different visualization products that are (e.g. highlighting, summary, etc.). The same user interface can available in the market. be used with a different search engine. Figure 3 shows an AltaVista-like user interface using interMedia Text as the 5. REFERENCES search engine. We are also exploring the automatic generation [1] O. Alonso and R. Baeza-Yates. “A Model and Software of simple user interfaces and previews/overviews with our Architecture for Search Results Visualization on WWW”. architecture. VII Symposium on String Processing and Information Retrieval (SPIRE), A Coruna, Spain. IEEE Computer Society Press (September 2000). [2] R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley Longman, Edinburgh Gate, London (1999). [3] S. Greene et al. “Previews and Overviews in Digital Libraries: Designing Surrogates to Support Visual Information Seeking”. UMIACS-TR-97-73 ISR-TR (1999). [4] Oracle interMedia Text Reference (8.1.6). Oracle Corp. Redwood Shores, CA (December 1999). Figure 2. A Two-View search results interface