An Intelligent Meta Search Engine for Efficient Web Document Retrieval

iosrjce

IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.

IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 2, Ver. V (Mar – Apr. 2015), PP 45-54
www.iosrjournals.org
DOI: 10.9790/0661-17254554 www.iosrjournals.org 45 | Page
An Intelligent Meta Search Engine for Efficient Web Document
Retrieval
Keerthana.I.P 1
, Aby Abahai.T 2
1(
Dept. of CSE, Mar Athanasius College of Engineering, Kothamangalam , Kerala, India)
2(
Dept. of CSE, Assistant Professor, Mar Athanasius College of Engineering, Kothamangalam, Kerala, India)
Abstract: In daily use of internet, when searching information we face lots of difficulty due to the rapid growth
of Information Resources. This is because of the fact that a single search engine cannot index the entire web of
resources. A Meta Search Engine is a solution for this, which submits the query to many other search engines
and returns summary of the results. Therefore, the search results receive are an aggregate result of multiple
searches. This strategy gives our search a boarder scope than searching a single search engine, but the results
are not always better. Because the Meta Search Engine must use its own algorithm to choose the best results
from multiple search engines. In this paper we proposed a new Meta Search Engine to overcome these
drawbacks. It uses a new page rank algorithm called modified ranking for ranking and optimizing the search
results in an efficient way. It is a two phase ranking algorithm used for ordering the web pages based on their
relevance and popularity. This Meta Search Engine is developed in such a way that it will produce more
efficient results than traditional search engines.
Keywords: Metacrawler, Meta Search Engine, Ranking, Search Engine, Web Crawler
I. Introduction
The World Wide Web (WWW) is a collection of information linked together from all over the world. It
is based on hypertext, which means that when the user is navigating on the ocean of information he can pick up
an interesting word or expression within a text and request for more information about it. This can not apply to
all words in a text, but only to those who have been properly designated as such by the producer of the
information and which are displayed on screen example as underlined. The web is highly dynamic in nature i.e.,
new web pages are being added, some are removed and some others are modified. So it is very difficult to find
relevant information from it. Search engines are used to extract valuable Information from the internet. A search
engine [1] is a program that searches for and identifies items in a database that correspond to keywords or
characters specified by the user, especially for finding particular sites on the WWW.
On the Internet, a search engine is a coordinated set of programs that includes: A spider (also called a
"crawler" or a "bot") that goes to every page or representative pages on every web site that wants to be
searchable and reads it, uses hypertext links on each page to discover and read a site's other pages. Search
engine includes program that creates a huge index (sometimes called a "catalog") from the pages that have been
read. It also contains a program that receives our search request, compares them to the entries in the index and
then returns results.
A Meta Search Engine [2] is a search engine that queries other search engines and then combines the
results that are received from all. Here, the user is not using just one search engine but a combination of many
search engines at once to optimize web searching. Meta Search Engines don't create their own database of
information. They search the other search engine‟s databases. The major advantage of a Meta Search Engine is
that it allows the user to search several search engines simultaneously. So there is no need to search in each
search engine separately.
In this paper a new Meta Search Engine is proposed, which is capable of producing relevant results for
particular query string. It will display the results from different search engines as soon as those found by a
crawler. Finally it will rank each and every web page based on its popularity and relevance. For that, this paper
proposes a new ranking algorithm. This paper is organized as follows: in section II, the works related to search
engines is discussed. Section III describes the Meta Search Engine implementation. In section IV the
performance analysis of the proposed Meta Search Engine is discussed. Section V describes the advantages of
the proposed system. Finally the section VI gives the conclusion for the work.
II. Related Works
2.1 Search Engines
A search engine is designed to search WWW for information about the given search query and returns
links to various documents in which the search query‟s keywords are found. There are mainly three types of
search engines-Web Search Engines, Meta Search Engines, Vertical Search Engines [3]. A web search engine
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 46 | Page
searches for information in WWW. A Vertical Search provides the user with results for queries on a particular
domain. Meta Search Engines send the user‟s search queries to various search engines and combine the search
results. It is important for search engine to maintain a high quality websites. A database is to be made in which
following attributes should be there like length of title, keywords in title, number of back-links, in-links, length
of the URL, ranking. WEKA and Tanagra a data mining tools can be used to extract meaningful knowledge
from this large set of data [4].
The search engines used in this paper for the Meta Search Engine construction are Google, Yahoo, Ask
and Bing [5]. Google is a web search engine owned by Google Inc. it is the most widely used search engine on
the Web. The main purpose of Google Search is to search for text in WebPages, as opposed to other data, such
as with Image Search. Google was developed by Larry Page and Sergey Brin in 1997. Yahoo is a web search
engine, owned by Yahoo! Inc. the 2nd largest search engine on the web by query volume. It was founded in
January 1994 by Jerry Yang and David Filo. Ask is a Search Engine, which is also known as Ask Jeeves. It is
basically designed to answer the user‟s Queries in the mode of Q&A and is proved to be a focused search
engine. Ask was developed in 1996 by Garrett Gruener and David Warthen in Berkeley, California. Bing is a
Search Engine, which was formerly known as Live Search and MSN Search. It is a web search engine that was
owned by Microsoft. It went fully online on 2009 June 3, with a preview version released on June 1, 2009.
2.2 Web Crawlers
All search engines internally use web crawlers to keep the copies of data a fresh. A search engine is
divided into different modules. Among those modules crawler module is the module on which search engine
relies the most because it helps to provide the best possible results to the search engine. Crawlers or spiders are
small programs that browse the web on the search engine‟s behalf, similarly how a human user would follow
links to reach different pages. Google crawlers run on a distributed network of thousands of low-cost computers
and can therefore carry out fast parallel processing. This is the way Google returns results within fraction of
seconds. It has three main components [6]: a frontier which stores the list of URL‟s to visit, a Page Downloader
which download pages from WWW and Web Repository receives web pages from a crawler and stores it in the
database. The repository stores only standard HTML pages. The working of a web crawler is as follows [6]:
1. Initializing the seed URL or URLs
2. Adding it to the frontier
3. Selecting the URL from the frontier
4. Fetching the web-page corresponding to that URLs
5. Parsing the retrieved page to extract the URLs
6. Adding all the unvisited links to the frontier
7. Again start with step 2 and repeat till the frontier is empty.
Based on different strategies employed in web crawling there are four types of web crawlers: focused
crawler, distributed crawler, incremental crawler, and parallel crawler [6]. A focused crawler is the Web crawler
that tries to download pages which are related to each other. It collects documents which are specific and
relevant to the given topic. An incremental crawler is a traditional crawler which refreshes its collection
periodically and replaces the old documents with the newly downloaded documents. It also exchanges less
important pages by new and more important pages. A distributed Crawler uses a distributed computing
technique. Many crawlers are working to distribute in the process of web crawling to get most coverage of the
web. A central server manages the communication and synchronization of the nodes, as it is geographically
distributed. In the case of parallel crawler multiple crawlers are often run in parallel, which are termed as
Parallel crawlers. A parallel crawler consists of multiple crawling Processes called as C-procs which can run on
network of workstations. The Parallel crawlers depend on Page freshness and Page Selection. A Parallel crawler
can be on local network or be distributed at geographically distant locations.
Popular examples of web crawlers are Yahoo Slurp was the name of the Yahoo search crawler,
Googlebot is the name of the Google search crawler, Bingbot is the name of Microsoft's Bing web crawler, etc.
2.3 Web Page Elements
The main web page elements that will be extracted by a crawler are given below [7].
Title Tag: Each web page has a unique title tag, which helps search engines to know how the page is distinct
from the others on the site. Titles can be both short and informative. If the title is too long, then search engines
will show only a portion of it in the search result. Page titles are an important aspect of search engine
optimization.
Description Meta Tag: A page's description meta tag gives search engines a summary of what the page is
about. A page's title may be a few words or a phrase, but a page's description meta tag might be a sentence or
two or a short paragraph. Fig.1 shows an example for title and description tags.
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 47 | Page
Fig.1: Title and Description Tags
URL (Uniform Resource Locator): The URL to a document is displayed as part of a search result in search
engines, below the document's title and snippet. Search engines are good for crawling all types of URL
structures, even if they are quite complex, but spending the time to make the URLs as simple as possible for
both users and search engines can help. Fig.2 shows a sample URL.
Fig.2: URL
Anchor Text: Anchor text (See Fig.3) is the clickable text that users will see as a result of a link, and is placed
within the anchor tag <a href="..."></a>.This text tells search engines something about the page you're linking
to. It may be internal (pointing to other pages on the same site) or external (leading to content on other sites).
Fig.3: Anchor Text
2.4 Searching Techniques & Strategies
Several searching techniques and strategies are essential for narrowing search results and will assist
you with locating valuable resources via the Internet. These techniques and strategies include wildcard, plus and
minus signs, quotation marks and brackets, pipe symbol, Boolean operators, and nesting [8].
Wildcard: A wildcard is a special character that can be added to a phrase while searching and the search engine
or subject directory looks for all possible endings. The results will provide all possible documents in their
database that have those letters.
Plus and Minus Signs: The plus sign used before a keyword or phrase should retrieve results that include that
specific keyword or phrase. The minus sign used before a keyword or phrase should retrieve results that exclude
that specific keyword or phrase.
Quotation Marks and Brackets: Quotation marks and brackets assist with narrowing the search results from
the search tools. When they are used, the search engine will only retrieve documents that have those key terms
appearing together.
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 48 | Page
Pipe Symbol (|): The pipe (|) symbol is used for narrowing down results within a broad category.
Boolean Operators: Working of AND operator is similar to the plus sign and the NOT operator is similar to the
minus sign. The OR operator tells the search engine to retrieve one term or the other.
Near: The NEAR phrase indicates to the search tools that those terms must be located within a certain number
of words. The results may vary depending on the search tool. To illustrate, some search tools may try to locate
the terms within 2, 10 or 25 words of each other. The command to use is NEAR/#.
Nesting: Nesting allows the user to perform complex search. Here the parentheses are used to group the key
words and Boolean operators together.
2.5 Challenges Faced By Search Engines
Search engines are the major tools for retrieving information from the web. But the list of retrieved
documents contains a high percentage of redundant and near document result. So there is the need to improve
the performance of search results. Most of the current search engine use data filtering algorithm which will
eliminate near duplicate and duplicate documents to save the users time and effort [9]. The identification of
near-duplicate pairs or similar documents in a large collection is a major problem with many applications.
The problem with most of the search engines are [5], most people don‟t take advantage of the offered
tools, but instead, they just type a few keywords for a query. Thus, search engines have to find a way how to
improve basic queries so that they can provide users successful research at the same time. Using a search
engine, an index is searched rather than the entire Web. An index is created and maintained by automated web
searching by spiders. Plain search engines prove to be very effective for certain types of search tasks, such as
retrieving of a particular URL and transactional queries. However, search engines can‟t address informational
queries, where the user has information that needs to be satisfied. A Meta Search Engine overcomes the above
by virtue of sending the user‟s query to a set of search engines, collects the data from them displays the relevant
records by using a clustering algorithm [10].
III. Implementation
The new Meta Search Engine has the following components: Meta Search Engine GUI, Query
Formulator, Metacrawler, Redundant URL Eliminator, Modified Ranking and Result Generation. The new
architecture is built by using four search engines Google, Yahoo, Bing and Ask. The proposed architecture is
shown in Fig.4. It shows the interaction between various components of the Meta Search Engine.
Fig.4: Meta Search Engine Architecture
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 49 | Page
3.1 Meta Search Engine GUI
In the proposed architecture, user gives the keyword to be searched through the Meta Search Engine
Graphical User Interface (GUI). Here user has the choice to select the search engines he wants to search
(See Fig.5). The query is the given to all selected search engines. The results corresponding to the query are then
stored into the web database. The new Meta Search Engine retrieves important results from four different
traditional search engines available. The retrieved results are then extracted by using the crawler. We have
developed this application using Adobe Dreamweaver, using PHP for front end designing and My SQL Server
as back end. We have used XAMPP Apache distribution to link front end and back end in our application.
Fig.5: Meta Search Engine GUI
3.2 Query Formulator
As different search engines follow different styles for the representation of the query search string,
different search query strings are generated for a given user input. The query strings are then sent to various
search engines to extract desired results from the search engines. The query string formats for each of the search
engines are given below.
For Google: http://www.google.co.in/search?q=sword
For Yahoo: http://in.search.yahoo.com/search;_ylt=?p=sword
For Bing: http://www.bing.com/search?q=sword
For Ask: http://www.ask.com/web?q=sword
Here sword is the user input. For example if we want to search “data mining” then the query string
corresponding to this for Google is:
http://www.google.co.in/search?q=data+mining.
In the new Meta Search Engine we are focusing in crawling the first result page suggested by every search
engine. Because important and highly relevant results are located in the first page of the results and majority of
the users are focusing on those pages.
3.3 Metacrawler
The web crawler used in the new Meta Search Engine is termed as metacrawler. Because it helps to
crawl web pages through multiple search engines. We developed this using PHP curl. Curl is a library that helps
us to make HTTP requests in PHP. Using curl we can set the various parameters for HTTP requests like user
agent, follow location details, HTTP version, header details, certificate verification details etc. It will help to
handle authentication. The metacrawler will crawl through each and every search engine to find the web pages
relevant to a query string. When crawling a web page it will remove scripts, style sheets and other information
from the web page source code. It will only extract html code. Then it will convert the HTML code into DOM
(Document Object Model) structure.
Compared with traditional plain text, a Web page has more structure. Web pages are also regarded as
semi-structured data, which are represented as a DOM structure. The DOM structure of a Web page is a tree
structure, in which every HTML tag in the page represents a node in the DOM tree. The Web page can be
segmented by some predefined structural tags. Useful tags include <P> (paragraph), <TABLE> (table), <UL>
(list), <H1>-<H6> (heading), etc. Thus the DOM structure can be used to facilitate information extraction. From
the DOM structure it will extract the document or web page URL, title and description. The extracted details are
the stored to the web database.
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 50 | Page
3.4 Redundant URL Eliminator
Since the metacrawler combines the results from different search engines, there is a chance for the link
suggested by one search engine will also suggested by another search engines. Here the redundant URL
eliminator component will check whether an extracted link is a duplicate one by looking if it is already in the
web database URL list. If the URL is already present in the web database it is duplicate, so it will not add into
the result page.
As soon as results are available from one search engine, they are displayed on the screen to view. The
results are therefore that much faster, and while you are down through the list of hits, other results are being
added to the page even as you view. The new results found will be added to result page only after checking if it
is duplicate or not. To check this for each new link suggested, the system will find its domain name. If this
domain name matches with the domain name of another link in the same result set then the previous one will
kept and the later one will avoid. So here duplicate link elimination is not only on the basis of URL similarities
but also on the basis of domain a name matches. It will help to optimize the results.
3.5 Modified Ranking
Ranking is used for ranking or ordering the web pages based on their relevance. This assigns a score to
each link in the result set. The system is based on two phase ranking. First phase is based on the domain name of
each web page. Second phase is based on outgoing and incoming link counts of each web page.
Phase 1: First the system will find the domain name of each link in the result set. Then it will calculate the
count of each domain name in the result set. Then arrange the links in decreasing order of their domain name
counts in the result set.
Phase 2: In this phase a web page is assigned a rank based on its domain name count and its popularity. A web
page that has more incoming and outgoing links is considered as more popular and relevant. So that phase 1
follows a modified ranking which calculate a rank for each web page based on this.
Algorithm: Modified Ranking
Input: Result Set R which contains links from phase 1
Output: Ordered list of Result Set R
1. Start
2. Initialise the score of all links to their domain name count in the result set R
3. for each link Li in result set R
4. Crawl web page corresponding to link Li
5. Find all its outgoing links and store them into list OL
6. for each link Lj in OL
7. if Lj is in R then
8. Increment score of Li by 1
9. Increment score of Lj by 1
10. end if
11. end for
12. end for
13. Order links in R in the decreasing order of their scores
14. return R to result page
15. End
The above algorithm calculates the score of the web pages based on their outgoing links. The incoming
links of a web page are included in the scoring when outgoing links of another web page is calculated. For
example if link to the web page A is included in web page B, then A is the outgoing link of B and B is the
incoming link of A.
3.6 Results Generation
Alternatively the ordering process is performed and the ranked list is displayed as a single list with
most relevant links on the top of the result window. The result will contain the extracted title of the web page,
URL, and description. The relevancy of the pages is calculated based on input query and the web pages are
ranked using the modified ranking algorithm. The pages after ranking are displayed to the user. The top links are
the links with highest rank. When moving from top to bottom the rank of web pages is decreasing. Fig.6 shows
the ranked results for the search query „Sachin‟. It gives the details of web pages suggested by each of the four
search engines Google, Yahoo, Bing and Ask. The results shows the search results merged from the first result
page given by each of the search engines. The Meta Search Engine also supports searches other than web
search. Fig.7 shows the image search results, Fig.8 shows the video search results and Fig.9 shows the latest
news search results.
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 51 | Page
Fig.6: Web Search Results
Fig.7: Image Search Results
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 52 | Page
Fig.8: Video Search Results
Fig.9: Latest News Search Results
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 53 | Page
IV. Results
The two basic measures for checking the effectiveness of the search engines in document retrieval are
precision and recall.
Precision: This is the percentage of retrieved documents that are in fact relevant to the query. It is defined as:
Precision =
Relevant ∩ Retrieved
| Retrieved |
Recall: This is the percentage of documents that are relevant to the given query and were, in fact, retrieved. It is
defined as:
Recall=
Relevant ∩ Retrieved
| Relevant |
The total number of relevant documents retrieved by executing ten different queries on the traditional
search engines and the new Meta Search Engine, with the average precision value is given in Table 1. It
represents the performance of the proposed Meta Search Engine in terms of precision.
Table 1: Relevant Documents Retrieved
Query
Number
Search Engines
Google Yahoo Ask Bing Meta Search Engine
1 7 5 5 6 13
2 9 7 6 7 15
3 8 7 5 6 16
4 5 5 5 4 12
5 7 6 5 5 15
6 8 6 6 6 11
7 7 6 5 6 14
8 7 7 6 6 16
9 6 6 4 5 14
10 6 5 5 6 12
Average
Precision
0.93 0.88 0.83 0.87 0.96
The Fig.10 shows the comparison between the precision of different search engines. From the results
we can see that the new Meta Search Engine has a higher precision value when compared to the search engines
Google, Yahoo, Ask and Bing. So we conclude that the Meta Search Engine is more efficient in relevant
document retrieval and its performance is better than the performance of individual search engines.
Fig.10: Precision
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
DOI: 10.9790/0661-17254554 www.iosrjournals.org 54 | Page
V. Advantages
The proposed Meta Search Engine has many advantages compared to other search engines. It extends
the search coverage of the topic and allows more web pages to be found. It can utilize some of the specific
functionality of the search engines that they employ. It searches all the major search engines at once so it
reduces the work of users from having to separately type in different search engines to look for resources. It is
easy to use and similar format to other search engines. It supports and utilizes the searching syntax of each
search engine, including booleans, phrase searching, pluses and minuses, wildcards, field searching, etc. It can
consolidate and remove duplicates from the results. It is customizable for user's preferences. It use the indexes
created by other search engines, combining and post processing results in unique way. Using this more results
can be retrieved with the same effort. Overall it provides improved recall and precision.
VI. Conclusion
WWW search engines have a number of deficiencies including: periods of downtime, low coverage of
the WWW, inconsistent and inefficient user interfaces, out of date databases, poor relevancy ranking and
precision, and difficulties with spamming techniques. In this paper a new Meta Search Engine has been
introduced which address some of these and other difficulties in searching the WWW. It is understood that Meta
Search Engine retrieves efficient results than any Search Engine and its performance also improves factors like
recall and precision. It will combine and rank the results in an efficient way using the modified page rank
algorithm. It is powerful because one search can highlight strengths of a number of top search engines such as
Google, Yahoo, Bing and Ask.
References
[1]. What Is A Search Engine? Digitallearn.Org a PLA Initiative, http://digitallearn.org/sites/default/files/class/Basic%20Search/supple
mental-materials/basicsearchhandout.pdf
[2]. Subarna Kumar Das, “Role Of Meta Search Engines In Web- Based Information System: Fundamentals And Challenges”, 4th
Convention Planner -2006, Mizoram Univ, Aizawl, 09-10 November, 2006
[3]. D.Minnie, S.Srinivasan, “Meta Search Engine With An Intelligent Interface For Information Retrieval On Multiple Domains”,
International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol.1, No.4, October 2011
[4]. Minky Jindal , Nisha Kharb, “K-Means Clustering Technique On Search Engine Dataset Using Data Mining Tool”, International
Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 505-510
[5]. Sonali Kushwah, Avdhesh Singh, Indu Gupta, “Analysis on Meta Search Engine”
[6]. Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik, “Study Of Web Crawler And Its Different Types”, IOSR Journal of
Computer Engineering (IOSR-JCE) e-Volume 16, Issue 1, Ver. VI (Feb. 2014), PP 01-05
[7]. Google Search Engine Optimization Starter Guide, http://static.googleusercontent.com/media/www.google.com/en//webmasters/doc
s/search-engine-optimization-starter-guide.pdf
[8]. Kimber ly McCoy , “Search Engines, Subject Directories, And Meta-Search Engines”, Winter 2000 The OLRC News Volume 5,
No: 1
[9]. Bassma S.Alsulami, Maysoon F. Abulkhair, Fathy E. Eassa,“Near Duplicate Document Detection Survey”, International Journal of
Computer Science & Communication Networks,Vol 2(2), 147-151
[10]. K R Remesh Babu, AP Arya, “Design of a Metacrawler for Web Document Retrieval”, 12th International Conference on
Intelligent Systems Design and Applications (ISDA), 27-29 Nov. 2012, pp.478-48

Recomendados

TEXT ANALYZER por
TEXT ANALYZER TEXT ANALYZER
TEXT ANALYZER ijcseit
10 vistas11 diapositivas
Effective Searching Policies for Web Crawler por
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerIJMER
388 vistas3 diapositivas
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING por
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERINGBLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERINGijasa
1.1K vistas12 diapositivas
Search engine and web crawler por
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlervinay arora
2.5K vistas33 diapositivas
Meta search engine por
Meta search engineMeta search engine
Meta search engineOECLIB Odisha Electronics Control Library
2.7K vistas14 diapositivas
Internet Tutorial 03 por
Internet  Tutorial 03Internet  Tutorial 03
Internet Tutorial 03dpd
1.2K vistas54 diapositivas

Más contenido relacionado

La actualidad más candente

Design Issues for Search Engines and Web Crawlers: A Review por
Design Issues for Search Engines and Web Crawlers: A ReviewDesign Issues for Search Engines and Web Crawlers: A Review
Design Issues for Search Engines and Web Crawlers: A ReviewIOSR Journals
406 vistas4 diapositivas
IJCER (www.ijceronline.com) International Journal of computational Engineerin... por
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
233 vistas8 diapositivas
Focused web crawling using named entity recognition for narrow domains por
Focused web crawling using named entity recognition for narrow domainsFocused web crawling using named entity recognition for narrow domains
Focused web crawling using named entity recognition for narrow domainseSAT Publishing House
464 vistas9 diapositivas
Quest Trail: An Effective Approach for Construction of Personalized Search En... por
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...Editor IJCATR
302 vistas6 diapositivas
Search engine and web crawler por
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
3.5K vistas21 diapositivas
A Two Stage Crawler on Web Search using Site Ranker for Adaptive Learning por
A Two Stage Crawler on Web Search using Site Ranker for Adaptive LearningA Two Stage Crawler on Web Search using Site Ranker for Adaptive Learning
A Two Stage Crawler on Web Search using Site Ranker for Adaptive LearningIJMTST Journal
29 vistas4 diapositivas

La actualidad más candente(18)

Design Issues for Search Engines and Web Crawlers: A Review por IOSR Journals
Design Issues for Search Engines and Web Crawlers: A ReviewDesign Issues for Search Engines and Web Crawlers: A Review
Design Issues for Search Engines and Web Crawlers: A Review
IOSR Journals406 vistas
IJCER (www.ijceronline.com) International Journal of computational Engineerin... por ijceronline
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline233 vistas
Focused web crawling using named entity recognition for narrow domains por eSAT Publishing House
Focused web crawling using named entity recognition for narrow domainsFocused web crawling using named entity recognition for narrow domains
Focused web crawling using named entity recognition for narrow domains
Quest Trail: An Effective Approach for Construction of Personalized Search En... por Editor IJCATR
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...
Editor IJCATR302 vistas
Search engine and web crawler por ishmecse13
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
ishmecse133.5K vistas
A Two Stage Crawler on Web Search using Site Ranker for Adaptive Learning por IJMTST Journal
A Two Stage Crawler on Web Search using Site Ranker for Adaptive LearningA Two Stage Crawler on Web Search using Site Ranker for Adaptive Learning
A Two Stage Crawler on Web Search using Site Ranker for Adaptive Learning
IJMTST Journal29 vistas
Search Engines and its working por Mukesh Kumar
Search Engines and its workingSearch Engines and its working
Search Engines and its working
Mukesh Kumar1.4K vistas
Search engines powerpoint por vbaker2210
Search engines powerpointSearch engines powerpoint
Search engines powerpoint
vbaker221012.5K vistas
Data mining in web search engine optimization por BookStoreLib
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
BookStoreLib215 vistas
Classification of search_engine por BookStoreLib
Classification of search_engineClassification of search_engine
Classification of search_engine
BookStoreLib219 vistas
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler por IRJET Journal
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET Journal28 vistas
HIGWGET-A Model for Crawling Secure Hidden WebPages por ijdkp
HIGWGET-A Model for Crawling Secure Hidden WebPagesHIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPages
ijdkp801 vistas
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine. por iosrjce
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
iosrjce490 vistas

Destacado

Supervised WSD Using Master- Slave Voting Technique por
Supervised WSD Using Master- Slave Voting TechniqueSupervised WSD Using Master- Slave Voting Technique
Supervised WSD Using Master- Slave Voting Techniqueiosrjce
183 vistas9 diapositivas
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ... por
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...iosrjce
194 vistas5 diapositivas
Riemann Sequence based SLM with nonlinear effects of HPA por
Riemann Sequence based SLM with nonlinear effects of HPARiemann Sequence based SLM with nonlinear effects of HPA
Riemann Sequence based SLM with nonlinear effects of HPAiosrjce
213 vistas7 diapositivas
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi... por
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...iosrjce
223 vistas11 diapositivas
Methods Migration from On-premise to Cloud por
Methods Migration from On-premise to CloudMethods Migration from On-premise to Cloud
Methods Migration from On-premise to Cloudiosrjce
267 vistas8 diapositivas
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data... por
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...iosrjce
157 vistas6 diapositivas

Destacado(20)

Supervised WSD Using Master- Slave Voting Technique por iosrjce
Supervised WSD Using Master- Slave Voting TechniqueSupervised WSD Using Master- Slave Voting Technique
Supervised WSD Using Master- Slave Voting Technique
iosrjce183 vistas
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ... por iosrjce
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...
Chemical Reaction on Heat and Mass TransferFlow through an Infinite Inclined ...
iosrjce194 vistas
Riemann Sequence based SLM with nonlinear effects of HPA por iosrjce
Riemann Sequence based SLM with nonlinear effects of HPARiemann Sequence based SLM with nonlinear effects of HPA
Riemann Sequence based SLM with nonlinear effects of HPA
iosrjce213 vistas
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi... por iosrjce
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...
Crude Oil Fractions in the Environment: A Comparative Study of Agbada Communi...
iosrjce223 vistas
Methods Migration from On-premise to Cloud por iosrjce
Methods Migration from On-premise to CloudMethods Migration from On-premise to Cloud
Methods Migration from On-premise to Cloud
iosrjce267 vistas
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data... por iosrjce
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...
Adaptive Steganography Based Enhanced Cipher Hiding Technique for Secure Data...
iosrjce157 vistas
A Review on Evolution and Versioning of Ontology Based Information Systems por iosrjce
A Review on Evolution and Versioning of Ontology Based Information SystemsA Review on Evolution and Versioning of Ontology Based Information Systems
A Review on Evolution and Versioning of Ontology Based Information Systems
iosrjce152 vistas
Inherent flame retardant polyester fibre using organophosporous compounds as ... por iosrjce
Inherent flame retardant polyester fibre using organophosporous compounds as ...Inherent flame retardant polyester fibre using organophosporous compounds as ...
Inherent flame retardant polyester fibre using organophosporous compounds as ...
iosrjce253 vistas
Audio Steganography Using Discrete Wavelet Transformation (DWT) & Discrete Co... por iosrjce
Audio Steganography Using Discrete Wavelet Transformation (DWT) & Discrete Co...Audio Steganography Using Discrete Wavelet Transformation (DWT) & Discrete Co...
Audio Steganography Using Discrete Wavelet Transformation (DWT) & Discrete Co...
iosrjce491 vistas
Soil lixiviation and slow release pattern of starch-nano sliver particles-enc... por iosrjce
Soil lixiviation and slow release pattern of starch-nano sliver particles-enc...Soil lixiviation and slow release pattern of starch-nano sliver particles-enc...
Soil lixiviation and slow release pattern of starch-nano sliver particles-enc...
iosrjce233 vistas
Prevalence of Intestinal Helminths and Protozoa Parasites of Ruminants in Min... por iosrjce
Prevalence of Intestinal Helminths and Protozoa Parasites of Ruminants in Min...Prevalence of Intestinal Helminths and Protozoa Parasites of Ruminants in Min...
Prevalence of Intestinal Helminths and Protozoa Parasites of Ruminants in Min...
iosrjce324 vistas
Strength of Bamboo Leaf Ash and Pulverized Burnt Clay Waste Blended Cement Co... por iosrjce
Strength of Bamboo Leaf Ash and Pulverized Burnt Clay Waste Blended Cement Co...Strength of Bamboo Leaf Ash and Pulverized Burnt Clay Waste Blended Cement Co...
Strength of Bamboo Leaf Ash and Pulverized Burnt Clay Waste Blended Cement Co...
iosrjce771 vistas
Monitoring of Selected Heavy Metals Uptake by Plant around Fagbohun Dumpsite,... por iosrjce
Monitoring of Selected Heavy Metals Uptake by Plant around Fagbohun Dumpsite,...Monitoring of Selected Heavy Metals Uptake by Plant around Fagbohun Dumpsite,...
Monitoring of Selected Heavy Metals Uptake by Plant around Fagbohun Dumpsite,...
iosrjce279 vistas
Lossless Image Compression Using Data Folding Followed By Arithmetic Coding por iosrjce
Lossless Image Compression Using Data Folding Followed By Arithmetic CodingLossless Image Compression Using Data Folding Followed By Arithmetic Coding
Lossless Image Compression Using Data Folding Followed By Arithmetic Coding
iosrjce187 vistas
Effcet of Vernonia Amygdalina.Del Ethanolic Extract Fraction on Serum Prolact... por iosrjce
Effcet of Vernonia Amygdalina.Del Ethanolic Extract Fraction on Serum Prolact...Effcet of Vernonia Amygdalina.Del Ethanolic Extract Fraction on Serum Prolact...
Effcet of Vernonia Amygdalina.Del Ethanolic Extract Fraction on Serum Prolact...
iosrjce262 vistas
The Interaction Forces between Two non-Identical Cylinders Spinning around th... por iosrjce
The Interaction Forces between Two non-Identical Cylinders Spinning around th...The Interaction Forces between Two non-Identical Cylinders Spinning around th...
The Interaction Forces between Two non-Identical Cylinders Spinning around th...
iosrjce98 vistas
Auto Pilot Controlled Flying Wing (UAV) For QRF (Quick Reaction Armed Forces) por iosrjce
Auto Pilot Controlled Flying Wing (UAV) For QRF (Quick Reaction Armed Forces)Auto Pilot Controlled Flying Wing (UAV) For QRF (Quick Reaction Armed Forces)
Auto Pilot Controlled Flying Wing (UAV) For QRF (Quick Reaction Armed Forces)
iosrjce854 vistas
TemboStatus - how to recruit the employees you already have por TemboStatus
TemboStatus -  how to recruit the employees you already haveTemboStatus -  how to recruit the employees you already have
TemboStatus - how to recruit the employees you already have
TemboStatus443 vistas
Continuous Testing of Service-Oriented Applications Using Service Virtualization por iosrjce
Continuous Testing of Service-Oriented Applications Using Service VirtualizationContinuous Testing of Service-Oriented Applications Using Service Virtualization
Continuous Testing of Service-Oriented Applications Using Service Virtualization
iosrjce176 vistas
Polyesters From Chemically Modified Oil of Renewable Source por iosrjce
Polyesters From Chemically Modified Oil of Renewable SourcePolyesters From Chemically Modified Oil of Renewable Source
Polyesters From Chemically Modified Oil of Renewable Source
iosrjce315 vistas

Similar a An Intelligent Meta Search Engine for Efficient Web Document Retrieval

Search engine por
Search engineSearch engine
Search engineWasif Khan
116 vistas15 diapositivas
Notes for por
Notes forNotes for
Notes for9pallen
499 vistas4 diapositivas
E3602042044 por
E3602042044E3602042044
E3602042044ijceronline
223 vistas3 diapositivas
A Comparative Study Of The Results Produced By Search Engines por
A Comparative Study Of The Results Produced By Search EnginesA Comparative Study Of The Results Produced By Search Engines
A Comparative Study Of The Results Produced By Search EnginesJoaquin Hamad
3 vistas4 diapositivas
International conference On Computer Science And technology por
International conference On Computer Science And technologyInternational conference On Computer Science And technology
International conference On Computer Science And technologyanchalsinghdm
22 vistas8 diapositivas
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ... por
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
36 vistas5 diapositivas

Similar a An Intelligent Meta Search Engine for Efficient Web Document Retrieval(20)

Search engine por Wasif Khan
Search engineSearch engine
Search engine
Wasif Khan116 vistas
Notes for por 9pallen
Notes forNotes for
Notes for
9pallen499 vistas
A Comparative Study Of The Results Produced By Search Engines por Joaquin Hamad
A Comparative Study Of The Results Produced By Search EnginesA Comparative Study Of The Results Produced By Search Engines
A Comparative Study Of The Results Produced By Search Engines
Joaquin Hamad3 vistas
International conference On Computer Science And technology por anchalsinghdm
International conference On Computer Science And technologyInternational conference On Computer Science And technology
International conference On Computer Science And technology
anchalsinghdm22 vistas
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ... por inventionjournals
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
inventionjournals36 vistas
Search Engine working, Crawlers working, Search Engine mechanism por Umang MIshra
Search Engine working, Crawlers working, Search Engine mechanismSearch Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanism
Umang MIshra141 vistas
IRJET - Building Your Own Search Engine por IRJET Journal
IRJET -  	  Building Your Own Search EngineIRJET -  	  Building Your Own Search Engine
IRJET - Building Your Own Search Engine
IRJET Journal16 vistas
Introduction to internet. por Anish Thomas
Introduction to internet.Introduction to internet.
Introduction to internet.
Anish Thomas140 vistas
Search Engine Optimization - Aykut Aslantaş por Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşSearch Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut Aslantaş
Aykut Aslantaş416 vistas
Search Engines Other than Google por Dr Trivedi
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than Google
Dr Trivedi172 vistas

Más de iosrjce

An Examination of Effectuation Dimension as Financing Practice of Small and M... por
An Examination of Effectuation Dimension as Financing Practice of Small and M...An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...iosrjce
1.4K vistas12 diapositivas
Does Goods and Services Tax (GST) Leads to Indian Economic Development? por
Does Goods and Services Tax (GST) Leads to Indian Economic Development?Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?iosrjce
1.7K vistas5 diapositivas
Childhood Factors that influence success in later life por
Childhood Factors that influence success in later lifeChildhood Factors that influence success in later life
Childhood Factors that influence success in later lifeiosrjce
858 vistas9 diapositivas
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe... por
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...iosrjce
1.1K vistas8 diapositivas
Customer’s Acceptance of Internet Banking in Dubai por
Customer’s Acceptance of Internet Banking in DubaiCustomer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in Dubaiiosrjce
1.6K vistas20 diapositivas
A Study of Employee Satisfaction relating to Job Security & Working Hours amo... por
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...iosrjce
684 vistas4 diapositivas

Más de iosrjce(20)

An Examination of Effectuation Dimension as Financing Practice of Small and M... por iosrjce
An Examination of Effectuation Dimension as Financing Practice of Small and M...An Examination of Effectuation Dimension as Financing Practice of Small and M...
An Examination of Effectuation Dimension as Financing Practice of Small and M...
iosrjce1.4K vistas
Does Goods and Services Tax (GST) Leads to Indian Economic Development? por iosrjce
Does Goods and Services Tax (GST) Leads to Indian Economic Development?Does Goods and Services Tax (GST) Leads to Indian Economic Development?
Does Goods and Services Tax (GST) Leads to Indian Economic Development?
iosrjce1.7K vistas
Childhood Factors that influence success in later life por iosrjce
Childhood Factors that influence success in later lifeChildhood Factors that influence success in later life
Childhood Factors that influence success in later life
iosrjce858 vistas
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe... por iosrjce
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
Emotional Intelligence and Work Performance Relationship: A Study on Sales Pe...
iosrjce1.1K vistas
Customer’s Acceptance of Internet Banking in Dubai por iosrjce
Customer’s Acceptance of Internet Banking in DubaiCustomer’s Acceptance of Internet Banking in Dubai
Customer’s Acceptance of Internet Banking in Dubai
iosrjce1.6K vistas
A Study of Employee Satisfaction relating to Job Security & Working Hours amo... por iosrjce
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
A Study of Employee Satisfaction relating to Job Security & Working Hours amo...
iosrjce684 vistas
Consumer Perspectives on Brand Preference: A Choice Based Model Approach por iosrjce
Consumer Perspectives on Brand Preference: A Choice Based Model ApproachConsumer Perspectives on Brand Preference: A Choice Based Model Approach
Consumer Perspectives on Brand Preference: A Choice Based Model Approach
iosrjce485 vistas
Student`S Approach towards Social Network Sites por iosrjce
Student`S Approach towards Social Network SitesStudent`S Approach towards Social Network Sites
Student`S Approach towards Social Network Sites
iosrjce358 vistas
Broadcast Management in Nigeria: The systems approach as an imperative por iosrjce
Broadcast Management in Nigeria: The systems approach as an imperativeBroadcast Management in Nigeria: The systems approach as an imperative
Broadcast Management in Nigeria: The systems approach as an imperative
iosrjce710 vistas
A Study on Retailer’s Perception on Soya Products with Special Reference to T... por iosrjce
A Study on Retailer’s Perception on Soya Products with Special Reference to T...A Study on Retailer’s Perception on Soya Products with Special Reference to T...
A Study on Retailer’s Perception on Soya Products with Special Reference to T...
iosrjce338 vistas
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ... por iosrjce
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
A Study Factors Influence on Organisation Citizenship Behaviour in Corporate ...
iosrjce328 vistas
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh por iosrjce
Consumers’ Behaviour on Sony Xperia: A Case Study on BangladeshConsumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
Consumers’ Behaviour on Sony Xperia: A Case Study on Bangladesh
iosrjce547 vistas
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P... por iosrjce
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
Design of a Balanced Scorecard on Nonprofit Organizations (Study on Yayasan P...
iosrjce412 vistas
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu... por iosrjce
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
Public Sector Reforms and Outsourcing Services in Nigeria: An Empirical Evalu...
iosrjce481 vistas
Media Innovations and its Impact on Brand awareness & Consideration por iosrjce
Media Innovations and its Impact on Brand awareness & ConsiderationMedia Innovations and its Impact on Brand awareness & Consideration
Media Innovations and its Impact on Brand awareness & Consideration
iosrjce280 vistas
Customer experience in supermarkets and hypermarkets – A comparative study por iosrjce
Customer experience in supermarkets and hypermarkets – A comparative studyCustomer experience in supermarkets and hypermarkets – A comparative study
Customer experience in supermarkets and hypermarkets – A comparative study
iosrjce546 vistas
Social Media and Small Businesses: A Combinational Strategic Approach under t... por iosrjce
Social Media and Small Businesses: A Combinational Strategic Approach under t...Social Media and Small Businesses: A Combinational Strategic Approach under t...
Social Media and Small Businesses: A Combinational Strategic Approach under t...
iosrjce245 vistas
Secretarial Performance and the Gender Question (A Study of Selected Tertiary... por iosrjce
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
Secretarial Performance and the Gender Question (A Study of Selected Tertiary...
iosrjce241 vistas
Implementation of Quality Management principles at Zimbabwe Open University (... por iosrjce
Implementation of Quality Management principles at Zimbabwe Open University (...Implementation of Quality Management principles at Zimbabwe Open University (...
Implementation of Quality Management principles at Zimbabwe Open University (...
iosrjce345 vistas
Organizational Conflicts Management In Selected Organizaions In Lagos State, ... por iosrjce
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
Organizational Conflicts Management In Selected Organizaions In Lagos State, ...
iosrjce401 vistas

Último

K8S Roadmap.pdf por
K8S Roadmap.pdfK8S Roadmap.pdf
K8S Roadmap.pdfMaryamTavakkoli2
6 vistas1 diapositiva
Searching in Data Structure por
Searching in Data StructureSearching in Data Structure
Searching in Data Structureraghavbirla63
7 vistas8 diapositivas
SPICE PARK DEC2023 (6,625 SPICE Models) por
SPICE PARK DEC2023 (6,625 SPICE Models) SPICE PARK DEC2023 (6,625 SPICE Models)
SPICE PARK DEC2023 (6,625 SPICE Models) Tsuyoshi Horigome
24 vistas218 diapositivas
Final Year Presentation por
Final Year PresentationFinal Year Presentation
Final Year PresentationComsat Universal Islamabad Wah Campus
7 vistas29 diapositivas
Design of machine elements-UNIT 3.pptx por
Design of machine elements-UNIT 3.pptxDesign of machine elements-UNIT 3.pptx
Design of machine elements-UNIT 3.pptxgopinathcreddy
32 vistas31 diapositivas
Saikat Chakraborty Java Oracle Certificate.pdf por
Saikat Chakraborty Java Oracle Certificate.pdfSaikat Chakraborty Java Oracle Certificate.pdf
Saikat Chakraborty Java Oracle Certificate.pdfSaikatChakraborty787148
15 vistas1 diapositiva

Último(20)

SPICE PARK DEC2023 (6,625 SPICE Models) por Tsuyoshi Horigome
SPICE PARK DEC2023 (6,625 SPICE Models) SPICE PARK DEC2023 (6,625 SPICE Models)
SPICE PARK DEC2023 (6,625 SPICE Models)
Tsuyoshi Horigome24 vistas
Design of machine elements-UNIT 3.pptx por gopinathcreddy
Design of machine elements-UNIT 3.pptxDesign of machine elements-UNIT 3.pptx
Design of machine elements-UNIT 3.pptx
gopinathcreddy32 vistas
What is Whirling Hygrometer.pdf por IIT KHARAGPUR
What is Whirling Hygrometer.pdfWhat is Whirling Hygrometer.pdf
What is Whirling Hygrometer.pdf
IIT KHARAGPUR 12 vistas
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ... por AltinKaradagli
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...
AltinKaradagli9 vistas
2023Dec ASU Wang NETR Group Research Focus and Facility Overview.pptx por lwang78
2023Dec ASU Wang NETR Group Research Focus and Facility Overview.pptx2023Dec ASU Wang NETR Group Research Focus and Facility Overview.pptx
2023Dec ASU Wang NETR Group Research Focus and Facility Overview.pptx
lwang7853 vistas
MSA Website Slideshow (16).pdf por msaucla
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdf
msaucla68 vistas
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L... por Anowar Hossain
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
DevOps to DevSecOps: Enhancing Software Security Throughout The Development L...
Anowar Hossain13 vistas
Advances in micro milling: From tool fabrication to process outcomes por Shivendra Nandan
Advances in micro milling: From tool fabrication to process outcomesAdvances in micro milling: From tool fabrication to process outcomes
Advances in micro milling: From tool fabrication to process outcomes
Shivendra Nandan7 vistas
SUMIT SQL PROJECT SUPERSTORE 1.pptx por Sumit Jadhav
SUMIT SQL PROJECT SUPERSTORE 1.pptxSUMIT SQL PROJECT SUPERSTORE 1.pptx
SUMIT SQL PROJECT SUPERSTORE 1.pptx
Sumit Jadhav 13 vistas
NEW SUPPLIERS SUPPLIES (copie).pdf por georgesradjou
NEW SUPPLIERS SUPPLIES (copie).pdfNEW SUPPLIERS SUPPLIES (copie).pdf
NEW SUPPLIERS SUPPLIES (copie).pdf
georgesradjou15 vistas
Control Systems Feedback.pdf por LGGaming5
Control Systems Feedback.pdfControl Systems Feedback.pdf
Control Systems Feedback.pdf
LGGaming56 vistas
Instrumentation & Control Lab Manual.pdf por NTU Faisalabad
Instrumentation & Control Lab Manual.pdfInstrumentation & Control Lab Manual.pdf
Instrumentation & Control Lab Manual.pdf
NTU Faisalabad 5 vistas
fakenews_DBDA_Mar23.pptx por deepmitra8
fakenews_DBDA_Mar23.pptxfakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptx
deepmitra814 vistas

An Intelligent Meta Search Engine for Efficient Web Document Retrieval

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 2, Ver. V (Mar – Apr. 2015), PP 45-54 www.iosrjournals.org DOI: 10.9790/0661-17254554 www.iosrjournals.org 45 | Page An Intelligent Meta Search Engine for Efficient Web Document Retrieval Keerthana.I.P 1 , Aby Abahai.T 2 1( Dept. of CSE, Mar Athanasius College of Engineering, Kothamangalam , Kerala, India) 2( Dept. of CSE, Assistant Professor, Mar Athanasius College of Engineering, Kothamangalam, Kerala, India) Abstract: In daily use of internet, when searching information we face lots of difficulty due to the rapid growth of Information Resources. This is because of the fact that a single search engine cannot index the entire web of resources. A Meta Search Engine is a solution for this, which submits the query to many other search engines and returns summary of the results. Therefore, the search results receive are an aggregate result of multiple searches. This strategy gives our search a boarder scope than searching a single search engine, but the results are not always better. Because the Meta Search Engine must use its own algorithm to choose the best results from multiple search engines. In this paper we proposed a new Meta Search Engine to overcome these drawbacks. It uses a new page rank algorithm called modified ranking for ranking and optimizing the search results in an efficient way. It is a two phase ranking algorithm used for ordering the web pages based on their relevance and popularity. This Meta Search Engine is developed in such a way that it will produce more efficient results than traditional search engines. Keywords: Metacrawler, Meta Search Engine, Ranking, Search Engine, Web Crawler I. Introduction The World Wide Web (WWW) is a collection of information linked together from all over the world. It is based on hypertext, which means that when the user is navigating on the ocean of information he can pick up an interesting word or expression within a text and request for more information about it. This can not apply to all words in a text, but only to those who have been properly designated as such by the producer of the information and which are displayed on screen example as underlined. The web is highly dynamic in nature i.e., new web pages are being added, some are removed and some others are modified. So it is very difficult to find relevant information from it. Search engines are used to extract valuable Information from the internet. A search engine [1] is a program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, especially for finding particular sites on the WWW. On the Internet, a search engine is a coordinated set of programs that includes: A spider (also called a "crawler" or a "bot") that goes to every page or representative pages on every web site that wants to be searchable and reads it, uses hypertext links on each page to discover and read a site's other pages. Search engine includes program that creates a huge index (sometimes called a "catalog") from the pages that have been read. It also contains a program that receives our search request, compares them to the entries in the index and then returns results. A Meta Search Engine [2] is a search engine that queries other search engines and then combines the results that are received from all. Here, the user is not using just one search engine but a combination of many search engines at once to optimize web searching. Meta Search Engines don't create their own database of information. They search the other search engine‟s databases. The major advantage of a Meta Search Engine is that it allows the user to search several search engines simultaneously. So there is no need to search in each search engine separately. In this paper a new Meta Search Engine is proposed, which is capable of producing relevant results for particular query string. It will display the results from different search engines as soon as those found by a crawler. Finally it will rank each and every web page based on its popularity and relevance. For that, this paper proposes a new ranking algorithm. This paper is organized as follows: in section II, the works related to search engines is discussed. Section III describes the Meta Search Engine implementation. In section IV the performance analysis of the proposed Meta Search Engine is discussed. Section V describes the advantages of the proposed system. Finally the section VI gives the conclusion for the work. II. Related Works 2.1 Search Engines A search engine is designed to search WWW for information about the given search query and returns links to various documents in which the search query‟s keywords are found. There are mainly three types of search engines-Web Search Engines, Meta Search Engines, Vertical Search Engines [3]. A web search engine
  • 2. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 46 | Page searches for information in WWW. A Vertical Search provides the user with results for queries on a particular domain. Meta Search Engines send the user‟s search queries to various search engines and combine the search results. It is important for search engine to maintain a high quality websites. A database is to be made in which following attributes should be there like length of title, keywords in title, number of back-links, in-links, length of the URL, ranking. WEKA and Tanagra a data mining tools can be used to extract meaningful knowledge from this large set of data [4]. The search engines used in this paper for the Meta Search Engine construction are Google, Yahoo, Ask and Bing [5]. Google is a web search engine owned by Google Inc. it is the most widely used search engine on the Web. The main purpose of Google Search is to search for text in WebPages, as opposed to other data, such as with Image Search. Google was developed by Larry Page and Sergey Brin in 1997. Yahoo is a web search engine, owned by Yahoo! Inc. the 2nd largest search engine on the web by query volume. It was founded in January 1994 by Jerry Yang and David Filo. Ask is a Search Engine, which is also known as Ask Jeeves. It is basically designed to answer the user‟s Queries in the mode of Q&A and is proved to be a focused search engine. Ask was developed in 1996 by Garrett Gruener and David Warthen in Berkeley, California. Bing is a Search Engine, which was formerly known as Live Search and MSN Search. It is a web search engine that was owned by Microsoft. It went fully online on 2009 June 3, with a preview version released on June 1, 2009. 2.2 Web Crawlers All search engines internally use web crawlers to keep the copies of data a fresh. A search engine is divided into different modules. Among those modules crawler module is the module on which search engine relies the most because it helps to provide the best possible results to the search engine. Crawlers or spiders are small programs that browse the web on the search engine‟s behalf, similarly how a human user would follow links to reach different pages. Google crawlers run on a distributed network of thousands of low-cost computers and can therefore carry out fast parallel processing. This is the way Google returns results within fraction of seconds. It has three main components [6]: a frontier which stores the list of URL‟s to visit, a Page Downloader which download pages from WWW and Web Repository receives web pages from a crawler and stores it in the database. The repository stores only standard HTML pages. The working of a web crawler is as follows [6]: 1. Initializing the seed URL or URLs 2. Adding it to the frontier 3. Selecting the URL from the frontier 4. Fetching the web-page corresponding to that URLs 5. Parsing the retrieved page to extract the URLs 6. Adding all the unvisited links to the frontier 7. Again start with step 2 and repeat till the frontier is empty. Based on different strategies employed in web crawling there are four types of web crawlers: focused crawler, distributed crawler, incremental crawler, and parallel crawler [6]. A focused crawler is the Web crawler that tries to download pages which are related to each other. It collects documents which are specific and relevant to the given topic. An incremental crawler is a traditional crawler which refreshes its collection periodically and replaces the old documents with the newly downloaded documents. It also exchanges less important pages by new and more important pages. A distributed Crawler uses a distributed computing technique. Many crawlers are working to distribute in the process of web crawling to get most coverage of the web. A central server manages the communication and synchronization of the nodes, as it is geographically distributed. In the case of parallel crawler multiple crawlers are often run in parallel, which are termed as Parallel crawlers. A parallel crawler consists of multiple crawling Processes called as C-procs which can run on network of workstations. The Parallel crawlers depend on Page freshness and Page Selection. A Parallel crawler can be on local network or be distributed at geographically distant locations. Popular examples of web crawlers are Yahoo Slurp was the name of the Yahoo search crawler, Googlebot is the name of the Google search crawler, Bingbot is the name of Microsoft's Bing web crawler, etc. 2.3 Web Page Elements The main web page elements that will be extracted by a crawler are given below [7]. Title Tag: Each web page has a unique title tag, which helps search engines to know how the page is distinct from the others on the site. Titles can be both short and informative. If the title is too long, then search engines will show only a portion of it in the search result. Page titles are an important aspect of search engine optimization. Description Meta Tag: A page's description meta tag gives search engines a summary of what the page is about. A page's title may be a few words or a phrase, but a page's description meta tag might be a sentence or two or a short paragraph. Fig.1 shows an example for title and description tags.
  • 3. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 47 | Page Fig.1: Title and Description Tags URL (Uniform Resource Locator): The URL to a document is displayed as part of a search result in search engines, below the document's title and snippet. Search engines are good for crawling all types of URL structures, even if they are quite complex, but spending the time to make the URLs as simple as possible for both users and search engines can help. Fig.2 shows a sample URL. Fig.2: URL Anchor Text: Anchor text (See Fig.3) is the clickable text that users will see as a result of a link, and is placed within the anchor tag <a href="..."></a>.This text tells search engines something about the page you're linking to. It may be internal (pointing to other pages on the same site) or external (leading to content on other sites). Fig.3: Anchor Text 2.4 Searching Techniques & Strategies Several searching techniques and strategies are essential for narrowing search results and will assist you with locating valuable resources via the Internet. These techniques and strategies include wildcard, plus and minus signs, quotation marks and brackets, pipe symbol, Boolean operators, and nesting [8]. Wildcard: A wildcard is a special character that can be added to a phrase while searching and the search engine or subject directory looks for all possible endings. The results will provide all possible documents in their database that have those letters. Plus and Minus Signs: The plus sign used before a keyword or phrase should retrieve results that include that specific keyword or phrase. The minus sign used before a keyword or phrase should retrieve results that exclude that specific keyword or phrase. Quotation Marks and Brackets: Quotation marks and brackets assist with narrowing the search results from the search tools. When they are used, the search engine will only retrieve documents that have those key terms appearing together.
  • 4. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 48 | Page Pipe Symbol (|): The pipe (|) symbol is used for narrowing down results within a broad category. Boolean Operators: Working of AND operator is similar to the plus sign and the NOT operator is similar to the minus sign. The OR operator tells the search engine to retrieve one term or the other. Near: The NEAR phrase indicates to the search tools that those terms must be located within a certain number of words. The results may vary depending on the search tool. To illustrate, some search tools may try to locate the terms within 2, 10 or 25 words of each other. The command to use is NEAR/#. Nesting: Nesting allows the user to perform complex search. Here the parentheses are used to group the key words and Boolean operators together. 2.5 Challenges Faced By Search Engines Search engines are the major tools for retrieving information from the web. But the list of retrieved documents contains a high percentage of redundant and near document result. So there is the need to improve the performance of search results. Most of the current search engine use data filtering algorithm which will eliminate near duplicate and duplicate documents to save the users time and effort [9]. The identification of near-duplicate pairs or similar documents in a large collection is a major problem with many applications. The problem with most of the search engines are [5], most people don‟t take advantage of the offered tools, but instead, they just type a few keywords for a query. Thus, search engines have to find a way how to improve basic queries so that they can provide users successful research at the same time. Using a search engine, an index is searched rather than the entire Web. An index is created and maintained by automated web searching by spiders. Plain search engines prove to be very effective for certain types of search tasks, such as retrieving of a particular URL and transactional queries. However, search engines can‟t address informational queries, where the user has information that needs to be satisfied. A Meta Search Engine overcomes the above by virtue of sending the user‟s query to a set of search engines, collects the data from them displays the relevant records by using a clustering algorithm [10]. III. Implementation The new Meta Search Engine has the following components: Meta Search Engine GUI, Query Formulator, Metacrawler, Redundant URL Eliminator, Modified Ranking and Result Generation. The new architecture is built by using four search engines Google, Yahoo, Bing and Ask. The proposed architecture is shown in Fig.4. It shows the interaction between various components of the Meta Search Engine. Fig.4: Meta Search Engine Architecture
  • 5. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 49 | Page 3.1 Meta Search Engine GUI In the proposed architecture, user gives the keyword to be searched through the Meta Search Engine Graphical User Interface (GUI). Here user has the choice to select the search engines he wants to search (See Fig.5). The query is the given to all selected search engines. The results corresponding to the query are then stored into the web database. The new Meta Search Engine retrieves important results from four different traditional search engines available. The retrieved results are then extracted by using the crawler. We have developed this application using Adobe Dreamweaver, using PHP for front end designing and My SQL Server as back end. We have used XAMPP Apache distribution to link front end and back end in our application. Fig.5: Meta Search Engine GUI 3.2 Query Formulator As different search engines follow different styles for the representation of the query search string, different search query strings are generated for a given user input. The query strings are then sent to various search engines to extract desired results from the search engines. The query string formats for each of the search engines are given below. For Google: http://www.google.co.in/search?q=sword For Yahoo: http://in.search.yahoo.com/search;_ylt=?p=sword For Bing: http://www.bing.com/search?q=sword For Ask: http://www.ask.com/web?q=sword Here sword is the user input. For example if we want to search “data mining” then the query string corresponding to this for Google is: http://www.google.co.in/search?q=data+mining. In the new Meta Search Engine we are focusing in crawling the first result page suggested by every search engine. Because important and highly relevant results are located in the first page of the results and majority of the users are focusing on those pages. 3.3 Metacrawler The web crawler used in the new Meta Search Engine is termed as metacrawler. Because it helps to crawl web pages through multiple search engines. We developed this using PHP curl. Curl is a library that helps us to make HTTP requests in PHP. Using curl we can set the various parameters for HTTP requests like user agent, follow location details, HTTP version, header details, certificate verification details etc. It will help to handle authentication. The metacrawler will crawl through each and every search engine to find the web pages relevant to a query string. When crawling a web page it will remove scripts, style sheets and other information from the web page source code. It will only extract html code. Then it will convert the HTML code into DOM (Document Object Model) structure. Compared with traditional plain text, a Web page has more structure. Web pages are also regarded as semi-structured data, which are represented as a DOM structure. The DOM structure of a Web page is a tree structure, in which every HTML tag in the page represents a node in the DOM tree. The Web page can be segmented by some predefined structural tags. Useful tags include <P> (paragraph), <TABLE> (table), <UL> (list), <H1>-<H6> (heading), etc. Thus the DOM structure can be used to facilitate information extraction. From the DOM structure it will extract the document or web page URL, title and description. The extracted details are the stored to the web database.
  • 6. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 50 | Page 3.4 Redundant URL Eliminator Since the metacrawler combines the results from different search engines, there is a chance for the link suggested by one search engine will also suggested by another search engines. Here the redundant URL eliminator component will check whether an extracted link is a duplicate one by looking if it is already in the web database URL list. If the URL is already present in the web database it is duplicate, so it will not add into the result page. As soon as results are available from one search engine, they are displayed on the screen to view. The results are therefore that much faster, and while you are down through the list of hits, other results are being added to the page even as you view. The new results found will be added to result page only after checking if it is duplicate or not. To check this for each new link suggested, the system will find its domain name. If this domain name matches with the domain name of another link in the same result set then the previous one will kept and the later one will avoid. So here duplicate link elimination is not only on the basis of URL similarities but also on the basis of domain a name matches. It will help to optimize the results. 3.5 Modified Ranking Ranking is used for ranking or ordering the web pages based on their relevance. This assigns a score to each link in the result set. The system is based on two phase ranking. First phase is based on the domain name of each web page. Second phase is based on outgoing and incoming link counts of each web page. Phase 1: First the system will find the domain name of each link in the result set. Then it will calculate the count of each domain name in the result set. Then arrange the links in decreasing order of their domain name counts in the result set. Phase 2: In this phase a web page is assigned a rank based on its domain name count and its popularity. A web page that has more incoming and outgoing links is considered as more popular and relevant. So that phase 1 follows a modified ranking which calculate a rank for each web page based on this. Algorithm: Modified Ranking Input: Result Set R which contains links from phase 1 Output: Ordered list of Result Set R 1. Start 2. Initialise the score of all links to their domain name count in the result set R 3. for each link Li in result set R 4. Crawl web page corresponding to link Li 5. Find all its outgoing links and store them into list OL 6. for each link Lj in OL 7. if Lj is in R then 8. Increment score of Li by 1 9. Increment score of Lj by 1 10. end if 11. end for 12. end for 13. Order links in R in the decreasing order of their scores 14. return R to result page 15. End The above algorithm calculates the score of the web pages based on their outgoing links. The incoming links of a web page are included in the scoring when outgoing links of another web page is calculated. For example if link to the web page A is included in web page B, then A is the outgoing link of B and B is the incoming link of A. 3.6 Results Generation Alternatively the ordering process is performed and the ranked list is displayed as a single list with most relevant links on the top of the result window. The result will contain the extracted title of the web page, URL, and description. The relevancy of the pages is calculated based on input query and the web pages are ranked using the modified ranking algorithm. The pages after ranking are displayed to the user. The top links are the links with highest rank. When moving from top to bottom the rank of web pages is decreasing. Fig.6 shows the ranked results for the search query „Sachin‟. It gives the details of web pages suggested by each of the four search engines Google, Yahoo, Bing and Ask. The results shows the search results merged from the first result page given by each of the search engines. The Meta Search Engine also supports searches other than web search. Fig.7 shows the image search results, Fig.8 shows the video search results and Fig.9 shows the latest news search results.
  • 7. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 51 | Page Fig.6: Web Search Results Fig.7: Image Search Results
  • 8. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 52 | Page Fig.8: Video Search Results Fig.9: Latest News Search Results
  • 9. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 53 | Page IV. Results The two basic measures for checking the effectiveness of the search engines in document retrieval are precision and recall. Precision: This is the percentage of retrieved documents that are in fact relevant to the query. It is defined as: Precision = Relevant ∩ Retrieved | Retrieved | Recall: This is the percentage of documents that are relevant to the given query and were, in fact, retrieved. It is defined as: Recall= Relevant ∩ Retrieved | Relevant | The total number of relevant documents retrieved by executing ten different queries on the traditional search engines and the new Meta Search Engine, with the average precision value is given in Table 1. It represents the performance of the proposed Meta Search Engine in terms of precision. Table 1: Relevant Documents Retrieved Query Number Search Engines Google Yahoo Ask Bing Meta Search Engine 1 7 5 5 6 13 2 9 7 6 7 15 3 8 7 5 6 16 4 5 5 5 4 12 5 7 6 5 5 15 6 8 6 6 6 11 7 7 6 5 6 14 8 7 7 6 6 16 9 6 6 4 5 14 10 6 5 5 6 12 Average Precision 0.93 0.88 0.83 0.87 0.96 The Fig.10 shows the comparison between the precision of different search engines. From the results we can see that the new Meta Search Engine has a higher precision value when compared to the search engines Google, Yahoo, Ask and Bing. So we conclude that the Meta Search Engine is more efficient in relevant document retrieval and its performance is better than the performance of individual search engines. Fig.10: Precision
  • 10. An Intelligent Meta Search Engine for Efficient Web Document Retrieval DOI: 10.9790/0661-17254554 www.iosrjournals.org 54 | Page V. Advantages The proposed Meta Search Engine has many advantages compared to other search engines. It extends the search coverage of the topic and allows more web pages to be found. It can utilize some of the specific functionality of the search engines that they employ. It searches all the major search engines at once so it reduces the work of users from having to separately type in different search engines to look for resources. It is easy to use and similar format to other search engines. It supports and utilizes the searching syntax of each search engine, including booleans, phrase searching, pluses and minuses, wildcards, field searching, etc. It can consolidate and remove duplicates from the results. It is customizable for user's preferences. It use the indexes created by other search engines, combining and post processing results in unique way. Using this more results can be retrieved with the same effort. Overall it provides improved recall and precision. VI. Conclusion WWW search engines have a number of deficiencies including: periods of downtime, low coverage of the WWW, inconsistent and inefficient user interfaces, out of date databases, poor relevancy ranking and precision, and difficulties with spamming techniques. In this paper a new Meta Search Engine has been introduced which address some of these and other difficulties in searching the WWW. It is understood that Meta Search Engine retrieves efficient results than any Search Engine and its performance also improves factors like recall and precision. It will combine and rank the results in an efficient way using the modified page rank algorithm. It is powerful because one search can highlight strengths of a number of top search engines such as Google, Yahoo, Bing and Ask. References [1]. What Is A Search Engine? Digitallearn.Org a PLA Initiative, http://digitallearn.org/sites/default/files/class/Basic%20Search/supple mental-materials/basicsearchhandout.pdf [2]. Subarna Kumar Das, “Role Of Meta Search Engines In Web- Based Information System: Fundamentals And Challenges”, 4th Convention Planner -2006, Mizoram Univ, Aizawl, 09-10 November, 2006 [3]. D.Minnie, S.Srinivasan, “Meta Search Engine With An Intelligent Interface For Information Retrieval On Multiple Domains”, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol.1, No.4, October 2011 [4]. Minky Jindal , Nisha Kharb, “K-Means Clustering Technique On Search Engine Dataset Using Data Mining Tool”, International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 505-510 [5]. Sonali Kushwah, Avdhesh Singh, Indu Gupta, “Analysis on Meta Search Engine” [6]. Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik, “Study Of Web Crawler And Its Different Types”, IOSR Journal of Computer Engineering (IOSR-JCE) e-Volume 16, Issue 1, Ver. VI (Feb. 2014), PP 01-05 [7]. Google Search Engine Optimization Starter Guide, http://static.googleusercontent.com/media/www.google.com/en//webmasters/doc s/search-engine-optimization-starter-guide.pdf [8]. Kimber ly McCoy , “Search Engines, Subject Directories, And Meta-Search Engines”, Winter 2000 The OLRC News Volume 5, No: 1 [9]. Bassma S.Alsulami, Maysoon F. Abulkhair, Fathy E. Eassa,“Near Duplicate Document Detection Survey”, International Journal of Computer Science & Communication Networks,Vol 2(2), 147-151 [10]. K R Remesh Babu, AP Arya, “Design of a Metacrawler for Web Document Retrieval”, 12th International Conference on Intelligent Systems Design and Applications (ISDA), 27-29 Nov. 2012, pp.478-48