A Survey On Search Engines

Essay Writing Service http://StudyHub.vip/A-Survey-On-Search-Engines 👈

Journal for Research| Volume 02 | Issue 11 | January 2017
ISSN: 2395-7549
All rights reserved by www.journalforresearch.org 5
A Survey on Search Engines
Shiwani Gupta
Assistant Professor
CMPN, TCET, Mumbai, India
Abstract
Web surfing for various purposes has become a habit of humans. Searching for information from the Internet today has been
made easier by the widely available search engines. However, there are many search engines and their number is increasing. It is
of considerable importance for the designer to develop quality search engines and for the users to select the most appropriate
ones for their use. The Information quality linked through these searches is quite irregular. There are fair chances that the
retrieved results are irreverent and belong to an unreliable source. In fact, most search engines are developed mainly for better
technical performance and there could be a lack of quality attributes from the customers’ perspective. In this paper, we first
provide a brief review of the most commonly used search engines, with the focus on existing comparative studies of the search
engines. The paper also includes a survey conducted of 137 respondents where the identified user expectations will be of great
help not only to the designers for improving the search engines, but also to the users for selecting suitable ones. The objective
behind this study was also to find the reason behind poor precision and recall of so many available search engines. The study
finally aims to enhance user search experience.
Keywords: Search Engine, Information Retrieval, World Wide Web
_______________________________________________________________________________________________________
I. INTRODUCTION TO SEARCH ENGINES
Search Engine is a computer program that searches documents on World Wide Web for specified keywords.
Crawlers are computer robots which actually build search engine databases by browsing through the internet/web to find pages
which are potentially capable of containing results as asked for, and are present within these search engine databases. The
drawback of these is that if any page is not linked to any other page via a link, then it’s not possible for the spiders to find it. The
solution to this is, to put that brand new page as a link to already present pages or to add its URL manually for inclusion. This
feature is already incorporated into every major search engine available online.
As soon as these web pages come into contact of any of these crawlers, another computer program, spider is on to its work for
"indexing." The Indexing program is responsible for identifying links, texts, images and other content available in the web page
and storing this page into the search engine database's files. Indexing these pages saves us from searching the whole web for the
exact search keyword thus limit rework and saving time.
Such web pages which are not accessible by search engine spiders are excluded from the searchable databases uploaded on the
web and hence such contents are termed as Invisible Web or Deep Web or Deep Net. Some search engines for Deep Web are
Infomine, Intute, Infoplease, IncyWincy, etc.
Searching on the WWW for articles, theses, books, and abstracts from academic publishers, professional societies, online
repositories, universities and other websites can be a lot easier with a focused search via Academic search engines like
googlescholar, citeseerX, getCITED etc.
The same query on multiple search engines would return different results. Thus there are search engines called as Metasearch
Engines which don’t maintain their own database but forward search terms to a number of different search engines and collate.
Examples of such metasearch engines are DogPile, Metacrawler etc.
The searchers today are quite keen to get quick results without the need to key in the keywords. Hence few Multimedia Search
engines which can search Image, Audio and Video are Radio-locator, Blinkx, Pixsy, Retrievr, Picsearch etc.
We all agree that we live in a Social Connect Era. Wherein, a business owner might want to tap into social search engines like
Wink, iSearch, Pip, SocialSearcher etc. to get key information to customers.
“A progression from ubiquitous keywords to increasingly important entities”, words have become concepts and search engines
have evolved to genuine Learning machines. Thus Semantic Search Engines seek to improve search accuracy by understanding
searcher intent and contextual meaning of terms. To name few Kngine, Swoogle, SenseBot, Cluuz, Evri etc. retrieve results
looking for sense on the web.
Moreover in recent years, search engines have increasingly pushed to monetize their organic search results, automate search
query suggestion, evolve the search interface, place more focus on search personalization and localization and have built more
advanced anti spam penalties and filters.
A Survey on Search Engines
(J4R/ Volume 02 / Issue 11 / 002)
All rights reserved by www.journalforresearch.org 6
II. LITERATURE SURVEY
Web search engines can differ from one another in three ways – crawling reach, frequency of updates, and relevancy analysis.
Therefore, the performance capabilities and limitations of web search engines, and the differences between single and meta-
search engines, is an important and significant research area. There is a critical need for a greater understanding of the
differences in web search engines’ web site indexing and the overlap among results for the same queries. Overall, most web
searchers view only the first or second page of results (Spink and Jansen, 2004).
In [5], the author concludes that the percent of total results retrieved by only one of the three major web search engines
Google, Bing, Yahoo was 85 percent, retrieved by two web search engines was 12 percent, and retrieved by all three web search
engines was 3 percent. This small level of overlap reflects major differences in web search engines retrieval and ranking results.
The findings point to the value of meta-search engines such as Dogpile.com in web retrieval to overcome the biases of single
search engines. Further findings state that first page results returned by the three major web search engines included in this study
are different from one another. Therefore, examining overlap levels for queries on first page results is an important research
issue.
In [2] author concludes that international Search Engines like google.com, altavista.com and yahoo.com received positive
comments in comparison to Greek search engines in terms of working environment, relevancy of data retrieved, satisfaction with
precision, classification of results and quality of results, specially with Google.com.
In [3] author states that China, which has the largest Internet population and potentially the largest Internet market in the
world, Google is far behind the local search market leader Baidu which holds around 70% of the search market while Google has
only around 21%. Similarly in Korea, the local market leader Naver has 62% search market share and Google only has 4%
(Bonfils, 2010; Market The- Globe, 2010; Visual Economics, 2010) because Google’s bare bone design of home page with a
single search box is very popular in Western countries, but in Korea, people like web pages with rich contents so users can find a
lot of information on one page easily.
In [1], the author has studied Google, GoogleChina and Baidu. He asserts that Google outperformes Google China in terms of
retrieval effectiveness in five search features (title search, basic search, exact phrase search, PDF search, and URL search).
Meanwhile Google China surpassed Baidu in terms of retrieval effectiveness in four search features (title search, basic search,
PDF search, and URL search). The regression analysis results show that Google also exceeded Google China and Baidu in terms
of search results ranking quality in all five search features. Google China outperformed Baidu and held the second place in this
category. Google search engines outperformed the Baidu search engine due to factors such as quality of indexing in the
databases, coverage of the databases, quality of webpage contents, quality of results rankings, and so on.
In [6], the author states that none of the search engines received more than 42% recall. Metasearch engines don’t give better
results than individual. Searching several search engines is necessary to achieve complete search but metasearch engines are not
a substitute.
In [4] the author summarizes that engines with multiple language support features were EZ2Find in the metadata search engine
category, Google in the regular search engine category, and Onlinelink in the visualization search engine category. Google
supported 37 different languages. Google’s translation tool was the strongest one of all the search engines. Secondly, the
maximum number of languages a search engine could support in this investigation was 49 and the average number of languages
supported per search engine was about 21.57. In the investigated search engines, the maximum number of language translation
pairs was 18. Only three of the investigated search engines (Google, EZ2Find, and AltaVista) provided users with a translation
feature.
III. EVALUATING SEARCH
The research question is whether search engines retrieve relevant documents or are they ranked high enough to be seen by user.
Following is the bases for statistical comparisons of retrieval effectiveness:
 If engine A retrieves ‘a’ relevant documents in top 20 and there are ‘c’ possible relevant documents, then
 Precision=a/20 and Recall=a/c
 If engine B retrieves ‘b’ relevant documents in top 20 and there are ‘c’ possible relevant documents, then
 Precision=b/20 and Recall=b/c
Therefore
 Precision/Recall=a/b.
Fig. 1: Recall and Precision
A Survey on Search Engines
(J4R/ Volume 02 / Issue 11 / 002)
All rights reserved by www.journalforresearch.org 7
Thus we define Recall as how well a system retrieves all relevant documents whereas Precision as how well the system
retrieves only the relevant documents.
Looking into the second research question, the author has identified the ranking algorithm used into some of the traditional
search engines.
Table - 1
Comparison of Search Engines
Search engine Ranking algorithm
Google inbound links, authority, relevancy etc.
Baidu no. of backlinks pointing to you site
Altavista search term in title, in meta tag, beginning in body, exact phrase, etc.
Naver updates search algorithm every year eg. Libra, sonar.
yahoo keyword in title, click popularity
bing click through rate
yandex links
IV. COMPARISON OF SEARCH ENGINES
The author has used web analytics from [12] to find global ranking of few search engines, the visitor countries and major
keywords searched.
Table - 2
Website Traffic Statistics
Search Engine Traffic rank globally Visitor Countries Search keywords sending traffic
google.com 1 U.S., India, China, Japan, Iran Gmail. google, translate, maps
bing.com 30 U.S., China, Germany, India, U.K. Weather, google, youtube, walmart
Yahoo.com 5 U.S., India, Taiwan, Indonesia, Brazil Yahoo, youtube, google
Baidu.com 4 China, Japan, U.S. Instagram, liknkedin, java, watsapp, google
The author has also compared 10 search engines and found that though Bing is the latest in category but has been able to bag a
position in the market. It was also found that Google and Bing are good at personalization of results but some searchers may
want privacy which is provided by DuckduckGo. The table below enlists few search engines with their year of establishment, the
country where found and the current status.
Table – 3
Comparison of Search Engines
Search Engine Year Of Establishment Country Current Status
AllTheWeb 1999 Norway Aquired by Overture in 2003
Altavista 1995 California Lost to Google and purchased by Yahoo in 2003
Archie 1990 Montreal First Internet SE, project was ceased in late 1990s
AskJeeves 1996 California 6th
largest in U.S., Now known as Ask.com
Baidu 2000 China Largest in China
Bing 2009 Washington 2nd
largest in U.S. with query vol of 20.9%
DuckduckGo 2008 Pennsylvania Emphasizes protecting searcher’s privacy
Google 1998 California largest in U.S. with query vol of 63.9%
Infoseek 1995 California Originally hoped to charge for searching
Inktomi 1996 California Aquired by Yahoo
Lycos 1994 Massachusetts NoheWebw outsources for its search results from AllT
Yahoo 1995 California 3rd
largest in U.S. with query vol of 12.5%, largest in Japan
Yandex 1997 Moscow 4th
largest in Russia
The author knows that the ranking algorithms used also play a major role in retrieving relevant results and hence has analyzed
the ranking algorithm behind major Search Engines, though much has not been talked about in literature. Google uses Inbound
Links, Authority and Relevancy measures mainly alongwith with many others. Bing uses Click Through Rate whereas
V. SURVEY
Users need to understand web search engine capabilities, coverage and limitations. Single web search engines have obvious
strengths and weaknesses. In some circumstances, the uniqueness of a web search engine’s coverage may be useful for engine
users. Our study also has implications for web search engines users. However, this information is not easy for web users to find.
The objective behind survey on search engines was to gather information that:
 Which search engine is most favored one
 Which search engine empowers which nation
 What are liked features of a search engine
 Which feature the respondents admire in some other search engine
 Lastly which feature of the search engine they don’t use
A Survey on Search Engines
(J4R/ Volume 02 / Issue 11 / 002)
All rights reserved by www.journalforresearch.org 8
VI. RESULTS
The survey of 137 respondents of which 135 were Indians shows that the respondents were mainly using Google search engine.
Fig. 2: Survey Result
The respondents have further come up with following good feature listings of Google:
Features of Google liked by users
 Clean UI
 Simplicity
 ImageSearch Pro
 Quick Response Time
 Relevant/efficient results
 Auto complete/correct
 Smart prediction/suggestion
 Search results pertain to search habits of the person querying
 Is able to synchronize information on all devices
 Use of operators for relevant search
 Vernacular support
 Google Earth
 Google Maps
 Google Scholar
They have also come up with some feature listings which they don’t use:
Features of Google not used by users
 Voice search
 Video search
 “I’m feeling lucky”
 Google+
 Results after page 2/3
 Google Wallet
 Special Operators
 Multi Language Support
Lastly the survey stated some features in some search engines which were present in other engines and not in the one they
used:
Table – 4
Comparison of Search Engines’ Features
Search Engine Feature not present/inappropriate Present in which search engine
Google
News Yahoo
Semantically connected results Semantic search engines
Video play on hover Bing
Deep Search Deep web search engines
Privacy Duck duck go
Too many Advertisements Duck duck go
VII.CONCLUSION
The respondents of our survey were mainly Indians who claim that Google is the most sought after search engine platform.
Moreover I can comment that every search engine has its PROs and CONs. There are few search engines, few features of which
the users are not aware. Designers of search engines focusing in a specific country should keep in mind the needs of local natives
A Survey on Search Engines
(J4R/ Volume 02 / Issue 11 / 002)
All rights reserved by www.journalforresearch.org 9
for better acceptability. The survey can be of benefit both to designers of search engines to help them optimize their engine and
to the users to decide which to go for which need. The paper summarizes that there is a lot of work done in the Search Engine
Optimization field in past but there is still lot scope for research to happen.
REFERENCES
[1] J. Zhang, W. Fei, T. Le, “A comparative analysis of the search feature effectiveness of the major English and Chinese search engines”, emraldinsight.com,
Vol. 37 No. 2, 2013 pp. 217-230 DOI 10.1108/OIR-07-2011-0099.
[2] E. Garoufallou, “Evaluating Search Engines- A comarative study between international and greek SE by greek librarians”, emraldinsight.com, 2012, DOI
10.1108/00330331211221837.
[3] W. Zhao, E. Tse, “Competition in search engine market”, Journal of Business Strategies, Vol 28, No. 2, 2011.
[4] J. Zhang, S. Lin, “Multiple Language supports in search engines”, emraldinsight.com, Jan 2007, DOI 10.1108/14684520710780458.
[5] A. Spink, B.J. Jansen, V. Kathuria, S. Koshman ,“Overlap among major web search engines”, Internet Research emraldinsight.com, Vol. 16 No. 4, 2006,
pp. 419-426, DOI 10.1108/10662240610690034.
[6] A.G. Smith, “Think local, search global? Comparing search engines for searching geographically specific information”, Online Information Review; 2003;
27, 2; ABI/INFORM Global pg. 102.
[7] Stephen Butcher, Charles L. A. Clarke, Gordan V. Cormack, “Information Retrieval, Implementing and Evaluating Search Engines”, MIT Press, 2010.
[8] www.googleinsidesearch.com
[9] www.searchengineland.com
[10] www.searchenginewatch.com
[11] www.searchenginejournal.com
[12] www.cse.google.com
[13] www.alexa.com
[14] www.webgranth.com

Recomendados

The beginners guide to SEO por
The beginners guide to SEOThe beginners guide to SEO
The beginners guide to SEOThanh Nguyen
4.8K vistas51 diapositivas
Refining Serp - Search engine result page for Enhanced Information Retrieval por
Refining Serp - Search engine result page for Enhanced Information RetrievalRefining Serp - Search engine result page for Enhanced Information Retrieval
Refining Serp - Search engine result page for Enhanced Information Retrievalijbuiiir1
5 vistas7 diapositivas
Essay On Search Engine por
Essay On Search EngineEssay On Search Engine
Essay On Search EnginePaper Writing Service Superiorpapers
6 vistas22 diapositivas
SEOMoz The Beginners Guide To SEO por
SEOMoz The Beginners Guide To SEOSEOMoz The Beginners Guide To SEO
SEOMoz The Beginners Guide To SEOFlutterbyBarb
8K vistas51 diapositivas
A Comparative Study Of The Results Produced By Search Engines por
A Comparative Study Of The Results Produced By Search EnginesA Comparative Study Of The Results Produced By Search Engines
A Comparative Study Of The Results Produced By Search EnginesJoaquin Hamad
3 vistas4 diapositivas
Search Engines Other than Google por
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than GoogleDr Trivedi
172 vistas72 diapositivas

Más contenido relacionado

Similar a A Survey On Search Engines

Search engine and web crawler por
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
3.5K vistas21 diapositivas
Search engine optimization (seo) overview por
Search engine optimization (seo) overviewSearch engine optimization (seo) overview
Search engine optimization (seo) overviewArpan Jain
1.4K vistas28 diapositivas
Essay On Search Engines por
Essay On Search EnginesEssay On Search Engines
Essay On Search EnginesWho Can Write My Paper For Me UK
25 vistas24 diapositivas
Search Engine Optimization Essay por
Search Engine Optimization EssaySearch Engine Optimization Essay
Search Engine Optimization EssayBuy School Papers Online UK
9 vistas23 diapositivas
Quest Trail: An Effective Approach for Construction of Personalized Search En... por
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...Editor IJCATR
302 vistas6 diapositivas
132-ArticleText-800-1-10-20210331 (1).pdf por
132-ArticleText-800-1-10-20210331 (1).pdf132-ArticleText-800-1-10-20210331 (1).pdf
132-ArticleText-800-1-10-20210331 (1).pdfvarshasatpute6
10 vistas12 diapositivas

Similar a A Survey On Search Engines(20)

Search engine and web crawler por ishmecse13
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
ishmecse133.5K vistas
Search engine optimization (seo) overview por Arpan Jain
Search engine optimization (seo) overviewSearch engine optimization (seo) overview
Search engine optimization (seo) overview
Arpan Jain1.4K vistas
Quest Trail: An Effective Approach for Construction of Personalized Search En... por Editor IJCATR
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...
Editor IJCATR302 vistas
132-ArticleText-800-1-10-20210331 (1).pdf por varshasatpute6
132-ArticleText-800-1-10-20210331 (1).pdf132-ArticleText-800-1-10-20210331 (1).pdf
132-ArticleText-800-1-10-20210331 (1).pdf
varshasatpute610 vistas
How search engine works por Ashraf Ali
How search engine worksHow search engine works
How search engine works
Ashraf Ali181 vistas
Semantic Search Engine using Ontologies por IJRES Journal
Semantic Search Engine using OntologiesSemantic Search Engine using Ontologies
Semantic Search Engine using Ontologies
IJRES Journal229 vistas
How google works and functions: A complete Approach por Prakhar Gethe
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
Prakhar Gethe1.6K vistas
An Intelligent Meta Search Engine for Efficient Web Document Retrieval por iosrjce
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
iosrjce194 vistas
PageRank algorithm and its variations: A Survey report por IOSR Journals
PageRank algorithm and its variations: A Survey reportPageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey report
IOSR Journals675 vistas
A Study on SEO Monitoring System Based on Corporate Website Development por ijcseit
A Study on SEO Monitoring System Based on Corporate Website DevelopmentA Study on SEO Monitoring System Based on Corporate Website Development
A Study on SEO Monitoring System Based on Corporate Website Development
ijcseit24 vistas
Search engine por Wasif Khan
Search engineSearch engine
Search engine
Wasif Khan116 vistas
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ... por inventionjournals
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
inventionjournals36 vistas
Search Marketing por Shankar Soma
Search MarketingSearch Marketing
Search Marketing
Shankar Soma2.1K vistas
How Google is Reading and Indexing Content in 2016 por Greenlane
How Google is Reading and Indexing Content in 2016How Google is Reading and Indexing Content in 2016
How Google is Reading and Indexing Content in 2016
Greenlane213 vistas

Más de Andrew Parish

A Novel Latent Factor Model For Recommender System por
A Novel Latent Factor Model For Recommender SystemA Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender SystemAndrew Parish
3 vistas18 diapositivas
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas... por
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...Andrew Parish
4 vistas81 diapositivas
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A... por
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...Andrew Parish
2 vistas23 diapositivas
Aphrodite The Goddess Of Appearances por
Aphrodite The Goddess Of AppearancesAphrodite The Goddess Of Appearances
Aphrodite The Goddess Of AppearancesAndrew Parish
2 vistas61 diapositivas
Accessible Tourism And Sustainability A Discussion And Business Case Study por
Accessible Tourism And Sustainability  A Discussion And Business Case StudyAccessible Tourism And Sustainability  A Discussion And Business Case Study
Accessible Tourism And Sustainability A Discussion And Business Case StudyAndrew Parish
6 vistas39 diapositivas
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES C... por
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES  C...AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES  C...
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES C...Andrew Parish
2 vistas10 diapositivas

Más de Andrew Parish(20)

A Novel Latent Factor Model For Recommender System por Andrew Parish
A Novel Latent Factor Model For Recommender SystemA Novel Latent Factor Model For Recommender System
A Novel Latent Factor Model For Recommender System
Andrew Parish3 vistas
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas... por Andrew Parish
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...
Assessment Of Local Government Budget Allocation In Public Sectors In The Cas...
Andrew Parish4 vistas
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A... por Andrew Parish
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...
A Comparison Of Generalizability Theory And Many-Facet Rasch Measurement In A...
Andrew Parish2 vistas
Aphrodite The Goddess Of Appearances por Andrew Parish
Aphrodite The Goddess Of AppearancesAphrodite The Goddess Of Appearances
Aphrodite The Goddess Of Appearances
Andrew Parish2 vistas
Accessible Tourism And Sustainability A Discussion And Business Case Study por Andrew Parish
Accessible Tourism And Sustainability  A Discussion And Business Case StudyAccessible Tourism And Sustainability  A Discussion And Business Case Study
Accessible Tourism And Sustainability A Discussion And Business Case Study
Andrew Parish6 vistas
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES C... por Andrew Parish
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES  C...AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES  C...
AN EMPIRICAL ANALYSIS OF THE INVENTORY MANAGEMENT IN AUTO PARTS INDUSTRIES C...
Andrew Parish2 vistas
A Framework For Unifying Problem-Solving Knowledge And Workflow Modelling por Andrew Parish
A Framework For Unifying Problem-Solving Knowledge And Workflow ModellingA Framework For Unifying Problem-Solving Knowledge And Workflow Modelling
A Framework For Unifying Problem-Solving Knowledge And Workflow Modelling
Andrew Parish2 vistas
4. (Re)Introducing Formalities In Copyright As A Strategy For The Public Domain por Andrew Parish
4. (Re)Introducing Formalities In Copyright As A Strategy For The Public Domain4. (Re)Introducing Formalities In Copyright As A Strategy For The Public Domain
4. (Re)Introducing Formalities In Copyright As A Strategy For The Public Domain
Andrew Parish3 vistas
Assignments Three And Four Object-Oriented Software Design And Implementatio... por Andrew Parish
Assignments Three And Four  Object-Oriented Software Design And Implementatio...Assignments Three And Four  Object-Oriented Software Design And Implementatio...
Assignments Three And Four Object-Oriented Software Design And Implementatio...
Andrew Parish2 vistas
40 The Long Term Effects Of A 3-Day First Aid Programme For 7 14 Years Old Pr... por Andrew Parish
40 The Long Term Effects Of A 3-Day First Aid Programme For 7 14 Years Old Pr...40 The Long Term Effects Of A 3-Day First Aid Programme For 7 14 Years Old Pr...
40 The Long Term Effects Of A 3-Day First Aid Programme For 7 14 Years Old Pr...
Andrew Parish2 vistas
A Bibliographic Essay On Reference Sources On International And Federal Law R... por Andrew Parish
A Bibliographic Essay On Reference Sources On International And Federal Law R...A Bibliographic Essay On Reference Sources On International And Federal Law R...
A Bibliographic Essay On Reference Sources On International And Federal Law R...
Andrew Parish2 vistas
Analysis Of Students Epistemological Obstacles On The Subject Of Pythagorean... por Andrew Parish
Analysis Of Students  Epistemological Obstacles On The Subject Of Pythagorean...Analysis Of Students  Epistemological Obstacles On The Subject Of Pythagorean...
Analysis Of Students Epistemological Obstacles On The Subject Of Pythagorean...
Andrew Parish7 vistas
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T... por Andrew Parish
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Andrew Parish6 vistas
Academic Stress And Coping Mechanism Of Students In Flexible Learning por Andrew Parish
Academic Stress And Coping Mechanism Of Students In Flexible LearningAcademic Stress And Coping Mechanism Of Students In Flexible Learning
Academic Stress And Coping Mechanism Of Students In Flexible Learning
Andrew Parish48 vistas
An E-Business Integration And Collaboration Platform For B2B E-Commerce por Andrew Parish
An E-Business Integration And Collaboration Platform For B2B E-CommerceAn E-Business Integration And Collaboration Platform For B2B E-Commerce
An E-Business Integration And Collaboration Platform For B2B E-Commerce
Andrew Parish3 vistas
Assessing Real Estate Volatility por Andrew Parish
Assessing Real Estate VolatilityAssessing Real Estate Volatility
Assessing Real Estate Volatility
Andrew Parish6 vistas
Andre Bazin - What Is Cinema Vol. por Andrew Parish
Andre Bazin - What Is Cinema Vol.Andre Bazin - What Is Cinema Vol.
Andre Bazin - What Is Cinema Vol.
Andrew Parish15 vistas
Alchemy Orackle Essay Lendvai.Pdf por Andrew Parish
Alchemy Orackle Essay Lendvai.PdfAlchemy Orackle Essay Lendvai.Pdf
Alchemy Orackle Essay Lendvai.Pdf
Andrew Parish4 vistas
Analysis Of The Grocery Industry Coles Supermarkets por Andrew Parish
Analysis Of The Grocery Industry Coles SupermarketsAnalysis Of The Grocery Industry Coles Supermarkets
Analysis Of The Grocery Industry Coles Supermarkets
Andrew Parish47 vistas
2. Intermediate Research Skills Workshop 2 Argumentative Writing (2016) por Andrew Parish
2. Intermediate Research Skills Workshop 2  Argumentative Writing (2016)2. Intermediate Research Skills Workshop 2  Argumentative Writing (2016)
2. Intermediate Research Skills Workshop 2 Argumentative Writing (2016)
Andrew Parish2 vistas

Último

Solar System and Galaxies.pptx por
Solar System and Galaxies.pptxSolar System and Galaxies.pptx
Solar System and Galaxies.pptxDrHafizKosar
94 vistas26 diapositivas
Use of Probiotics in Aquaculture.pptx por
Use of Probiotics in Aquaculture.pptxUse of Probiotics in Aquaculture.pptx
Use of Probiotics in Aquaculture.pptxAKSHAY MANDAL
104 vistas15 diapositivas
The Accursed House by Émile Gaboriau por
The Accursed House  by Émile GaboriauThe Accursed House  by Émile Gaboriau
The Accursed House by Émile GaboriauDivyaSheta
212 vistas15 diapositivas
Thanksgiving!.pdf por
Thanksgiving!.pdfThanksgiving!.pdf
Thanksgiving!.pdfEnglishCEIPdeSigeiro
137 vistas17 diapositivas
Class 9 lesson plans por
Class 9 lesson plansClass 9 lesson plans
Class 9 lesson plansTARIQ KHAN
47 vistas34 diapositivas
Structure and Functions of Cell.pdf por
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdfNithya Murugan
701 vistas10 diapositivas

Último(20)

Solar System and Galaxies.pptx por DrHafizKosar
Solar System and Galaxies.pptxSolar System and Galaxies.pptx
Solar System and Galaxies.pptx
DrHafizKosar94 vistas
Use of Probiotics in Aquaculture.pptx por AKSHAY MANDAL
Use of Probiotics in Aquaculture.pptxUse of Probiotics in Aquaculture.pptx
Use of Probiotics in Aquaculture.pptx
AKSHAY MANDAL104 vistas
The Accursed House by Émile Gaboriau por DivyaSheta
The Accursed House  by Émile GaboriauThe Accursed House  by Émile Gaboriau
The Accursed House by Émile Gaboriau
DivyaSheta212 vistas
Class 9 lesson plans por TARIQ KHAN
Class 9 lesson plansClass 9 lesson plans
Class 9 lesson plans
TARIQ KHAN47 vistas
Structure and Functions of Cell.pdf por Nithya Murugan
Structure and Functions of Cell.pdfStructure and Functions of Cell.pdf
Structure and Functions of Cell.pdf
Nithya Murugan701 vistas
REPRESENTATION - GAUNTLET.pptx por iammrhaywood
REPRESENTATION - GAUNTLET.pptxREPRESENTATION - GAUNTLET.pptx
REPRESENTATION - GAUNTLET.pptx
iammrhaywood107 vistas
Narration lesson plan por TARIQ KHAN
Narration lesson planNarration lesson plan
Narration lesson plan
TARIQ KHAN59 vistas
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant... por Ms. Pooja Bhandare
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...
Pharmaceutical Inorganic Chemistry Unit IVMiscellaneous compounds Expectorant...
Ms. Pooja Bhandare109 vistas
Drama KS5 Breakdown por WestHatch
Drama KS5 BreakdownDrama KS5 Breakdown
Drama KS5 Breakdown
WestHatch87 vistas
11.28.23 Social Capital and Social Exclusion.pptx por mary850239
11.28.23 Social Capital and Social Exclusion.pptx11.28.23 Social Capital and Social Exclusion.pptx
11.28.23 Social Capital and Social Exclusion.pptx
mary850239304 vistas
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB... por Nguyen Thanh Tu Collection
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
CUNY IT Picciano.pptx por apicciano
CUNY IT Picciano.pptxCUNY IT Picciano.pptx
CUNY IT Picciano.pptx
apicciano54 vistas

A Survey On Search Engines

  • 1. Journal for Research| Volume 02 | Issue 11 | January 2017 ISSN: 2395-7549 All rights reserved by www.journalforresearch.org 5 A Survey on Search Engines Shiwani Gupta Assistant Professor CMPN, TCET, Mumbai, India Abstract Web surfing for various purposes has become a habit of humans. Searching for information from the Internet today has been made easier by the widely available search engines. However, there are many search engines and their number is increasing. It is of considerable importance for the designer to develop quality search engines and for the users to select the most appropriate ones for their use. The Information quality linked through these searches is quite irregular. There are fair chances that the retrieved results are irreverent and belong to an unreliable source. In fact, most search engines are developed mainly for better technical performance and there could be a lack of quality attributes from the customers’ perspective. In this paper, we first provide a brief review of the most commonly used search engines, with the focus on existing comparative studies of the search engines. The paper also includes a survey conducted of 137 respondents where the identified user expectations will be of great help not only to the designers for improving the search engines, but also to the users for selecting suitable ones. The objective behind this study was also to find the reason behind poor precision and recall of so many available search engines. The study finally aims to enhance user search experience. Keywords: Search Engine, Information Retrieval, World Wide Web _______________________________________________________________________________________________________ I. INTRODUCTION TO SEARCH ENGINES Search Engine is a computer program that searches documents on World Wide Web for specified keywords. Crawlers are computer robots which actually build search engine databases by browsing through the internet/web to find pages which are potentially capable of containing results as asked for, and are present within these search engine databases. The drawback of these is that if any page is not linked to any other page via a link, then it’s not possible for the spiders to find it. The solution to this is, to put that brand new page as a link to already present pages or to add its URL manually for inclusion. This feature is already incorporated into every major search engine available online. As soon as these web pages come into contact of any of these crawlers, another computer program, spider is on to its work for "indexing." The Indexing program is responsible for identifying links, texts, images and other content available in the web page and storing this page into the search engine database's files. Indexing these pages saves us from searching the whole web for the exact search keyword thus limit rework and saving time. Such web pages which are not accessible by search engine spiders are excluded from the searchable databases uploaded on the web and hence such contents are termed as Invisible Web or Deep Web or Deep Net. Some search engines for Deep Web are Infomine, Intute, Infoplease, IncyWincy, etc. Searching on the WWW for articles, theses, books, and abstracts from academic publishers, professional societies, online repositories, universities and other websites can be a lot easier with a focused search via Academic search engines like googlescholar, citeseerX, getCITED etc. The same query on multiple search engines would return different results. Thus there are search engines called as Metasearch Engines which don’t maintain their own database but forward search terms to a number of different search engines and collate. Examples of such metasearch engines are DogPile, Metacrawler etc. The searchers today are quite keen to get quick results without the need to key in the keywords. Hence few Multimedia Search engines which can search Image, Audio and Video are Radio-locator, Blinkx, Pixsy, Retrievr, Picsearch etc. We all agree that we live in a Social Connect Era. Wherein, a business owner might want to tap into social search engines like Wink, iSearch, Pip, SocialSearcher etc. to get key information to customers. “A progression from ubiquitous keywords to increasingly important entities”, words have become concepts and search engines have evolved to genuine Learning machines. Thus Semantic Search Engines seek to improve search accuracy by understanding searcher intent and contextual meaning of terms. To name few Kngine, Swoogle, SenseBot, Cluuz, Evri etc. retrieve results looking for sense on the web. Moreover in recent years, search engines have increasingly pushed to monetize their organic search results, automate search query suggestion, evolve the search interface, place more focus on search personalization and localization and have built more advanced anti spam penalties and filters.
  • 2. A Survey on Search Engines (J4R/ Volume 02 / Issue 11 / 002) All rights reserved by www.journalforresearch.org 6 II. LITERATURE SURVEY Web search engines can differ from one another in three ways – crawling reach, frequency of updates, and relevancy analysis. Therefore, the performance capabilities and limitations of web search engines, and the differences between single and meta- search engines, is an important and significant research area. There is a critical need for a greater understanding of the differences in web search engines’ web site indexing and the overlap among results for the same queries. Overall, most web searchers view only the first or second page of results (Spink and Jansen, 2004). In [5], the author concludes that the percent of total results retrieved by only one of the three major web search engines Google, Bing, Yahoo was 85 percent, retrieved by two web search engines was 12 percent, and retrieved by all three web search engines was 3 percent. This small level of overlap reflects major differences in web search engines retrieval and ranking results. The findings point to the value of meta-search engines such as Dogpile.com in web retrieval to overcome the biases of single search engines. Further findings state that first page results returned by the three major web search engines included in this study are different from one another. Therefore, examining overlap levels for queries on first page results is an important research issue. In [2] author concludes that international Search Engines like google.com, altavista.com and yahoo.com received positive comments in comparison to Greek search engines in terms of working environment, relevancy of data retrieved, satisfaction with precision, classification of results and quality of results, specially with Google.com. In [3] author states that China, which has the largest Internet population and potentially the largest Internet market in the world, Google is far behind the local search market leader Baidu which holds around 70% of the search market while Google has only around 21%. Similarly in Korea, the local market leader Naver has 62% search market share and Google only has 4% (Bonfils, 2010; Market The- Globe, 2010; Visual Economics, 2010) because Google’s bare bone design of home page with a single search box is very popular in Western countries, but in Korea, people like web pages with rich contents so users can find a lot of information on one page easily. In [1], the author has studied Google, GoogleChina and Baidu. He asserts that Google outperformes Google China in terms of retrieval effectiveness in five search features (title search, basic search, exact phrase search, PDF search, and URL search). Meanwhile Google China surpassed Baidu in terms of retrieval effectiveness in four search features (title search, basic search, PDF search, and URL search). The regression analysis results show that Google also exceeded Google China and Baidu in terms of search results ranking quality in all five search features. Google China outperformed Baidu and held the second place in this category. Google search engines outperformed the Baidu search engine due to factors such as quality of indexing in the databases, coverage of the databases, quality of webpage contents, quality of results rankings, and so on. In [6], the author states that none of the search engines received more than 42% recall. Metasearch engines don’t give better results than individual. Searching several search engines is necessary to achieve complete search but metasearch engines are not a substitute. In [4] the author summarizes that engines with multiple language support features were EZ2Find in the metadata search engine category, Google in the regular search engine category, and Onlinelink in the visualization search engine category. Google supported 37 different languages. Google’s translation tool was the strongest one of all the search engines. Secondly, the maximum number of languages a search engine could support in this investigation was 49 and the average number of languages supported per search engine was about 21.57. In the investigated search engines, the maximum number of language translation pairs was 18. Only three of the investigated search engines (Google, EZ2Find, and AltaVista) provided users with a translation feature. III. EVALUATING SEARCH The research question is whether search engines retrieve relevant documents or are they ranked high enough to be seen by user. Following is the bases for statistical comparisons of retrieval effectiveness:  If engine A retrieves ‘a’ relevant documents in top 20 and there are ‘c’ possible relevant documents, then  Precision=a/20 and Recall=a/c  If engine B retrieves ‘b’ relevant documents in top 20 and there are ‘c’ possible relevant documents, then  Precision=b/20 and Recall=b/c Therefore  Precision/Recall=a/b. Fig. 1: Recall and Precision
  • 3. A Survey on Search Engines (J4R/ Volume 02 / Issue 11 / 002) All rights reserved by www.journalforresearch.org 7 Thus we define Recall as how well a system retrieves all relevant documents whereas Precision as how well the system retrieves only the relevant documents. Looking into the second research question, the author has identified the ranking algorithm used into some of the traditional search engines. Table - 1 Comparison of Search Engines Search engine Ranking algorithm Google inbound links, authority, relevancy etc. Baidu no. of backlinks pointing to you site Altavista search term in title, in meta tag, beginning in body, exact phrase, etc. Naver updates search algorithm every year eg. Libra, sonar. yahoo keyword in title, click popularity bing click through rate yandex links IV. COMPARISON OF SEARCH ENGINES The author has used web analytics from [12] to find global ranking of few search engines, the visitor countries and major keywords searched. Table - 2 Website Traffic Statistics Search Engine Traffic rank globally Visitor Countries Search keywords sending traffic google.com 1 U.S., India, China, Japan, Iran Gmail. google, translate, maps bing.com 30 U.S., China, Germany, India, U.K. Weather, google, youtube, walmart Yahoo.com 5 U.S., India, Taiwan, Indonesia, Brazil Yahoo, youtube, google Baidu.com 4 China, Japan, U.S. Instagram, liknkedin, java, watsapp, google The author has also compared 10 search engines and found that though Bing is the latest in category but has been able to bag a position in the market. It was also found that Google and Bing are good at personalization of results but some searchers may want privacy which is provided by DuckduckGo. The table below enlists few search engines with their year of establishment, the country where found and the current status. Table – 3 Comparison of Search Engines Search Engine Year Of Establishment Country Current Status AllTheWeb 1999 Norway Aquired by Overture in 2003 Altavista 1995 California Lost to Google and purchased by Yahoo in 2003 Archie 1990 Montreal First Internet SE, project was ceased in late 1990s AskJeeves 1996 California 6th largest in U.S., Now known as Ask.com Baidu 2000 China Largest in China Bing 2009 Washington 2nd largest in U.S. with query vol of 20.9% DuckduckGo 2008 Pennsylvania Emphasizes protecting searcher’s privacy Google 1998 California largest in U.S. with query vol of 63.9% Infoseek 1995 California Originally hoped to charge for searching Inktomi 1996 California Aquired by Yahoo Lycos 1994 Massachusetts NoheWebw outsources for its search results from AllT Yahoo 1995 California 3rd largest in U.S. with query vol of 12.5%, largest in Japan Yandex 1997 Moscow 4th largest in Russia The author knows that the ranking algorithms used also play a major role in retrieving relevant results and hence has analyzed the ranking algorithm behind major Search Engines, though much has not been talked about in literature. Google uses Inbound Links, Authority and Relevancy measures mainly alongwith with many others. Bing uses Click Through Rate whereas V. SURVEY Users need to understand web search engine capabilities, coverage and limitations. Single web search engines have obvious strengths and weaknesses. In some circumstances, the uniqueness of a web search engine’s coverage may be useful for engine users. Our study also has implications for web search engines users. However, this information is not easy for web users to find. The objective behind survey on search engines was to gather information that:  Which search engine is most favored one  Which search engine empowers which nation  What are liked features of a search engine  Which feature the respondents admire in some other search engine  Lastly which feature of the search engine they don’t use
  • 4. A Survey on Search Engines (J4R/ Volume 02 / Issue 11 / 002) All rights reserved by www.journalforresearch.org 8 VI. RESULTS The survey of 137 respondents of which 135 were Indians shows that the respondents were mainly using Google search engine. Fig. 2: Survey Result The respondents have further come up with following good feature listings of Google: Features of Google liked by users  Clean UI  Simplicity  ImageSearch Pro  Quick Response Time  Relevant/efficient results  Auto complete/correct  Smart prediction/suggestion  Search results pertain to search habits of the person querying  Is able to synchronize information on all devices  Use of operators for relevant search  Vernacular support  Google Earth  Google Maps  Google Scholar They have also come up with some feature listings which they don’t use: Features of Google not used by users  Voice search  Video search  “I’m feeling lucky”  Google+  Results after page 2/3  Google Wallet  Special Operators  Multi Language Support Lastly the survey stated some features in some search engines which were present in other engines and not in the one they used: Table – 4 Comparison of Search Engines’ Features Search Engine Feature not present/inappropriate Present in which search engine Google News Yahoo Semantically connected results Semantic search engines Video play on hover Bing Deep Search Deep web search engines Privacy Duck duck go Too many Advertisements Duck duck go VII.CONCLUSION The respondents of our survey were mainly Indians who claim that Google is the most sought after search engine platform. Moreover I can comment that every search engine has its PROs and CONs. There are few search engines, few features of which the users are not aware. Designers of search engines focusing in a specific country should keep in mind the needs of local natives
  • 5. A Survey on Search Engines (J4R/ Volume 02 / Issue 11 / 002) All rights reserved by www.journalforresearch.org 9 for better acceptability. The survey can be of benefit both to designers of search engines to help them optimize their engine and to the users to decide which to go for which need. The paper summarizes that there is a lot of work done in the Search Engine Optimization field in past but there is still lot scope for research to happen. REFERENCES [1] J. Zhang, W. Fei, T. Le, “A comparative analysis of the search feature effectiveness of the major English and Chinese search engines”, emraldinsight.com, Vol. 37 No. 2, 2013 pp. 217-230 DOI 10.1108/OIR-07-2011-0099. [2] E. Garoufallou, “Evaluating Search Engines- A comarative study between international and greek SE by greek librarians”, emraldinsight.com, 2012, DOI 10.1108/00330331211221837. [3] W. Zhao, E. Tse, “Competition in search engine market”, Journal of Business Strategies, Vol 28, No. 2, 2011. [4] J. Zhang, S. Lin, “Multiple Language supports in search engines”, emraldinsight.com, Jan 2007, DOI 10.1108/14684520710780458. [5] A. Spink, B.J. Jansen, V. Kathuria, S. Koshman ,“Overlap among major web search engines”, Internet Research emraldinsight.com, Vol. 16 No. 4, 2006, pp. 419-426, DOI 10.1108/10662240610690034. [6] A.G. Smith, “Think local, search global? Comparing search engines for searching geographically specific information”, Online Information Review; 2003; 27, 2; ABI/INFORM Global pg. 102. [7] Stephen Butcher, Charles L. A. Clarke, Gordan V. Cormack, “Information Retrieval, Implementing and Evaluating Search Engines”, MIT Press, 2010. [8] www.googleinsidesearch.com [9] www.searchengineland.com [10] www.searchenginewatch.com [11] www.searchenginejournal.com [12] www.cse.google.com [13] www.alexa.com [14] www.webgranth.com