The overlap and coverage of 4 local search engines: Parsijoo, Yooz, Parseek and Rismoun

Authors
Ferdowsi University of Mashhad
Abstract
Background and Aim: The aim of this study was to measure the overlap of 4 local Persian search engines of Parsijoo, Yooz, Parseek, and Rismoun and to compare the capabilities of these engines in covering indexable web.

Methods: This was an applied and evaluative research. To collect data, a keyword-based method was used. First, the selected keywords were entered into the search engines and then a sample was extracted of the retrieved records. Finally, based on the existence or absence of these records in the search engines, the necessary data were gathered. Accordingly to analyze the data, inferential statistical methods were used.

Results: The relative overlap of the Parseek compared to that of Parsijoo and Parsijoo's one compared to Yooz was 26 percent on average and Parseek had the most recall. Rismoun had not any common records with the other investigated search engines. Three search engines (Parseekc, Parsijoo and Yooz retrieved 27 common records out of 225 recalled records; there was a significant difference between the relative overlap of the 4 search engines. Also, on average, Parseel, Parsigoo, Yooz and Rismoun covered respectively 38, 31, 26, and 6 percent of the indexable web. There was a significant difference between the coverage of the 4 search engines.

Conclusion: It seems that each search engine has a different indexing policy, and users need to search for more than one search engine to get comprehensive information about an issue. It can be predicted that by foraging in two search engines, Parseek and Parsijoo, one may access 70 percent of the indexable web.
Keywords

Anderson B (2006). Indexing the Internet. Behavioral & Social Sciences Librarian, 25(1), 135-139.
Bharat K, Broder A (1998). A Technique for Measur-ing the Relative Size and Systems, 30(1), 379-388.
Buckland MK, Hindle A, Walker P. M (1975). Meth-odological problems in assessing the overlap be-tween bibliographical files and library holdings. Information Processing and Management 11(3–4), 89–105.
Clarke SJ (2000) .Search Engines for the World Wide Web, Journal of Internet Cataloging, 2(3-4), 81-93.
Davarpanah M (2008). Hostejoye Ettlaate Elmi va Pazoheshi dar Manabee CHapi va Elektroniki. Tehran: Dabezesh.
Egghe L (2006). Properties of the n-overlap vector and n-overlap similarity theory. Journal of the American Society for Information Science and Technology57 (9)1165-1177.
Egghe L, Goovaerts M (2007). A note on measuring overlap. Journal of Information Science.33 (2), 189-195.
Fattahi R(2004). Tahlele Avamele Moather bar Nesbi Bodane Rabt dar Nezam Bazyabeye Etlaat. Ettelae SHenasi, 2 (1),7-22.
Gohari S, maktabifard L, Jamaleye Mehmamoee H (2015). Sanjeshe mezane Tekrar Bazyabeye Etlaat Farsi az Web ba Mogayeseye Motorhaye Kavoshe Emomi. Tahgigate Ettlae Ketabdary va Ettlae Resani Daneshgahi, 49 (2),239-254.
Gulli A, Signorini A (2005). The indexable web is more than 11.5 billion pages. In Special interest tracks and posters of the 14th international conference on World Wide Web (pp. 902-903). ACM.
Hood W W, Wilson C S (2003). Overlap in biblio-graphic databases. Journal of the American Socie-ty for Information Science and Technology, 54(12), 1091-1103.
Isfandyari Moghaddam A, Bahari Movaffagh, Z( 2012).The Overlap Rate of Searching Medical Keywords in General Search Engines.Modereyat Etelaat Salamat,9(2).203-214.
Isfandyari Moghaddam A (2005).Barasiye Natayeje jostejo dar Abarmotorhaye Kavosh va Motorhaye Tahte Poshesh anha az janbeye Hamposhani va Rotbe Bandeye .Payannameye Karshenasiye Arshad.Daneshgahe Ferdowsiye Mashhad.
Isfandyari Moghaddam A, Parirokh M (2006). A comparative study on overlapping of search re-sults in metasearch engines and their common underlying search engines. Library Review, 55(5), 301-306.
Jahangard N (2017). Chahar Melyard Safheye Farsi dar Web Retrevited in 20 Desamber 2017 from www.irna.ir/fa/News/82498650.
Jansen B J, Spink A. (2006). How are we searching the World Wide Web? A comparison of nine search engine transaction logs. Information pro-cessing & management, 42(1), 248-263.
Khalili K(1993). Farhange Moshtagate Masader Farsi. Tehran: Moseseye Motaleat va Tahgigate Farhangi.
Kosha k (2002). Abzarhaye jostejo enternet: esol,maharatha va emkanate jostejo. Tehran:ketabdar.
Lewandowski D (2012). A Framework for Evaluating the Retrieval Effectiveness of Search Engines In Jouis, Christophe, Next Generation Search Engine: Advanced Models for Information Retrieval. Her-shey, PA: IGI Global, retrieved 20 Decembers 2016 from http:// www.igi-global.com/book/next-generation-search-engines/59723
Mitra A, Awekar A (2017). On Low Overlap Among Search Results of Academic Search Engines. arXiv preprint arXiv. 823-824.
Mohammad Esmaeel S, gaemi M (2009). Mogayeseye Mezane Hamposhaneye Natayaje Bazyabi Shode dar Motorhaye Kavosh , Abarmotorhaye Kavoshe dar Bazyabeye Ettlaat Keshavarzi. Mahnameye Ettlae yabi , Ettlae Rasani, (21),55-61.
Pappas S (2016). How Big Is the Internet, Really? Retrievted 8 march 2017 http://www.livescience.com/54094-how-big-is-the-internet.html
Powell R (2000). Basic research methods for librari-ans. Translated by Najla Hriry.Tehran: Asar Nafes.
Poyer R (1984). Journal Article Overlap among Index Medicus Science Citation Index, Biological Ab-stracts, and Chemical Abstracts. Bull. Med. Libr. Assoc. 72(4).
Rajabi M, Norozi Y (2015). Motorhaye Jostojoye Farsi: Arzyabeye Emkanat Jostejo, Bazyabeye Etlaat,M0ezane Jameeyat va Maneeeyat va Taeen Hamposhaneye anha. Motalaat Meleye Katabdary va Sazmandeheye Etlaat, 26(3) 133-150.
Rather RA, Lone FA, SHah GJ (2008). Overlap in web search results: A study of five searches Engines. Library philosophy and practice. 226
Saracevic, T. (1996). Relevance reconsidered. In Pro-ceedings of the second conference on conceptions of library and information science (CoLIS 2) (pp. 201-218).
Spink A, Janson B J (2004). A study of web search trends. Webology,1(2). Retrieved 10 December 2016 from www.webology.org.
Spink A, Jansen B J, Blakely C, Koshman S (2006). A study of results overlap and uniqueness among major web search engines. Information Processing & Management, 42(5), 1379-1391.
Vickrey B (2001). Information science in theory and practice. Transleted by Abdolhosien Faraj Pahlo. Mashhad: Daneshgahe Ferdowsiye Mash-had
Wood J, Flanagan C, Kenned H Edward (1972). Overlap in the Lists of Journals Monitored by Bios is, CAS, and E. Journal of the American Society for Information Science.
Wood J, Flanagan C, Kenned (1973). Overlap Among the Journal Articles Selected for Coverage b BIO-SIS, CAS, and Ei. Journal of the American Society for Information Science.