Please use this identifier to cite or link to this item: http://hdl.handle.net/11189/1455
Title: Empirical Study on Crawler Visibility of PDF Documents in Digital Libraries
Authors: Weideman, Melius 
Keywords: Digital libraries;Search engine
Issue Date: Jul-2010
Publisher: Institute of Electrical and Electronics Engineers, Inc.
Abstract: Digital library users might not enter a digital library through homepage menus. As a result, digital library owners should consider the visibility to search engines of stored PDF documents. The aim of this research project was to determine to what extent the visibility of these PDF documents can be improved. In a series of empirical experiments, 100 PDF documents stored on digital libraries were identified an inspected. Searches were done for them and rankings on search engine result pages recorded. The current visibility of these documents was then calculated. After submission to Google, a waiting period was allowed for crawler visitation and the searches repeated. The results of these experiments proved that the visibility of these documents could be improved only marginally. It is therefore concluded that the designers of university digital libraries should consider other alternatives, such as providing text extracts of PDF documents, to enhance the overall visibility of content. Keywords-digital
Description: Proceedings 2010 3rd IEEE International Conference on Computer Science and Information Technology July 710, 2010 Chengdu, China
URI: http://hdl.handle.net/11189/1455
ISSN: 978-1-4244-5538-6
Appears in Collections:WARC Publications
Prof. Melius Weideman

Files in This Item:
File Description SizeFormat 
fulltext.pdfConference paper871.49 kBAdobe PDFView/Open
Show full item record

Page view(s)

15
Last Week
0
Last month
1
checked on Jun 23, 2018

Download(s)

9
checked on Jun 23, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.