Skip to main content

VDM Verlag

Designing New Crawling and Indexing Techniques for Web Search Engines

No reviews yet
Product Code: 9783639204001
ISBN13: 9783639204001
Condition: New
$63.72
$63.18
Sale 1%

Designing New Crawling and Indexing Techniques for Web Search Engines

$63.72
$63.18
Sale 1%
 
This thesis studies in a Web search engine how a crawler with limited computing resource can effectively crawl from the dynamically changing Web and acquire the most updated Web documents, and how a Web search engine can provide information-object--oriented indexing methods which enable users to retrieve desired information with high accuracy and high efficiency. To address the first problem, we design a set of sampling policies with various downloading granularity for the sampling method, taking into account the link structure, the directory structure, and the content-based features which include the clustering technique. We further extend the clustering-based sampling approach by testing more dynamic features and strategically selecting samples from each cluster. For the second problem, we propose building indexes on extracted metadata of various information objects, instead of the whole document. We set up a digital library named ArchSeer for the domain of archeology. ArchSeer allows users to retrieve archeology literature via domain-specific search engines.


Author: Qingzhao Tan
Publisher: VDM Verlag
Publication Date: Oct 02, 2009
Number of Pages: 156 pages
Binding: Paperback or Softback
ISBN-10: 363920400X
ISBN-13: 9783639204001
 

Customer Reviews

This product hasn't received any reviews yet. Be the first to review this product!

Faster Shipping

Delivery in 3-8 days

Easy Returns

14 days returns

Discount upto 30%

Monthly discount on books

Outstanding Customer Service

Support 24 hours a day