Information Retrieval System: a Domain Specific Parallel Crawler - Nidhi Tyagi - Libros - VDM Verlag Dr. Müller - 9783639377798 - 24 de agosto de 2011
En caso de que portada y título no coincidan, el título será el correcto

Information Retrieval System: a Domain Specific Parallel Crawler


Recibe un correo electrónico cuando el artículo esté disponible
¿Tienes un perfil? Iniciar sesión
Añadir a tu lista de deseos de iMusic

The World Wide Web is an interlinked collection of billions of documents formatted using HTML. Due to the growing and dynamic nature of the web, it has become a challenge to traverse all URLs in the web documents and handle these URLs, so it has become imperative to parallelize a crawling process. The crawler process is further being parallelized in the form ecology of crawler workers that parallely download information from the web. This paper proposes a novel architecture of parallel crawler, which is based on domain specific crawling, makes crawling task more effective, scalable and load-sharing among the different crawlers which parallel download web pages related to different domains specific URLs.

Medios de comunicación Libros     Paperback Book   (Libro con tapa blanda y lomo encolado)
Publicado 24 de agosto de 2011
ISBN13 9783639377798
Editores VDM Verlag Dr. Müller
Páginas 92
Dimensiones 150 × 6 × 226 mm   ·   145 g
Lengua Inglés  

Mere med samme udgiver