Comparative Analysis of the Information Retrieval Strategies in Web Crawling

Saini, Chandni

Comparative Analysis of the Information Retrieval Strategies in Web Crawling

Files

4065.pdf (2.71 MB)

Date

2016-08-11

Authors

Saini, Chandni

Supervisors

Arora, Vinay

Abstract

In today’s scenario, World Wide Web (WWW) is flooded with huge amount of information. Due to growing popularity of the internet, finding the meaningful information among billions of information resources on the WWW is a challenging task. The Information Retrieval (IR) provides relevant information to the end users which satisfy their requirement. Search engine is used to extract valuable information from the internet. Web crawler is the principal part of search engine; it is an automatic script or program which can browse the web in automatic manner. This process is known as web crawling. In this literature, review on the strategies of information retrieval in web crawling has been presented that are classified into four main categories viz: focused, distributed, incremental and hidden web crawler. Finally, on the basis of user customized parameters the comparative analysis of various IR strategies has been performed.

Keywords

Information reterival, Web crawling, Search Engine

URI

http://hdl.handle.net/10266/4065

Collections

Masters Theses@CSED

Full item page

Comparative Analysis of the Information Retrieval Strategies in Web Crawling

Files

Date

Authors

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By