A novel user trend‐based priority assigner and URL scheduler for dynamic incremental crawling. (8th August 2021)
- Record Type:
- Journal Article
- Title:
- A novel user trend‐based priority assigner and URL scheduler for dynamic incremental crawling. (8th August 2021)
- Main Title:
- A novel user trend‐based priority assigner and URL scheduler for dynamic incremental crawling
- Authors:
- Gupta, Ashlesha
Dixit, Ashutosh - Abstract:
- Summary: An efficient search engine needs to be designed in such a way that is able to provide relevant and accurate information in accordance with user needs and interests. The quality of downloaded records can be guaranteed only when website pages of high pertinence are downloaded by the crawlers in accordance with the current topics or user trends. Earlier Focused Crawlers were used to download topic specific pages but these crawlers were not able to adapt to the changing interest of the users. Therefore, there is a need to design crawlers that are able to naturally track the present pattern points and download site pages that meet client's present need. In this paper, a priority assigner and scheduler method for organizing Uniform Resource Locators (URLs) is being proposed that helps the crawler in tracking user's interest and prioritize downloading documents that are relevant to the user's choice in addition to current trends. The experimental results conforms that the proposed priority assigner and URL scheduler‐based crawling outshines conventional crawling strategies based on Change‐history or Site‐Map‐based methods in terms of quality of downloaded web pages and reducing network traffic over the Internet.
- Is Part Of:
- Concurrency and computation. Volume 34:Number 3(2022)
- Journal:
- Concurrency and computation
- Issue:
- Volume 34:Number 3(2022)
- Issue Display:
- Volume 34, Issue 3 (2022)
- Year:
- 2022
- Volume:
- 34
- Issue:
- 3
- Issue Sort Value:
- 2022-0034-0003-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-08-08
- Subjects:
- page rank -- priority assigner and scheduler -- search engine -- web crawler -- World Wide Web
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.6555 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20323.xml