Design of topic Web crawler based on improved PageRank algorithm. Issue 1 (February 2021)
- Record Type:
- Journal Article
- Title:
- Design of topic Web crawler based on improved PageRank algorithm. Issue 1 (February 2021)
- Main Title:
- Design of topic Web crawler based on improved PageRank algorithm
- Authors:
- Yu, Linxuan
Li, Yeli
Zeng, Qingtao - Abstract:
- Abstract: With the continuous development of network information technology, the network is filled with a large number of all kinds of unstructured data called big data. However, this data is not easily stored in a local database. People realize that it is essential to get useful information from the Internet efficiently. The effort to gather information by human hands has led to the emergence of web crawler technology. However, the existing search engines still have shortcomings in topic similarity judgment and web page sorting algorithm. Therefore, this paper applies PageRank algorithm to topic crawler, constructs a vertical search engine, and introduces topic relevance factor to suppress "topic drift" according to the shortcomings of PageRank algorithm.
- Is Part Of:
- Journal of physics. Volume 1754:Issue 1(2021)
- Journal:
- Journal of physics
- Issue:
- Volume 1754:Issue 1(2021)
- Issue Display:
- Volume 1754, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 1754
- Issue:
- 1
- Issue Sort Value:
- 2021-1754-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Physics -- Congresses
530.5 - Journal URLs:
- http://www.iop.org/EJ/journal/1742-6596 ↗
http://ioppublishing.org/ ↗ - DOI:
- 10.1088/1742-6596/1754/1/012210 ↗
- Languages:
- English
- ISSNs:
- 1742-6588
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5036.223000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25338.xml