An efficient page ranking approach based on vector norms using sNorm(p) algorithm. Issue 3 (May 2019)
- Record Type:
- Journal Article
- Title:
- An efficient page ranking approach based on vector norms using sNorm(p) algorithm. Issue 3 (May 2019)
- Main Title:
- An efficient page ranking approach based on vector norms using sNorm(p) algorithm
- Authors:
- Goel, Shubham
Kumar, Ravinder
Kumar, Munish
Chopra, Vikram - Abstract:
- Highlights: s Norm( p ) is a novel system model, designed for efficient ranking of web resources. s Norm( p ) uses p-Norm from Vector Norm family for computations in hub and authority vectors. Two datasets i.e. huge web graph and ODP, for our experiment to show the effectiveness of s Norm( p ). Extensive comparison with PageRank, HITS and SALSA w.r.t ranking given to different web resources. s Norm( p ) outperforms state of the art methods by attaining 0.999719 as MRR value. Abstract: In the whole world, the internet is exercised by millions of people every day for information retrieval. Even for a small to smaller task like fixing a fan, to cook food or even to iron clothes persons opt to search the web. To fulfill the information needs of people, there are billions of web pages, each having a different degree of relevance to the topic of interest (TOI), scattered throughout the web but this huge size makes manual information retrieval impossible. The page ranking algorithm is an integral part of search engines as it arranges web pages associated with a queried TOI in order of their relevance level. It, therefore, plays an important role in regulating the search quality and user experience for information retrieval. PageRank, HITS, and SALSA are well-known page ranking algorithm based on link structure analysis of a seed set, but ranking given by them has not yet been efficient. In this paper, we propose a variant of SALSA to give sNorm(p) for the efficient ranking of webHighlights: s Norm( p ) is a novel system model, designed for efficient ranking of web resources. s Norm( p ) uses p-Norm from Vector Norm family for computations in hub and authority vectors. Two datasets i.e. huge web graph and ODP, for our experiment to show the effectiveness of s Norm( p ). Extensive comparison with PageRank, HITS and SALSA w.r.t ranking given to different web resources. s Norm( p ) outperforms state of the art methods by attaining 0.999719 as MRR value. Abstract: In the whole world, the internet is exercised by millions of people every day for information retrieval. Even for a small to smaller task like fixing a fan, to cook food or even to iron clothes persons opt to search the web. To fulfill the information needs of people, there are billions of web pages, each having a different degree of relevance to the topic of interest (TOI), scattered throughout the web but this huge size makes manual information retrieval impossible. The page ranking algorithm is an integral part of search engines as it arranges web pages associated with a queried TOI in order of their relevance level. It, therefore, plays an important role in regulating the search quality and user experience for information retrieval. PageRank, HITS, and SALSA are well-known page ranking algorithm based on link structure analysis of a seed set, but ranking given by them has not yet been efficient. In this paper, we propose a variant of SALSA to give sNorm(p) for the efficient ranking of web pages. Our approach relies on a p-Norm from Vector Norm family in a novel way for the ranking of web pages as Vector Norms can reduce the impact of low authority weight in hub weight calculation in an efficient way. Our study, then compares the rankings given by PageRank, HITS, SALSA, and sNorm(p) to the same pages in the same query. The effectiveness of the proposed approach over state of the art methods has been shown using performance measurement technique, Mean Reciprocal Rank (MRR), Precision, Mean Average Precision (MAP), Discounted Cumulative Gain (DCG) and Normalized DCG (NDCG). The experimentation is performed on a dataset acquired after pre-processing of the results collected from initial few pages retrieved for a query by the Google search engine. Based on the type and amount of in-hand domain expertise 30 queries are designed. The extensive evaluation and result analysis are performed using MRR, Precision@k, MAP, DCG, and NDCG as the performance measuring statistical metrics. Furthermore, results are statistically verified using a significance test. Findings show that our approach outperforms state of the art methods by attaining 0.8666 as MRR value, 0.7957 as MAP value. Thus contributing to the improvement in the ranking of web pages more efficiently as compared to its counterparts. … (more)
- Is Part Of:
- Information processing & management. Volume 56:Issue 3(2019:May)
- Journal:
- Information processing & management
- Issue:
- Volume 56:Issue 3(2019:May)
- Issue Display:
- Volume 56, Issue 3 (2019)
- Year:
- 2019
- Volume:
- 56
- Issue:
- 3
- Issue Sort Value:
- 2019-0056-0003-0000
- Page Start:
- 1053
- Page End:
- 1066
- Publication Date:
- 2019-05
- Subjects:
- Page ranking -- HITS -- SALSA -- Vector Norm -- Mean reciprocal rank -- p-Norm
Information storage and retrieval systems -- Periodicals
Information science -- Periodicals
Systèmes d'information -- Périodiques
Sciences de l'information -- Périodiques
Information science
Information storage and retrieval systems
Periodicals
658.4038 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064573 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ipm.2019.02.004 ↗
- Languages:
- English
- ISSNs:
- 0306-4573
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4493.893000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12860.xml