Using score distributions to compare statistical significance tests for information retrieval evaluation. (5th April 2019)
- Record Type:
- Journal Article
- Title:
- Using score distributions to compare statistical significance tests for information retrieval evaluation. (5th April 2019)
- Main Title:
- Using score distributions to compare statistical significance tests for information retrieval evaluation
- Authors:
- Parapar, Javier
Losada, David E.
Presedo‐Quindimil, Manuel A.
Barreiro, Alvaro - Abstract:
- Abstract : Statistical significance tests can provide evidence that the observed difference in performance between 2 methods is not due to chance. In information retrieval (IR), some studies have examined the validity and suitability of such tests for comparing search systems. We argue here that current methods for assessing the reliability of statistical tests suffer from some methodological weaknesses, and we propose a novel way to study significance tests for retrieval evaluation. Using Score Distributions, we model the output of multiple search systems, produce simulated search results from such models, and compare them using various significance tests. A key strength of this approach is that we assess statistical tests under perfect knowledge about the truth or falseness of the null hypothesis. This new method for studying the power of significance tests in IR evaluation is formal and innovative. Following this type of analysis, we found that both the sign test and Wilcoxon signed test have more power than the permutation test and the t‐ test. The sign test and Wilcoxon signed test also have good behavior in terms of type I errors. The bootstrap test shows few type I errors, but it has less power than the other methods tested.
- Is Part Of:
- Journal of the Association for Information Science and Technology. Volume 71:Number 1(2020:Jan.)
- Journal:
- Journal of the Association for Information Science and Technology
- Issue:
- Volume 71:Number 1(2020:Jan.)
- Issue Display:
- Volume 71, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 71
- Issue:
- 1
- Issue Sort Value:
- 2020-0071-0001-0000
- Page Start:
- 98
- Page End:
- 113
- Publication Date:
- 2019-04-05
- Subjects:
- Information science -- Periodicals
Information technology -- Periodicals
020.5 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%292330-1643 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/asi.24203 ↗
- Languages:
- English
- ISSNs:
- 2330-1635
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4704.325000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12433.xml