Implementing Monte Carlo tests with p‐value buckets. (17th December 2019)
- Record Type:
- Journal Article
- Title:
- Implementing Monte Carlo tests with p‐value buckets. (17th December 2019)
- Main Title:
- Implementing Monte Carlo tests with p‐value buckets
- Authors:
- Gandy, Axel
Hahn, Georg
Ding, Dong - Abstract:
- Abstract: Software packages usually report the results of statistical tests using p‐values. Users often interpret these values by comparing them with standard thresholds, for example, 0.1, 1, and 5%, which is sometimes reinforced by a star rating (***, **, and *, respectively). We consider an arbitrary statistical test whose p‐value p is not available explicitly, but can be approximated by Monte Carlo samples, for example, by bootstrap or permutation tests. The standard implementation of such tests usually draws a fixed number of samples to approximate p . However, the probability that the exact and the approximated p‐value lie on different sides of a threshold (the resampling risk) can be high, particularly for p‐values close to a threshold. We present a method to overcome this. We consider a finite set of user‐specified intervals that cover [0, 1] and that can be overlapping. We call these p‐value buckets. We present algorithms that, with arbitrarily high probability, return a p‐value bucket containing p . We prove that for both a bounded resampling risk and a finite runtime, overlapping buckets need to be employed, and that our methods both bound the resampling risk and guarantee a finite runtime for such overlapping buckets. To interpret decisions with overlapping buckets, we propose an extension of the star rating system. We demonstrate that our methods are suitable for use in standard software, including for low p‐value thresholds occurring in multiple testingAbstract: Software packages usually report the results of statistical tests using p‐values. Users often interpret these values by comparing them with standard thresholds, for example, 0.1, 1, and 5%, which is sometimes reinforced by a star rating (***, **, and *, respectively). We consider an arbitrary statistical test whose p‐value p is not available explicitly, but can be approximated by Monte Carlo samples, for example, by bootstrap or permutation tests. The standard implementation of such tests usually draws a fixed number of samples to approximate p . However, the probability that the exact and the approximated p‐value lie on different sides of a threshold (the resampling risk) can be high, particularly for p‐values close to a threshold. We present a method to overcome this. We consider a finite set of user‐specified intervals that cover [0, 1] and that can be overlapping. We call these p‐value buckets. We present algorithms that, with arbitrarily high probability, return a p‐value bucket containing p . We prove that for both a bounded resampling risk and a finite runtime, overlapping buckets need to be employed, and that our methods both bound the resampling risk and guarantee a finite runtime for such overlapping buckets. To interpret decisions with overlapping buckets, we propose an extension of the star rating system. We demonstrate that our methods are suitable for use in standard software, including for low p‐value thresholds occurring in multiple testing settings, and that they can be computationally more efficient than standard implementations. … (more)
- Is Part Of:
- Scandinavian journal of statistics. Volume 47:Number 3(2020:Sep.)
- Journal:
- Scandinavian journal of statistics
- Issue:
- Volume 47:Number 3(2020:Sep.)
- Issue Display:
- Volume 47, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 47
- Issue:
- 3
- Issue Sort Value:
- 2020-0047-0003-0000
- Page Start:
- 950
- Page End:
- 967
- Publication Date:
- 2019-12-17
- Subjects:
- algorithm -- bootstrap -- hypothesis testing -- p-value -- resampling -- sampling
Statistics -- Periodicals
310 - Journal URLs:
- http://www.blackwellpublishers.co.uk/asp/journal.asp?ref=0303-6898 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/sjos.12434 ↗
- Languages:
- English
- ISSNs:
- 0303-6898
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8087.549000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23890.xml