Comparing Traditional and IRT Scoring of Forced-Choice Tests. (November 2015)
- Record Type:
- Journal Article
- Title:
- Comparing Traditional and IRT Scoring of Forced-Choice Tests. (November 2015)
- Main Title:
- Comparing Traditional and IRT Scoring of Forced-Choice Tests
- Authors:
- Hontangas, Pedro M.
de la Torre, Jimmy
Ponsoda, Vicente
Leenen, Iwin
Morillo, Daniel
Abad, Francisco J. - Abstract:
- This article explores how traditional scores obtained from different forced-choice (FC) formats relate to their true scores and item response theory (IRT) estimates. Three FC formats are considered from a block of items, and respondents are asked to (a) pick the item that describes them most (PICK), (b) choose the two items that describe them the most and the least (MOLE), or (c) rank all the items in the order of their descriptiveness of the respondents (RANK). The multi-unidimensional pairwise-preference (MUPP) model, which is extended to more than two items per block and different FC formats, is applied to obtain the responses to each item block. Traditional and IRT (i.e., expected a posteriori) scores are computed from each data set and compared. The aim is to clarify the conditions under which simpler traditional scoring procedures for FC formats may be used in place of the more appropriate IRT estimates for the purpose of inter-individual comparisons. Six independent variables are considered: response format, number of items per block, correlation between the dimensions, item discrimination level, and sign-heterogeneity and variability of item difficulty parameters. Results show that the RANK response format outperforms the other formats for both the IRT estimates and traditional scores, although it is only slightly better than the MOLE format. The highest correlations between true and traditional scores are found when the test has a large number of blocks, dimensionsThis article explores how traditional scores obtained from different forced-choice (FC) formats relate to their true scores and item response theory (IRT) estimates. Three FC formats are considered from a block of items, and respondents are asked to (a) pick the item that describes them most (PICK), (b) choose the two items that describe them the most and the least (MOLE), or (c) rank all the items in the order of their descriptiveness of the respondents (RANK). The multi-unidimensional pairwise-preference (MUPP) model, which is extended to more than two items per block and different FC formats, is applied to obtain the responses to each item block. Traditional and IRT (i.e., expected a posteriori) scores are computed from each data set and compared. The aim is to clarify the conditions under which simpler traditional scoring procedures for FC formats may be used in place of the more appropriate IRT estimates for the purpose of inter-individual comparisons. Six independent variables are considered: response format, number of items per block, correlation between the dimensions, item discrimination level, and sign-heterogeneity and variability of item difficulty parameters. Results show that the RANK response format outperforms the other formats for both the IRT estimates and traditional scores, although it is only slightly better than the MOLE format. The highest correlations between true and traditional scores are found when the test has a large number of blocks, dimensions assessed are independent, items have high discrimination and highly dispersed location parameters, and the test contains blocks formed by positive and negative items. … (more)
- Is Part Of:
- Applied psychological measurement. Volume 39:Number 8(2015)
- Journal:
- Applied psychological measurement
- Issue:
- Volume 39:Number 8(2015)
- Issue Display:
- Volume 39, Issue 8 (2015)
- Year:
- 2015
- Volume:
- 39
- Issue:
- 8
- Issue Sort Value:
- 2015-0039-0008-0000
- Page Start:
- 598
- Page End:
- 612
- Publication Date:
- 2015-11
- Subjects:
- forced choice -- ipsative data -- multi-unidimensional pairwise-preference -- MUPP -- unfolding model -- GGUM -- EAP -- traditional scoring -- personality assessment -- faking
Psychometrics -- Periodicals
Psychological tests -- Periodicals
Psychology, Applied -- Periodicals
150.1519505 - Journal URLs:
- http://apm.sagepub.com ↗
http://www-us.ebsco.com/online/direct.asp?JournalID=103714 ↗
http://www.ingentaselect.com/rpsv/ij/sage/01466216/contp1.htm ↗
http://www.sagepublications.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/0146621615585851 ↗
- Languages:
- English
- ISSNs:
- 0146-6216
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6508.xml