The Science and Practice of Item Response Theory in Organizations

Jonas W.B. Lang; Louis Tay

doi:10.1146/annurev-orgpsych-012420-061705

Annual Review of Organizational Psychology and Organizational Behavior

Volume 8, 2021

Review Article

Free

The Science and Practice of Item Response Theory in Organizations

Jonas W.B. Lang^1,2, and Louis Tay³
View Affiliations Hide Affiliations

Affiliations: ¹Department of Human Resource Management and Organizational Psychology, Ghent University, B-9000 Gent, Belgium; email: [email protected] ²Business School, University of Exeter, EX4 4PU Exeter, United Kingdom ³Department of Psychological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
Vol. 8:311-338 (Volume publication date January 2021) https://doi.org/10.1146/annurev-orgpsych-012420-061705
First published as a Review in Advance on November 11, 2020
Copyright © 2021 by Annual Reviews. All rights reserved

Abstract

Item response theory (IRT) is a modeling approach that links responses to test items with underlying latent constructs through formalized statistical models. This article focuses on how IRT can be used to advance science and practice in organizations. We describe established applications of IRT as a scale development tool and new applications of IRT as a research and theory testing tool that enables organizational researchers to improve their understanding of workers and organizations. We focus on IRT models and their application in four key research and practice areas: testing, questionnaire responding, construct validation, and measurement equivalence of scores. In so doing, we highlight how novel developments in IRT such as explanatory IRT, multidimensional IRT, random item models, and more complex models of response processes such as ideal point models and tree models can potentially advance existing science and practice in these areas. As a starting point for readers interested in learning IRT and applying recent developments in IRT in their research, we provide concrete examples with data and R code.

Keyword(s): measurement, models, psychometrics, research methods, theory testing, validity

Article metrics loading...

/content/journals/10.1146/annurev-orgpsych-012420-061705

2021-01-21

2024-04-25

Full text loading...

/deliver/fulltext/orgpsych/8/1/annurev-orgpsych-012420-061705.html?itemId=/content/journals/10.1146/annurev-orgpsych-012420-061705&mimeType=html&fmt=ahah

Literature Cited

Ackerman TA. 1989. Unidimensional IRT calibration of compensatory and noncompensatory multidimensional items. Appl. Psychol. Meas. 13:113–27
[Google Scholar]
Ashby N. 2003. Relativity in the Global Positioning System. Living Rev. Relativ. 6:11
[Google Scholar]
Bafumi J, Gelman A, Park DK, Kaplan N 2005. Practical issues in implementing and understanding Bayesian ideal point estimation. Political Anal 13:2171–87
[Google Scholar]
Barr DJ, Levy R, Scheepers C, Tily HJ 2013. Random effects structure for confirmatory hypothesis testing: Keep it maximal. J. Mem. Lang. 68:255–78
[Google Scholar]
Bates D, Kliegl R, Vasishth S, Baayen H 2015a. Parsimonious mixed models arXiv:1506.04967 [stat.ME]
Bates D, Mächler M, Bolker B, Walker S 2015b. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67:1 https://doi.org/10.18637/jss.v067.i01
[Crossref] [Google Scholar]
Binet A, Simon T. 1916. New Methods for the Diagnosis of the Intellectual Level of Subnormals. (L'Année Psych., 1905, pp. 191–244), transl. A Binet, T Simon, ES Kite, in The Development of Intelligence in Children (The Binet-Simon Scale)37–90 Philadelphia: Williams & Wilkins Co (from French) https://doi.org/10.1037/11069-002
[Crossref] [Google Scholar]
Birnbaum A. 1968. Some latent trait models and their use in inferring an examinee's ability. Statistical Theories of Mental Test Scores FM Lord, MR Novick 17–20 Reading, MA: Addison-Wesley
[Google Scholar]
Bliese PD, Wright KM, Adler AB, Cabrera O, Castro CA, Hoge CW 2008. Validating the primary care posttraumatic stress disorder screen and the posttraumatic stress disorder checklist with soldiers returning from combat. J. Consult. Clin. Psychol. 76:2272–81
[Google Scholar]
Bock RD. 1997. A brief history of item theory response. Educ. Meas. Issues Pract. 16:421–33
[Google Scholar]
Bock RD, Lieberman M. 1970. Fitting a response model for n dichotomously scored items. Psychometrika 35:2179–97
[Google Scholar]
Bock RD, Murakl E, Pfeiffenberger W 1988. Item pool maintenance in the presence of item parameter drift. J. Educ. Meas. 25:275–85
[Google Scholar]
Böckenholt U. 2001. Hierarchical modeling of paired comparison data. Psychol. Methods 6:149–66
[Google Scholar]
Böckenholt U. 2012. Modeling multiple response processes in judgment and choice. Psychol. Methods 17:665–78
[Google Scholar]
Böckenholt U, Meiser T. 2017. Response style analysis with threshold and multi-process IRT models: a review and tutorial. Br. J. Math. Stat. Psychol. 70:159–81
[Google Scholar]
Borsboom D. 2006. The attack of the psychometricians. Psychometrika 71:425–40
[Google Scholar]
Borsboom D, Mellenbergh GJ. 2007. Test validity in cognitive assessment. Cognitive Diagnostic Assessment for Education J Leighton, M Gierl 85–116 Cambridge, UK: Cambridge Univ. Press
[Google Scholar]
Borsboom D, Mellenbergh GJ, van Heerden J 2004. The concept of validity. Psychol. Rev. 111:1061–71
[Google Scholar]
Brown A. 2016. Item response models for forced-choice questionnaires: a common framework. Psychometrika 81:1135–60
[Google Scholar]
Cai L. 2010. Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis. J. Educ. Behav. Stat. 35:3307–35
[Google Scholar]
Carter NT, Dalal DK, Boyce AS, O'Connell MS, Kung M-C, Delgado KM 2014. Uncovering curvilinear relationships between conscientiousness and job performance: how theoretically appropriate measurement makes an empirical difference. J. Appl. Psychol. 99:4564–86
[Google Scholar]
Chalmers RP. 2012. mirt: a multidimensional item response theory package for the R environment. J. Stat. Softw. 48:6 https://doi.org/10.18637/jss.v048.i06
[Crossref] [Google Scholar]
Coles P. 2019. Relativity revealed. Nature 568:7752306–7
[Google Scholar]
Davison ML. 1977. On a metric, unidimensional unfolding model for attitudinal and developmental data. Psychometrika 42:4523–48
[Google Scholar]
De Boeck P. 2008. Random item IRT models. Psychometrika 73:533–59
[Google Scholar]
De Boeck P, Bakker M, Zwitser R, Nivard M, Hofman A et al. 2011. The estimation of item response models with the lmer function from the lme4 package in R. J. Stat. Softw. 39:12 https://doi.org/10.18637/jss.v039.i12
[Crossref] [Google Scholar]
De Boeck P, Partchev I 2012. IRTrees: tree-based item response models of the GLMM family. J. Stat. Softw. 48:Code Snippet 1 https://doi.org/10.18637/jss.v048.c01
[Crossref] [Google Scholar]
De Boeck P, Wilson M 2004. Explanatory Item Response Models: A Generalized Linear and Nonlinear Approach New York: Springer
de la Torre J. 2009. DINA model and parameter estimation: a didactic. J. Educ. Behav. Stat. 34:115–30
[Google Scholar]
Debeer D, Janssen R. 2013. Modeling item-position effects within an IRT framework. J. Educ. Meas. 50:2164–85
[Google Scholar]
Debeer D, Janssen R, De Boeck P 2017. Modeling skipped and not-reached items using IRTrees. J. Educ. Meas. 54:3333–63
[Google Scholar]
DeSimone JA, James LR. 2015. An item analysis of the Conditional Reasoning Test of Aggression. J. Appl. Psychol. 100:61872–86
[Google Scholar]
Doran H, Bates D, Bliese P, Dowling M 2007. Estimating the multilevel Rasch model: with the lme4 package. J. Stat. Softw. 20:2 https://doi.org/10.18637/jss.v020.i02
[Crossref] [Google Scholar]
Drasgow F. 1982. Biased test items and differential validity. Psychol. Bull. 92:2526–31
[Google Scholar]
Drasgow F. 1987. Study of the measurement bias of two standardized psychological tests. J. Appl. Psychol. 72:119–29
[Google Scholar]
Drasgow F, Chernyshenko OS, Stark S 2010. Improving the measurement of psychological variables: Ideal point models rock. ! Ind. Organ. Psychol. 3:515–20
[Google Scholar]
Drasgow F, Hulin CL. 1990. Item response theory. Handbook of Industrial and Organizational Psychology MD Dunnette, LM Hough 577–636 Palo Alto, CA: Consult. Psychol. Press
[Google Scholar]
Embretson SE. 1998. A cognitive design system approach to generating valid tests: application to abstract reasoning. Psychol. Methods 3:380–96
[Google Scholar]
Eysenck HJ, Eysenck SBG. 1968. Manual for the Eysenck Personality Inventory San Diego, CA: Educ. Ind. Test. Serv.
Fischer GH. 1973. The linear logistic test model as an instrument in educational research. Acta Psychol 37:6359–74
[Google Scholar]
Foster GC, Min H, Zickar MJ 2017. Review of item response theory practices in organizational research. Organ. Res. Methods 20:465–86
[Google Scholar]
Fox J-P, Verhagen AJ. 2010. Random item effects modeling for cross-national survey data. Cross-Cultural Analysis: Methods and Applications E Davidov, P Schmidt, J Billiet 461–82 New York: Routledge
[Google Scholar]
Gierl MJ, Gotzmann A, Boughton KA 2004. Performance of SIBTEST when the percentage of DIF items is large. Appl. Meas. Educ. 17:3241–64
[Google Scholar]
Greenwald AG. 2012. There is nothing so theoretical as a good method. Perspect. Psychol. Sci. 7:299–108
[Google Scholar]
Gulliksen H. 1950. Theory of Mental Tests New York: Wiley
Hambleton RK, Swaminathan H, Rogers HJ 1991. Fundamentals of Item Response Theory Newbury Park, CA: Sage
Hernández A, Drasgow F, González-Romá V 2004. Investigating the functioning of a middle category by means of a mixed-measurement model. J. Appl. Psychol. 89:687–99
[Google Scholar]
Hornke LF. 2002. Item-generation models for higher-order cognitive functions. Item Generation and Test Development SH Irvine, PC Kyllonen 159–78 Mahwah, NJ: Erlbaum
[Google Scholar]
Irtel H. 1995. An extension of the concept of specific objectivity. Psychometrika 60:1115–18
[Google Scholar]
Johnston AK, Connor RD, Stephens CE, Ceruzzi PE 2015. Time and Navigation: The Untold Story of Getting from Here to There Washington, DC: Smithson. Books
Kim S. 2012. A note on the reliability coefficients for item response model-based ability estimates. Psychometrika 77:1153–62
[Google Scholar]
Kim S-H, Cohen AS. 1998. A comparison of linking and concurrent calibration under item response theory. Appl. Psychol. Meas. 22:131–43
[Google Scholar]
Kolen MJ, Brennan RL. 2004. Test Equating, Scaling, and Linking: Methods and Practice New York: Springer. , 2nd ed..
Kubinger KD. 2009. Applications of the linear logistic test model in psychometric research. Educ. Psychol. Meas. 69:2232–44
[Google Scholar]
LaHuis DM, Blackmore CE, Bryant-Lees KB, Delgado K 2019. Applying item response trees to personality data in the selection context. Organ. Res. Methods 22:41007–18
[Google Scholar]
LaHuis DM, Copeland D. 2009. Investigating faking using a multilevel logistic regression approach to measuring person fit. Organ. Res. Methods 12:2296–319
[Google Scholar]
Lang JWB. 2014. A dynamic Thurstonian item response theory of motive expression in the picture story exercise: solving the internal consistency paradox of the PSE. Psychol. Rev. 121:481–500
[Google Scholar]
Lang JWB, Lievens F, De Fruyt F, Zettler I, Tackett JL 2019. Assessing meaningful within-person variability in Likert-scale rated personality descriptions: an IRT tree approach. Psychol. Assess. 31:4474–87
[Google Scholar]
Lee TW, Mitchell TR. 1994. An alternative approach: the unfolding model of voluntary employee turnover. Acad. Manag. Rev. 19:151–89
[Google Scholar]
Lievens F. 2017. Construct-driven SJTs: toward an agenda for future research. Int. J. Test. 17:3269–76
[Google Scholar]
Lievens F, Burke E. 2011. Dealing with the threats inherent in unproctored Internet testing of cognitive ability: results from a large-scale operational test program. J. Occup. Organ. Psychol. 84:4817–24
[Google Scholar]
Lievens F, Lang JWB, De Fruyt F, Corstjens J, Van De Vijver M, Bledow R 2018. The predictive power of people's intra-individual variability across situations: implementing whole trait theory in assessment. J. Appl. Psychol. 103:753–71
[Google Scholar]
Lievens F, Sackett PR. 2017. The effects of predictor method factors on selection outcomes: a modular approach to personnel selection procedures. J. Appl. Psychol. 102:43–66
[Google Scholar]
Lord FM. 1980. Applications of Item Response Theory to Practical Testing Problems Mahwah, NJ: Erlbaum
Lord FM, Novick MR. 1968. Statistical Theories of Mental Test Scores Reading, MA: Addison-Wesley
Lord RG, Kanfer R. 2002. Emotions and organizational behavior. Emotions in the Workplace: Understanding the Structure and Role of Emotions in Organizational Behavior RG Lord, RJ Klimoski, R Kanfer 5–19 San Francisco: Jossey-Bass
[Google Scholar]
Marsh HW, Hau K-T. 2007. Applications of latent-variable models in educational psychology: the need for methodological-substantive synergies. Contemp. Educ. Psychol. 32:1151–70
[Google Scholar]
Matuschek H, Kliegl R, Vasishth S, Baayen H, Bates D 2017. Balancing Type I error and power in linear mixed models. J. Mem. Lang. 94:305–15
[Google Scholar]
Maydeu-Olivares A, Brown A. 2010. Item response modeling of paired comparison and ranking data. Multivar. Behav. Res. 45:935–74
[Google Scholar]
McClimans L, Browne J, Cano S 2017. Clinical outcome measurement: models, theory, psychometrics and practice. Stud. Hist. Philos. Sci. A 65–66:67–73
[Google Scholar]
McCloy RA, Heggestad ED, Reeve CL 2005. A silk purse from the sow's ear: retrieving normative information from multidimensional forced-choice items. Organ. Res. Methods 8:2222–48
[Google Scholar]
McFadden D. 2001. Economic choices. Am. Econ. Rev. 91:351–78
[Google Scholar]
McHorney CA, Monahan PO. 2004. Postscript: applications of Rasch analysis in health care. Med. Care 42:Suppl.1–73
[Google Scholar]
Meade AW, Wright NA. 2012. Solving the measurement invariance anchor item problem in item response theory. J. Appl. Psychol. 97:51016–31
[Google Scholar]
Meijer RR, Sijtsma K. 2001. Methodology review: evaluating person fit. Appl. Psychol. Meas. 25:107–35
[Google Scholar]
Messick S. 1989a. Meaning and values in test validation: the science and ethics of assessment. Educ. Res. 18:25–11
[Google Scholar]
Messick S. 1989b. Validity. Educational Measurement RL Linn 13–103 New York: Am. Counc. Educ./Macmillan Publ.
[Google Scholar]
Michell J. 2015. Measurement theory: history and philosophy. International Encyclopedia of the Social & Behavioral Sciences JD Wright 868–72 Amsterdam: Elsevier. , 2nd ed..
[Google Scholar]
Mitchell DJ, Tal E, Chang H 2017. The making of measurement: Editors’ introduction. Stud. Hist. Philos. Sci. A 65–66:1–7
[Google Scholar]
Muthén LK, Muthén BO. 2015. Mplus User's Guide Los Angeles: Muthén & Muthén. , 7th ed..
Nye CD, Joo S-H, Zhang B, Stark S 2019. Advancing and evaluating IRT model data fit indices in organizational research. Organ. Res. Methods 23:457–86
[Google Scholar]
Oswald FL, Shaw A, Farmer WL 2015. Comparing simple scoring with IRT scoring of personality measures. Appl. Psychol. Meas. 39:144–54
[Google Scholar]
Paulhus DL, Williams KM. 2002. The Dark Triad of personality: Narcissism, Machiavellianism, and psychopathy. J. Res. Personal. 36:6556–63
[Google Scholar]
Peterson C, Park N. 2004. Classification and measurement of character strengths: implications for practice. Positive Psychology in Practice PA Linley, S Joseph 433–46 Hoboken, NJ: Wiley
[Google Scholar]
Plieninger H. 2020. Developing and applying IR-Tree models: guidelines, caveats, and an extension to multiple groups. Organ. Res. Methods. In press. https://doi.org/10.1177/1094428120911096
[Crossref] [Google Scholar]
Podsakoff PM, Organ DW. 1986. Self-reports in organizational research: problems and prospects. J. Manag. 12:4531–44
[Google Scholar]
Reckase MD. 2009. Multidimensional Item Response Theory New York: Springer
Reeve BB, Hays RD, Bjorner JB, Cook KF, Crane PK et al. 2007. Psychometric evaluation and calibration of health-related quality of life item banks: plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med. Care 45:5S22–31
[Google Scholar]
Reise SP, Yu J. 1990. Parameter recovery in the graded response model using MULTILOG. J. Educ. Meas. 27:2133–44
[Google Scholar]
Revelle W. 2020. psych: procedures for psychological, psychometric, and personality research. R Package Version 2.07. https://CRAN.R-project.org/package=psych
[Google Scholar]
Revelle W, Zinbarg RE. 2009. Coefficients alpha, beta, omega, and the glb: comments on Sijtsma. Psychometrika 74:1145–54
[Google Scholar]
Rizopoulos D. 2006. ltm: an R package for latent variable modeling and item response theory analyses. J. Stat. Softw. 17:5 https://doi.org/10.18637/jss.v017.i05
[Crossref] [Google Scholar]
Roberts JS. 2001. GGUM2000: estimation of parameters in the generalized graded unfolding model. Appl. Psychol. Meas. 25:138
[Google Scholar]
Roberts JS, Donoghue JR, Laughlin JE 2000. A general item response theory model for unfolding unidimensional polytomous responses. Appl. Psychol. Meas. 24:13–32
[Google Scholar]
Robitzsch A. 2020. sirt: supplementary item response theory models. R Package, Version 3.9-4 https://cran.r-project.org/package=sirt
[Google Scholar]
Rogelberg SG, Church AH, Waclawski J, Stanton JM 2008. Organizational survey research. Handbook of Research Methods in Industrial and Organizational Psychology SG Rogelberg 140–60 Hoboken, NJ: Blackwell Publ https://doi.org/10.1002/9780470756669.ch7
[Crossref] [Google Scholar]
Rosseel Y. 2012. lavaan: an R package for structural equation modeling. J. Stat. Softw. 48:2 https://doi.org/10.18637/jss.v048.i02
[Crossref] [Google Scholar]
Runge JM, Lang JWB. 2019. Can people recognize their implicit thoughts? The motive self-categorization test. Psychol. Assess. 31:7939–51
[Google Scholar]
Runge JM, Lang JWB, Chasiotis A, Hofer J 2019. Improving the assessment of implicit motives using IRT: cultural differences and differential item functioning. J. Personal. Assess. 101:4414–24
[Google Scholar]
Sackett PR, Lievens F, Van Iddekinge CH, Kuncel NR 2017. Individual differences and their measurement: a review of 100 years of research. J. Appl. Psychol. 102:3254–73
[Google Scholar]
Sen S, Bradshaw L. 2017. Comparison of relative fit indices for diagnostic model selection. Appl. Psychol. Meas. 41:6422–38
[Google Scholar]
Sijtsma K. 2009. On the use, the misuse, and the very limited usefulness of Cronbach's alpha. Psychometrika 74:1107–20
[Google Scholar]
Smith AG, Burns TM. 2014. Reevaluating clinical measurement tools in therapeutic trials: Time to make a Rasch decision. ? Neurology 83:232104–5
[Google Scholar]
Smith PC, Kendall L, Hulin CL 1969. The Measurement of Satisfaction in Work and Retirement: A Strategy for the Study of Attitudes Chicago: Rand McNally
Sorrel MA, Olea J, Abad FJ, de la Torre J, Aguado D, Lievens F 2016. Validity and reliability of situational judgement test scores. Organ. Res. Methods 19:3506–32
[Google Scholar]
Stark S, Chernyshenko OS, Drasgow F, Williams BA 2006. Examining assumptions about item responding in personality assessment: Should ideal point methods be considered for scale development and scoring. ? J. Appl. Psychol. 91:125–39
[Google Scholar]
Tal E. 2017. Measurement in science. The Stanford Encyclopedia of Philosophy EN Zalta Stanford, CA: Metaphys. Res. Lab., Stanford Cent. Stud. Lang. Inf. Fall 2017 Ed. https://plato.stanford.edu/archives/fall2017/entries/measurement-science/
[Google Scholar]
Tay L, Diener E, Drasgow F, Vermunt JK 2011. Multilevel mixed-measurement IRT analysis: an explication and application to self-reported emotions across the world. Organ. Res. Methods 14:1177–207
[Google Scholar]
Tay L, Drasgow F, Rounds J, Williams BA 2009. Fitting measurement models to vocational interest data: Are dominance models ideal. ? J. Appl. Psychol. 94:51287–304
[Google Scholar]
Tay L, Huang Q, Vermunt JK 2016. Item response theory with covariates (IRT-C): assessing item recovery and differential item functioning for the three-parameter logistic model. Educ. Psychol. Meas. 76:122–42
[Google Scholar]
Tay L, Jebb AT. 2018. Establishing construct continua in construct validation: the process of continuum specification. Adv. Methods Pract. Psychol. Sci. 1:3375–88
[Google Scholar]
Tay L, Kuykendall L. 2017. Why self-reports of happiness and sadness may not necessarily contradict bipolarity: a psychometric review and proposal. Emot. Rev. 9:2146–54
[Google Scholar]
Tay L, Meade AW, Cao M 2015. An overview and practical guide to IRT measurement equivalence analysis. Organ. Res. Methods 18:3–46
[Google Scholar]
Tay L, Newman DA, Vermunt JK 2011. Using mixed-measurement item response theory with covariates (MM-IRT-C) to ascertain observed and unobserved measurement equivalence. Organ. Res. Methods 14:1147–76
[Google Scholar]
Tay L, Ng V. 2018. Ideal point modeling of non-cognitive constructs: review and recommendations for research. Front. Psychol. 9: https://doi.org/10.3389/fpsyg.2018.02423
[Crossref] [Google Scholar]
Terman LM. 1916. The Measurement of Intelligence: An Explanation of and a Complete Guide for the Use of the Stanford Revision and Extension of the Binet-Simon Intelligence Scale Cambridge, MA: Houghton Mifflin
Thissen D, Reeve BB, Bjorner JB, Chang C-H 2007. Methodological issues for building item banks and computerized adaptive scales. Qual. Life Res. 16:S1109–19
[Google Scholar]
Thurstone LL. 1927. A law of comparative judgment. Psychol. Rev. 34:4273–86
[Google Scholar]
Thurstone LL. 1928. Attitudes can be measured. Am. J. Sociol. 33:4529–54
[Google Scholar]
van der Linden W, Hambleton RK 1997. Handbook of Modern Item Response Theory New York: Springer
van Rijn PW, Rijmen F 2012. A note on explaining away and paradoxical results in multidimensional item response theory. ETS Res. Rep. Ser. 2012:2i–10
[Google Scholar]
Vandenberg RJ, Lance CE. 2000. A review and synthesis of the measurement invariance literature: suggestions, practices, and recommendations for organizational research. Organ. Res. Methods 3:14–70
[Google Scholar]
Verhelst ND, Glas CAW. 1993. A dynamic generalization of the Rasch model. Psychometrika 58:3395–415
[Google Scholar]
Wainer H, Bradlow ET, Wang X 2007. Testlet Response Theory and Its Applications Cambridge, UK: Cambridge Univ. Press
Wang X, Berger JO, Burdick DS 2013. Bayesian analysis of dynamic item response models in educational testing. Ann. Appl. Stat. 7:1126–53
[Google Scholar]
Weiss DJ. 1982. Improving measurement quality and efficiency with adaptive testing. Appl. Psychol. Meas. 6:4473–92
[Google Scholar]
Weiss DJ. 2004. Computerized adaptive testing for effective and efficient measurement in counseling and education. Meas. Eval. Couns. Dev. 37:270–84
[Google Scholar]
Weiss J. 1987. The Golden Rule bias reduction principle: a practical reform. Educ. Meas. Issues Pract. 6:223–25
[Google Scholar]
Whitney DJ, Schmitt N. 1997. Relationship between culture and responses to biodata employment items. J. Appl. Psychol. 82:1113–29
[Google Scholar]
Wilson MR. 2005. Constructing Measures: An Item Response Modeling Approach Mahwah, NJ: Erlbaum
Wilson MR, De Boeck P, Carstensen CH 2008. Explanatory item response models. Assessment of Competencies in Educational Contexts J Hartig, E Klieme, D Leutner 83–110 Göttingen, Ger: Hogrefe & Huber
[Google Scholar]
Yerkes RM. 1917. The Binet versus the Point Scale method of measuring intelligence. J. Appl. Psychol. 1:2111–22
[Google Scholar]
Yoakum CS, Yerkes RM. 1920. Army Mental Tests New York: Henry Holt
Zettler I, Lang JWB, Hülsheger UR, Hilbig BE 2016. Dissociating indifferent, directional, and extreme responding in personality data: applying the three-process model to self- and observer reports. J. Personal. 84:461–72
[Google Scholar]
Zickar MJ. 1998. Modeling item-level data with item response theory. Curr. Dir. Psychol. Sci. 7:4104–9
[Google Scholar]

/content/journals/10.1146/annurev-orgpsych-012420-061705

The Science and Practice of Item Response Theory in Organizations

Annual Review of Organizational Psychology and Organizational Behavior 8, 311 (2021); https://doi.org/10.1146/annurev-orgpsych-012420-061705

/content/journals/10.1146/annurev-orgpsych-012420-061705

Data & Media loading...

Supplemental Material

Supplementary Data

Download the Supplemental Appendix (ZIP).

Article Type: Review Article

Most Cited Most Cited RSS feed

- Conservation of Resources in the Organizational Context: The Reality of Resources and Their Consequences
  
  Stevan E. Hobfoll, Jonathon Halbesleben, Jean-Pierre Neveu, and Mina Westman
  
  Vol. 5 (2018), pp. 103–128
- Burnout and Work Engagement: The JD–R Approach
  
  Arnold B. Bakker, Evangelia Demerouti, and Ana Isabel Sanz-Vergel
  
  Vol. 1 (2014), pp. 389–411
- Self-Determination Theory in Work Organizations: The State of a Science
  
  Edward L. Deci, Anja H. Olafsen, and Richard M. Ryan
  
  Vol. 4 (2017), pp. 19–43
- Psychological Safety: The History, Renaissance, and Future of an Interpersonal Construct
  
  Amy C. Edmondson, and Zhike Lei
  
  Vol. 1 (2014), pp. 23–43
- Employee Voice and Silence
  
  Elizabeth W. Morrison
  
  Vol. 1 (2014), pp. 173–197
- Psychological Capital: An Evidence-Based Positive Approach
  
  Fred Luthans, and Carolyn M. Youssef-Morgan
  
  Vol. 4 (2017), pp. 339–366
- How Technology Is Changing Work and Organizations
  
  Wayne F. Cascio, and Ramiro Montealegre
  
  Vol. 3 (2016), pp. 349–375
- Research on Workplace Creativity: A Review and Redirection
  
  Jing Zhou, and Inga J. Hoever
  
  Vol. 1 (2014), pp. 333–359
- The Psychology of Entrepreneurship
  
  Michael Frese, and Michael M. Gielnik
  
  Vol. 1 (2014), pp. 413–438
- Abusive Supervision
  
  Bennett J. Tepper, Lauren Simon, and Hee Man Park
  
  Vol. 4 (2017), pp. 123–152
More Less

Annual Review of Organizational Psychology and Organizational Behavior

Volume 8, 2021

Review Article

Free

The Science and Practice of Item Response Theory in Organizations

Abstract

Supplementary Data

Most Read This Month

Most Cited Most Cited RSS feed