OOIR: Observatory of International Research

Papers

(The median citation count of Journal of Educational Measurement is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
NCME Presidential Address 2022: Turning the Page to the Next Chapter of Educational Measurement	21
The Automated Test Assembly and Routing Rule for Multistage Adaptive Testing with Multidimensional Item Response Theory	16
A Statistical Test for the Detection of Item Compromise Combining Responses and Response Times	15
	14
How Many Plausible Values?	14
Assessing Differential Bundle Functioning Using Meta‐Analysis	13
Measuring the Uncertainty of Imputed Scores	12
A Note on the Use of Categorical Subscores	10
Optimal Calibration of Items for Multidimensional Achievement Tests	10
Editorial for JEM issue 58‐3	9
Linking Error on Achievement Levels Accounting for Dependencies and Complex Sampling	8
Issue Information	8
Two IRT Characteristic Curve Linking Methods Weighted by Information	7
Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation	7
A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge	7
Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing	6
Briggs, Derek C.Historical and Conceptual Foundations of Measurement in the Human Sciences: Credos and Controversies	6
Using Item Parameter Predictions for Reducing Calibration Sample Requirements—A Case Study Based on a High‐Stakes Admission Test	6
	5
An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior	5
Validity Arguments for AI‐Based Automated Scores: Essay Scoring as an Illustration	5
Model Selection Posterior Predictive Model Checking via Limited‐Information Indices for Bayesian Diagnostic Classification Modeling	5
Likelihood‐Based Estimation of Model‐Derived Oral Reading Fluency	4
Parametric Bootstrap Mantel‐Haenszel Statistic for Aggregated Testlet Effects	4
Differential and Functional Response Time Item Analysis: An Application to Understanding Paper versus Digital Reading Processes	4

Information Functions of Rank‐2PL Models for Forced‐Choice Questionnaires	4
On the Positive Correlation between DIF and Difficulty: A New Theory on the Correlation as Methodological Artifact	4
Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests	4
Using Response Time in Multidimensional Computerized Adaptive Testing	3
Using Eye‐Tracking Data as Part of the Validity Argument for Multiple‐Choice Questions: A Demonstration	3
Score Comparability between Online Proctored and In‐Person Credentialing Exams	3
Issue Information	3
Controlling the Speededness of Assembled Test Forms: A Generalization to the Three‐Parameter Lognormal Response Time Model	3
	3
Detecting Group Collaboration Using Multiple Correspondence Analysis	3
Sensemaking of Process Data from Evaluation Studies of Educational Games: An Application of Cross‐Classified Item Response Theory Modeling	3
An Item Response Tree Model for Items with Multiple‐Choice and Constructed‐Response Parts	3
DIF Detection for Multiple Groups: Comparing Three‐Level GLMMs and Multiple‐Group IRT Models	3
A Generalized Objective Function for Computer Adaptive Item Selection	3
Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language‐Based Assessment	3
Utilizing Response Time for Item Selection in On‐the‐Fly Multistage Adaptive Testing for PISA Assessment	3
Exploring the Impact of Random Guessing in Distractor Analysis	3
Issue Information	2
MSAEM Estimation for Confirmatory Multidimensional Four‐Parameter Normal Ogive Models	2
	2
Measuring the Impact of Peer Interaction in Group Oral Assessments with an Extended Many‐Facet Rasch Model	2
Cognitive Diagnostic Multistage Testing by Partitioning Hierarchically Structured Attributes	2
	2
A Highly Adaptive Testing Design for PISA	2
Issue Information	2
BettyLanteigne, ChristineCoombe, & James DeanBrown. 2021. Challenges in Language Testing around the World: Insights for language test users. Singapore: Springer, 2021, 129.99 € (hardcover),	2
Subscores: A Practical Guide to Their Production and Consumption. ShelbyHaberman, SandipSinharay, RichardFeinberg, and HowardWainer. Cambridge, Cambridge University Press2024, 176 pp. (paperback)	2
On the Choice of Parameters for the Lognormal Model for Response Times: Commentary on Becker et al. (2013)	2
	2
Constructing a Robust Score Scale from IRT Scores with Informed Boundaries	2
Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches	2
Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles	2
Explanatory Cognitive Diagnostic Modeling Incorporating Response Times	2
Online Monitoring of Test‐Taking Behavior Based on Item Responses and Response Times	2
The Vulnerability of AI‐Based Scoring Systems to Gaming Strategies: A Case Study	2
Issue Information	2
Curvilinearity in the Reference Composite and Practical Implications for Measurement	1
Modeling Hierarchical Attribute Structures in Diagnostic Classification Models with Multiple Attempts	1
	1
Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs	1
Using Keystroke Dynamics to Detect Nonoriginal Text	1
The Impact of Cheating on Score Comparability via Pool‐Based IRT Pre‐equating	1
Using Item Scores and Distractors in Person‐Fit Assessment	1
Issue Information	1
Reckase, M.The Psychometrics of Standard Setting: Connecting Policy and Test Scores: First edition published 2023 by CRC Press, 6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487‐274	1
Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment: A Discussion and Look Forward	1
Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment	1
Modeling the Intraindividual Relation of Ability and Speed within a Test	1
IRT Observed‐Score Equating for Rater‐Mediated Assessments Using a Hierarchical Rater Model	1
Influence of Intersectional Routing Modules between Dimensions on Measurement Precision in Multidimensional Multistage Testing	1

Argument‐Based Approach to Validity: Developing a Living Document and Incorporating Preregistration	1
Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System	1
	1
Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics	1
A Dual‐Purpose Model for Binary Data: Estimating Ability and Misconceptions	1
A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles	1
On Joining a Signal Detection Choice Model with Response Time Models	1
Fully Gibbs Sampling Algorithms for Bayesian Variable Selection in Latent Regression Models	1
Issue Information	1
	1
Using Multilabel Neural Network to Score High‐Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment	1
	1