Journal of Educational Measurement

Papers
(The TQCC of Journal of Educational Measurement is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Longitudinal Cross‐Classified Item Response Theory Model: Application to Longitudinal Rater‐Mediated Assessment32
32
Measuring the Uncertainty of Imputed Scores20
NCME Presidential Address 2022: Turning the Page to the Next Chapter of Educational Measurement19
Optimal Calibration of Items for Multidimensional Achievement Tests13
How Many Plausible Values?11
Leveraging Process Data and Variable Selection for Achievement Estimation in Large‐Scale Assessments10
Parameter Estimation in Comparative Judgment Under Random and Adaptive Scheduling Schemes9
A Statistical Test for the Detection of Item Compromise Combining Responses and Response Times8
A Note on the Use of Categorical Subscores8
Issue Information6
Linking Error on Achievement Levels Accounting for Dependencies and Complex Sampling6
Comparing Data‐Driven Methods for Removing Options in Assessment Items6
The Precision and Bias of Cut Score Estimates from the Beuk Standard Setting Method6
Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation5
Using Item Parameter Predictions for Reducing Calibration Sample Requirements—A Case Study Based on a High‐Stakes Admission Test5
Automated Coding of Communications in Collaborative Problem‐Solving Tasks Using ChatGPT5
Issue Information5
A Quantitative Method for Evaluating the Predictive Utility of Linked Scores4
Model Selection Posterior Predictive Model Checking via Limited‐Information Indices for Bayesian Diagnostic Classification Modeling4
Likelihood‐Based Estimation of Model‐Derived Oral Reading Fluency4
A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge4
Validity Arguments for AI‐Based Automated Scores: Essay Scoring as an Illustration4
Parametric Bootstrap Mantel‐Haenszel Statistic for Aggregated Testlet Effects4
An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior4
Briggs, Derek C.Historical and Conceptual Foundations of Measurement in the Human Sciences: Credos and Controversies4
4
Information Functions of Rank‐2PL Models for Forced‐Choice Questionnaires4
Using Response Time in Multidimensional Computerized Adaptive Testing3
Differential and Functional Response Time Item Analysis: An Application to Understanding Paper versus Digital Reading Processes3
Special Issue: Adaptive Testing in Large‐Scale Assessments3
Issue Information3
Controlling the Speededness of Assembled Test Forms: A Generalization to the Three‐Parameter Lognormal Response Time Model3
Issue Information3
Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests3
Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language‐Based Assessment3
3
An Item Response Tree Model for Items with Multiple‐Choice and Constructed‐Response Parts3
DIF Detection for Multiple Groups: Comparing Three‐Level GLMMs and Multiple‐Group IRT Models3
A Generalized Objective Function for Computer Adaptive Item Selection3
Simultaneous Detection of Compromised Items and Examinees with Item Preknowledge in Online Assessments Using Response Time Data3
Sensemaking of Process Data from Evaluation Studies of Educational Games: An Application of Cross‐Classified Item Response Theory Modeling3
Utilizing Response Time for Item Selection in On‐the‐Fly Multistage Adaptive Testing for PISA Assessment3
Detecting Group Collaboration Using Multiple Correspondence Analysis3
Issue Information2
Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches2
MSAEM Estimation for Confirmatory Multidimensional Four‐Parameter Normal Ogive Models2
Measuring the Impact of Peer Interaction in Group Oral Assessments with an Extended Many‐Facet Rasch Model2
2
Issue Information2
Issue Information2
The Vulnerability of AI‐Based Scoring Systems to Gaming Strategies: A Case Study2
2
Mapping out the Hexagon Measurement Framework as a Blueprint Underlying Measurement in the Human Sciences2
Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles2
Using GPT‐4 to Augment Imbalanced Data for Automatic Scoring2
Online Monitoring of Test‐Taking Behavior Based on Item Responses and Response Times2
On the Choice of Parameters for the Lognormal Model for Response Times: Commentary on Becker et al. (2013)2
Backward Construct Mapping in Practice: Using Rasch Measurement to Reveal the Internal Structure of Inflectional Morphological Processing in Students with Dyslexia2
2
Subscores: A Practical Guide to Their Production and Consumption. ShelbyHaberman, SandipSinharay, RichardFeinberg, and HowardWainer. Cambridge, Cambridge University Press2024, 176 pp. (paperback)2
Generalizability Theory for Randomly Parallel Testing2
BettyLanteigne, ChristineCoombe, & James DeanBrown. 2021. Challenges in Language Testing around the World: Insights for language test users. Singapore: Springer, 2021, 129.99 € (hardcover),2
A Highly Adaptive Testing Design for PISA2
Using Multilabel Neural Network to Score High‐Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment2
Issue Information2
Cognitive Diagnostic Multistage Testing by Partitioning Hierarchically Structured Attributes2
0.054336071014404