OOIR: Observatory of International Research

Papers

(The median citation count of Journal of Educational and Behavioral Statistics is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
Acknowledgments	29
Bayesian Change-Point Analysis Approach to Detecting Aberrant Test-Taking Behavior Using Response Times	16
Comparison of Within- and Between-Series Effect Estimates in the Meta-Analysis of Multiple Baseline Studies	16
Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches	16
Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators With Variation in Treatment Timing	15
Measurement and Uncertainty Preserving Parametric Modeling for Continuous Latent Variables With Discrete Indicators and External Variables	15
A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development	12
Analyzing Polytomous Test Data: A Comparison Between an Information-Based IRT Model and the Generalized Partial Credit Model	10
Chance-Constrained Automated Test Assembly	10
Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles	9
Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings	9
Using MLP-F in Three Different Aberrant Behaviors in Education	9
A General Mixture Model for Cognitive Diagnosis	9
Using Extant Data to Improve Estimation of the Standardized Mean Difference	8
Commentary on “Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces”	8
A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement	8
Latent Transition Cognitive Diagnosis Model With Covariates: A Three-Step Approach	7
Power Analyses for Estimation of Complier Average Causal Effects Under Random Encouragement Designs in Education Research: Theory and Guidance	7
Introduction to the JEBS Special Section on Artificial Intelligence in Educational Statistics	7
Identifying Informative Predictor Variables With Random Forests	7
Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming	7
Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design	6
A Two-Stage Regression Approach to Detecting Section Score Inconsistency	6
Using Item Scores and Distractors to Detect Item Compromise and Preknowledge	6
Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis	5

IRT Models for Learning With Item-Specific Learning Parameters	5
Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables	5
AI and Psychometrics: Epistemology, Process, and Politics	5
A Simple Technique Assessing Ordinal and Disordinal Interaction Effects	4
Improving Accuracy and Stability of Aggregate Student Growth Measures Using Empirical Best Linear Prediction	4
Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions	4
Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial	4
A Two-Level Adaptive Test Battery	4
A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams	4
Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces	4
Harnessing AI for Educational Measurement: Standards and Emerging Frontiers	3
Inferring Individual Attributes Using Testlet-Based Visual Analogue Scaling and Beta Copula Diagnostic Classification Models	3
A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies	3
New Iterative Algorithms for Estimation of Item Functioning	3
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models	3
Finding the Right Grain-Size for Measurement in the Classroom	3
Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment	3
Using Permutation Tests to Identify Statistically Sound and Nonredundant Sequential Patterns in Educational Event Sequences	3
Reporting Proficiency Levels for Examinees With Incomplete Data	3
Evaluating Intersectional Fairness in Algorithmic Decision Making Using Intersectional Differential Algorithmic Functioning	2
Analyzing Longitudinal Social Relations Model Data Using the Social Relations Structural Equation Model	2
A Hybrid EM Algorithm for Linear Two-Way Interactions With Missing Data	2
How Do We Demonstrate AI Responsibility: The Devil Is in the Details	2
Predictive Performance of Bayesian Stacking in Multilevel Education Data	2
Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index	2
Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection	2
Detecting Item Preknowledge Using Revisits With Speed and Accuracy	2
Using Ordering Theory to Learn Attribute Hierarchies From Examinees’ Attribute Profiles	2
Three-Part Random Effect Models for Longitudinal Skewed Survey Data With “Not Applicable” Responses	2
Smoothing of Bivariate Test Score Distributions: Model Selection Targeting Test Score Equating	2
Deep Reinforcement Learning for Adaptive Learning Systems	2
Two Statistical Tests for the Detection of Item Compromise	2
Fuzzy Regression Discontinuity Designs With Multiple Control Groups Under One-Sided Noncompliance: Evaluating Extended Time Accommodations	2
Cognitive Diagnosis Modeling Incorporating Response Times and Fixation Counts: Providing Comprehensive Feedback and Accurate Diagnosis	2
Modeling Item-Level Heterogeneous Treatment Effects With the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions	2
Regression Discontinuity Designs With an Ordinal Running Variable: Evaluating the Effects of Extended Time Accommodations for English-Language Learners	2
Using the Bayesian Network’s Structural Learning Algorithm to Estimate the Q-Matrix in Cognitive Diagnosis Models	2
An Improved Satterthwaite (1941, 1946) Effective df Approximation	2
Computational Strategies and Estimation Performance With Bayesian Semiparametric Item Response Theory Models	2
Reviewer Acknowledgments	2
Bayesian Q Matrix Estimation of Saturated Diagnostic Classification Models Using NIMBLE	2
Expertise on Offer: Why Isn’t Anyone Buying?	1
Bayesian Analysis Methods for Two-Level Diagnosis Classification Models	1
Testing Differential Item Functioning Without Predefined Anchor Items Using Robust Regression	1
Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups	1
Erratum to Identifying Informative Predictor Variables With Random Forests	1
Profiles in Research: Lawrence J. Hubert	1
Optimizing Diagnostic Classification Models Application Considering Real-Life Constraints	1
What Is Actually Equated in “Test Equating”? A Didactic Note	1
Evaluating Psychometric Differences Between Fast Versus Slow Responses on Rating Scale Items	1

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity	1
Automatic Text Classification With Large Language Models: A Review of `openai` for Zero- and Few-Shot Classification	1
Forced-Choice Ranking Models for Raters’ Ranking Data	1
Exploiting Network Information to Disentangle Spillover Effects in a Field Experiment on Teens’ Museum Attendance	1
Editorial	1
Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores	1
Extending the Cluster Approach to Differential Item Functioning in Polytomous Items	1
A Diagnostic Tree Model for Adaptive Assessment of Complex Cognitive Processes Using Multidimensional Response Options	1
Generalizing Beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features	1
Acknowledgments	1
Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration	1
A New Multiprocess IRT Model With Ideal Points for Likert-Type Items	1
Diagnosing Primary Students’ Reading Progression: Is Cognitive Diagnostic Computerized Adaptive Testing the Way Forward?	1
A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems	1
Using Response Times in Answer Similarity Analysis	1
Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items	1
Inspection-Guided Randomization: A Flexible and Transparent Restricted Randomization Framework for Better Experimental Design	1