Journal of Educational and Behavioral Statistics

Papers
(The median citation count of Journal of Educational and Behavioral Statistics is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches31
Bayesian Change-Point Analysis Approach to Detecting Aberrant Test-Taking Behavior Using Response Times27
Acknowledgments22
Comparison of Within- and Between-Series Effect Estimates in the Meta-Analysis of Multiple Baseline Studies16
Using MLP-F in Three Different Aberrant Behaviors in Education14
Chance-Constrained Automated Test Assembly14
A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development12
Measurement and Uncertainty Preserving Parametric Modeling for Continuous Latent Variables With Discrete Indicators and External Variables12
Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators With Variation in Treatment Timing12
Analyzing Polytomous Test Data: A Comparison Between an Information-Based IRT Model and the Generalized Partial Credit Model12
Analyzing Cross-Sectionally Clustered Data Using Generalized Estimating Equations10
Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles9
Introduction to the JEBS Special Section on Artificial Intelligence in Educational Statistics8
Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings8
Identifying Informative Predictor Variables With Random Forests8
A General Mixture Model for Cognitive Diagnosis8
Using Extant Data to Improve Estimation of the Standardized Mean Difference7
Latent Transition Cognitive Diagnosis Model With Covariates: A Three-Step Approach7
Power Analyses for Estimation of Complier Average Causal Effects Under Random Encouragement Designs in Education Research: Theory and Guidance7
Commentary on “Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces”7
A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement7
Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design6
Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming6
Using Item Scores and Distractors to Detect Item Compromise and Preknowledge6
AI and Psychometrics: Epistemology, Process, and Politics6
A Two-Stage Regression Approach to Detecting Section Score Inconsistency6
A Two-Level Adaptive Test Battery5
Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis5
Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables5
IRT Models for Learning With Item-Specific Learning Parameters5
New Iterative Algorithms for Estimation of Item Functioning4
A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams4
A Simple Technique Assessing Ordinal and Disordinal Interaction Effects4
Improving Accuracy and Stability of Aggregate Student Growth Measures Using Empirical Best Linear Prediction3
Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment3
Harnessing AI for Educational Measurement: Standards and Emerging Frontiers3
Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces3
Using Sequence Mining Techniques for Understanding Incorrect Behavioral Patterns on Interactive Tasks3
Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial3
Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions3
Using Permutation Tests to Identify Statistically Sound and Nonredundant Sequential Patterns in Educational Event Sequences3
A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies2
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models2
Deep Reinforcement Learning for Adaptive Learning Systems2
A Hybrid EM Algorithm for Linear Two-Way Interactions With Missing Data2
Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index2
Predictive Performance of Bayesian Stacking in Multilevel Education Data2
How Do We Demonstrate AI Responsibility: The Devil Is in the Details2
Smoothing of Bivariate Test Score Distributions: Model Selection Targeting Test Score Equating2
Reporting Proficiency Levels for Examinees With Incomplete Data2
Fuzzy Regression Discontinuity Designs With Multiple Control Groups Under One-Sided Noncompliance: Evaluating Extended Time Accommodations2
Two Statistical Tests for the Detection of Item Compromise2
Reviewer Acknowledgments2
Inferring Individual Attributes Using Testlet-Based Visual Analogue Scaling and Beta Copula Diagnostic Classification Models2
Finding the Right Grain-Size for Measurement in the Classroom2
An Improved Satterthwaite (1941, 1946) Effective df Approximation2
Three-Part Random Effect Models for Longitudinal Skewed Survey Data With “Not Applicable” Responses2
Computational Strategies and Estimation Performance With Bayesian Semiparametric Item Response Theory Models2
Cognitive Diagnosis Modeling Incorporating Response Times and Fixation Counts: Providing Comprehensive Feedback and Accurate Diagnosis2
Extending the Cluster Approach to Differential Item Functioning in Polytomous Items1
A Diagnostic Tree Model for Adaptive Assessment of Complex Cognitive Processes Using Multidimensional Response Options1
Evaluating Intersectional Fairness in Algorithmic Decision Making Using Intersectional Differential Algorithmic Functioning1
Using Ordering Theory to Learn Attribute Hierarchies From Examinees’ Attribute Profiles1
Modeling Item-Level Heterogeneous Treatment Effects With the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions1
Bayesian Analysis Methods for Two-Level Diagnosis Classification Models1
Testing Differential Item Functioning Without Predefined Anchor Items Using Robust Regression1
Exploiting Network Information to Disentangle Spillover Effects in a Field Experiment on Teens’ Museum Attendance1
Generalizing Beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features1
What Is Actually Equated in “Test Equating”? A Didactic Note1
Evaluating Psychometric Differences Between Fast Versus Slow Responses on Rating Scale Items1
Detecting Item Preknowledge Using Revisits With Speed and Accuracy1
Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches1
Expertise on Offer: Why Isn’t Anyone Buying?1
Using Response Times in Answer Similarity Analysis1
Acknowledgments1
Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity1
Automatic Text Classification With Large Language Models: A Review of openai for Zero- and Few-Shot Classification1
Analyzing Longitudinal Social Relations Model Data Using the Social Relations Structural Equation Model1
Regression Discontinuity Designs With an Ordinal Running Variable: Evaluating the Effects of Extended Time Accommodations for English-Language Learners1
Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection1
Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items1
Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups1
Editorial1
Optimizing Diagnostic Classification Models Application Considering Real-Life Constraints1
Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration1
Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores1
0.035655975341797