Language Testing

Papers
(The median citation count of Language Testing is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
A comprehensive review of Rasch measurement in language assessment: Recommendations and guidelines for research59
Test Review: Current options in at-home language proficiency tests for making high-stakes decisions34
A meta-analysis of self-assessment and language performance in language testing and assessment31
“Am I qualified to be a language tester?”: Understanding the development of language assessment literacy across three stakeholder groups27
Automated scoring of junior and senior high essays using Coh-Metrix features: Implications for large-scale language testing23
The effect of response order on candidate viewing behaviour and item difficulty in a multiple-choice listening test22
Developing individualized feedback for listening assessment: Combining standard setting and cognitive diagnostic assessment approaches21
Hanyu Shuiping Kaoshi (HSK): A multi-level, multi-purpose proficiency test19
An eye-tracking study of attention to visual cues in L2 listening tests18
What can gaze behaviors, neuroimaging data, and test scores tell us about test method effects and cognitive load in listening assessments?17
Young learners’ voices: Towards a learner-centered approach to understanding language assessment literacy16
Interpreting testing and assessment: A state-of-the-art review15
More efficient processes for creating automated essay scoring frameworks: A demonstration of two algorithms15
Using confidence intervals to determine adequate item sample sizes for vocabulary tests: An essential but overlooked practice15
What scores from monologic speaking tests can(not) tell us about interactional competence14
Examining the L2 reading comprehension ability of adult ELLs: Developing a diagnostic test within the cognitive diagnostic assessment framework13
The typology of second language listening constructs: A systematic review11
Revisiting rating scale development for rater-mediated language performance assessments: Modelling construct and contextual choices made by scale developers10
Drawing on repeat test takers to study test preparation practices and their links to score gains10
Examining the effects of different English speech varieties on an L2 academic listening comprehension test at the item level10
Hong Kong secondary students’ perspectives on selecting test difficulty level and learner washback: Effects of a graded approach to assessment8
Critical language assessment literacy of EFL teachers: Scale construction and validation8
Dimensionality of speech fluency: Examining the relationships among complexity, accuracy, and fluency (CAF) features of speaking performances on the Aptis test7
Exploring which test-taker characteristics predict young L2 learners’ performance on listening and reading comprehension tests7
Assessing Rasch measurement estimation methods across R packages with yes/no vocabulary test data7
The longitudinal stability of rating characteristics in an EFL examination: Methodological and substantive considerations7
Predicting communicative effectiveness in the international workplace: Support for TOEIC® Speaking test scores from linguistic laypersons6
Investigating and optimizing score dependability of a local ITA speaking test across language groups: A generalizability theory approach6
Adaptation of the British Sign Language Receptive Skills Test into Polish Sign Language6
Bridging local needs and national standards: Use of standards-based individualized feedback of an in-house EFL listening test in China6
The vexing problem of validity and the future of second language assessment6
An investigation of the validity of a speaking assessment for adolescent English language learners6
Local tests, local contexts6
Application of an Automated Essay Scoring engine to English writing assessment using Many-Facet Rasch Measurement5
A comparative judgment approach to assessing Chinese Sign Language interpreting5
Testing young foreign language learners’ reading comprehension: Exploring the effects of working memory, grade level, and reading task4
Register variation in spoken and written language use across technology-mediated and non-technology-mediated learning environments4
Review of the Japanese-Language Proficiency Test4
But who trains the language teacher educator who trains the language teacher? An empirical investigation of Chilean EFL teacher educators’ language assessment literacy4
Test-taker insights for language assessment policies and practices4
Developing a level-specific checklist for assessing EFL writing4
Linking scores from two written receptive English academic vocabulary tests—The VLT-Ac and the AVT3
Psychometric approaches to analyzing C-tests3
The relationship among accent familiarity, shared L1, and comprehensibility: A path analysis perspective3
A look into the practices and challenges of assessing young EFL learners’ writing in Croatia3
What the analytic versus holistic scoring of international teaching assistants can reveal: Lexical grammar matters3
Developing a local academic English listening test using authentic unscripted audio-visual texts3
The domain expert perspective: A qualitative study into the views expressed in a standard-setting exercise on a language for specific purposes (LSP) test for health professionals3
How do raters learn to rate? Many-facet Rasch modeling of rater performance over the course of a rater certification program3
Reflecting on assessing young foreign language learners3
Innovation and expansion in Language Testing for changing times3
Investigating the impact of self-pacing on the L2 listening performance of young learner candidates with differing L1 literacy skills3
A meta-analysis on the predictive validity of English language proficiency assessments for college admissions3
Local placement test retrofit and building language assessment literacy with teacher stakeholders: A case study from Colombia2
Modeling local item dependence in C-tests with the loglinear Rasch model2
Speaking performances, stakeholder perceptions, and test scores: Extrapolating from the Duolingo English test to the university2
A Bayesian approach to improving measurement precision over multiple test occasions2
Repeated test-taking and longitudinal test score analysis2
Change in home language environment and English literacy achievement over time: A multi-group latent growth curve modeling investigation2
Establishing meaning recall and meaning recognition vocabulary knowledge as distinct psychometric constructs in relation to reading proficiency2
Test design and validity evidence of interactive speaking assessment in the era of emerging technologies2
Measuring bilingual language dominance: An examination of the reliability of the Bilingual Language Profile2
An analysis of TOEFL® Primary™ repeaters: How much score change occurs?2
National assessment of foreign languages in Sweden: A multifaceted and collaborative venture2
Construct validity and fairness of an operational listening test with World Englishes2
Comparing holistic and analytic marking methods in assessing speech act production in L2 Chinese2
Test Review: The International English Language Testing System (IELTS)2
Understanding writing quality change: A longitudinal study of repeaters of a high-stakes standardized English proficiency test2
Responsibilities and opportunities in language testing with respect to historicized forms of socio-political discrimination: A matter of academic citizenship2
Assessing children’s incremental word knowledge in the upper primary grades2
Roles of working memory, syllogistic inferencing ability, and linguistic knowledge on second language listening comprehension for passages of different lengths2
Using instructor judgment, learner corpora, and DIF to develop a placement test for Spanish L2 and heritage learners2
Examining the predictive validity of the Duolingo English Test: Evidence from a major UK university2
Towards a new sophistication in vocabulary assessment2
Strategy use in a spoken dialog system–delivered paired discussion task: A stimulated recall study1
Future challenges and opportunities in language testing and assessment: Basic questions and principles at the forefront1
L2 and L1 semantic context indices as automated measures of lexical sophistication1
Book review: Assessing English for Professional Purposes1
An introduction to Language Testing’s first Virtual Special Issue: Investigating consequences of language test use1
Toward a systematic accessibility review process for English language proficiency tests for young learners1
Practical considerations when building concordances between English tests1
Examining the factor structure and its replicability across multiple listening test forms: Validity evidence for the Michigan English Test1
Operationalizing the reading-into-writing construct in analytic rating scales: Effects of different approaches on rating1
Reading is a multidimensional construct at child-L2-English-literacy onset, but comprises fewer dimensions over time: Evidence from multidimensional IRT analysis1
Accommodations in language testing and assessment: Safeguarding equity, access, and inclusion1
Advancing equity in language assessment for learners with disabilities1
Critical discursive approaches to evaluating policy-driven testing: Social impact as a target for validation1
Book review: Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages1
Local English testing in China’s tertiary education: Contexts, policies, and practices1
Assessing speaking through multimodal oral presentations: The case of construct underrepresentation in EAP contexts1
English learners who are blind or visually impaired: A participatory design approach to enhancing fairness and validity for language testing accommodations1
Gauging the impact of literacy and educational background on receptive vocabulary test scores1
Aptis test review1
Development and initial validation of productive vocabulary tests for isiZulu, Siswati and English in South Africa1
Implementation of an accommodations policy for candidates with diverse needs in a large-scale testing system1
IRT-based classification analysis of an English language reading proficiency subtest1
Revisiting raters’ accent familiarity in speaking tests: Evidence that presentation mode interacts with accent familiarity to variably affect comprehensibility ratings1
Book Review: The sociology of assessment: Comparative and policy perspectives: The selected works of Patricia Broadfoot1
Who succeeds and who fails? Exploring the role of background variables in explaining the outcomes of L2 language tests1
Revisiting English language proficiency and its impact on the academic performance of domestic university students in Singapore1
Universal tools activation in English language proficiency assessments: A comparison of Grades 1–12 English learners with and without disabilities1
Fairness of using different English accents: The effect of shared L1s in listening tasks of the Duolingo English test1
Time to achieving a designated criterion score level: A survival analysis study of test taker performance on the TOEFL iBT® test1
Korean Syntactic Complexity Analyzer (KOSCA): An NLP application for the analysis of syntactic complexity in second language production1
0.017497062683105