IEEE Transactions on Software Engineering

Papers
(The TQCC of IEEE Transactions on Software Engineering is 15. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
50 Years of Transactions on Software Engineering495
Confirmation Bias and Time Pressure: A Family of Experiments in Software Testing468
To Do or Not to Do: Semantics and Patterns for Do Activities in UML PSSM State Machines235
Combining Genetic Programming and Model Checking to Generate Environment Assumptions176
Towards Scalable Model Checking of Reflective Systems via Labeled Transition Systems155
Efficiently Testing Distributed Systems via Abstract State Space Prioritization143
Enhancing Mobile App Bug Reporting via Real-Time Understanding of Reproduction Steps120
Multi-Granularity Detector for Vulnerability Fixes115
Can We Trust the Phone Vendors? Comprehensive Security Measurements on the Android Firmware Ecosystem112
Just-in-Time Prediction of Software Architectural Changes Through Commit-Level Analyses108
The Why, When, What, and How About Predictive Continuous Integration: A Simulation-Based Investigation105
A Retrospective on Whole Test Suite Generation: On the Role of SBST in the Age of LLMs101
Recommending API Function Calls and Code Snippets to Support Software Development101
Enhancing Project-Specific Code Completion by Inferring Internal API Information100
Automatic Fairness Testing of Neural Classifiers Through Adversarial Sampling97
Computation Tree Logic Guided Program Repair96
Review Dynamics and Their Impact on Software Quality90
Enhancing Protocol Fuzzing via Diverse Seed Corpus Generation87
Question Selection for Multimodal Code Search Synthesis Using Probabilistic Version Spaces84
Do as You Say: Consistency Detection of Data Practice in Program Code and Privacy Policy in Mini-App82
What Leads to a Confirmatory or Disconfirmatory Behavior of Software Testers?81
Influence of the 1990 IEEE TSE Paper “Automated Software Test Data Generation” on Software Engineering77
Are Your Dependencies Code Reviewed?: Measuring Code Review Coverage in Dependency Updates76
Mission Specification Patterns for Mobile Robots: Providing Support for Quantitative Properties69
Answering Uncertain, Under-Specified API Queries Assisted by Knowledge-Aware Human-AI Dialogue69
Socio-Technical Grounded Theory for Software Engineering67
DSSDPP: Data Selection and Sampling Based Domain Programming Predictor for Cross-Project Defect Prediction66
Advanced Smart Contract Vulnerability Detection via LLM-Powered Multi-Agent Systems64
A Declarative Metamorphic Testing Framework for Autonomous Driving64
Theoretical and Empirical Analyses of the Effectiveness of Metamorphic Relation Composition63
2023 Reviewers List63
Prevent: An Unsupervised Approach to Predict Software Failures in Production63
Esale: Enhancing Code-Summary Alignment Learning for Source Code Summarization63
Multimodal Fusion for Android Malware Detection Based on Large Pre-Trained Models62
Mask–Mediator–Wrapper Architecture as a Data Mesh Driver61
T-Evos: A Large-Scale Longitudinal Study on CI Test Execution and Failure61
Mole: Efficient Crash Reproduction in Android Applications With Enforcing Necessary UI Events61
A Theory of Pending Schemas in Combinatorial Testing60
Detecting Malicious Packages in PyPI and npm by Clustering Installation Scripts59
Measuring the Fidelity of a Physical and a Digital Twin Using Trace Alignments58
Neural Library Recommendation by Embedding Project-Library Knowledge Graph57
The Impact of Surface Features on Choice of (in)Secure Answers by Stackoverflow Readers56
Towards a Cognitive Model of Dynamic Debugging: Does Identifier Construction Matter?56
A Wizard of Oz Study Simulating API Usage Dialogues With a Virtual Assistant56
Automated Code Editing With Search-Generate-Modify54
Trace Diagnostics for Signal-Based Temporal Properties54
Systematic Evaluation and Usability Analysis of Formal Methods Tools for Railway Signaling System Design53
PerfJIT: Test-Level Just-in-Time Prediction for Performance Regression Introducing Commits53
GenMorph: Automatically Generating Metamorphic Relations via Genetic Programming52
Efficient State Identification for Finite State Machine-Based Testing51
Mutation Testing in Practice: Insights From Open-Source Software Developers51
An Empirical Study of Software Refactorings in Real-World Open-Source Java Projects51
Detecting Software Security Vulnerabilities Via Requirements Dependency Analysis49
An Empirical Study of Refactoring Rhythms and Tactics in the Software Development Process49
Mitigating False Positive Static Analysis Warnings: Progress, Challenges, and Opportunities48
Multi-Objective Software Defect Prediction via Multi-Source Uncertain Information Fusion and Multi-Task Multi-View Learning48
MASTER: Multi-Source Transfer Weighted Ensemble Learning for Multiple Sources Cross-Project Defect Prediction48
A Systematic Review of IoT Systems Testing: Objectives, Approaches, Tools, and Challenges48
Evolutionary generation of test suites for multi-path coverage of MPI programs with non-determinism47
Decision Support for Selecting Blockchain-Based Application Design Patterns With Layered Taxonomy and Quality Attributes47
Robust Test Selection for Deep Neural Networks47
Leveraging Large Language Model for Automatic Patch Correctness Assessment46
Annotative Software Product Line Analysis Using Variability-Aware Datalog45
MBL-CPDP: A Multi-Objective Bilevel Method for Cross-Project Defect Prediction45
Discovering Reusable Functional Features in Legacy Object-Oriented Systems44
Legion: Massively Composing Rankers for Improved Bug Localization at Adobe44
A Faceted Taxonomy of Requirements Changes in Agile Contexts43
Program Synthesis for Cyber-Resilience43
Evaluating and Improving GPT-Based Expansion of Abbreviations42
How Should Software Engineering Secondary Studies Include Grey Material?42
Context-Aware Personalized Crowdtesting Task Recommendation41
LLMorpheus: Mutation Testing Using Large Language Models41
Generalized Coverage Criteria for Combinatorial Sequence Testing40
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree Transformation39
Human-in-the-Loop Automatic Program Repair39
An Empirical Study of Parameter-Efficient Fine-Tuning in Code Change Learning and Beyond38
An Empirical Study of C++ Vulnerabilities in Crowd-Sourced Code Examples38
An Experience Report on Producing Verifiable Builds for Large-Scale Commercial Systems38
CODIT: Code Editing With Tree-Based Neural Models38
How Developers Choose Names37
Pull Request Decisions Explained: An Empirical Overview37
Can Clean New Code Reduce Technical Debt Density?37
Triple Peak Day: Work Rhythms of Software Developers in Hybrid Work37
API2Vec++: Boosting API Sequence Representation for Malware Detection and Classification37
On the Understandability of MLOps System Architectures37
Pathidea: Improving Information Retrieval-Based Bug Localization by Re-Constructing Execution Paths Using Logs37
Formal Equivalence Checking for Mobile Malware Detection and Family Classification36
Evaluating and Improving Unified Debugging36
EpiTESTER: Testing Autonomous Vehicles With Epigenetic Algorithm and Attention Mechanism36
Microservice Extraction Based on a Comprehensive Evaluation of Logical Independence and Performance35
A Study of Call Graph Construction for JVM-Hosted Languages35
Experimental Evaluation of Test-Driven Development With Interns Working on a Real Industrial Project35
Specializing Neural Networks for Cryptographic Code Completion Applications35
Retrieval-Augmented Fine-Tuning for Improving Retrieve-and-Edit Based Assertion Generation34
CloudRaid: Detecting Distributed Concurrency Bugs via Log Mining and Enhancement34
Automated Refactoring of Non-Idiomatic Python Code With Pythonic Idioms34
Studying Ad Library Integration Strategies of Top Free-to-Download Apps34
Just-In-Time Obsolete Comment Detection and Update33
What Drives and Sustains Self-Assignment in Agile Teams33
Practitioners’ Expectations on Log Anomaly Detection33
Automated Commit Message Generation With Large Language Models: An Empirical Study and Beyond33
DiffGAN: A Test Generation Approach for Differential Testing of Deep Neural Networks for Image Analysis33
Increasing the Confidence of Deep Neural Networks by Coverage Analysis32
Revisiting Test Impact Analysis in Continuous Testing From the Perspective of Code Dependencies32
How Do Developers Structure Unit Test Cases? An Empirical Analysis of the AAA Pattern in Open Source Projects32
SigRec: Automatic Recovery of Function Signatures in Smart Contracts32
Test Flakiness Across Programming Languages32
From Tea Leaves to System Maps: A Survey and Framework on Context-aware Machine Learning Monitoring32
A Study About the Knowledge and Use of Requirements Engineering Standards in Industry32
Quantitative Verification for Monitoring Event-Streaming Systems32
“Estimating Software Project Effort Using Analogies”: Reflections After 28 Years31
Empirical Validation of Automated Vulnerability Curation and Characterization31
Evaluation of Static Vulnerability Detection Tools With Java Cryptographic API Benchmarks31
Data Quality Matters: A Case Study on Data Label Correctness for Security Bug Report Prediction31
Detecting Continuous Integration Skip Commits Using Multi-Objective Evolutionary Search31
Exploring and Analyzing Software Architecture Refactoring in Practice30
Watch Out for Extrinsic Bugs! A Case Study of Their Impact in Just-In-Time Bug Prediction Models on the OpenStack Project30
Mind the Gap! A Study on the Transferability of Virtual Versus Physical-World Testing of Autonomous Driving Systems30
DAppSCAN: Building Large-Scale Datasets for Smart Contract Weaknesses in DApp Projects30
From Executable Specifications to Hard-to-Specify Requirements: Challenges in Describing Reactive System Behavior29
Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding29
The Analysis of Safety Critical Software Systems29
Continuously Managing NFRs: Opportunities and Challenges in Practice29
Effect of Requirements Analyst Experience on Elicitation Effectiveness: A Family of Quasi-Experiments29
Why My App Crashes? Understanding and Benchmarking Framework-Specific Exceptions of Android Apps28
A Qualitative Study of the Benefits and Costs of Logging From Developers’ Perspectives28
CRPWarner: Warning the Risk of Contract-Related Rug Pull in DeFi Smart Contracts28
NumScout: Unveiling Numerical Defects in Smart Contracts Using LLM-Pruning Symbolic Execution28
Line-Level Defect Prediction by Capturing Code Contexts With Graph Convolutional Networks28
Understanding the Robustness of Transformer-Based Code Intelligence via Code Transformation: Challenges and Opportunities28
The Impact of Prompt Programming on Function-Level Code Generation28
Scrutinizing Implementations of Smart Home Integrations27
A Systematic Study on Real-World Android App Bundles27
ASTRAEA: Grammar-based Fairness Testing27
Assessing Evaluation Metrics for Neural Test Oracle Generation27
STRE: An Automated Approach to Suggesting App Developers When to Stop Reading Reviews27
On the Validity of Pre-Trained Transformers for Natural Language Processing in the Software Engineering Domain27
Provably Valid and Diverse Mutations of Real-World Media Data for DNN Testing27
Cross-Language Taint Analysis: Generating Caller-Sensitive Native Code Specification for Java27
Efficient Summary Reuse for Software Regression Verification27
Are You Still Working on This? An Empirical Study on Pull Request Abandonment26
Improving Cross-Language Code Clone Detection via Code Representation Learning and Graph Neural Networks26
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair26
Optimization of Software Release Planning Considering Architectural Dependencies, Cost, and Value26
Towards Exploring Developers’ Struggles in Developing Upgradeable Smart Contracts26
Deconstructing the Nature of Collaboration in Organizations Open Source Software Development: The Impact of Developer and Task Characteristics26
Predicting Defective Lines Using a Model-Agnostic Technique26
A Grounded Theory of Cross-Community SECOs: Feedback Diversity Versus Synchronization25
The Effectiveness of Supervised Machine Learning Algorithms in Predicting Software Refactoring25
Diversity-Oriented Testing for Competitive Game Agent via Constraint-Guided Adversarial Agent Training25
Predictive Comment Updating With Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach25
iTCRL: Causal-Intervention-Based Trace Contrastive Representation Learning for Microservice Systems25
Automated Infrastructure as Code Program Testing25
An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models25
ArchHypo: Managing Software Architecture Uncertainty Using Hypotheses Engineering25
Sentinel: A Hyper-Heuristic for the Generation of Mutant Reduction Strategies24
Let’s Talk With Developers, Not About Developers: A Review of Automatic Program Repair Research24
Parameterized Verification of Leader/Follower Systems via Arithmetic Constraints24
Hashing Fuzzing: Introducing Input Diversity to Improve Crash Detection24
A Variability Fault Localization Approach for Software Product Lines24
Beyond the Sum of Parts: Leveraging Entanglement for Bug Inducing Commit Localization23
A Survey on the Use of Computer Vision to Improve Software Engineering Tasks23
How Templated Requirements Specifications Inhibit Creativity in Software Engineering23
Mithra: Anomaly Detection as an Oracle for Cyberphysical Systems23
Misactivation-Aware Stealthy Backdoor Attacks on Neural Code Understanding Models23
Stakeholder Preference Extraction From Scenarios23
Explaining Static Analysis With Rule Graphs23
Automated Use-After-Free Detection and Exploit Mitigation: How Far Have We Gone?23
Causes and Canonicalization of Unreproducible Builds in Java22
The Power of Small LLMs: A Multi-Agent for Code Generation via Dynamic Precaution Tuning22
Domain-Driven Design for Microservices: An Evidence-Based Investigation22
FCGHUNTER: Towards Evaluating Robustness of Graph-Based Android Malware Detection22
Beyond Literal Meaning: Uncover and Explain Implicit Knowledge in Code Through Wikipedia-Based Concept Linking22
Unearthing Gas-Wasting Code Smells in Smart Contracts With Large Language Models22
Do Pretrained Language Models Indeed Understand Software Engineering Tasks?21
SmartOracle: Generating Smart Contract Oracle via Fine-Grained Invariant Detection21
Forecasting the Principal of Code Technical Debt in JavaScript Applications21
Retrospective on: Constraint-Based Automatic Test Data Generation21
A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering21
Range Specification Bug Detection in Flight Control System Through Fuzzing21
PopArt: Ranked Testing Efficiency21
Boosting Generalizable Fairness With Mahalanobis Distances Guided Boltzmann Exploratory Testing21
Syntactic Versus Semantic Similarity of Artificial and Real Faults in Mutation Testing Studies21
Practical Mutation Testing at Scale: A view from Google21
RefactoringMiner 2.021
Towards More Precise Coincidental Correctness Detection With Deep Semantic Learning21
Verification of Fuzzy Decision Trees20
Translating to a Low-Resource Language with Compiler Feedback: A Case Study on Cangjie20
Learning to Predict User-Defined Types20
PATEN: Identifying Unpatched Third-Party APIs via Fine-Grained Patch-Enhanced AST-Level Signature20
Restore: Retrospective Fault Localization Enhancing Automated Program Repair20
Does Treatment Adherence Impact Experiment Results in TDD?20
Emerging App Issue Identification via Online Joint Sentiment-Topic Tracing19
SCAnoGenerator: Automatic Anomaly Injection for Ethereum Smart Contracts19
Reuse of Similarly Behaving Software Through Polymorphism-Inspired Variability Mechanisms19
Stealthy Backdoor Attack for Code Models19
Runtime Evolution of Bitcoin's Consensus Rules19
TkT: Automatic Inference of Timed and Extended Pushdown Automata19
Do Chase Your Tail! Missing Key Aspects Augmentation in Textual Vulnerability Descriptions of Long-Tail Software Through Feature Inference19
Engineering Within Boundaries When Software Has None19
A Little Help Goes a Long Way: Tutoring LLMs in Solving Competitive Programming through Hints19
Generating Structurally Realistic Models With Deep Autoregressive Networks19
Studying the Influence and Distribution of the Human Effort in a Hybrid Fitness Function for Search-Based Model-Driven Engineering19
Dealing With Data Challenges When Delivering Data-Intensive Software Solutions19
Concretization of Abstract Traffic Scene Specifications Using Metaheuristic Search19
The Human Side of Software Engineering Teams: An Investigation of Contemporary Challenges18
Clopper-Pearson Algorithms for Efficient Statistical Model Checking Estimation18
The “Question Neighbourhood” Approach for Systematic Evaluation of Code-Generating LLMs18
Active Code Learning: Benchmarking Sample-Efficient Training of Code Models18
Multitask-Based Evaluation of Open-Source LLM on Software Vulnerability18
Evaluating SZZ Implementations: An Empirical Study on the Linux Kernel17
An Assessment of Rules of Thumb for Software Phase Management, and the Relationship Between Phase Effort and Schedule Success17
Utilizing Automatic Query Reformulations as Genetic Operations to Improve Feature Location in Software Models17
Examiner-Pro: Testing Arm Emulators Across Different Privileges17
Static Profiling of Alloy Models17
DaNuoYi: Evolutionary Multitask Injection Testing on Web Application Firewalls17
Accelerating Finite State Machine-Based Testing Using Reinforcement Learning17
Software Testing With Large Language Models: Survey, Landscape, and Vision17
Fast and Precise Static Null Exception Analysis With Synergistic Preprocessing17
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards17
Finding Trends in Software Research17
Malo in the Code Jungle: Explainable Fault Localization for Decentralized Applications17
Defining Smart Contract Defects on Ethereum17
A Retrospective of Proving the Correctness of Multiprocess Programs17
Comparing Block-Based Programming Models for Two-Armed Robots17
Enforcing Correctness of Collaborative Business Processes Using Plans17
Isolating Compiler Faults Through Differentiated Compilation Configurations17
Darcy: Automatic Architectural Inconsistency Resolution in Java17
A Search-Based Testing Approach for Deep Reinforcement Learning Agents16
How Toxic Can You Get? Search-Based Toxicity Testing for Large Language Models16
MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation16
Active Learning of Discriminative Subgraph Patterns for API Misuse Detection16
AddressWatcher: Sanitizer-Based Localization of Memory Leak Fixes16
A Framework for Emotion-Oriented Requirements Change Handling in Agile Software Engineering16
Factors Affecting On-Time Delivery in Large-Scale Agile Software Development16
Heuristic and Neural Network Based Prediction of Project-Specific API Member Access16
A Procedure to Continuously Evaluate Predictive Performance of Just-In-Time Software Defect Prediction Models During Software Development16
How Software Developers Mitigate Their Errors When Developing Code16
A Theory of Value for Value-Based Feature Selection in Software Engineering16
VERCATION: Precise Vulnerable Open-source Software Version Identification based on Static Analysis and LLM15
Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression Against Heterogeneous Attacks Toward AI Software Deployment15
Pride: Prioritizing Documentation Effort Based on a PageRank-Like Algorithm and Simple Filtering Rules15
FairMask: Better Fairness via Model-Based Rebalancing of Protected Attributes15
Diversified Third-Party Library Prediction for Mobile App Development15
A Systematical Study on Application Performance Management Libraries for Apps15
IoTCom: Dissecting Interaction Threats in IoT Systems15
RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems15
DT4LM: Differential Testing for Reliable Language Model Updates in Classification Tasks15
Inferring Bug Signatures to Detect Real Bugs15
Software Architecture Description Revisited15
0.099485158920288