ACM Transactions on Software Engineering and Methodology

Papers
(The TQCC of ACM Transactions on Software Engineering and Methodology is 9. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Finding Information Leaks with Information Flow Fuzzing—RCR Report391
Mutant Reduction Evaluation: What is There and What is Missing?202
Automatic Identification of Game Stuttering via Gameplay Videos Analysis143
Test Generation Strategies for Building Failure Models and Explaining Spurious Failures131
Better Supporting Human Aspects in Mobile eHealth Apps: Development and Validation of Enhanced Guidelines99
Deceiving Humans and Machines Alike: Search-based Test Input Generation for DNNs Using Variational Autoencoders94
Understanding the OSS Communities of Deep Learning Frameworks: A Comparative Case Study of P y T orch and T ensor93
I Depended on You and You Broke Me: An Empirical Study of Manifesting Breaking Changes in Client Packages92
Bounded Verification of Atomicity Violations for Interrupt-Driven Programs via Lazy Sequentialization88
M2CVD: Enhancing Vulnerability Understanding through Multi-Model Collaboration for Code Vulnerability Detection84
Reusing d-DNNFs for Efficient Feature-Model Counting84
SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval80
History-Driven Fuzzing for Deep Learning Libraries80
Horus : Accelerating Kernel Fuzzing through Efficient Host-VM Memory Access Procedures72
TestLoop: A Process Model Describing Human-in-the-Loop Software Test Suite Generation72
A Survey on Failure Analysis and Fault Injection in AI Systems68
FairGenerate: Enhancing Fairness Through Synthetic Data Generation and Two-Fold Biased Labels Removal66
KAPE: k NN-based Performance Testing for Deep Code Search66
Enhancing Android Malware Detection: The Influence of ChatGPT on Decision-centric Task60
Preference-wise Testing of Android Apps via Test Amplification58
An empirical study on vulnerability disclosure management of open source software systems57
Assessing the Robustness of Test Selection Methods for Deep Neural Networks56
Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing55
An Empirical Analysis of Machine Learning Model and Dataset Documentation, Supply Chain, and Licensing Challenges on Hugging Face55
Securing the Ethereum from Smart Ponzi Schemes: Identification Using Static Features55
Communicating Study Design Trade-offs in Software Engineering53
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs52
A Comprehensive View on TD Prevention Practices and Reasons for Not Preventing It51
An Empirical Study of the Non-Determinism of ChatGPT in Code Generation51
An Empirical Study on Governance in Bitcoin’s Consensus Evolution48
Deep API Sequence Generation via Golden Solution Samples and API Seeds45
Why Do Developers Reject Refactorings in Open-Source Projects?44
JavaScript SBST Heuristics to Enable Effective Fuzzing of NodeJS Web APIs43
Storage State Analysis and Extraction of Ethereum Blockchain Smart Contracts43
FAVDisco: Modeling and Discovering File Access Vulnerabilities41
Towards Automating Domain-Specific Data Generation for Text-to-SQL: A Comprehensive Approach39
Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural Networks - RCR Report38
Characterizing Deep Learning Package Supply Chains in PyPI: Domains, Clusters, and Disengagement38
Assessing and Analyzing the Correctness of GitHub Copilot’s Code Suggestions38
PVDetector: Pretrained Vulnerability Detection on Vulnerability-enriched Code Semantic Graph38
Toward Interpretable Graph Tensor Convolution Neural Network for Code Semantics Embedding37
Do Current Language Models Support Code Intelligence for R Programming Language?37
Supporting Emotional Intelligence, Productivity and Team Goals while Handling Software Requirements Changes37
FormatFuzzer : Effective Fuzzing of Binary File Formats37
Introducing Interactions in Multi-Objective Optimization of Software Architectures36
Help Them Understand: Testing and Improving Voice User Interfaces35
Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review33
An Accurate Identifier Renaming Prediction and Suggestion Approach33
Estimating Uncertainty in Labeled Changes by SZZ Tools on Just-In-Time Defect Prediction32
I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow Posts32
A Survey of Learning-based Automated Program Repair32
Single and Multi-objective Test Cases Prioritization for Self-driving Cars in Virtual Environments32
Systematic Literature Review on Software Security Vulnerability Information Extraction32
Try with Simpler - An Evaluation of Improved Principal Component Analysis in Log-based Anomaly Detection32
When Fine-Tuning LLMs Meets Data Privacy: An Empirical Study of Federated Learning in LLM-Based Program Repair31
Enhancing Security and Acuity of Smart Contract Vulnerability Detection based on Federated Learning and BiLSTM-Attention31
Towards On-The-Fly Code Performance Profiling31
HeMiRCA: Fine-Grained Root Cause Analysis for Microservices with Heterogeneous Data Sources31
On-the-fly Generation-Quality Enhancement of Deep Code Models via Model Collaboration30
Why Do GitHub Actions Workflows Fail? An Empirical Study30
Assessing the Early Bird Heuristic (for Predicting Project Quality)30
Vulnerability Repair via Concolic Execution and Code Mutations29
APIRO: A Framework for Automated Security Tools API Recommendation29
A Systematic Literature Review of Multi-Label Learning in Software Engineering28
AutoRIC: Automated Neural Network Repairing Based on Constrained Optimization28
Exploring Data-Efficient Adaptation of Large Language Models for Code Generation27
SimClone: Detecting Tabular Data Clones Using Value Similarity27
An Empirical Study on GitHub Pull Requests’ Reactions27
Editorial: Toward the Future with Eight Issues Per Year26
Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing26
GIST : Generated Inputs Sets Transferability in Deep Learning26
Mapping the Trust Terrain: LLMs in Software Engineering - Insights and Perspectives25
PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing25
Editorial: ICSE and the Incredible Contradictions of Software Engineering25
Assessing and Improving an Evaluation Dataset for Detecting Semantic Code Clones via Deep Learning24
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities24
Contemporary Software Modernization: Strategies, Driving Forces, and Research Opportunities24
SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems24
A Survey of Learning-based Method Name Prediction24
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-based Safety-critical Systems23
Industry–Academia Research Collaboration and Knowledge Co-creation: Patterns and Anti-patterns23
Revisiting the Identification of the Co-evolution of Production and Test Code23
Leveraging Reviewer Experience in Code Review Comment Generation23
Towards Learning Generalizable Code Embeddings Using Task-agnostic Graph Convolutional Networks23
SCOPE : Performance Testing for Serverless Computing23
Security of Language Models for Code: A Systematic Literature Review22
Efficient Multivariate Time Series Anomaly Detection through Transfer Learning for Large-Scale Software Systems22
Commit Messages Generation Based on Core Changes22
SourcererJBF: A Java Build Framework For Large-Scale Compilation22
Characterizing Installation- and Run-Time Compatibility Issues in Android Benign Apps and Malware22
Graphuzz: Data-driven Seed Scheduling for Coverage-guided Greybox Fuzzing22
Learning from Very Little Data: On the Value of Landscape Analysis for Predicting Software Project Health22
Monitoring data for Anomaly Detection in Cloud-Based Systems: A Systematic Mapping Study21
Actor-Driven Decomposition of Microservices through Multi-level Scalability Assessment21
You Don’t Have to Say Where to Edit! jLED—Joint Learning to Localize and Edit Source Code21
Beyond Fidelity: Explaining Vulnerability Localization of Learning-Based Detectors21
Test Input Prioritization for 3D Point Clouds20
Cleaning Up Confounding: Accounting for Endogeneity Using Instrumental Variables and Two-Stage Models20
Exploring Fine-Grained Bug Report Categorization with Large Language Models and Prompt Engineering: An Empirical Study20
A Characterization Study of Merge Conflicts in Java Projects19
Demystifying Hidden Sensitive Operations in Android Apps19
Exploring the Capabilities of LLMs for Code-Change-Related Tasks19
Battling against Protocol Fuzzing: Protecting Networked Embedded Devices from Dynamic Fuzzers19
Automatic Rule Checking for Microservices: Supporting Security Analysis with Explainability19
MR-Scout: Automated Synthesis of Metamorphic Relations from Existing Test Cases19
Fold2Vec: Towards a Statement-Based Representation of Code for Code Comprehension19
Fairness Concerns in App Reviews: A Study on AI-Based Mobile Apps19
Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations19
A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers19
Evolution-Aware Constraint Derivation Approach for Software Remodularization19
Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development19
Variable Renaming-Based Adversarial Test Generation for Code Model: Benchmark and Enhancement18
Automating TODO-missed Methods Detection and Patching18
Time-travel Investigation: Toward Building a Scalable Attack Detection Framework on Ethereum18
MeDeT: Medical Device Digital Twins Creation with Few-shot Meta-learning18
Certified Cost Bounds for Abstract Programs18
Improving Deep Assertion Generation via Fine-Tuning Retrieval-Augmented Pre-trained Language Models18
Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural Networks18
SPOLRE: Semantic Preserving Object Layout Reconstruction for Image Captioning System Testing18
Autonomous Driving System Testing via Diversity-Oriented Driving Scenario Exploration17
Bypassing Guardrails: Lessons Learned from Red Teaming ChatGPT17
Programming Smart Playtesting17
All in One: Design, Verification, and Implementation of SNOW-optimal Read Atomic Transactions17
Coverage-directed Differential Testing of X.509 Certificate Validation in SSL/TLS Implementations17
PonziHunter: Hunting Ethereum Ponzi Contract via Static Analysis and Contrastive Learning on the Bytecode Level17
Enhancing Task In-Progress Time Predictions through Affective and Personality Factors17
Stress Testing Control Loops in Cyber-Physical Systems—RCR Report17
Testing Causality in Scientific Modelling Software17
Interpreting Deep Neural Networks via Relative Activation-Deactivation Abstractions17
An In-depth Study of Java Deserialization Remote-Code Execution Exploits and Vulnerabilities16
Is It Hard to Generate Holistic Commit Message?16
Adaptive Modelling Languages: Abstract Syntax and Model Migration16
Efficient Management of Containers for Software Defined Vehicles16
A Roadmap for Integrating Sustainability into Software Engineering Education16
Measuring and Clustering Heterogeneous Chatbot Designs15
An Interleaving Guided Metamorphic Testing Approach for Concurrent Programs15
Survey of Code Search Based on Deep Learning15
AI for DevSecOps: A Landscape and Future Opportunities15
Duplicate Bug Report Detection: How Far Are We?15
Differentiable Quantum Programming with Unbounded Loops15
Reputation Gaming in Crowd Technical Knowledge Sharing15
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model15
Refactoring in Computational Notebooks15
Generation-based Differential Fuzzing for Deep Learning Libraries15
OSS Effort Estimation Using Software Features Similarity and Developer Activity-Based Metrics15
Automatic Core-Developer Identification on GitHub: A Validation Study15
Can GitHub Issues Help in App Review Classifications?15
Exploring Development Methods for Reactive Synthesis Specifications14
Inferring Input Grammars from Code with Symbolic Parsing14
Input Distribution Coverage: Measuring Feature Interaction Adequacy in Neural Network Testing14
Preparation and Utilization of Mixed States for Testing Quantum Programs14
The Influence of Human Aspects on Requirements Engineering-related Activities: Software Practitioners’ Perspective14
On the Impact of Lower Recall and Precision in Defect Prediction for Guiding Search-based Software Testing14
MORepair: Teaching LLMs to Repair Code via Multi-Objective Fine-Tuning14
DiPri : Distance-Based Seed Prioritization for Greybox Fuzzing14
LogUpdater: Automated Detection and Repair of Specific Defects in Logging Statements14
Studying the Impact of TensorFlow and PyTorch Bindings on Machine Learning Software Quality14
Visualization Task Taxonomy to Understand the Fuzzing Internals14
On the Significance of Category Prediction for Code-Comment Synchronization13
Can Coverage Criteria Guide Failure Discovery for Image Classifiers? An Empirical Study13
Obfuscated Clone Search in JavaScript based on Reinforcement Subsequence Learning13
CITYWALK : Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge13
Mitigating Regression Faults Induced by Feature Evolution in Deep Learning Systems13
Rise of Distributed Deep Learning Training in the Big Model Era: From a Software Engineering Perspective13
Towards Robustness of Deep Program Processing Models—Detection, Estimation, and Enhancement13
A Comparative Study on Method Comment and Inline Comment13
Let’s Discover More API Relations: A Large Language Model-Based AI Chain for Unsupervised API Relation Inference13
A Hypothesis Testing-based Framework for Software Cross-modal Retrieval in Heterogeneous Semantic Spaces13
Data Complexity: A New Perspective for Analyzing the Difficulty of Defect Prediction Tasks12
Identifying and Explaining Safety-critical Scenarios for Autonomous Vehicles via Key Features12
NSFuzz: Towards Efficient and State-Aware Network Service Fuzzing12
Guided Feature Identification and Removal for Resource-constrained Firmware12
PanicFI: An Infrastructure for Fixing Panic Bugs in Real-World Rust Programs12
Testing RESTful APIs: A Survey12
How Do Successful and Failed Projects Differ? A Socio-Technical Analysis12
Towards Practical Binary Code Similarity Detection: Vulnerability Verification via Patch Semantic Analysis12
Large Language Models for Cyber Security: A Systematic Literature Review12
Fast, Fine-Grained Equivalence Checking for Neural Decompilers12
Sustainability of Machine Learning-Enabled Systems: The Machine Learning Practitioner’s Perspective12
Understanding Real-Time Collaborative Programming: A Study of Visual Studio Live Share12
Software Engineering by and for Humans in an AI Era12
The IDEA of Us: An Identity-Aware Architecture for Autonomous Systems12
A Systematic Literature Review on the Use of Deep Learning in Software Engineering Research12
SemMT: A Semantic-Based Testing Approach for Machine Translation Systems12
Grammar Mutation for Testing Input Parsers11
BiRD: Race Detection in Software Binaries under Relaxed Memory Models11
Fairness Testing of Machine Translation Systems11
Decision Support Model for Selecting the Optimal Blockchain Oracle Platform: An Evaluation of Key Factors11
Revealing the Unseen: AI Chain on LLMs for Predicting Implicit Dataflows to Generate Dataflow Graphs in Dynamically Typed Code11
What Constitutes the Deployment and Runtime Configuration System? An Empirical Study on OpenStack Projects11
Exploring JVM Garbage Collector Testing with Event-Coverage11
Some Seeds Are Strong: Seeding Strategies for Search-based Test Case Selection11
Simulating Software Evolution to Evaluate the Reliability of Early Decision-making among Design Alternatives toward Maintainability11
Test Oracle Generation for REST APIs11
Automated Abstract Transformer Synthesis for Reduced Product Domains10
Mobile Application Online Cross-Project Just-in-Time Software Defect Prediction Framework10
The Havoc Paradox in Generator-Based Fuzzing — RCR Report10
Toward Better Comprehension of Breaking Changes in the NPM Ecosystem10
Large Language Model for Vulnerability Detection and Repair: Literature Review and the Road Ahead10
How the Quality of Maintenance Tasks is Affected by Criteria for Selecting Engineers for Collaboration10
Recommending Variable Names for Extract Local Variable Refactorings10
Identifying Performance Issues in Cloud Service Systems Based on Relational-Temporal Features10
Divide-and-Conquer: Automating Code Revisions via Localization-and-Revision10
An Empirical Study on the Relationship between Defects and Source Code’s Unnaturalness10
Analysis of EMF meta-model duplication in open-source repositories10
Representation Learning for Stack Overflow Posts: How Far Are We?10
CCIHunter: Enhancing Smart Contract Code-Comment Inconsistencies Detection via Two-Stage Pre-training10
Editorial: The End of the Journey10
What You See is What it Means! Semantic Representation Learning of Code based on Visualization and Transfer Learning10
Verification Witnesses10
Revisiting Sentiment Analysis for Software Engineering in the Era of Large Language Models10
Addressing OSS Community Managers’ Challenges in Contributor Retention10
Open Problems in Fuzzing RESTful APIs: A Comparison of Tools10
Less Is More: Unlocking Semi-Supervised Deep Learning for Vulnerability Detection10
Understanding Vulnerability Inducing Commits of the Linux Kernel10
Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java10
Large Language Model-Aware In-Context Learning for Code Generation9
From Triumph to Uncertainty: The Journey of Software Engineering in the AI Era9
Model Driven Engineering, Artificial Intelligence, and DevOps for Software and Systems Engineering: A Systematic Mapping Study of Synergies and Challenges9
Prompt-based Code Completion via Multi-Retrieval Augmented Generation9
DRIVE: Dockerfile Rule Mining and Violation Detection9
Making Software Development More Diverse and Inclusive: Key Themes, Challenges, and Future Directions9
A Review of Learning-based Smart Contract Vulnerability Detection: A Perspective on Code Representation9
FQN Inference in Partial Code by Prompt-tuned Language Model of Code9
My Fuzzers Won’t Build: An Empirical Study of Fuzzing Build Failures9
Digital Twin-based Anomaly Detection with Curriculum Learning in Cyber-physical Systems9
Influential Global and Local Contexts Guided Trace Representation for Fault Localization9
Finding Information Leaks with Information Flow Fuzzing9
Learning Software Bug Reports: A Systematic Literature Review9
Automated Identification of Toxic Code Reviews Using ToxiCR9
Just-in-Time Detection of Silent Security Patches9
Software Security Analysis in 2030 and Beyond: A Research Roadmap9
Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing9
AceCoder : An Effective Prompting Technique Specialized in Code Generation9
Leveraging Symmetry in GR(1) Synthesis9
Identifying Affected Third-Party Java Libraries from Textual Descriptions of Vulnerabilities and Libraries9
Sustainability in the Field of Software Engineering: A Tertiary Study9
VexIR2Vec : An Architecture-Neutral Embedding Framework for Binary Similarity9
AcTracer: Active Testing of Large Language Model via Multi-Stage Sampling9
Microservice Security Metrics for Secure Communication, Identity Management, and Observability9
Learning-based Relaxation of Completeness Requirements for Data Entry Forms9
Improving Code Reviewer Recommendation: Accuracy, Latency, Workload, and Bystanders9
Rise of the Planet of Serverless Computing: A Systematic Review9
0.31282210350037