IEEE Transactions on Software Engineering

Papers
(The median citation count of IEEE Transactions on Software Engineering is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Machine Learning Testing: Survey, Landscapes and Horizons374
Smart Contract Development: Challenges and Opportunities349
The Art, Science, and Engineering of Fuzzing: A Survey219
Deep Learning Based Vulnerability Detection: Are We There Yet?207
Fault Analysis and Debugging of Microservice Systems: Industrial Survey, Benchmark System, and Empirical Study183
Deep Semantic Feature Learning for Software Defect Prediction178
The Impact of Class Rebalancing Techniques on the Performance and Interpretation of Defect Prediction Models177
An Empirical Study of Fault Localization Families and Their Combinations135
SEQUENCER: Sequence-to-Sequence Learning for End-to-End Program Repair129
Defining Smart Contract Defects on Ethereum100
Perceptions, Expectations, and Challenges in Defect Prediction95
RefactoringMiner 2.087
Automatic Feature Learning for Predicting Vulnerable Software Components84
Data Quality Matters: A Case Study on Data Label Correctness for Security Bug Report Prediction75
An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models75
Checking Smart Contracts With Structural Code Embedding73
Comparing Methods for Large-Scale Agile Software Development: A Systematic Literature Review71
Service Candidate Identification from Monolithic Systems Based on Execution Traces71
Mining Fix Patterns for FindBugs Violations66
Cognitive Biases in Software Engineering: A Systematic Mapping Study64
Socio-Technical Grounded Theory for Software Engineering63
A Survey on Adaptive Random Testing62
DefectChecker: Automated Smart Contract Defect Detection by Analyzing EVM Bytecode59
The Impact of Mislabeled Changes by SZZ on Just-in-Time Defect Prediction58
Predicting Defective Lines Using a Model-Agnostic Technique58
Beyond Technical Aspects: How Do Community Smells Influence the Intensity of Code Smells?57
Finding Critical Scenarios for Automated Driving Systems: A Systematic Mapping Study53
Reinforcement-Learning-Guided Source Code Summarization Using Hierarchical Attention53
Toxic Code Snippets on Stack Overflow53
The Impact of Feature Importance Methods on the Interpretation of Defect Classifiers53
Automatically Assessing Code Understandability49
Revisiting Supervised and Unsupervised Methods for Effort-Aware Cross-Project Defect Prediction48
Deep Learning Based Code Smell Detection48
A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering48
The Impact of Correlated Metrics on the Interpretation of Defect Models48
An Empirical Study of Boosting Spectrum-Based Fault Localization via PageRank47
CODIT: Code Editing With Tree-Based Neural Models47
Just-In-Time Defect Identification and Localization: A Two-Phase Framework46
DeepLineDP: Towards a Deep Learning Approach for Line-Level Defect Prediction46
Neural Transfer Learning for Repairing Security Vulnerabilities in C Code44
Today Was a Good Day: The Daily Life of Software Developers44
Historical Spectrum Based Fault Localization44
How to Evaluate Solutions in Pareto-Based Search-Based Software Engineering: A Critical Review and Methodological Guidance43
Methodological Principles for Reproducible Performance Evaluation in Cloud Computing43
Rebooting Research on Detecting Repackaged Android Apps: Literature Review and Benchmark42
Exploring Community Smells in Open-Source: An Automated Approach42
A Systematic Literature Review on Bad Smells–5 W's: Which, When, What, Who, Where42
Using K-core Decomposition on Class Dependency Networks to Improve Bug Prediction Model's Practical Performance41
Specification Patterns for Robotic Missions41
A Qualitative Study of the Benefits and Costs of Logging From Developers’ Perspectives41
A Large Scale Study of Long-Time Contributor Prediction for GitHub Projects41
App Store Effects on Software Engineering Practices41
An Empirical Study of Obsolete Answers on Stack Overflow41
RefDiff 2.0: A Multi-Language Refactoring Detection Tool40
The Effectiveness of Supervised Machine Learning Algorithms in Predicting Software Refactoring40
Towards Security Threats of Deep Learning Systems: A Survey40
ElementRank: Ranking Java Software Classes and Packages using a Multilayer Complex Network-Based Approach40
Deep Transfer Bug Localization40
Reinforcement Learning for Test Case Prioritization38
Search-Based Crash Reproduction and Its Impact on Debugging38
What Do Package Dependencies Tell Us About Semantic Versioning?38
Companies’ Participation in OSS Development–An Empirical Study of OpenStack37
SEGRESS: Software Engineering Guidelines for REporting Secondary Studies37
Mining Likely Analogical APIs Across Third-Party Libraries via Large-Scale Unsupervised API Semantics Embedding37
An Empirical Study on Heterogeneous Defect Prediction Approaches37
What Predicts Software Developers’ Productivity?36
A Longitudinal Study of Application Structure and Behaviors in Android35
Understanding and Detecting Fragmentation-Induced Compatibility Issues for Android Apps35
The State of Serverless Applications: Collection, Characterization, and Community Consensus34
An Empirical Study on the Usage of Transformer Models for Code Completion34
Too Many User-Reviews! What Should App Developers Look at First?34
A Machine Learning Approach to Improve the Detection of CI Skip Commits34
ATOM: Commit Message Generation Based on Abstract Syntax Tree and Hybrid Ranking33
Automatic Mining of Opinions Expressed About APIs in Stack Overflow33
Towards a Theory of Software Developer Job Satisfaction and Perceived Productivity33
Why Do Software Developers Use Static Analysis Tools? A User-Centered Study of Developer Needs and Motivations33
A Taxonomy of Inter-Team Coordination Mechanisms in Large-Scale Agile32
Platform-Independent Dynamic Taint Analysis for JavaScript32
Which Commits Can Be CI Skipped?31
What Makes Agile Software Development Agile?31
The Effects of Human Aspects on the Requirements Engineering Process: A Systematic Literature Review30
Diversified Third-Party Library Prediction for Mobile App Development30
Architecture Anti-Patterns: Automatically Detectable Violations of Design Principles30
Human Values in Software Engineering: Contrasting Case Studies of Practice30
What Do Programmers Discuss About Blockchain? A Case Study on the Use of Balanced LDA and the Reference Architecture of a Domain to Capture Online Discussions About Blockchain Platforms Across Stack E30
Why My App Crashes? Understanding and Benchmarking Framework-Specific Exceptions of Android Apps30
Mind the Gap! A Study on the Transferability of Virtual Versus Physical-World Testing of Autonomous Driving Systems30
oo7: Low-Overhead Defense Against Spectre Attacks via Program Analysis30
The Secret Life of Software Vulnerabilities: A Large-Scale Empirical Study29
A Progression Model of Software Engineering Goals, Challenges, and Practices in Start-Ups29
Real World Scrum A Grounded Theory of Variations in Practice29
Accessible or Not? An Empirical Investigation of Android App Accessibility29
Lightweight Assessment of Test-Case Effectiveness Using Source-Code-Quality Indicators29
Quality of Automated Program Repair on Real-World Defects29
How Well Do Change Sequences Predict Defects? Sequence Learning from Software Changes29
Automated Classification of Overfitting Patches With Statically Extracted Code Features28
What's Wrong with My Benchmark Results? Studying Bad Practices in JMH Benchmarks28
Holistic Combination of Structural and Textual Code Information for Context Based API Recommendation28
Emotions and Perceived Productivity of Software Developers at the Workplace28
Challenges and Common Solutions in Smart Contract Development28
Revisiting Binary Code Similarity Analysis Using Interpretable Feature Engineering and Lessons Learned28
Automatic Generation of Acceptance Test Cases From Use Case Specifications: An NLP-Based Approach28
Practical Mutation Testing at Scale: A view from Google27
Review Dynamics and Their Impact on Software Quality27
On the Validity of Pre-Trained Transformers for Natural Language Processing in the Software Engineering Domain27
Within-Project Defect Prediction of Infrastructure-as-Code Using Product and Process Metrics27
Codee: A Tensor Embedding Scheme for Binary Code Search27
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning26
Dealing with Non-Functional Requirements in Model-Driven Development: A Survey26
An Empirical Evaluation of Using Large Language Models for Automated Unit Test Generation26
Easy-to-Deploy API Extraction by Multi-Level Feature Embedding and Transfer Learning26
BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems26
Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding25
Boosting API Recommendation With Implicit Feedback25
A Survey of Performance Optimization for Mobile Applications25
CrySL: An Extensible Approach to Validating the Correct Usage of Cryptographic APIs25
SQAPlanner: Generating Data-Informed Software Quality Improvement Plans25
How to “DODGE” Complex Software Analytics24
Effects of Personality Traits on Pull Request Acceptance24
The Effect of Work Environments on Productivity and Satisfaction of Software Engineers24
Understanding How and Why Developers Seek and Analyze API-Related Opinions24
On the Costs and Profit of Software Defect Prediction24
Coverage Prediction for Accelerating Compiler Testing24
Enabling Good Work Habits in Software Developers through Reflective Goal-Setting24
GPT2SP: A Transformer-Based Agile Story Point Estimation Approach24
SOSRepair: Expressive Semantic Search for Real-World Program Repair24
Gender Differences in Personality Traits of Software Engineers24
PPChecker: Towards Accessing the Trustworthiness of Android Apps’ Privacy Policies23
Using Transfer Learning for Code-Related Tasks23
Characterizing Crowds to Better Optimize Worker Recommendation in Crowdsourced Testing23
Dependency Smells in JavaScript Projects23
An Empirical Study of Dependency Downgrades in the npm Ecosystem23
Black-Box Testing of Deep Neural Networks through Test Case Diversity22
SeqTrans: Automatic Vulnerability Fix Via Sequence to Sequence Learning22
Uncovering the Benefits and Challenges of Continuous Integration Practices22
On the Value of Oversampling for Deep Learning in Software Defect Prediction22
A Fast Clustering Algorithm for Modularization of Large-Scale Software Systems22
Construct Validity in Software Engineering22
Modeling Functional Similarity in Source Code With Graph-Based Siamese Networks22
Neural Network Guided Evolutionary Fuzzing for Finding Traffic Violations of Autonomous Vehicles22
The ORIS Tool: Quantitative Evaluation of Non-Markovian Systems22
Machine/Deep Learning for Software Engineering: A Systematic Literature Review21
An Empirical Study of C++ Vulnerabilities in Crowd-Sourced Code Examples21
Evaluation of Static Vulnerability Detection Tools With Java Cryptographic API Benchmarks21
Reading Answers on Stack Overflow: Not Enough!21
Decomposition of Monolith Applications Into Microservices Architectures: A Systematic Review21
Vuln4Real: A Methodology for Counting Actually Vulnerable Dependencies21
How Gender-Biased Tools Shape Newcomer Experiences in OSS Projects20
AI-Enabled Automation for Completeness Checking of Privacy Policies20
Multi-Objective Software Effort Estimation: A Replication Study20
CBGA-ES+: A Cluster-Based Genetic Algorithm with Non-Dominated Elitist Selection for Supporting Multi-Objective Test Optimization20
Systematic Evaluation and Usability Analysis of Formal Methods Tools for Railway Signaling System Design20
Metamorphic Robustness Testing: Exposing Hidden Defects in Citation Statistics and Journal Impact Factors20
User Review-Based Change File Localization for Mobile Applications19
Scalable and Accurate Test Case Prioritization in Continuous Integration Contexts19
How Developers Choose Names19
Data Preparation for Software Vulnerability Prediction: A Systematic Literature Review19
Can Clean New Code Reduce Technical Debt Density?19
Chatbot4QR: Interactive Query Refinement for Technical Question Retrieval18
A Deep Dive into the Impact of COVID-19 on Software Development18
Capabilities and Practices in DevOps: A Multivocal Literature Review18
Containerization for High Performance Computing Systems: Survey and Prospects18
Are You Still Working on This? An Empirical Study on Pull Request Abandonment18
Evaluating Automatic Program Repair Capabilities to Repair API Misuses18
Improving Vulnerability Inspection Efficiency Using Active Learning18
Detecting Software Security Vulnerabilities Via Requirements Dependency Analysis18
Deep Learning Based Program Generation From Requirements Text: Are We There Yet?18
Watch Out for Extrinsic Bugs! A Case Study of Their Impact in Just-In-Time Bug Prediction Models on the OpenStack Project18
Software Module Clustering: An In-Depth Literature Analysis18
PerfJIT: Test-Level Just-in-Time Prediction for Performance Regression Introducing Commits18
Software Testing With Large Language Models: Survey, Landscape, and Vision18
Characterizing the Usage, Evolution and Impact of Java Annotations in Practice18
Learning Configurations of Operating Environment of Autonomous Vehicles to Maximize their Collisions18
Identifying Self-Admitted Technical Debts With Jitterbug: A Two-Step Approach17
A Study of Feature Scattering in the Linux Kernel17
IntRepair: Informed Repairing of Integer Overflows17
CTOS: Compiler Testing for Optimization Sequences of LLVM17
A Study of C/C++ Code Weaknesses on Stack Overflow17
Will Dependency Conflicts Affect My Program's Semantics?17
Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)17
A Multi-Armed Bandit Approach for Test Case Prioritization in Continuous Integration Environments17
PatchNet: Hierarchical Deep Learning-Based Stable Patch Identification for the Linux Kernel17
Theoretical and Empirical Analyses of the Effectiveness of Metamorphic Relation Composition17
GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code Search16
A Search-Based Testing Approach for Deep Reinforcement Learning Agents16
Enriching API Documentation with Code Samples and Usage Scenarios from Crowd Knowledge16
Impact of Discretization Noise of the Dependent Variable on Machine Learning Classifiers in Software Engineering16
Specification-based Autonomous Driving System Testing16
Pluto: Exposing Vulnerabilities in Inter-Contract Scenarios16
PackerGrind: An Adaptive Unpacking System for Android Apps16
Finding Trends in Software Research16
Towards Reliable Online Just-in-Time Software Defect Prediction16
Modeling and Recommending Open Source Licenses with findOSSLicense16
How Developers Engineer Test Cases: An Observational Study16
Pathidea: Improving Information Retrieval-Based Bug Localization by Re-Constructing Execution Paths Using Logs16
Factors Affecting On-Time Delivery in Large-Scale Agile Software Development16
Research on Third-Party Libraries in Android Apps: A Taxonomy and Systematic Literature Review16
A Chaos Engineering System for Live Analysis and Falsification of Exception-Handling in the JVM16
A3: Assisting Android API Migrations Using Code Examples16
Key Stakeholders’ Value Propositions for Feature Selection in Software-Intensive Products: An Industrial Case Study16
Software Engineering for Internet of Things: The Practitioners' Perspective15
Context-Aware Personalized Crowdtesting Task Recommendation15
A Faceted Taxonomy of Requirements Changes in Agile Contexts15
The Mutation and Injection Framework: Evaluating Clone Detection Tools with Mutation Analysis15
Interlocking Safety Cases for Unmanned Autonomous Systems in Shared Airspaces15
Locating Latent Design Information in Developer Discussions: A Study on Pull Requests15
Automatic Test Case and Test Oracle Generation Based on Functional Scenarios in Formal Specifications for Conformance Testing15
In Search of Socio-Technical Congruence: A Large-Scale Longitudinal Study15
Post2Vec: Learning Distributed Representations of Stack Overflow Posts15
Pull Request Decisions Explained: An Empirical Overview15
Automatic Detection of Java Cryptographic API Misuses: Are We There Yet?15
ARTE: Automated Generation of Realistic Test Inputs for Web APIs15
Continuously Managing NFRs: Opportunities and Challenges in Practice14
Including Everyone, Everywhere: Understanding Opportunities and Challenges of Geographic Gender-Inclusion in OSS14
Comparative Analysis of Constraint Handling Techniques for Constrained Combinatorial Testing14
How Can Human Values Be Addressed in AgileMethods A Case Study on SAFe14
LogAssist: Assisting Log Analysis Through Log Summarization14
Cross-Project Online Just-In-Time Software Defect Prediction14
API-Related Developer Information Needs in Stack Overflow14
Recommending API Function Calls and Code Snippets to Support Software Development14
On Company Contributions to Community Open Source Software Projects14
Forecasting Architectural Decay From Evolutionary History14
Exploiting Natural Language Structures in Software Informal Documentation14
FairMask: Better Fairness via Model-Based Rebalancing of Protected Attributes14
A Comprehensive Investigation of the Impact of Class Overlap on Software Defect Prediction14
The Impact of Code Review on Architectural Changes14
A Survey on the Adoption of Patterns for Engineering Software for the Cloud14
Revisiting, Benchmarking and Exploring API Recommendation: How Far Are We?14
An Empirical Study of Release Note Production and Usage in Practice13
Code Cloning in Smart Contracts on the Ethereum Platform: An Extended Replication Study13
SigRec: Automatic Recovery of Function Signatures in Smart Contracts13
Simpler Hyperparameter Optimization for Software Analytics: Why, How, When?13
Transfer Learning Across Variants and Versions: The Case of Linux Kernel Size13
Dealing With Data Challenges When Delivering Data-Intensive Software Solutions13
On the Relationship Between the Developer’s Perceptible Race and Ethnicity and the Evaluation of Contributions in OSS13
Accelerating Continuous Integration by Caching Environments and Inferring Dependencies13
Inputs From Hell:13
MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation13
Testing with Fewer Resources: An Adaptive Approach to Performance-Aware Test Case Generation13
Aroc: An Automatic Repair Framework for On-Chain Smart Contracts13
VulExplainer: A Transformer-Based Hierarchical Distillation for Explaining Vulnerability Types13
The Impact of Data Merging on the Interpretation of Cross-Project Just-In-Time Defect Models13
Behavior Trees and State Machines in Robotics Applications13
RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems13
How Software Developers Mitigate Their Errors When Developing Code13
Dynamic Update of Discrete Event Controllers12
Requirements Framing Affects Design Creativity12
Redundancy, Context, and Preference: An Empirical Study of Duplicate Pull Requests in OSS Projects12
ConEx: Efficient Exploration of Big-Data System Configurations for Better Performance12
Comments on "Using k-core Decomposition on Class Dependency Networks to Improve Bug Prediction Model's Practical Performance"12
0.022445917129517