IEEE Transactions on Parallel and Distributed Systems

Papers
(The H4-Index of IEEE Transactions on Parallel and Distributed Systems is 52. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
2020 Reviewers List300
Enabling Large Scale Simulations for Particle Accelerators264
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Tsinghua University219
Building High-throughput Neural Architecture Search Workflows via a Decoupled Fitness Prediction Engine203
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications196
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity159
Design and Implementation of 2D Convolution on x86/x64 Processors154
Replicated Versioned Data Structures for Wide-Area Distributed Systems149
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information128
STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training128
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays124
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations113
H5Intent: Autotuning HDF5 With User Intent105
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach102
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System97
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement93
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing91
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds87
IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems85
AW B +-Tree: a Novel Width-based Index Structure Supporting Hybrid Matching for Large-scale Content-based Pub/Sub Systems81
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization80
Joint Task Scheduling and Containerizing for Efficient Edge Computing80
Federated Learning With Nesterov Accelerated Gradient77
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation77
A Case for Pricing Bandwidth: Sharing Datacenter Networks With Cost Dominant Fairness76
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud76
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network74
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From University of Washington74
Graph-Centric Performance Analysis for Large-Scale Parallel Applications72
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems71
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning69
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization66
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity64
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration64
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning62
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment62
DyLaClass: Dynamic Labeling Based Classification for Optimal Sparse Matrix Format Selection in Accelerating SpMV62
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads62
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms61
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration60
Improving the Scalability of GPU Synchronization Primitives59
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks59
Asynchronous Algorithms for Decentralized Resource Allocation Over Directed Networks59
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription58
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats57
Improved MPC Algorithms for Edit Distance and Ulam Distance57
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment56
Coordinating Fast Concurrency Adapting With Autoscaling for SLO-Oriented Web Applications55
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems54
Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation53
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs53
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training52
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation52
0.065178155899048