IEEE Transactions on Parallel and Distributed Systems

Papers
(The H4-Index of IEEE Transactions on Parallel and Distributed Systems is 50. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Tsinghua University339
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications245
H5Intent: Autotuning HDF5 With User Intent240
Replicated Versioned Data Structures for Wide-Area Distributed Systems189
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information181
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach164
STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training161
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement116
Mapping Large-Scale Spiking Neural Network on Arbitrary Meshed Neuromorphic Hardware114
Optimizing Data Locality by Integrating Intermediate Data Partitioning and Reduce Task Scheduling in Spark Framework111
Enabling Large Scale Simulations for Particle Accelerators108
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System100
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays100
Building High-throughput Neural Architecture Search Workflows via a Decoupled Fitness Prediction Engine96
Design and Implementation of 2D Convolution on x86/x64 Processors94
AWB+-Tree: A Novel Width-Based Index Structure Supporting Hybrid Matching for Large-Scale Content-Based Pub/Sub Systems93
IRHunter: Universal Detection of Instruction Reordering Vulnerabilities for Enhanced Concurrency in Distributed and Parallel Systems91
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing90
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds86
Federated Learning With Nesterov Accelerated Gradient85
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization85
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations85
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity81
RHINO: An Efficient Serverless Container System for Small-Scale HPC Applications75
Graph-Centric Performance Analysis for Large-Scale Parallel Applications74
Building Accurate and Interpretable Online Classifiers on Edge Devices72
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads71
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization71
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network71
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From University of Washington70
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats69
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription69
Improved MPC Algorithms for Edit Distance and Ulam Distance67
Coordinating Fast Concurrency Adapting With Autoscaling for SLO-Oriented Web Applications63
Cannikin: No Lagger of SLO in Concurrent Multiple LoRA LLM Serving63
PreTrans: Enabling Efficient CGRA Multi-Task Context Switch Through Config Pre-Mapping and Data Transceiving62
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation61
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning60
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment60
DyLaClass: Dynamic Labeling Based Classification for Optimal Sparse Matrix Format Selection in Accelerating SpMV59
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity58
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud57
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms57
CiMBA: Accelerating Genome Sequencing Through On-Device Basecalling via Compute-in-Memory56
Improving the Scalability of GPU Synchronization Primitives55
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment55
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration53
PHIDE: A Parallel Hybrid Direct-Iterative Eigensolver for Hermitian Eigenvalue Problems51
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems51
Scalable Hybrid Learning Techniques for Scientific Data Compression50
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration50
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning50
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems50
0.12942385673523