IEEE Transactions on Parallel and Distributed Systems

Papers
(The median citation count of IEEE Transactions on Parallel and Distributed Systems is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning191
Accelerating Federated Learning via Momentum Gradient Descent190
A Scalable Multi-Layer PBFT Consensus for Blockchain185
Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems183
Online Collaborative Data Caching in Edge Computing150
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning125
Kokkos 3: Programming Model Extensions for the Exascale Era125
Towards Fair and Privacy-Preserving Federated Deep Models124
Biscotti: A Blockchain System for Private and Secure Federated Learning118
Cost-Effective App Data Distribution in Edge Computing118
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments103
GRP-HEFT: A Budget-Constrained Resource Provisioning Scheme for Workflow Scheduling in IaaS Clouds102
Recent Advances of Resource Allocation in Network Function Virtualization102
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing99
Towards Accurate Prediction for High-Dimensional and Highly-Variable Cloud Workloads with Deep Learning97
Distributed and Dynamic Service Placement in Pervasive Edge Computing Networks87
Multi-Agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm86
The Deep Learning Compiler: A Comprehensive Survey84
Auditing Cache Data Integrity in the Edge Computing Environment83
Energy-Aware Application Placement in Mobile Edge Computing: A Stochastic Optimization Approach80
Online Deadline-Aware Task Dispatching and Scheduling in Edge Computing77
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg75
Distributed Task Migration Optimization in MEC by Extending Multi-Agent Deep Reinforcement Learning Approach71
Modeling and Optimization of Performance and Cost of Serverless Applications71
CASpMV: A Customized and Accelerative SpMV Framework for the Sunway TaihuLight71
Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation69
Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective67
AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning63
Multi-Hop Multi-Task Partial Computation Offloading in Collaborative Edge Computing62
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression61
FRATO: Fog Resource Based Adaptive Task Offloading for Delay-Minimizing IoT Service Provisioning61
Elastic Scheduling for Microservice Applications in Clouds60
Blockchain at the Edge: Performance of Resource-Constrained IoT Networks59
Min-Max Cost Optimization for Efficient Hierarchical Federated Learning in Wireless Edge Networks58
Offloading Tasks With Dependency and Service Caching in Mobile Edge Computing57
A Potential Game Theoretic Approach to Computation Offloading Strategy Optimization in End-Edge-Cloud Computing55
CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain55
Reliability-Aware Network Service Provisioning in Mobile Edge-Cloud Networks55
Proof of Federated Learning: A Novel Energy-Recycling Consensus Algorithm54
Efficient Algorithms for Delay-Aware NFV-Enabled Multicasting in Mobile Edge Clouds With Resource Sharing53
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments52
Energy-Aware Inference Offloading for DNN-Driven Applications in Mobile Edge Clouds52
Privacy-Preserving Multi-Keyword Searchable Encryption for Distributed Systems52
On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation50
On Consortium Blockchain Consistency: A Queueing Network Model Approach49
Cryptomining Detection in Container Clouds Using System Calls and Explainable Machine Learning48
Evaluation of Stream Processing Frameworks46
Towards Efficient Scheduling of Federated Mobile Devices Under Computational and Statistical Heterogeneity46
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems46
Faster Parallel Core Maintenance Algorithms in Dynamic Graphs46
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System45
Thermal Prediction for Efficient Energy Management of Clouds Using Machine Learning45
Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments44
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model44
Transformations of High-Level Synthesis Codes for High-Performance Computing43
e-PoS: Making Proof-of-Stake Decentralized and Fair43
Distributed Training of Deep Learning Models: A Taxonomic Perspective43
Task Scheduling for Energy Consumption Constrained Parallel Applications on Heterogeneous Computing Systems42
Elastic and Reliable Bandwidth Reservation Based on Distributed Traffic Monitoring and Control42
Joint Task Scheduling and Containerizing for Efficient Edge Computing42
The Design of Fast Content-Defined Chunking for Data Deduplication Based Storage Systems41
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum41
IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems41
Data, User and Power Allocations for Caching in Multi-Access Edge Computing39
DeepSlicing: Collaborative and Adaptive CNN Inference With Low Latency38
TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing38
Towards Distributed SDN: Mobility Management and Flow Scheduling in Software Defined Urban IoT37
A Game-Based Approach for Cost-Aware Task Assignment With QoS Constraint in Collaborative Edge and Cloud Environments37
A Quantum Approach Towards the Adaptive Prediction of Cloud Workloads37
Heterogeneous Edge Offloading With Incomplete Information: A Minority Game Approach36
ADRL: A Hybrid Anomaly-Aware Deep Reinforcement Learning-Based Resource Scaling in Clouds35
Location-Aware and Budget-Constrained Service Deployment for Composite Applications in Multi-Cloud Environment35
Automated Fine-Grained CPU Cap Control in Serverless Computing Platform34
Congestion-Balanced and Welfare-Maximized Charging Strategies for Electric Vehicles34
DL2: A Deep Learning-Driven Scheduler for Deep Learning Clusters34
Algorithm-Based Fault Tolerance for Convolutional Neural Networks33
Hierarchical Multi-Agent Optimization for Resource Allocation in Cloud Computing33
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks33
On the Effective Parallelization and Near-Optimal Deployment of Service Function Chains33
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems33
VPIC 2.0: Next Generation Particle-in-Cell Simulations32
PQC Acceleration Using GPUs: FrodoKEM, NewHope, and Kyber32
FedGraph: Federated Graph Learning With Intelligent Sampling32
Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks32
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems31
Customer Perceived Value- and Risk-Aware Multiserver Configuration for Profit Maximization31
A Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA29
Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers29
GPU-Accelerated Real-Time Stereo Estimation With Binary Neural Network29
Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds28
Large-Scale Analysis of Docker Images and Performance Implications for Container Storage Systems28
Distributed Adaptive Consensus Tracking Control for Multi-Agent System With Communication Constraints28
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing28
Efficient Compute-Intensive Job Allocation in Data Centers via Deep Reinforcement Learning28
Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems28
LightChain: Scalable DHT-Based Blockchain27
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud27
Constructing Completely Independent Spanning Trees in Data Center Network Based on Augmented Cube27
VeriML: Enabling Integrity Assurances and Fair Payments for Machine Learning as a Service27
Optimizing Depthwise Separable Convolution Operations on GPUs27
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters27
Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning26
Multi-GPU Design and Performance Evaluation of Homomorphic Encryption on GPU Clusters26
T-Caching: Enhancing Feasibility of In-Network Caching in ICN26
Learning Spatiotemporal Failure Dependencies for Resilient Edge Computing Services26
Dependent Function Embedding for Distributed Serverless Edge Computing26
Minority Disk Failure Prediction Based on Transfer Learning in Large Data Centers of Heterogeneous Disk Systems26
Quantum Supremacy Circuit Simulation on Sunway TaihuLight26
Energy-Efficient Parallel Real-Time Scheduling on Clustered Multi-Core26
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference26
An In-Depth Study of Microservice Call Graph and Runtime Performance25
Towards Higher Performance and Robust Compilation for CGRA Modulo Scheduling25
Reliability Aware Energy Optimized Scheduling of Non-Preemptive Periodic Real-Time Tasks on Heterogeneous Multiprocessor System25
ERA-LSTM: An Efficient ReRAM-Based Architecture for Long Short-Term Memory25
Rusty: Runtime Interference-Aware Predictive Monitoring for Modern Multi-Tenant Systems25
A Decentralized Federated Learning Framework via Committee Mechanism With Convergence Guarantee25
Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing24
GPGPU Performance Estimation With Core and Memory Frequency Scaling24
Topology-Aware Neural Model for Highly Accurate QoS Prediction24
Endurance-Aware Mapping of Spiking Neural Networks to Neuromorphic Hardware24
Elastic Resource Allocation Against Imbalanced Transaction Assignments in Sharding-Based Permissioned Blockchains23
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs23
An Efficient Parallel Secure Machine Learning Framework on GPUs22
Monodirectional Evolutional Symport Tissue P Systems With Promoters and Cell Division22
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds22
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift21
Joint SFC Deployment and Resource Management in Heterogeneous Edge for Latency Minimization21
Cooperative Edge Caching Based on Temporal Convolutional Networks21
High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks21
Coordinated Batching and DVFS for DNN Inference on GPU Accelerators21
Scalable and Adaptive Data Replica Placement for Geo-Distributed Cloud Storages21
Elastic Deep Learning in Multi-Tenant GPU Clusters20
High-Quality Shared-Memory Graph Partitioning20
COOPER-SCHED: A Cooperative Scheduling Framework for Mobile Edge Computing with Expected Deadline Guarantee20
Efficient Virtual Network Embedding of Cloud-Based Data Center Networks into Optical Networks20
LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment20
K-Athena: A Performance Portable Structured Grid Finite Volume Magnetohydrodynamics Code20
An Approximate Communication Framework for Network-on-Chips20
GPU Tensor Cores for Fast Arithmetic Reductions20
An Optimal Locality-Aware Task Scheduling Algorithm Based on Bipartite Graph Modelling for Spark Applications20
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing20
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution20
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression19
Parallelization and Optimization of NSGA-II on Sunway TaihuLight System19
Scalable, Multi-Constraint, Complex-Objective Graph Partitioning19
Context-Aware Online Client Selection for Hierarchical Federated Learning19
aeSpTV: An Adaptive and Efficient Framework for Sparse Tensor-Vector Product Kernel on a High-Performance Computing Platform19
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning19
Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization19
Anomaly Detection and Anticipation in High Performance Computing Systems19
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms18
SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions18
A Survey of System Architectures and Techniques for FPGA Virtualization18
Performance-Aware Speculative Resource Oversubscription for Large-Scale Clusters18
Exploring Data Analytics Without Decompression on Embedded GPU Systems18
Decentralized Application Placement in Fog Computing18
Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS18
Decentralized Utility- and Locality-Aware Replication for Heterogeneous DHT-Based P2P Cloud Storage Systems18
CURE: A High-Performance, Low-Power, and Reliable Network-on-Chip Design Using Reinforcement Learning18
Accelerating Gossip-Based Deep Learning in Heterogeneous Edge Computing Platforms18
Towards Efficient and Stable K-Asynchronous Federated Learning With Unbounded Stale Gradients on Non-IID Data18
HRHS: A High-Performance Real-Time Hardware Scheduler17
A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing17
Differentially Private Byzantine-Robust Federated Learning17
P-PFC: Reducing Tail Latency with Predictive PFC in Lossless Data Center Networks17
BOSSA: A Decentralized System for Proofs of Data Retrievability and Replication17
Incentive Mechanism Design for Joint Resource Allocation in Blockchain-Based Federated Learning17
Hamiltonian Paths of -cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests16
Mobility-Aware Offloading and Resource Allocation for Distributed Services Collaboration16
Hotspot-Aware Hybrid Memory Management for In-Memory Key-Value Stores16
RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud16
QShield: Protecting Outsourced Cloud Data Queries With Multi-User Access Control Based on SGX16
Endpoint-Flexible Coflow Scheduling Across Geo-Distributed Datacenters16
Scheduling Periodical Multi-Stage Jobs With Fuzziness to Elastic Cloud Resources16
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing16
An Event-Driven Approach to Serverless Seismic Imaging in the Cloud16
cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs16
Towards Usable Cloud Storage Auditing16
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs15
Cost-Efficient Server Configuration and Placement for Mobile Edge Computing15
Fine-Grained Multi-Query Stream Processing on Integrated Architectures15
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC15
Achieving Fine-Grained Flow Management Through Hybrid Rule Placement in SDNs15
Cuttlefish: Neural Configuration Adaptation for Video Analysis in Live Augmented Reality15
Microservice Deployment in Edge Computing Based on Deep Q Learning15
Co-Active: A Workload-Aware Collaborative Cache Management Scheme for NVMe SSDs15
Modeling Analysis and Cost-Performance Ratio Optimization of Virtual Machine Scheduling in Cloud Computing14
GOSH: Task Scheduling Using Deep Surrogate Models in Fog Computing Environments14
HiTDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge14
Trust: Triangle Counting Reloaded on GPUs14
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment14
Deterministic Data Distribution for Efficient Recovery in Erasure-Coded Storage Systems14
GossipFL: A Decentralized Federated Learning Framework With Sparsified and Adaptive Communication14
Octans: Optimal Placement of Service Function Chains in Many-Core Systems14
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems14
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services14
The Workflow Trace Archive: Open-Access Data From Public and Private Computing Infrastructures14
Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests14
TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems14
WindFlow: High-Speed Continuous Stream Processing With Parallel Building Blocks14
GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine Learning14
Improving Federated Learning With Quality-Aware User Incentive and Auto-Weighted Model Aggregation14
Dynamic Load Balancing in Parallel Execution of Cellular Automata13
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism13
MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems13
Decentralized Dual Proximal Gradient Algorithms for Non-Smooth Constrained Composite Optimization Problems13
Overlapping Communication With Computation in Parameter Server for Scalable DL Training13
SF-Sketch: A Two-Stage Sketch for Data Streams13
Efficient Function Queryable and Privacy Preserving Data Aggregation Scheme in Smart Grid13
A Generic Stochastic Model for Resource Availability in Fog Computing Environments13
A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications13
Privacy-Preserving Efficient Federated-Learning Model Debugging13
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations13
Phase-Aware Cache Partitioning to Target Both Turnaround Time and System Performance13
Eiffel: Efficient and Fair Scheduling in Adaptive Federated Learning13
Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs13
Accelerating Restarted GMRES With Mixed Precision Arithmetic13
Improving Restore Performance for In-Line Backup System Combining Deduplication and Delta Compression13
Learning-Driven Interference-Aware Workload Parallelization for Streaming Applications in Heterogeneous Cluster12
Parallel and Asynchronous Smart Contract Execution12
DONE: Distributed Approximate Newton-type Method for Federated Edge Learning12
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope12
Replica Exchange MCMC Hardware With Automatic Temperature Selection and Parallel Trial12
ESetStore: An Erasure-Coded Storage System With Fast Data Recovery12
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors12
Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory12
Reliability and Confidentiality Co-Verification for Parallel Applications in Distributed Systems12
DTransE: Distributed Translating Embedding for Knowledge Graph12
Adaptive Federated Deep Reinforcement Learning for Proactive Content Caching in Edge Computing12
Power-Aware Allocation of Graph Jobs in Geo-Distributed Cloud Networks12
Efficient Forwarding Anomaly Detection in Software-Defined Networks12
Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters12
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models12
High Performance Simulation of Spiking Neural Network on GPGPUs12
Benzene: Scaling Blockchain With Cooperation-Based Sharding12
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training12
Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers12
Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight12
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation11
Simplified Workflow Simulation on Clouds based on Computation and Communication Noisiness11
Auction-Based Cluster Federated Learning in Mobile Edge Computing Systems11
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures11
A Black-Box Fork-Join Latency Prediction Model for Data-Intensive Applications11
Accelerating Deep Learning Inference via Model Parallelism and Partial Computation Offloading11
Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures11
HiFlash: Communication-Efficient Hierarchical Federated Learning With Adaptive Staleness Control and Heterogeneity-Aware Client-Edge Association11
Performance Optimization for Relative-Error-Bounded Lossy Compression on Scientific Data11
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity11
NITI: Training Integer Neural Networks Using Integer-Only Arithmetic11
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms11
0.034308195114136