IEEE Transactions on Parallel and Distributed Systems

Papers
(The TQCC of IEEE Transactions on Parallel and Distributed Systems is 12. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
A Scalable Multi-Layer PBFT Consensus for Blockchain243
Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning241
Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems231
Kokkos 3: Programming Model Extensions for the Exascale Era183
Online Collaborative Data Caching in Edge Computing172
Biscotti: A Blockchain System for Private and Secure Federated Learning163
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning161
Towards Fair and Privacy-Preserving Federated Deep Models151
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing148
Cost-Effective App Data Distribution in Edge Computing132
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments125
Recent Advances of Resource Allocation in Network Function Virtualization124
Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation109
Distributed and Dynamic Service Placement in Pervasive Edge Computing Networks107
The Deep Learning Compiler: A Comprehensive Survey102
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg101
Multi-Agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm99
Auditing Cache Data Integrity in the Edge Computing Environment98
AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning92
Modeling and Optimization of Performance and Cost of Serverless Applications84
Distributed Task Migration Optimization in MEC by Extending Multi-Agent Deep Reinforcement Learning Approach80
Multi-Hop Multi-Task Partial Computation Offloading in Collaborative Edge Computing80
Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective79
Offloading Tasks With Dependency and Service Caching in Mobile Edge Computing77
CASpMV: A Customized and Accelerative SpMV Framework for the Sunway TaihuLight75
Proof of Federated Learning: A Novel Energy-Recycling Consensus Algorithm74
A Potential Game Theoretic Approach to Computation Offloading Strategy Optimization in End-Edge-Cloud Computing70
FRATO: Fog Resource Based Adaptive Task Offloading for Delay-Minimizing IoT Service Provisioning70
Elastic Scheduling for Microservice Applications in Clouds68
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression67
Min-Max Cost Optimization for Efficient Hierarchical Federated Learning in Wireless Edge Networks67
Energy-Aware Inference Offloading for DNN-Driven Applications in Mobile Edge Clouds66
On Consortium Blockchain Consistency: A Queueing Network Model Approach64
Blockchain at the Edge: Performance of Resource-Constrained IoT Networks64
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System63
CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain62
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments61
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems61
Towards Efficient Scheduling of Federated Mobile Devices Under Computational and Statistical Heterogeneity60
Privacy-Preserving Multi-Keyword Searchable Encryption for Distributed Systems60
Distributed Training of Deep Learning Models: A Taxonomic Perspective58
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum57
TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing56
Thermal Prediction for Efficient Energy Management of Clouds Using Machine Learning56
Cryptomining Detection in Container Clouds Using System Calls and Explainable Machine Learning55
Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments55
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems54
IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems50
e-PoS: Making Proof-of-Stake Decentralized and Fair50
Joint Task Scheduling and Containerizing for Efficient Edge Computing50
A Game-Based Approach for Cost-Aware Task Assignment With QoS Constraint in Collaborative Edge and Cloud Environments50
Transformations of High-Level Synthesis Codes for High-Performance Computing49
FedGraph: Federated Graph Learning With Intelligent Sampling48
DeepSlicing: Collaborative and Adaptive CNN Inference With Low Latency47
A Quantum Approach Towards the Adaptive Prediction of Cloud Workloads47
Elastic and Reliable Bandwidth Reservation Based on Distributed Traffic Monitoring and Control46
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model44
Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems44
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems44
Data, User and Power Allocations for Caching in Multi-Access Edge Computing44
DL2: A Deep Learning-Driven Scheduler for Deep Learning Clusters43
PQC Acceleration Using GPUs: FrodoKEM, NewHope, and Kyber43
Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks42
Algorithm-Based Fault Tolerance for Convolutional Neural Networks42
ADRL: A Hybrid Anomaly-Aware Deep Reinforcement Learning-Based Resource Scaling in Clouds41
Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning41
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing41
Context-Aware Online Client Selection for Hierarchical Federated Learning39
On the Effective Parallelization and Near-Optimal Deployment of Service Function Chains39
Congestion-Balanced and Welfare-Maximized Charging Strategies for Electric Vehicles38
VeriML: Enabling Integrity Assurances and Fair Payments for Machine Learning as a Service38
Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers38
A Decentralized Federated Learning Framework via Committee Mechanism With Convergence Guarantee38
Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds37
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift37
Distributed Adaptive Consensus Tracking Control for Multi-Agent System With Communication Constraints37
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks37
Hierarchical Multi-Agent Optimization for Resource Allocation in Cloud Computing37
Dependent Function Embedding for Distributed Serverless Edge Computing36
Coordinated Batching and DVFS for DNN Inference on GPU Accelerators35
Topology-Aware Neural Model for Highly Accurate QoS Prediction35
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters35
Incentive Mechanism Design for Joint Resource Allocation in Blockchain-Based Federated Learning35
Learning Spatiotemporal Failure Dependencies for Resilient Edge Computing Services35
VPIC 2.0: Next Generation Particle-in-Cell Simulations35
Large-Scale Analysis of Docker Images and Performance Implications for Container Storage Systems34
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud34
GossipFL: A Decentralized Federated Learning Framework With Sparsified and Adaptive Communication34
Optimizing Depthwise Separable Convolution Operations on GPUs33
An In-Depth Study of Microservice Call Graph and Runtime Performance32
Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing32
Multi-GPU Design and Performance Evaluation of Homomorphic Encryption on GPU Clusters31
GPU-Accelerated Real-Time Stereo Estimation With Binary Neural Network31
LightChain: Scalable DHT-Based Blockchain30
Accelerating Deep Learning Inference via Model Parallelism and Partial Computation Offloading30
Elastic Resource Allocation Against Imbalanced Transaction Assignments in Sharding-Based Permissioned Blockchains30
GPGPU Performance Estimation With Core and Memory Frequency Scaling30
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing29
Towards Efficient and Stable K-Asynchronous Federated Learning With Unbounded Stale Gradients on Non-IID Data29
LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment28
Differentially Private Byzantine-Robust Federated Learning28
Constructing Completely Independent Spanning Trees in Data Center Network Based on Augmented Cube28
Rusty: Runtime Interference-Aware Predictive Monitoring for Modern Multi-Tenant Systems28
GPU Tensor Cores for Fast Arithmetic Reductions27
BOSSA: A Decentralized System for Proofs of Data Retrievability and Replication27
Endurance-Aware Mapping of Spiking Neural Networks to Neuromorphic Hardware27
Cooperative Edge Caching Based on Temporal Convolutional Networks27
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference27
Efficient Virtual Network Embedding of Cloud-Based Data Center Networks into Optical Networks27
K-Athena: A Performance Portable Structured Grid Finite Volume Magnetohydrodynamics Code27
Improving Federated Learning With Quality-Aware User Incentive and Auto-Weighted Model Aggregation27
Adaptive Federated Deep Reinforcement Learning for Proactive Content Caching in Edge Computing26
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs26
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds26
Joint SFC Deployment and Resource Management in Heterogeneous Edge for Latency Minimization26
High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks25
Exploring Data Analytics Without Decompression on Embedded GPU Systems24
An Efficient Parallel Secure Machine Learning Framework on GPUs24
Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization24
Elastic Deep Learning in Multi-Tenant GPU Clusters24
Decentralized Application Placement in Fog Computing24
Benzene: Scaling Blockchain With Cooperation-Based Sharding23
Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS23
Monodirectional Evolutional Symport Tissue P Systems With Promoters and Cell Division23
Auction-Based Cluster Federated Learning in Mobile Edge Computing Systems23
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression23
A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing23
Cost-Efficient Server Configuration and Placement for Mobile Edge Computing22
HiFlash: Communication-Efficient Hierarchical Federated Learning With Adaptive Staleness Control and Heterogeneity-Aware Client-Edge Association22
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models22
Parallelization and Optimization of NSGA-II on Sunway TaihuLight System22
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution22
Microservice Deployment in Edge Computing Based on Deep Q Learning22
Privacy-Preserving Efficient Federated-Learning Model Debugging22
Personalized Edge Intelligence via Federated Self-Knowledge Distillation21
A Survey of System Architectures and Techniques for FPGA Virtualization21
RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud21
COOPER-SCHED: A Cooperative Scheduling Framework for Mobile Edge Computing with Expected Deadline Guarantee21
High-Quality Shared-Memory Graph Partitioning21
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs20
MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems20
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training20
Mobility-Aware Offloading and Resource Allocation for Distributed Services Collaboration20
Parallel and Asynchronous Smart Contract Execution20
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing20
Anomaly Detection and Anticipation in High Performance Computing Systems20
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms20
Scalable, Multi-Constraint, Complex-Objective Graph Partitioning20
Leveraging Deep Reinforcement Learning With Attention Mechanism for Virtual Network Function Placement and Routing20
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning19
Efficient Function Queryable and Privacy Preserving Data Aggregation Scheme in Smart Grid19
A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications19
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC19
Hamiltonian Paths of -cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests19
Accelerating Gossip-Based Deep Learning in Heterogeneous Edge Computing Platforms19
Eiffel: Efficient and Fair Scheduling in Adaptive Federated Learning19
HiTDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge19
CNNPC: End-Edge-Cloud Collaborative CNN Inference With Joint Model Partition and Compression19
GOSH: Task Scheduling Using Deep Surrogate Models in Fog Computing Environments18
Trust: Triangle Counting Reloaded on GPUs18
Performance Analysis of Machine Learning Centered Workload Prediction Models for Cloud18
Cost-Effective Web Application Replication and Deployment in Multi-Cloud Environment18
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope18
Co-Active: A Workload-Aware Collaborative Cache Management Scheme for NVMe SSDs18
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors18
Joint Coverage-Reliability for Budgeted Edge Application Deployment in Mobile Edge Computing Environment18
Scheduling Periodical Multi-Stage Jobs With Fuzziness to Elastic Cloud Resources18
Reliability-Aware Multi-Objective Memetic Algorithm for Workflow Scheduling Problem in Multi-Cloud System18
WindFlow: High-Speed Continuous Stream Processing With Parallel Building Blocks17
Data-Centric Client Selection for Federated Learning Over Distributed Edge Networks17
QShield: Protecting Outsourced Cloud Data Queries With Multi-User Access Control Based on SGX17
Towards Usable Cloud Storage Auditing17
FedMDS: An Efficient Model Discrepancy-Aware Semi-Asynchronous Clustered Federated Learning Framework17
Cuttlefish: Neural Configuration Adaptation for Video Analysis in Live Augmented Reality16
Joint Application Placement and Request Routing Optimization for Dynamic Edge Computing Service Management16
Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems16
NITI: Training Integer Neural Networks Using Integer-Only Arithmetic16
HierFedML: Aggregator Placement and UE Assignment for Hierarchical Federated Learning in Mobile Edge Computing16
Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests16
TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems16
Overlapping Communication With Computation in Parameter Server for Scalable DL Training16
Reputation-aware Hedonic Coalition Formation for Efficient Serverless Hierarchical Federated Learning16
Adaptive DRL-based Virtual Machine Consolidation in Energy-Efficient Cloud Data Center16
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems16
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment16
TDFL: Truth Discovery Based Byzantine Robust Federated Learning16
Incentive-Aware Autonomous Client Participation in Federated Learning15
CIA: A Collaborative Integrity Auditing Scheme for Cloud Data With Multi-Replica on Multi-Cloud Storage Providers15
Collaborative Intrusion Detection System for SDVN: A Fairness Federated Deep Learning Approach15
Achieving Fine-Grained Flow Management Through Hybrid Rule Placement in SDNs15
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations15
funcX: Federated Function as a Service for Science15
Octans: Optimal Placement of Service Function Chains in Many-Core Systems15
Reliability and Confidentiality Co-Verification for Parallel Applications in Distributed Systems15
PISTIS: An Event-Triggered Real-Time Byzantine-Resilient Protocol Suite15
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity15
Efficient Forwarding Anomaly Detection in Software-Defined Networks15
CAMIG: Concurrency-Aware Live Migration Management of Multiple Virtual Machines in SDN-Enabled Clouds15
High Performance Simulation of Spiking Neural Network on GPGPUs15
Learning-Driven Interference-Aware Workload Parallelization for Streaming Applications in Heterogeneous Cluster15
Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs15
GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine Learning15
CoopEdge+: Enabling Decentralized, Secure and Cooperative Multi-Access Edge Computing Based on Blockchain15
Dynamic Load Balancing in Parallel Execution of Cellular Automata15
Fine-Grained Multi-Query Stream Processing on Integrated Architectures15
Online Pricing and Trading of Private Data in Correlated Queries15
Performant, Multi-Objective Scheduling of Highly Interleaved Task Graphs on Heterogeneous System on Chip Devices15
Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight14
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics14
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism14
Decentralized Dual Proximal Gradient Algorithms for Non-Smooth Constrained Composite Optimization Problems14
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation14
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors14
iMLBench: A Machine Learning Benchmark Suite for CPU-GPU Integrated Architectures14
Intermittent Fault Diagnosis of Split-Star Networks and its Applications14
DTransE: Distributed Translating Embedding for Knowledge Graph14
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services14
TridentKV: A Read-Optimized LSM-Tree Based KV Store via Adaptive Indexing and Space-Efficient Partitioning14
Distributed Approaches to Butterfly Analysis on Large Dynamic Bipartite Graphs14
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism14
Bridging the Gap between Deep Learning and Frustrated Quantum Spin System for Extreme-Scale Simulations on New Generation of Sunway Supercomputer14
A Generic Stochastic Model for Resource Availability in Fog Computing Environments14
Fairness-Aware VNF Sharing and Rate Coordination for High Efficient Service Scheduling14
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration13
Adaptive Vertical Federated Learning on Unbalanced Features13
Federated Learning With Nesterov Accelerated Gradient13
Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods13
LightFed: An Efficient and Secure Federated Edge Learning System on Model Splitting13
The PetscSF Scalable Communication Layer13
A GPU Acceleration Framework for Motif and Discord Based Pattern Mining13
Bandwidth-Aware Scheduling Repair Techniques in Erasure-Coded Clusters: Design and Analysis13
On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Federated Learning13
Accelerating Restarted GMRES With Mixed Precision Arithmetic13
Accelerating the Bron-Kerbosch Algorithm for Maximal Clique Enumeration Using GPUs13
Auto-GNAS: A Parallel Graph Neural Architecture Search Framework13
Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management13
Phase-Aware Cache Partitioning to Target Both Turnaround Time and System Performance13
Securing Deployed Smart Contracts and DeFi With Distributed TEE Cluster13
Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory13
ReHy: A ReRAM-based Digital/Analog Hybrid PIM Architecture for Accelerating CNN Training13
ProScale: Proactive Autoscaling for Microservice With Time-Varying Workload at the Edge13
Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers13
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems13
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures13
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms13
Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters13
FenceKV: Enabling Efficient Range Query for Key-Value Separation13
Look-up-Table Based Processing-in-Memory Architecture With Programmable Precision-Scaling for Deep Learning Applications13
HSA-Net: Hidden-State-Aware Networks for High-Precision QoS Prediction13
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication13
0.056416034698486