Parallel Computing

Papers
(The TQCC of Parallel Computing is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Editorial Board151
Parallel multi-view HEVC for heterogeneously embedded cluster system45
Heterogeneous sparse matrix–vector multiplication via compressed sparse row format36
Integrating FPGA-based hardware acceleration with relational databases29
NPDP benchmark suite for the evaluation of the effectiveness of automatic optimizing compilers24
A parallel non-convex approximation framework for risk parity portfolio design18
Editorial Board16
Mobilizing underutilized storage nodes via job path: A job-aware file striping approach14
LSHDP: Locally sharded heterogeneous data parallel for distributed deep learning14
C-Lop: Accurate contention-based modeling of MPI concurrent communication11
EESF: Energy-efficient scheduling framework for deadline-constrained workflows with computation speed estimation method in cloud11
Evaluating SYCL as a unified programming model for heterogeneous systems11
Distributed consensus-based estimation of the leading eigenvalue of a non-negative irreducible matrix11
Editorial on Advances in High Performance Programming11
Task graph-based performance analysis of parallel-in-time methods10
Adaptively parallel runtime verification based on distributed network for temporal properties9
New YARN sharing GPU based on graphics memory granularity scheduling7
Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers7
ShyLU-node: On-node scalable solvers and preconditioners: Recent progress and current performance7
Editorial Board7
OF-WFBP: A near-optimal communication mechanism for tensor fusion in distributed deep learning7
Parallel optimization and application of unstructured sparse triangular solver on new generation of Sunway architecture7
Special issue of Selected Papers from EuroMPI/USA 20206
Optimizing convolutional neural networks on multi-core vector accelerator6
ParVoro++: A scalable parallel algorithm for constructing 3D Voronoi tessellations based on kd-tree decomposition6
Efficient parallel reduction of bandwidth for symmetric matrices6
Editorial Board5
Multi-level parallelism optimization for two-dimensional convolution vectorization method on multi-core vector accelerator5
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters5
Using Java to create and analyze models of parallel computing systems5
GPU acceleration of Levenshtein distance computation between long strings5
Targeting performance and user-friendliness: GPU-accelerated finite element computation with automated code generation in FEniCS5
Tausch: A halo exchange library for large heterogeneous computing systems using MPI, OpenCL, and CUDA5
A survey of software techniques to emulate heterogeneous memory systems in high-performance computing4
Byzantine-tolerant detection of causality: There is no holy grail4
A lightweight semi-centralized strategy for the massive parallelization of branching algorithms4
Spatial- and time- division multiplexing in CNN accelerator4
Editorial Board4
Distributed software defined network-based fog to fog collaboration scheme4
A sleek lock-free hash map in an ERA of safe memory reclamation methods4
Lifeline-based load balancing schemes for Asynchronous Many-Task runtimes in clusters4
Accelerating the scheduling of the network resources of the next-generation optical data centers4
4.043848991394