Parallel Computing

Papers
(The TQCC of Parallel Computing is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
Parallel multi-view HEVC for heterogeneously embedded cluster system109
Heterogeneous sparse matrix–vector multiplication via compressed sparse row format42
Editorial Board39
GPU accelerated parallel reliability-guided digital volume correlation with automatic seed selection based on 3D SIFT39
Porting hypre to heterogeneous computer architectures: Strategies and experiences34
Using long vector extensions for MPI reductions30
Measurement and analysis of GPU-accelerated applications with HPCToolkit30
Integrating FPGA-based hardware acceleration with relational databases28
On revisiting energy and performance in microservices applications: A cloud elasticity-driven approach26
Enabling GPU accelerated computing in the SUNDIALS time integration library24
Toward performance-portable PETSc for GPU-based exascale systems23
NPDP benchmark suite for the evaluation of the effectiveness of automatic optimizing compilers20
A parallel non-convex approximation framework for risk parity portfolio design18
Editorial Board18
Octopus-DF: Unified DataFrame-based cross-platform data analytic system17
Mobilizing underutilized storage nodes via job path: A job-aware file striping approach16
Implementation and evaluation of MPI 4.0 partitioned communication libraries15
Distributed consensus-based estimation of the leading eigenvalue of a non-negative irreducible matrix14
Editorial on Advances in High Performance Programming12
Task graph-based performance analysis of parallel-in-time methods12
Adaptively parallel runtime verification based on distributed network for temporal properties10
Evaluating MPI resource usage summary statistics9
OF-WFBP: A near-optimal communication mechanism for tensor fusion in distributed deep learning9
EESF: Energy-efficient scheduling framework for deadline-constrained workflows with computation speed estimation method in cloud8
Parallel optimization and application of unstructured sparse triangular solver on new generation of Sunway architecture8
C-Lop: Accurate contention-based modeling of MPI concurrent communication8
New YARN sharing GPU based on graphics memory granularity scheduling8
Editorial Board8
ParVoro++: A scalable parallel algorithm for constructing 3D Voronoi tessellations based on kd-tree decomposition7
Optimizing convolutional neural networks on multi-core vector accelerator7
Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers7
Special issue of Selected Papers from EuroMPI/USA 20207
Efficient parallel reduction of bandwidth for symmetric matrices7
Using Java to create and analyze models of parallel computing systems6
GPU acceleration of Levenshtein distance computation between long strings6
Multi-level parallelism optimization for two-dimensional convolution vectorization method on multi-core vector accelerator6
The BondMachine, a moldable computer architecture6
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters6
Editorial Board6
A survey of software techniques to emulate heterogeneous memory systems in high-performance computing5
Tausch: A halo exchange library for large heterogeneous computing systems using MPI, OpenCL, and CUDA5
Byzantine-tolerant detection of causality: There is no holy grail5
Spatial- and time- division multiplexing in CNN accelerator5
Targeting performance and user-friendliness: GPU-accelerated finite element computation with automated code generation in FEniCS5
A sleek lock-free hash map in an ERA of safe memory reclamation methods5
Editorial Board5
0.37416887283325