OOIR: Observatory of International Research

Papers

(The TQCC of Parallel Computing is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)

Article	Citations
Editorial Board	151
Parallel multi-view HEVC for heterogeneously embedded cluster system	45
Heterogeneous sparse matrix–vector multiplication via compressed sparse row format	36
Integrating FPGA-based hardware acceleration with relational databases	29
NPDP benchmark suite for the evaluation of the effectiveness of automatic optimizing compilers	24
A parallel non-convex approximation framework for risk parity portfolio design	18
Editorial Board	16
Mobilizing underutilized storage nodes via job path: A job-aware file striping approach	14
LSHDP: Locally sharded heterogeneous data parallel for distributed deep learning	14
C-Lop: Accurate contention-based modeling of MPI concurrent communication	11
EESF: Energy-efficient scheduling framework for deadline-constrained workflows with computation speed estimation method in cloud	11
Evaluating SYCL as a unified programming model for heterogeneous systems	11
Distributed consensus-based estimation of the leading eigenvalue of a non-negative irreducible matrix	11
Editorial on Advances in High Performance Programming	11
Task graph-based performance analysis of parallel-in-time methods	10
Adaptively parallel runtime verification based on distributed network for temporal properties	9
New YARN sharing GPU based on graphics memory granularity scheduling	7
Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers	7
ShyLU-node: On-node scalable solvers and preconditioners: Recent progress and current performance	7
Editorial Board	7
OF-WFBP: A near-optimal communication mechanism for tensor fusion in distributed deep learning	7
Parallel optimization and application of unstructured sparse triangular solver on new generation of Sunway architecture	7
Special issue of Selected Papers from EuroMPI/USA 2020	6
Optimizing convolutional neural networks on multi-core vector accelerator	6
ParVoro++: A scalable parallel algorithm for constructing 3D Voronoi tessellations based on kd-tree decomposition	6

Efficient parallel reduction of bandwidth for symmetric matrices	6
Editorial Board	5
Multi-level parallelism optimization for two-dimensional convolution vectorization method on multi-core vector accelerator	5
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters	5
Using Java to create and analyze models of parallel computing systems	5
GPU acceleration of Levenshtein distance computation between long strings	5
Targeting performance and user-friendliness: GPU-accelerated finite element computation with automated code generation in FEniCS	5
Tausch: A halo exchange library for large heterogeneous computing systems using MPI, OpenCL, and CUDA	5
A survey of software techniques to emulate heterogeneous memory systems in high-performance computing	4
Byzantine-tolerant detection of causality: There is no holy grail	4
A lightweight semi-centralized strategy for the massive parallelization of branching algorithms	4
Spatial- and time- division multiplexing in CNN accelerator	4
Editorial Board	4
Distributed software defined network-based fog to fog collaboration scheme	4
A sleek lock-free hash map in an ERA of safe memory reclamation methods	4
Lifeline-based load balancing schemes for Asynchronous Many-Task runtimes in clusters	4
Accelerating the scheduling of the network resources of the next-generation optical data centers	4