International Journal of High Performance Computing Applications

Papers
(The median citation count of International Journal of High Performance Computing Applications is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
Accelerated dynamic data reduction using spatial and temporal properties100
Advances in ArborX to support exascale applications78
MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures63
Accelerated execution via eager-release of dependencies in task-based workflows57
Non-smooth Bayesian optimization in tuning scientific applications37
An integrated three-dimensional aeromechanical analysis for the prediction of stresses on modern coaxial rotors36
Versatile software-defined HPC and cloud clusters on Alps supercomputer for diverse workflows34
Employing artificial intelligence to steer exascale workflows with colmena32
Cache-optimized and low-overhead implementations of additive Schwarz methods for high-order FEM multigrid computations30
Guest Editor’s note: Special issue on challenges and solutions for porting applications to next-generation high performance computing systems29
INDIANA—In-Network Distributed Infrastructure for Advanced Network Applications27
A study on the performance of distributed training of data-driven CFD simulations25
Graph neural networks for detecting anomalies in scientific workflows21
AMReX: Block-structured adaptive mesh refinement for multiphysics applications21
TwoFold: Highly accurate structure and affinity prediction for protein-ligand complexes from sequences19
Malleability techniques applications in high-performance computing19
Very fast finite element Poisson solvers on lower precision accelerator hardware: A proof of concept study for Nvidia Tesla V10019
Performance analysis of relaxation Runge–Kutta methods17
UMap: An application-oriented user level memory mapping library16
AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics15
PeleC: An adaptive mesh refinement solver for compressible reacting flows14
Dynamic spawning of MPI processes applied to malleability12
Advancements of PAPI for the exascale generation12
Visualization at exascale: Making it all work with VTK-m11
Enhancing data locality of the conjugate gradient method for high-order matrix-free finite-element implementations11
Guest editors note: Special issue on clusters, clouds, and data for scientific computing11
Special issue: Introduction10
Highly efficient lattice Boltzmann multiphase simulations of immiscible fluids at high-density ratios on CPUs and GPUs through code generation9
Heterogeneous programming using OpenMP and CUDA/HIP for hybrid CPU-GPU scientific applications8
PaRSEC: Scalability, flexibility, and hybrid architecture support for task-based applications in ECP8
Fast truncated SVD of sparse and dense matrices on graphics processors7
Acceleration of a parallel BDDC solver by using graphics processing units on subdomains6
Special Issue Introduction: The Gordon Bell Special Prize for HPC-Based COVID-19 Research Finalists6
Portable, heterogeneous ensemble workflows at scale using libEnsemble6
ZFP: A compressed array representation for numerical computations6
The ECP ALPINE project: In situ and post hoc visualization infrastructure and analysis capabilities for exascale6
Exploiting mesh structure to improve multigrid performance for saddle-point problems5
Running ahead of evolution—AI-based simulation for predicting future high-risk SARS-CoV-2 variants5
Automatizing the creation of specialized high-performance computing containers5
Refining HPCToolkit for application performance analysis at exascale5
Black-box statistical prediction of lossy compression ratios for scientific data5
Enhancing scalability of a matrix-free eigensolver for studying many-body localization5
ECP libraries and tools: An overview5
Combining multitask and transfer learning with deep Gaussian processes for autotuning-based performance engineering5
Large-scale ab initio simulation of light–matter interaction at the atomic scale in Fugaku5
Recovering single precision accuracy from Tensor Cores while surpassing the FP32 theoretical peak performance5
Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action5
Accelerating atmospheric physics parameterizations using graphics processing units4
Experience and analysis of scalable high-fidelity computational fluid dynamics on modular supercomputing architectures4
Unprecedented cloud resolution in a GPU-enabled full-physics atmospheric climate simulation on OLCF’s summit supercomputer4
Development of NCL equivalent serial and parallel python routines for meteorological data analysis4
Efficient exascale discretizations: High-order finite element methods4
Globus service enhancements for exascale applications and facilities4
Compressed basis GMRES on high-performance graphics processing units3
Orchestration of materials science workflows for heterogeneous resources at large scale3
Parallel encryption of input and output data for HPC applications3
A massively parallel time-domain coupled electrodynamics–micromagnetics solver3
Increased space-parallelism via time-simultaneous Newton-multigrid methods for nonstationary nonlinear PDE problems3
HipBone: A performance-portable graphics processing unit-accelerated C++ version of the NekBone benchmark3
Performance comparison of the A-grid and C-grid shallow-water models on icosahedral grids3
IO-aware Job-Scheduling: Exploiting the Impacts of Workload Characterizations to select the Mapping Strategy3
Online data analysis and reduction: An important Co-design motif for extreme-scale computers3
Coupling of regional geophysics and local soil-structure models in the EQSIM fault-to-structure earthquake simulation framework3
An analytical performance model of generalized hierarchical scheduling2
Designing and prototyping extensions to the Message Passing Interface in MPICH2
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic2
Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors2
Efficiency and scalability of fully-resolved fluid-particle simulations on heterogeneous CPU-GPU architectures2
HDF5 in the exascale era: Delivering efficient and scalable parallel I/O for exascale applications2
Resiliency in numerical algorithm design for extreme scale simulations2
Data-driven scalable pipeline using national agent-based models for real-time pandemic response and decision support2
High-throughput virtual laboratory for drug discovery using massive datasets2
Coarsening and parallelism with reduction multigrids for hyperbolic Boltzmann transport2
Myths and legends in high-performance computing2
Numerical eigen-spectrum slicing, accurate orthogonal eigen-basis, and mixed-precision eigenvalue refinement using OpenMP data-dependent tasks and accelerator offload2
A Survey of Graph Comparison Methods with Applications to Nondeterminism in High-Performance Computing2
Modeling, evaluating, and orchestrating heterogeneous environmental leverages for large-scale data center management2
Towards exascale for wind energy simulations2
GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics2
A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations2
Corrigendum to large-scale direct numerical simulations of turbulence using GPUs and modern Fortran1
Batched sparse direct solver design and evaluation in SuperLU_DIST1
Preparing MPICH for exascale1
Accelerating physics simulations with tensor processing units: An inundation modeling example1
Clacc: OpenACC for C/C++ in Clang1
NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors1
Corrigendum to ‘Unprecedented cloud resolution in a GPU-enabled full-physics atmospheric climate simulation on OLCF’s summit supercomputer’1
Enabling particle applications for exascale computing platforms1
A survey of software implementations used by application codes in the Exascale Computing Project1
Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs1
Resilience and fault tolerance in high-performance computing for numerical weather and climate prediction1
Efficient implementation of low-order-precision smoothed particle hydrodynamics1
General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods1
A runtime based comparison of highly tuned lattice Boltzmann and finite difference solvers1
Detecting interference between applications and improving the scheduling using malleable application clones1
0.049206972122192