IEEE Computer Architecture Letters

Papers
(The TQCC of IEEE Computer Architecture Letters is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Learned Performance Model for SSD29
Speculative Multi-Level Access in LSM Tree-Based KV Store26
Accelerating Programmable Bootstrapping Targeting Contemporary GPU Microarchitecture17
Toward Practical 128-Bit General Purpose Microarchitectures13
Characterization and Analysis of Text-to-Image Diffusion Models12
A Characterization of Generative Recommendation Models: Study of Hierarchical Sequential Transduction Unit11
SCALES: SCALable and Area-Efficient Systolic Accelerator for Ternary Polynomial Multiplication11
Decoupled SSD: Reducing Data Movement on NAND-Based Flash SSD11
SoCurity: A Design Approach for Enhancing SoC Security10
2021 Index IEEE Computer Architecture Letters Vol. 2010
Straw: A Stress-Aware WL-Based Read Reclaim Technique for High-Density NAND Flash-Based SSDs10
Scale-Model Simulation10
In-Memory Versioning (IMV)9
RouteReplies: Alleviating Long Latency in Many-Chip-Module GPUs9
Improving Energy-Efficiency of Capsule Networks on Modern GPUs9
A Flexible Embedding-Aware Near Memory Processing Architecture for Recommendation System9
A Case for In-Memory Random Scatter-Gather for Fast Graph Processing8
Reorder Buffer Contention: A Forward Speculative Interference Attack for Speculation Invariant Instructions8
Exploring the DIMM PIM Architecture for Accelerating Time Series Analysis8
Security Helper Chiplets: A New Paradigm for Secure Hardware Monitoring7
Mitigating Timing-Based NoC Side-Channel Attacks With LLC Remapping7
Exploiting Intel Advanced Matrix Extensions (AMX) for Large Language Model Inference7
QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture7
DeMM: A Decoupled Matrix Multiplication Engine Supporting Relaxed Structured Sparsity6
NoHammer: Preventing Row Hammer With Last-Level Cache Management6
Accelerating Deep Reinforcement Learning via Phase-Level Parallelism for Robotics Applications6
Managing Prefetchers With Deep Reinforcement Learning5
LADIO: Leakage-Aware Direct I/O for I/O-Intensive Workloads5
High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network5
Data-Aware Compression of Neural Networks5
Memory-Centric MCM-GPU Architecture5
SparseLeakyNets: Classification Prediction Attack Over Sparsity-Aware Embedded Neural Networks Using Timing Side-Channel Information5
A Flexible Hybrid Interconnection Design for High-Performance and Energy-Efficient Chiplet-Based Systems4
Chopping off the Tail: Bounded Non-Determinism for Real-Time Accelerators4
Primate: A Framework to Automatically Generate Soft Processors for Network Applications4
Fast Performance Prediction for Efficient Distributed DNN Training4
Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training4
Enhancing the Reach and Reliability of Quantum Annealers by Pruning Longer Chains4
Adaptive Web Browsing on Mobile Heterogeneous Multi-cores4
FPGA-Accelerated Data Preprocessing for Personalized Recommendation Systems4
PreGNN: Hardware Acceleration to Take Preprocessing Off the Critical Path in Graph Neural Networks4
ZoneBuffer: An Efficient Buffer Management Scheme for ZNS SSDs4
Guard Cache: Creating Noisy Side-Channels4
A Quantum Computer Trusted Execution Environment3
Energy-Efficient Bayesian Inference Using Bitstream Computing3
PINSim: A Processing In- and Near-Sensor Simulator to Model Intelligent Vision Sensors3
Accelerators & Security: The Socket Approach3
Exploring Volatile FPGAs Potential for Accelerating Energy-Harvesting IoT Applications3
T-CAT: Dynamic Cache Allocation for Tiered Memory Systems With Memory Interleaving3
Characterization and Analysis of Deep Learning for 3D Point Cloud Analytics3
SSE: Security Service Engines to Accelerate Enclave Performance in Secure Multicore Processors3
Architectural Implications of GNN Aggregation Programming Abstractions3
Enhancing DNN Training Efficiency Via Dynamic Asymmetric Architecture2
A Case Study of a DRAM-NVM Hybrid Memory Allocator for Key-Value Stores2
Analyzing and Exploiting Memory Hierarchy Parallelism With MLP Stacks2
Unleashing the Potential of PIM: Accelerating Large Batched Inference of Transformer-Based Generative Models2
Redundant Array of Independent Memory Devices2
IntervalSim++: Enhanced Interval Simulation for Unbalanced Processor Designs2
Reducing the Silicon Area Overhead of Counter-Based Rowhammer Mitigations2
SEMS: Scalable Embedding Memory System for Accelerating Embedding-Based DNNs2
Characterization and Implementation of Radar System Applications on a Reconfigurable Dataflow Architecture2
Hungarian Qubit Assignment for Optimized Mapping of Quantum Circuits on Multi-Core Architectures2
Computational CXL-Memory Solution for Accelerating Memory-Intensive Applications2
Cost-Effective Extension of DRAM-PIM for Group-Wise LLM Quantization2
Halis: A Hardware-Software Co-designed Near-Cache Accelerator for Graph Pattern Mining2
Accelerating Page Migrations in Operating Systems With Intel DSA2
DRAM-CAM: General-Purpose Bit-Serial Exact Pattern Matching2
gem5-accel: A Pre-RTL Simulation Toolchain for Accelerator Architecture Validation2
Overcoming Memory Capacity Wall of GPUs With Heterogeneous Memory Stack2
R.I.P. Geomean Speedup Use Equal-Work (Or Equal-Time) Harmonic Mean Speedup Instead2
Approximate Multiplier Design With LFSR-Based Stochastic Sequence Generators for Edge AI2
Characterization and Analysis of the 3D Gaussian Splatting Rendering Pipeline2
HBM3 RAS: Enhancing Resilience at Scale2
FullPack: Full Vector Utilization for Sub-Byte Quantized Matrix-Vector Multiplication on General Purpose CPUs2
A First-Order Model to Assess Computer Architecture Sustainability2
FPGA-Based AI Smart NICs for Scalable Distributed AI Training Systems2
Direct-Coding DNA With Multilevel Parallelism2
0.039930105209351