International Journal of Computer Vision

Papers
(The H4-Index of International Journal of Computer Vision is 47. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-03-01 to 2024-03-01.)
ArticleCitations
Knowledge Distillation: A Survey919
The Open Images Dataset V4768
FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking570
BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation535
Image Matching from Handcrafted to Deep Features: A Survey483
Deep Image Prior338
HOTA: A Higher Order Metric for Evaluating Multi-object Tracking315
Learning to Prompt for Vision-Language Models311
Rectifying Pseudo Label Learning via Uncertainty Estimation for Domain Adaptive Semantic Segmentation251
Beyond Brightening Low-light Images248
Scene Text Detection and Recognition: The Deep Learning Era211
Image Matching Across Wide Baselines: From Paper to Practice173
Human Action Recognition and Prediction: A Survey160
SDNet: A Versatile Squeeze-and-Decomposition Network for Real-Time Image Fusion158
The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection145
Weakly-supervised Semantic Guided Hashing for Social Image Retrieval135
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking130
Attention Guided Low-Light Image Enhancement with a Large Scale Low-Light Simulation Dataset127
OCNet: Object Context for Semantic Segmentation118
EfficientPS: Efficient Panoptic Segmentation112
You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network101
Benchmarking Low-Light Image Enhancement and Beyond100
Deep Image Deblurring: A Survey98
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-10096
Unsupervised Scale-Consistent Depth Learning from Video88
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis82
Comparison of Full-Reference Image Quality Models for Optimization of Image Processing Systems81
Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images79
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection77
Pixel-Wise Crowd Understanding via Synthetic Data73
LaSOT: A High-quality Large-scale Single Object Tracking Benchmark71
Unsupervised Deep Representation Learning for Real-Time Tracking69
Unified Quality Assessment of in-the-Wild Videos with Mixed Datasets Training69
On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited67
JÂA-Net: Joint Facial Action Unit Detection and Face Alignment Via Adaptive Attention67
A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains61
Curriculum Learning: A Survey61
Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis58
Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild53
CLIP-Adapter: Better Vision-Language Models with Feature Adapters52
Adaptive Channel Selection for Robust Visual Object Tracking with Discriminative Correlation Filters52
Structure-Measure: A New Way to Evaluate Foreground Maps49
GhostNets on Heterogeneous Devices via Cheap Operations49
Explainability of Deep Vision-Based Autonomous Driving Systems: Review and Challenges49
Deformable Kernel Networks for Joint Image Filtering49
VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change48
ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond48
Learning Adaptive Attribute-Driven Representation for Real-Time RGB-T Tracking47
An Exploration of Embodied Visual Exploration47
AutoScale: Learning to Scale for Crowd Counting47
0.032564163208008