International Journal of Computer Vision

Papers
(The median citation count of International Journal of Computer Vision is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Knowledge Distillation: A Survey1385
BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation819
FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking796
Learning to Prompt for Vision-Language Models667
Beyond Brightening Low-light Images392
Rectifying Pseudo Label Learning via Uncertainty Estimation for Domain Adaptive Semantic Segmentation304
SDNet: A Versatile Squeeze-and-Decomposition Network for Real-Time Image Fusion250
Human Action Recognition and Prediction: A Survey224
The MVTec Anomaly Detection Dataset: A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection208
Attention Guided Low-Light Image Enhancement with a Large Scale Low-Light Simulation Dataset168
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking165
Benchmarking Low-Light Image Enhancement and Beyond153
CLIP-Adapter: Better Vision-Language Models with Feature Adapters150
OCNet: Object Context for Semantic Segmentation149
Deep Image Deblurring: A Survey146
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100146
You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network142
EfficientPS: Efficient Panoptic Segmentation142
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection140
Unsupervised Scale-Consistent Depth Learning from Video126
Comparison of Full-Reference Image Quality Models for Optimization of Image Processing Systems115
Curriculum Learning: A Survey112
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis97
On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited93
Unified Quality Assessment of in-the-Wild Videos with Mixed Datasets Training86
ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond85
Explainability of Deep Vision-Based Autonomous Driving Systems: Review and Challenges83
3D-FUTURE: 3D Furniture Shape with TextURE81
VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change75
GhostNets on Heterogeneous Devices via Cheap Operations74
Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild74
Context Autoencoder for Self-supervised Representation Learning73
Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis73
Learning Adaptive Attribute-Driven Representation for Real-Time RGB-T Tracking72
AutoScale: Learning to Scale for Crowd Counting69
Structure-Measure: A New Way to Evaluate Foreground Maps66
Vis-MVSNet: Visibility-Aware Multi-view Stereo Network66
3D Object Detection for Autonomous Driving: A Comprehensive Survey64
Countering Malicious DeepFakes: Survey, Battleground, and Horizon60
A Survey on Long-Tailed Visual Recognition59
Scale-Aware Domain Adaptive Faster R-CNN56
Adaptive Channel Selection for Robust Visual Object Tracking with Discriminative Correlation Filters55
Synthetic Humans for Action Recognition from Unseen Viewpoints55
Bridging Composite and Real: Towards End-to-End Deep Image Matting55
Occluded Video Instance Segmentation: A Benchmark54
Twin Contrastive Learning for Online Clustering53
An Exploration of Embodied Visual Exploration52
AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild52
Low-light Image Enhancement via Breaking Down the Darkness52
Towards High Performance Human Keypoint Detection51
Beyond Dents and Scratches: Logical Constraints in Unsupervised Anomaly Detection and Localization49
A Comprehensive Benchmark Analysis of Single Image Deraining: Current Challenges and Future Perspectives49
The Fishyscapes Benchmark: Measuring Blind Spots in Semantic Segmentation48
Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion47
Multi-level Motion Attention for Human Motion Prediction46
SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds46
NormAttention-PSN: A High-frequency Region Enhanced Photometric Stereo Network with Normalized Attention45
Learning JPEG Compression Artifacts for Image Manipulation Detection and Localization44
Underwater Camera: Improving Visual Perception Via Adaptive Dark Pixel Prior and Color Correction42
3D Semantic Scene Completion: A Survey42
Viewpoint and Scale Consistency Reinforcement for UAV Vehicle Re-Identification41
Semantic Edge Detection with Diverse Deep Supervision40
Deep Nets: What have They Ever Done for Vision?40
Learning Adaptive Classifiers Synthesis for Generalized Few-Shot Learning40
MADAN: Multi-source Adversarial Domain Aggregation Network for Domain Adaptation39
Multi-Modal 3D Object Detection in Autonomous Driving: A Survey38
Successive Graph Convolutional Network for Image De-raining38
Mitigating Demographic Bias in Facial Datasets with Style-Based Multi-attribute Transfer38
Quo Vadis, Skeleton Action Recognition?38
Learning Self-supervised Low-Rank Network for Single-Stage Weakly and Semi-supervised Semantic Segmentation37
Progressive DARTS: Bridging the Optimization Gap for NAS in the Wild37
Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks36
Pyramid Attention Network for Image Restoration36
Learning to Reconstruct HDR Images from Events, with Applications to Depth and Flow Prediction35
Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction35
Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior34
PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer34
SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos34
Manhattan Room Layout Reconstruction from a Single $$360^{\circ }$$ Image: A Comparative Study of State-of-the-Art Methods33
Mimetics: Towards Understanding Human Actions Out of Context33
RePCD-Net: Feature-Aware Recurrent Point Cloud Denoising Network33
Polysemy Deciphering Network for Robust Human–Object Interaction Detection33
Memory-Augmented Deep Unfolding Network for Guided Image Super-resolution32
Semantics-to-Signal Scalable Image Compression with Learned Revertible Representations32
Zero-Shot Learning on 3D Point Cloud Objects and Beyond31
Dual Convolutional Neural Networks for Low-Level Vision31
Parallel Single-Pixel Imaging: A General Method for Direct–Global Separation and 3D Shape Reconstruction Under Strong Global Illumination29
LAMP-HQ: A Large-Scale Multi-pose High-Quality Database and Benchmark for NIR-VIS Face Recognition29
Selective Wavelet Attention Learning for Single Image Deraining29
Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling28
Feature Matching via Motion-Consistency Driven Probabilistic Graphical Model27
RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning27
A Coarse-to-Fine Framework for Resource Efficient Video Recognition27
Unsupervised Domain Adaptation with Background Shift Mitigating for Person Re-Identification27
Exploiting Diffusion Prior for Real-World Image Super-Resolution27
Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks27
REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets26
Distribution-Sensitive Information Retention for Accurate Binary Neural Network26
On Measuring and Controlling the Spectral Bias of the Deep Image Prior26
Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild26
A Survey on Intrinsic Images: Delving Deep into Lambert and Beyond26
SODA: Weakly Supervised Temporal Action Localization Based on Astute Background Response and Self-Distillation Learning25
Intra-Camera Supervised Person Re-Identification25
Revisiting Consistency Regularization for Semi-Supervised Learning24
Generalized Out-of-Distribution Detection: A Survey24
SRT3D: A Sparse Region-Based 3D Object Tracking Approach for the Real World24
Dual-Attention-Guided Network for Ghost-Free High Dynamic Range Imaging24
Spatial–Temporal Relation Reasoning for Action Prediction in Videos23
Learning Deep Patch representation for Probabilistic Graphical Model-Based Face Sketch Synthesis23
EAN: Event Adaptive Network for Enhanced Action Recognition23
Multi-adversarial Faster-RCNN with Paradigm Teacher for Unrestricted Object Detection23
Pre-Training Without Natural Images22
Delving Deeper into Anti-Aliasing in ConvNets22
Few-Shot Segmentation via Divide-and-Conquer Proxies22
Delving into Inter-Image Invariance for Unsupervised Visual Representations22
Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition22
GLENet: Boosting 3D Object Detectors with Generative Label Uncertainty Estimation22
Artificial Intelligence for Dunhuang Cultural Heritage Protection: The Project and the Dataset22
Evaluation Metrics for Conditional Image Generation22
Attribute Prototype Network for Any-Shot Learning21
OASIS: Only Adversarial Supervision for Semantic Image Synthesis21
Dual-Constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior20
One-Shot Object Affordance Detection in the Wild20
Context-Enhanced Representation Learning for Single Image Deraining20
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision20
DeMoCap: Low-Cost Marker-Based Motion Capture20
Visual Object Tracking in First Person Vision20
Towards Balanced Learning for Instance Recognition20
Cascaded Split-and-Aggregate Learning with Feature Recombination for Pedestrian Attribute Recognition20
A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection19
Learning to Detect Instance-Level Salient Objects Using Complementary Image Labels19
Going Deeper than Tracking: A Survey of Computer-Vision Based Recognition of Animal Pain and Emotions19
Enhanced 3D Human Pose Estimation from Videos by Using Attention-Based Neural Network with Dilated Convolutions19
Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection18
Real-World Video Deblurring: A Benchmark Dataset and an Efficient Recurrent Neural Network17
Vote-Based 3D Object Detection with Context Modeling and SOB-3DNMS17
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval17
ShadingNet: Image Intrinsics by Fine-Grained Shading Decomposition17
Deep Memory-Augmented Proximal Unrolling Network for Compressive Sensing17
I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection17
Class-Difficulty Based Methods for Long-Tailed Visual Recognition17
Guided Attention in CNNs for Occluded Pedestrian Detection and Re-identification17
Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation16
Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with Self-Supervised Depth Estimation16
A Comprehensive Survey on Test-Time Adaptation Under Distribution Shifts16
DLOW: Domain Flow and Applications16
H-SegMed: A Hybrid Method for Prostate Segmentation in TRUS Images via Improved Closed Principal Curve and Improved Enhanced Machine Learning16
Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors16
Label-Free Robustness Estimation of Object Detection CNNs for Autonomous Driving Applications16
CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection16
AutoDet: Pyramid Network Architecture Search for Object Detection16
Learning Regression and Verification Networks for Robust Long-term Tracking16
Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection15
Spatial Monitoring and Insect Behavioural Analysis Using Computer Vision for Precision Pollination15
Semi-supervised Visual Tracking of Marine Animals Using Autonomous Underwater Vehicles15
Domain-Specific Bias Filtering for Single Labeled Domain Generalization15
DESC: Domain Adaptation for Depth Estimation via Semantic Consistency15
Sparse Black-Box Video Attack with Reinforcement Learning15
CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition15
Learning to Adapt to Light15
Disentangled Inference for GANs With Latently Invertible Autoencoder15
Multi-Object Tracking and Segmentation Via Neural Message Passing15
CoCoNet: Coupled Contrastive Learning Network with Multi-level Feature Ensemble for Multi-modality Image Fusion15
Deep Unfolding for Snapshot Compressive Imaging15
Incremental Rotation Averaging14
Investigating the Role of Image Retrieval for Visual Localization14
NAS-FCOS: Efficient Search for Object Detection Architectures14
Object Priors for Classifying and Localizing Unseen Actions14
Facial Kinship Verification: A Comprehensive Review and Outlook14
CRCNet: Few-Shot Segmentation with Cross-Reference and Region–Global Conditional Networks14
Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates13
Shape My Face: Registering 3D Face Scans by Surface-to-Surface Translation13
SMG: A Micro-gesture Dataset Towards Spontaneous Body Gestures for Emotional Stress State Analysis13
Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild13
HiEve: A Large-Scale Benchmark for Human-Centric Video Analysis in Complex Events13
Bilevel Fast Scene Adaptation for Low-Light Image Enhancement13
Open-Set Adversarial Defense with Clean-Adversarial Mutual Learning13
DOVE: Learning Deformable 3D Objects by Watching Videos13
Adversarial Learning Domain-Invariant Conditional Features for Robust Face Anti-spoofing13
Guided Hyperspectral Image Denoising with Realistic Data13
CNN-Based RGB-D Salient Object Detection: Learn, Select, and Fuse13
A Benchmark and Evaluation of Non-Rigid Structure from Motion13
Semantic Contrastive Embedding for Generalized Zero-Shot Learning13
Spectral Shape Recovery and Analysis Via Data-driven Connections13
DDPNAS: Efficient Neural Architecture Search via Dynamic Distribution Pruning13
A Numerical Framework for Elastic Surface Matching, Comparison, and Interpolation13
Surgical Tool Datasets for Machine Learning Research: A Survey13
Adaptive Deep PnP Algorithm for Video Snapshot Compressive Imaging13
Instance Segmentation in the Dark12
Deep Maximum a Posterior Estimator for Video Denoising12
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation12
Scene Reconstruction with Functional Objects for Robot Autonomy12
Multi-target Knowledge Distillation via Student Self-reflection12
Accurate Fine-Grained Object Recognition with Structure-Driven Relation Graph Networks12
Elastic Shape Analysis of Surfaces with Second-Order Sobolev Metrics: A Comprehensive Numerical Framework12
Saliency Detection Inspired by Topological Perception Theory12
Full-Spectrum Out-of-Distribution Detection12
Context and Structure Mining Network for Video Object Detection11
Invertible Rescaling Network and Its Extensions11
Learning Scene Dynamics from Point Cloud Sequences11
Letter-Level Online Writer Identification11
Joint Bilateral-Resolution Identity Modeling for Cross-Resolution Person Re-Identification11
Action2video: Generating Videos of Human 3D Actions11
Learning Contrastive Representation for Semantic Correspondence11
Rectified Binary Convolutional Networks with Generative Adversarial Learning11
Meta Attention-Generation Network for Cross-Granularity Few-Shot Learning11
Hierarchical Visual-Textual Knowledge Distillation for Life-Long Correlation Learning11
ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval11
Adaptive Dimension-Discriminative Low-Rank Tensor Recovery for Computational Hyperspectral Imaging11
Wide-Angle Image Rectification: A Survey10
Through Hawks’ Eyes: Synthetically Reconstructing the Visual Field of a Bird in Flight10
Just Recognizable Distortion for Machine Vision Oriented Image and Video Coding10
Exposing Semantic Segmentation Failures via Maximum Discrepancy Competition10
Learning 3D Semantic Scene Graphs with Instance Embeddings10
Nonblind Image Deconvolution via Leveraging Model Uncertainty in An Untrained Deep Neural Network10
Inferring Bias and Uncertainty in Camera Calibration10
Deep Trajectory Post-Processing and Position Projection for Single & Multiple Camera Multiple Object Tracking10
Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction10
Excitation Dropout: Encouraging Plasticity in Deep Neural Networks10
Unsupervised Multi-View CNN for Salient View Selection and 3D Interest Point Detection10
3D Scene Reconstruction with an Un-calibrated Light Field Camera10
Pluralistic Free-Form Image Completion10
A CNN Based Approach for the Point-Light Photometric Stereo Problem10
Distribution-Aware Margin Calibration for Semantic Segmentation in Images9
SoftPool++: An Encoder–Decoder Network for Point Cloud Completion9
AnimalTrack: A Benchmark for Multi-Animal Tracking in the Wild9
Learning Extremal Representations with Deep Archetypal Analysis9
Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction9
Norm-Aware Embedding for Efficient Person Search and Tracking9
Improved 3D Markerless Mouse Pose Estimation Using Temporal Semi-supervision9
Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation9
Entrack: Probabilistic Spherical Regression with Entropy Regularization for Fiber Tractography9
A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems9
Cross-Domain Gated Learning for Domain Generalization9
Minimal Solvers for Relative Pose Estimation of Multi-Camera Systems using Affine Correspondences9
Towards Compact 1-bit CNNs via Bayesian Learning9
InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions9
Visual Attention Consistency for Human Attribute Recognition9
Segmentation by Continuous Latent Semantic Analysis for Multi-structure Model Fitting9
A Shape-Aware Retargeting Approach to Transfer Human Motion and Appearance in Monocular Videos9
A Deep Learning Framework for Infrared and Visible Image Fusion Without Strict Registration9
Visual Interestingness Prediction: A Benchmark Framework and Literature Review9
DeepFlux for Skeleton Detection in the Wild9
Learning Geometric Transformation for Point Cloud Completion9
CODON: On Orchestrating Cross-Domain Attentions for Depth Super-Resolution9
Deep Learning Geometry Compression Artifacts Removal for Video-Based Point Cloud Compression9
Deep Human-Interaction and Association by Graph-Based Learning for Multiple Object Tracking in the Wild8
AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach8
The Isowarp: The Template-Based Visual Geometry of Isometric Surfaces8
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch8
0.067328929901123