Computer Vision and Image Understanding

Papers
(The H4-Index of Computer Vision and Image Understanding is 27. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Editorial Board273
Editorial Board257
Editorial Board217
Editorial Board140
Editorial Board101
Improving the planarity and sharpness of monocularly estimated depth images using the Phong reflection model100
Exploring using jigsaw puzzles for out-of-distribution detection83
Feature reconstruction and metric based network for few-shot object detection82
RetSeg3D: Retention-based 3D semantic segmentation for autonomous driving80
Luminance prior guided Low-Light 4C catenary image enhancement68
Siamese self-supervised learning for fine-grained visual classification68
Robust Teacher: Self-correcting pseudo-label-guided semi-supervised learning for object detection65
Twin-SegNet: Dynamically coupled complementary segmentation networks for generalized medical image segmentation48
Deducing health cues from biometric data45
Emerging image generation with flexible control of perceived difficulty39
3D semantic segmentation based on spatial-aware convolution and shape completion for augmented reality applications39
CRML-Net: Cross-Modal Reasoning and Multi-Task Learning Network for tooth image segmentation38
MATTE: Multi-task multi-scale attention34
Extending function mixture network for improved spectral super-resolution34
Modality adaptation via feature difference learning for depth human parsing33
Lightweight feature point detection network with channel enhancement32
Convolutional neural network framework for deepfake detection: A diffusion-based approach31
Efficient cross-information fusion decoder for semantic segmentation30
Decoupled appearance and motion learning for efficient anomaly detection in surveillance video29
Editorial Board27
Implicit and explicit commonsense for multi-sentence video captioning27
View-aligned pixel-level feature aggregation for 3D shape classification27
Exploring the differences in adversarial robustness between ViT- and CNN-based models using novel metrics27
Editorial Board27
3D object feature extraction and classification using 3D MF-DFA27
Syntactically and semantically enhanced captioning network via hybrid attention and POS tagging prompt27
Adaptive CNN filter pruning using global importance metric27
0.061047077178955