Computer Vision and Image Understanding

Papers
(The H4-Index of Computer Vision and Image Understanding is 28. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
Luminance prior guided Low-Light 4C catenary image enhancement296
Editorial Board239
Feature reconstruction and metric based network for few-shot object detection153
Efficient cross-information fusion decoder for semantic segmentation114
Convolutional neural network framework for deepfake detection: A diffusion-based approach107
Lightweight feature point detection network with channel enhancement96
3D semantic segmentation based on spatial-aware convolution and shape completion for augmented reality applications95
Editorial Board84
Robust Teacher: Self-correcting pseudo-label-guided semi-supervised learning for object detection79
Siamese self-supervised learning for fine-grained visual classification71
Emerging image generation with flexible control of perceived difficulty51
Improving the planarity and sharpness of monocularly estimated depth images using the Phong reflection model50
Editorial Board43
Exploring using jigsaw puzzles for out-of-distribution detection40
Extending function mixture network for improved spectral super-resolution38
MATTE: Multi-task multi-scale attention37
Deducing health cues from biometric data36
Editorial Board34
Editorial Board34
CRML-Net: Cross-Modal Reasoning and Multi-Task Learning Network for tooth image segmentation33
Modality adaptation via feature difference learning for depth human parsing33
Exploring the differences in adversarial robustness between ViT- and CNN-based models using novel metrics32
RetSeg3D: Retention-based 3D semantic segmentation for autonomous driving31
Twin-SegNet: Dynamically coupled complementary segmentation networks for generalized medical image segmentation31
RelFormer: Advancing contextual relations for transformer-based dense captioning30
GaitBranch: A multi-branch refinement model combined with frame-channel attention mechanism for gait recognition29
Implicit and explicit commonsense for multi-sentence video captioning29
PConvSRGAN: Real-world super-resolution reconstruction with pure convolutional networks29
Embedding AI ethics into the design and use of computer vision technology for consumer’s behaviour understanding28
Syntactically and semantically enhanced captioning network via hybrid attention and POS tagging prompt28
Reverse Stable Diffusion: What prompt was used to generate this image?28
0.069382905960083