Image and Vision Computing

Papers
(The TQCC of Image and Vision Computing is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
HPD-Depth: High performance decoding network for self-supervised monocular depth estimation134
Learning diverse and deep clues for person reidentification122
RGB-T tracking by modality difference reduction and feature re-selection119
Synthetic lidar point cloud generation using deep generative models for improved driving scene object recognition118
Alignment and fusion for adaptive domain nighttime semantic segmentation113
DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions90
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty67
Modeling content-attribute preference for personalized image esthetics assessment66
Privacy-preserving explainable AI enable federated learning-based denoising fingerprint recognition model54
Multi-information guided camouflaged object detection54
Editorial Board51
ABC: Aligning binary centers for single-stage monocular 3D object detection48
Cross-scale global attention feature pyramid network for person search47
GAN-BodyPose: Real-time 3D human body pose data key point detection and quality assessment assisted by generative adversarial network43
BF3D: Bi-directional fusion 3D detector with semantic sampling and geometric mapping42
Feature decoupling and interaction network for defending against adversarial examples39
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling38
Hourglass cascaded recurrent stereo matching network38
Single stage architecture for improved accuracy real-time object detection on mobile devices34
Learning an augmentation strategy for sparse datasets34
Accurate and efficient salient object detection via position prior attention33
G-TRACE: Grouped temporal recalibration for video object segmentation32
Background debiased class incremental learning for video action recognition31
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public31
Few-shot classification with multisemantic information fusion network31
AI-powered trustable and explainable fall detection system using transfer learning30
MVPCC-Net: Multi-View Based Point Cloud Completion Network for MLS data29
1D kernel distillation network for efficient image super-resolution29
ST-VTON: Self-supervised vision transformer for image-based virtual try-on28
Recent advances in deterministic human motion prediction: A review28
Certifiable relative pose estimation27
SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation27
Utilizing Inherent Bias for Memory Efficient Continual Learning: A Simple and Robust Baseline27
Spatial likelihood voting with self-knowledge distillation for weakly supervised object detection26
Flow guided mutual attention for person re-identification26
Multi-view dynamic facial action unit detection26
Learning accurate monocular 3D voxel representation via bilateral voxel transformer26
Deep learning with adaptive convolutions for classification of retinal diseases via optical coherence tomography25
Dual subspace clustering for spectral-spatial hyperspectral image clustering25
Underwater bubble plume image generative model based on noise prior and multi conditional labels24
Memory-MambaNav: Enhancing object-goal navigation through integration of spatial–temporal scanning with state space models24
Frequency and content dual stream network for image dehazing23
FSBI: Deepfake detection with frequency enhanced self-blended images23
Two-stream transformer tracking with messengers23
Depth assisted novel view synthesis using few images23
A Point-2s reinforcement learning biomimetic model for estimating and analyzing human 3D motion posture23
Self-supervised Vision Transformers for 3D pose estimation of novel objects23
Enhanced residual network for burst image super-resolution using simple base frame guidance22
DeepSegment: Segmentation of motion capture data using deep convolutional neural network22
Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale benchmark dataset22
CMS-net: Edge-aware multimodal MRI feature fusion for brain tumor segmentation22
STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation21
Intelligent deep learning based ethnicity recognition and classification using facial images21
Object tracking based on temporal and spatial context information21
PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding21
Editorial Board21
Matte anything: Interactive natural image matting with segment anything model20
PAGML: Precise Alignment Guided Metric Learning for sketch-based 3D shape retrieval20
Feature alignment via mutual mapping for few-shot fine-grained visual classification19
TransMix: Crafting highly transferable adversarial examples to evade face recognition models19
FastNet: Fast high-resolution network for human pose estimation19
Multi-view self-supervised learning for 3D facial texture reconstruction from single image19
RFSC-net: Re-parameterization forward semantic compensation network in low-light environments18
Short-term anchor linking and long-term self-guided attention for video object detection18
Robust visual tracking via modified Harris hawks optimization18
A deep-shallow and global–local multi-feature fusion network for photometric stereo18
Intelligent facial expression recognition and classification using optimal deep transfer learning model18
A multi-branch dual attention segmentation network for epiphyte drone images17
NPVForensics: Learning VA correlations in non-critical phoneme–viseme regions for deepfake detection17
A new multi-picture architecture for learned video deinterlacing and demosaicing with parallel deformable convolution and self-attention blocks17
AGSAM-Net: UAV route planning and visual guidance model for bridge surface defect detection17
Mixup Mask Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup17
SADGFeat: Learning local features with layer spatial attention and domain generalization17
Underwater image restoration based on light attenuation prior and color-contrast adaptive correction17
SDMNet: Spatially dilated multi-scale network for object detection for drone aerial imagery17
EMA-GS: Improving sparse point cloud rendering with EMA gradient and anchor upsampling17
Face deidentification with controllable privacy protection17
A spatial-frequency domain multi-branch decoder method for real-time semantic segmentation17
Editorial Board16
Contrast enhancement of region of interest of backlit image for surveillance systems based on multi-illumination fusion16
Dual-branch adaptive attention transformer for occluded person re-identification16
Landmark-in-facial-component: Towards occlusion-robust facial landmark localization16
Detection of anomaly in surveillance videos using quantum convolutional neural networks16
DFG-HCEN: A distinctive-feature guided and hierarchical channel enhanced network-based infrared and visible image fusion16
Enhancing consistency in virtual try-on: A novel diffusion-based approach16
Mitigating human fall injuries: A novel system utilizing 3D 4-stream convolutional neural networks and image fusion16
Social robot in service of the cognitive therapy of elderly people: Exploring robot acceptance in a real-world scenario16
SAMNet: Adapting segment anything model for accurate light field salient object detection15
Editorial Board15
TQRFormer: Tubelet query recollection transformer for action detection15
AHA-track: Aggregating hierarchical awareness features for single15
WRGPruner: A new model pruning solution for tiny salient object detection15
Real-time human-centric segmentation for complex video scenes15
WPE: Weighted prototype estimation for few-shot learning15
CollaborativeBEV: Collaborative bird eye view for reconstructing crowded environment15
Adaptive graph reasoning network for object detection15
Attentive spatial-temporal contrastive learning for self-supervised video representation15
Self-knowledge distillation based on knowledge transfer from soft to hard examples14
OFACD: An end-to-end change detection network for small UAVs remote sensing with viewpoint differences14
Adaptive and fast image superpixel segmentation approach14
Self-trained prediction model and novel anomaly score mechanism for video anomaly detection14
Anchor-based discriminative dual distribution calibration for transductive zero-shot learning14
CRFormer: A cross-region transformer for shadow removal14
Source domain prior-assisted segment anything model for single domain generalization in medical image segmentation13
A novel facial expression recognition model based on harnessing complementary features in multi-scale network with attention fusion13
Adaptive scale matching for remote sensing object detection based on aerial images13
Online multi-object tracking with δ-GLMB filter based on occlusion and identity switch handling13
Multi-axis interactive multidimensional attention network for vehicle re-identification13
H-net: Unsupervised domain adaptation person re-identification network based on hierarchy13
Class-discriminative domain generalization for semantic segmentation13
Deep learning-based efficient diagnosis of periapical diseases with dental X-rays13
Video anomaly detection based on a multi-layer reconstruction autoencoder with a variance attention strategy13
FgbCNN: A unified bilinear architecture for learning a fine-grained feature representation in facial expression recognition13
PW-NeRF: Progressive wavelet-mask guided neural radiance fields view synthesis13
M2VAD: Multiview multi13
Real-time gait biometrics for surveillance applications: A review13
Editorial Board12
Incremental human action recognition with dual memory12
Few-shot class incremental learning via prompt transfer and knowledge distillation12
Corrigendum to “A novel framework for diverse video generation from a single video using frame-conditioned denoising diffusion probabilistic model and ConvNeXt-V2” [Image and Vision Computing 154 (20212
Semantic-aware for point cloud domain adaptation with self-distillation learning12
Data-driven 2D-EWT based diabetic retinopathy identification using hybrid neural network12
Enhancing small object tracking with reversible rescaling networks12
Optimal deep transfer learning based ethnicity recognition on face images12
Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training12
Face and body-shape integration model for cloth-changing person re-identification12
Enhancing single-view 3D mesh reconstruction with the aid of implicit surface learning12
Perceiving local relative motion and global correlations for weakly supervised group activity recognition12
An edge-aware high-resolution framework for camouflaged object detection12
CVAD-GAN: Constrained video anomaly detection via generative adversarial network12
Dynamic semantic prototype perception for text–video retrieval11
Learning auto-scale representations for person re-identification11
Editorial Board11
Flexible multi-objective particle swarm optimization clustering with game theory to address human activity discovery fully unsupervised11
Bridging efficiency and interpretability: Explainable AI for multi-classification of pulmonary diseases utilizing modified lightweight CNNs11
Guest Editorial : Learning with Manifolds in Computer Vision11
Multi-object tracking with adaptive measurement noise and information fusion11
Editorial Board11
Resource-aware strategies for real-time multi-person pose estimation11
Exploiting spatial and temporal context for online tracking with improved transformer11
Synthetic multi-view clustering with missing relationships and instances11
Multimodal assessment of apparent personality using feature attention and error consistency constraint10
SDE-RAE:CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion10
Monocular contextual constraint for stereo matching with adaptive weights assignment10
Black-box reversible adversarial examples with invertible neural network10
GFFT: Global-local feature fusion transformers for facial expression recognition in the wild10
Qualitative failures of image generation models and their application in detecting deepfakes10
Robust ensemble person reidentification via orthogonal fusion with occlusion handling10
Speaker independent VSR: A systematic review and futuristic applications10
UIR-ES: An unsupervised underwater image restoration framework with equivariance and stein unbiased risk estimator10
External knowledge-assisted Transformer for image captioning10
Transformer-based feature interactor for person re-identification with margin self-punishment loss10
Twin relaxed least squares regression with classwise mean constraint for image classification10
Gait recognition via View-aware Part-wise Attention and Multi-scale Dilated Temporal Extractor10
Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model10
Deep hybrid learning for facial expression binary classifications and predictions10
Geometric feature statistics histogram for both real-valued and binary feature representations of 3D local shape10
An analytical proof on suitability of Cauchy-Schwarz Divergence as the aggregation criterion in Region Growing Algorithm10
Multi-granularity for knowledge distillation10
Weather-degraded image semantic segmentation with multi-task knowledge distillation10
A decision support system for acute lymphoblastic leukemia detection based on explainable artificial intelligence10
Parameter efficient finetuning of text-to-image models with trainable self-attention layer10
ECT: Fine-grained edge detection with learned cause tokens9
Editorial Board9
A dual-channel network based on occlusion feature compensation for human pose estimation9
Video object segmentation by multi-scale attention using bidirectional strategy9
Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey9
Boosting semi-supervised face recognition with raw faces9
Does explainable machine learning uncover the black box in vision applications?9
RGB road scene material segmentation9
Feature extraction and fusion algorithm for infrared visible light images based on residual and generative adversarial network9
Learning language to symbol and language to vision mapping for visual grounding9
Knowledge graph construction in hyperbolic space for automatic image annotation9
A dedicated benchmark for contour-based corner detection evaluation9
Effective hybrid attention network based on pseudo-color enhancement in ultrasound image segmentation9
ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation9
OCUCFormer: An Over-Complete Under-Complete Transformer Network for accelerated MRI reconstruction9
Unified Volumetric Avatar: Enabling flexible editing and rendering of neural human representations9
Editorial Board9
Contrastive learning based facial action unit detection in children with hearing impairment for a socially assistive robot platform9
Fuzzy set-based Bernoulli Random Noise Weighted Loss for unsupervised person re-identification9
DiPS: Discriminative pseudo-label sampling with self-supervised transformers for weakly supervised object localization9
FEANet: Foreground-edge-aware network with DenseASPOC for human parsing9
Continual coarse-to-fine domain adaptation in semantic segmentation9
Depth awakens: A depth-perceptual attention fusion network for RGB-D camouflaged object detection9
Learning to disentangle scenes for person re-identification8
EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation8
Three dimensional tracking of rigid objects in motion using 2D optical flows8
Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction8
Mobile-friendly and multi-feature aggregation via transformer for human pose estimation8
Universal domain adaptation from multiple black-box sources8
Machine learning applications in breast cancer prediction using mammography8
SAKD: Sparse attention knowledge distillation8
Noisy label facial expression recognition via face-specific label distribution learning8
Person re-identification: A taxonomic survey and the path ahead8
Part-aware distillation and aggregation network for human parsing8
Attention guided multi-level feature aggregation network for camouflaged object detection8
CF-SOLT: Real-time and accurate traffic accident detection using correlation filter-based tracking8
Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications8
LELD: Learn enhancement by learning degradation8
A supervised approach for the detection of AM-FM signals’ interference regions in spectrogram images8
Dense open-set recognition based on training with noisy negative images8
Transferable dual multi-granularity semantic excavating for partially relevant video retrieval8
RBGAN: Realistic-generation and balanced-utility GAN for face de-identification8
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance8
Combining complementary trackers for enhanced long-term visual object tracking8
A lightweight hash-directed global perception and self-calibrated multiscale fusion network for image super-resolution8
Federated learning based nonlinear two-stage framework for full-reference image quality assessment: An application for biometric8
Efficient masked feature and group attention network for stereo image super-resolution8
Improving defocus blur detection via adaptive supervision prior-tokens8
Multi-level feature disentanglement network for cross-dataset face forgery detection8
AES-Net: An adapter and enhanced self-attention guided network for multi-stage glaucoma classification using fundus images8
Image–text feature learning for unsupervised visible–infrared person re-identification8
Advances in deep learning-based image recognition of product packaging7
Open-set face recognition with maximal entropy and Objectosphere loss7
A locally weighted, correlated subdomain adaptive network employed to facilitate transfer learning7
GW-net: An efficient grad-CAM consistency neural network with weakening of random erasing features for semi-supervised person re-identification7
FRoundation: Are foundation models ready for face recognition?7
Editorial Board7
Editorial to special issue on novel insights on ocular biometrics7
Grassmann manifold based framework for automated fall detection from a camera7
Corrigendum to “STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation” [Journal of Image and Vision Computing volume 149 (2024) 105142]7
Markerless multi-view 3D human pose estimation: A survey7
Language conditioned multi-scale visual attention networks for visual grounding7
JGULF: Joint global and unilateral local feature network for micro-expression recognition7
Robust visual tracking based on modified mayfly optimization algorithm7
Video prediction by efficient transformers7
VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable visual question answering7
Improving multi-focus image fusion through Noisy image and feature difference network7
The triple refinement of self-paced learning style for unsupervised cross-domain person re-identification7
An Active Transfer Learning framework for image classification based on Maximum Differentiation Classifier7
Ricci curvature based volumetric segmentation7
POSER: POsed vs Spontaneous Emotion Recognition using fractal encoding7
Editorial Board7
IRPE: Instance-level reconstruction-based 6D pose estimator7
EMNet: Edge-guided multi-level network for salient object detection in low-light images7
Generative feature-driven image replay for continual learning7
Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification7
Editorial Board7
LTST: Long-term segmentation tracker with memory attention network7
Object detection via inner-inter relational reasoning network6
Editorial Board6
Complementary characteristics fusion network for weakly supervised salient object detection6
Multi-stream slowFast graph convolutional networks for skeleton-based action recognition6
Transparency and privacy measures of biometric patterns for data processing with synthetic data using explainable artificial intelligence6
EESSO: Exploiting Extreme and Smooth Signals via Omni-frequency learning for Text-based Person Retrieval6
Dual-path CNN with Max Gated block for text-based person re-identification6
I3Net: Intensive information interaction network for RGB-T salient object detection6
Distribution regularized self-supervised learning for domain adaptation of semantic segmentation6
Editorial Board6
0.057536125183105