Image and Vision Computing

Papers
(The TQCC of Image and Vision Computing is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
Accurate and efficient salient object detection via position prior attention271
Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention148
CFENet: Context-aware Feature Enhancement Network for efficient few-shot object counting111
AwareTrack: Object awareness for visual tracking via templates interaction103
Grad-CAM based explanations for multiocular disease detection using Xception net102
Privacy-preserving explainable AI enable federated learning-based denoising fingerprint recognition model95
Feature extraction and fusion algorithm for infrared visible light images based on residual and generative adversarial network93
EDCAANet: A lightweight COD network based on edge detection and coordinate attention assistance67
Unmasking deepfakes: Eye blink pattern analysis using a hybrid LSTM and MLP-CNN model58
CLBSR: A deep curriculum learning-based blind image super resolution network using geometrical prior51
HPD-Depth: High performance decoding network for self-supervised monocular depth estimation49
Spatial–temporal sequential network for anomaly detection based on long short-term magnitude representation47
Integrating prior knowledge into a bibranch pyramid network for medical image segmentation43
Exploring global context and position-aware representation for group activity recognition42
A polar-edge context-aware (PECA) network for mirror segmentation41
SalFBNet: Learning pseudo-saliency distribution via feedback convolutional networks38
Continual coarse-to-fine domain adaptation in semantic segmentation37
Multi parallel U-net encoder network for effective polyp image segmentation37
Single image dehazing using extended local dark channel prior36
Fine-grained bidirectional attentional generation and knowledge-assisted networks for cross-modal retrieval36
BF3D: Bi-directional fusion 3D detector with semantic sampling and geometric mapping34
Effective hybrid attention network based on pseudo-color enhancement in ultrasound image segmentation30
A study on attention-based LSTM for abnormal behavior recognition with variable pooling30
Authenticating and securing healthcare records: A deep learning-based zero watermarking approach29
Language and vision based person re-identification for surveillance systems using deep learning with LIP layers29
Attribute discrimination combined with selected sample dropout for unsupervised domain adaptive person re-identification27
Knowledge distillation methods for efficient unsupervised adaptation across multiple domains26
Video object segmentation by multi-scale attention using bidirectional strategy25
Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications25
Modality interactive attention for cross-modality person re-identification24
Visual tracking based on spatiotemporal transformer and fusion sequences24
Underwater image enhancement based on global features and prior distribution guided24
Localization of diffusion model-based inpainting through the inter-intra similarity of frequency features24
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty23
Editorial Board23
ATOM: Self-supervised human action recognition using atomic motion representation learning22
Fuzzy set-based Bernoulli Random Noise Weighted Loss for unsupervised person re-identification22
Visual question answering model based on graph neural network and contextual attention21
RGB-T tracking by modality difference reduction and feature re-selection21
Exploring holistic discriminative representation for micro-expression recognition via contrastive learning21
Edge-aware salient object detection network via context guidance20
Modeling content-attribute preference for personalized image esthetics assessment20
Multi-layer capsule network with joint dynamic routing for fire recognition20
: Robust real-time shape-from-template, a C ++ library19
Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer models19
Alignment and fusion for adaptive domain nighttime semantic segmentation19
View knowledge transfer network for multi-view action recognition19
DiPS: Discriminative pseudo-label sampling with self-supervised transformers for weakly supervised object localization19
A 3D multi-scale CycleGAN framework for generating synthetic PETs from MRIs for Alzheimer's disease diagnosis19
LSTM with bio inspired algorithm for action recognition in sports videos19
Proactive hybrid learning framework for real-time multi-vehicle detection in unregulated traffic environments18
C2F: An effective coarse-to-fine network for video summarization18
Editorial Board18
Hourglass cascaded recurrent stereo matching network18
Editorial Board18
G-TRACE: Grouped temporal recalibration for video object segmentation18
Attention-guided aggregation stereo matching network17
Cuepervision: self-supervised learning for continuous domain adaptation without catastrophic forgetting17
Improving eye movement biometrics in low frame rate eye-tracking devices using periocular and eye blinking features17
Special issue on role of computer vision in smart cities17
A robust image representation method against illumination and occlusion variations17
Editorial Board16
Feature decoupling and interaction network for defending against adversarial examples16
Image-based human re-identification: Which covariates are actually (the most) important?16
Cross-scale global attention feature pyramid network for person search16
FEANet: Foreground-edge-aware network with DenseASPOC for human parsing16
ECT: Fine-grained edge detection with learned cause tokens16
Editorial Board16
Distance metric-based learning for long-tail object detection16
Weakly supervised moment localization with natural language based on semantic reconstruction15
Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification15
Boosting semi-supervised face recognition with raw faces15
Editorial Board15
A lightweight network for monocular depth estimation with decoupled body and edge supervision15
MFC-Net : Multi-feature fusion cross neural network for salient object detection15
Hyperspherically regularized networks for self-supervision14
IAC-ReCAM: Two-dimensional attention modulation and category label guidance for weakly supervised semantic segmentation14
Faster and finer pose estimation for multiple instance objects in a single RGB image14
Editorial Board14
LDWS-net: A learnable deep wavelet scattering network for RGB salient object detection14
LocalFace: Learning significant local features for deep face recognition14
Learning language to symbol and language to vision mapping for visual grounding14
A dedicated benchmark for contour-based corner detection evaluation14
Exploiting recollection effects for memory-based video object segmentation13
Transformer-based feature interactor for person re-identification with margin self-punishment loss13
μPEWFace: Parallel ensemble of weighted deep convolutional neural networks with novel loss functions for face-based authentication13
ABC: Aligning binary centers for single-stage monocular 3D object detection13
3D human body modeling with orthogonal human mask image based on multi-channel Swin transformer architecture13
MetaPix: Domain transfer for semantic segmentation by meta pixel weighting12
Improving distinctiveness in video captioning with text-video similarity12
Multi-label recognition in open driving scenarios based on bipartite-driven superimposed dynamic graph12
An instance-level data balancing method for object detection via contextual information alignment12
Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey12
Robust ensemble person reidentification via orthogonal fusion with occlusion handling12
Few-shot classification with multisemantic information fusion network12
CFFNet: Coordinated feature fusion network for crowd counting12
GAN-BodyPose: Real-time 3D human body pose data key point detection and quality assessment assisted by generative adversarial network12
Artificial immune systems for data augmentation12
A few-shot learning-based ischemic stroke segmentation system using weighted MRI fusion12
Batch feature standardization network with triplet loss for weakly-supervised video anomaly detection12
AI-powered trustable and explainable fall detection system using transfer learning12
PTPFusion: A progressive infrared and visible image fusion network based on texture preserving11
Knowledge graph construction in hyperbolic space for automatic image annotation11
A new deepfake detection model for responding to perception attacks in embodied artificial intelligence11
Depth awakens: A depth-perceptual attention fusion network for RGB-D camouflaged object detection11
Efficient masked feature and group attention network for stereo image super-resolution11
Multi-branch residual image semantic segmentation combined with inverse weight gated-control11
SCTrans: Self-align and cross-align transformer for few-shot segmentation11
Learning weakly supervised audio-visual violence detection in hyperbolic space11
3D-ISRNet:Generating 3D point clouds through image similarity retrieval in a complex background from a single image10
Single stage architecture for improved accuracy real-time object detection on mobile devices10
Whether normalized or not? Towards more robust iris recognition using dynamic programming10
FSformer: Fast-Slow Transformer for video action recognition10
Image captioning: Semantic selection unit with stacked residual attention10
Detecting adversarial samples by noise injection and denoising10
Pyramid quaternion discrete cosine transform based ConvNet for cancelable face recognition10
CRENet: Crowd region enhancement network for multi-person 3D pose estimation10
UTR: A UNet-like transformer for efficient unsupervised medical image registration10
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling10
Learning diverse and deep clues for person reidentification10
Exploring cross-video matching for few-shot video classification via dual-hierarchy graph neural network learning10
Generous teacher: Good at distilling knowledge for student learning10
Machine learning based video segmentation of moving scene by motion index using IO detector and shot segmentation10
Synthetic lidar point cloud generation using deep generative models for improved driving scene object recognition10
Background debiased class incremental learning for video action recognition10
A dual-channel network based on occlusion feature compensation for human pose estimation10
Object aspect classification and 6DoF pose estimation10
Person search over security video surveillance systems using deep learning methods: A review10
RGB road scene material segmentation10
Real-time 3D human pose estimation without skeletal a priori structures10
Visible thermal person re-identification via multi-branch modality residual complementary learning10
OCUCFormer: An Over-Complete Under-Complete Transformer Network for accelerated MRI reconstruction10
Parameter efficient finetuning of text-to-image models with trainable self-attention layer10
RAMT-GAN: Realistic and accurate makeup transfer with generative adversarial network9
A data augmentation approach that ensures the reliability of foregrounds in medical image segmentation9
Adaptive weight based on overlapping blocks network for facial expression recognition9
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public9
Integration of ultrasound and mammogram for multimodal classification of breast cancer using hybrid residual neural network and machine learning9
Does explainable machine learning uncover the black box in vision applications?9
Learning an augmentation strategy for sparse datasets9
Attention guided contextual feature fusion network for salient object detection9
Multiscale features integration based multiple-in-single-out network for object detection9
An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers9
Acute lymphocytic leukemia detection and subtype classification via extended wavelet pooling based-CNNs and statistical-texture features9
Contrastive learning based facial action unit detection in children with hearing impairment for a socially assistive robot platform9
Unsupervised person re-identification by dynamic hybrid contrastive learning9
STRFormer: Spatial–Temporal–ReTemporal Transformer for 3D human pose estimation9
A lightweight depth completion network with spatial efficient fusion9
ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation9
Unified Volumetric Avatar: Enabling flexible editing and rendering of neural human representations9
Multi-view daily action recognition based on Hooke balanced matrix and broad learning system8
MLRMV: Multi-layer representation for multi-view action recognition8
Pose-guided part matching network via shrinking and reweighting for occluded person re-identification8
TransWild: Enhancing 3D interacting hands recovery in the wild with IoU-guided Transformer8
Bilateral regularized optimization model for edge-preserving image smoothing8
Editorial Board8
Feature attention fusion network for occluded person re-identification8
Recent advances in deterministic human motion prediction: A review8
Three dimensional tracking of rigid objects in motion using 2D optical flows8
Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection8
WITHDRAWN: Lips-SpecFormer: Non-linear interpolable transformer for spectral reconstruction using adjacent channel coupling8
SAVE: Encoding spatial interactions for vision transformers8
Spatial likelihood voting with self-knowledge distillation for weakly supervised object detection8
Noisy label facial expression recognition via face-specific label distribution learning8
Bidirectional Attentional Interaction Networks for RGB-D salient object detection8
Boundary guidance network for camouflage object detection8
DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions8
Adversarial attacks and defenses in person search: A systematic mapping study and taxonomy8
R2-trans: Fine-grained visual categorization with redundancy reduction8
Extended neighborhood-based road and median filter for impulse noise removal from depth map7
Improving defocus blur detection via adaptive supervision prior-tokens7
Task-based parameter isolation for foreground segmentation without catastrophic forgetting using multi-scale region and edges fusion network7
Universal domain adaptation from multiple black-box sources7
Combining complementary trackers for enhanced long-term visual object tracking7
Semantic and edge-based visual odometry by joint minimizing semantic and edge distance error7
Dictionary-enabled efficient training of ConvNets for image classification7
RBGAN: Realistic-generation and balanced-utility GAN for face de-identification7
Flow guided mutual attention for person re-identification7
Editorial Board7
Editorial Board7
A semantic fusion based approach for express bill detection in complex scenes7
Dual-scale point cloud completion network based on high-frequency feature fusion7
Accurate video saliency prediction via hierarchical fusion and temporal recurrence7
Research on efficient detection network method for remote sensing images based on self attention mechanism7
A novel micro-expression detection algorithm based on BERT and 3DCNN7
A Point-2s reinforcement learning biomimetic model for estimating and analyzing human 3D motion posture7
Multi-scale feature aggregation and boundary awareness network for salient object detection7
Detection of dental periapical lesions using retinex based image enhancement and lightweight deep learning model6
Moment preserving tomographic image reconstruction model6
Pedestrian detection in low-light conditions: A comprehensive survey6
Deep learning with adaptive convolutions for classification of retinal diseases via optical coherence tomography6
Transferable dual multi-granularity semantic excavating for partially relevant video retrieval6
Self-supervised Vision Transformers for 3D pose estimation of novel objects6
Tackling multiple object tracking with complicated motions — Re-designing the integration of motion and appearance6
Online-adaptive classification and regression network with sample-efficient meta learning for long-term tracking6
Activity guided multi-scales collaboration based on scaled-CNN for saliency prediction6
2D progressive fusion module for action recognition6
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs6
TENet: Accurate light-field salient object detection with a transformer embedding network6
A supervised approach for the detection of AM-FM signals’ interference regions in spectrogram images6
Starting from the structure: A review of small object detection based on deep learning6
DeepSegment: Segmentation of motion capture data using deep convolutional neural network6
GW-net: An efficient grad-CAM consistency neural network with weakening of random erasing features for semi-supervised person re-identification6
A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection6
Pose-guided counterfactual inference for occluded person re-identification6
Editorial Board6
Human–object interaction detection with missing objects6
Mobile-friendly and multi-feature aggregation via transformer for human pose estimation6
DGSN: Learning how to segment pedestrians from other datasets for occluded person re-identification6
VAE-GAN3D: Leveraging image-based semantics for 3D zero-shot recognition6
A multi-scale perceptual polyp segmentation network based on boundary guidance6
A review on 2D instance segmentation based on deep neural networks6
Multi-view dynamic facial action unit detection6
Regularization by denoising diffusion process meets deep relaxation in phase6
Attention guided multi-level feature aggregation network for camouflaged object detection6
Loss reweight in scale dimension: A simple while effective feature selection strategy for anchor-free detectors6
A motion model based on recurrent neural networks for visual object tracking6
Editorial Board6
Interactive multi-scale feature representation enhancement for small object detection6
A Survey on Object Detection for the Internet of Multimedia Things (IoMT) using Deep Learning and Event-based Middleware: Approaches, Challenges, and Future Directions6
Lightweight multi-scale attention-guided network for real-time semantic segmentation5
Lightweight boundary refinement module based on point supervision for semantic segmentation5
You look so different! Haven’t I seen you a long time ago?5
MINet: Modality interaction network for unified multi-modal tracking5
Multiscale segmentation net for segregating heterogeneous brain tumors: Gliomas on multimodal MR images5
Environmentally adaptive fast object detection in UAV images5
Dual subspace clustering for spectral-spatial hyperspectral image clustering5
Frequency and content dual stream network for image dehazing5
Consistent camera-invariant and noise-tolerant learning for unsupervised person re-identification5
MVPCC-Net: Multi-View Based Point Cloud Completion Network for MLS data5
A three-dimensional human motion pose recognition algorithm based on graph convolutional networks5
Certifiable relative pose estimation5
Alleviating the generalization issue in adversarial domain adaptation networks5
Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing5
Top-tuning: A study on transfer learning for an efficient alternative to fine tuning for image classification with fast kernel methods5
Generative feature-driven image replay for continual learning5
Video object segmentation based on dynamic perception update and feature fusion5
Cross-view action recognition with small-scale datasets5
Underwater bubble plume image generative model based on noise prior and multi conditional labels5
Improved real-time three-dimensional stereo matching with local consistency5
Rich global feature guided network for monocular depth estimation5
Novel approach for fast structured light framework using deep learning5
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance5
RAD-BNN: Regulating activation distribution for accurate binary neural network5
Federated learning based nonlinear two-stage framework for full-reference image quality assessment: An application for biometric5
Person re-identification: A taxonomic survey and the path ahead5
Semantic-aligned reinforced attention model for zero-shot learning5
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for face hallucination5
SAKD: Sparse attention knowledge distillation5
SGF3D: Similarity-guided fusion network for 3D object detection5
0.0903160572052