Image and Vision Computing

Papers
(The TQCC of Image and Vision Computing is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Recent advances in small object detection based on deep learning: A review271
Weighted boxes fusion: Ensembling boxes from different object detection models214
Deep learning-based object detection in low-altitude UAV datasets: A survey137
A comprehensive review on deep learning-based methods for video anomaly detection117
Application of the best evacuation model of deep learning in the design of public structures113
IoU-aware single-stage object detector for accurate localization99
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public97
A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection96
Deep multimodal fusion for semantic image segmentation: A survey86
Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model75
A review on object pose recovery: From 3D bounding box detectors to full 6D pose estimators63
A review on 2D instance segmentation based on deep neural networks61
Anomaly detection in surveillance video based on bidirectional prediction59
Deep learning-based detection from the perspective of small or tiny objects: A survey53
ReMOT: A model-agnostic refinement for multiple object tracking50
Deep learning-based person re-identification methods: A survey and outlook of recent works48
Cross-resolution learning for Face Recognition44
Intelligent detection of building cracks based on deep learning40
Visual question answering model based on graph neural network and contextual attention39
Intelligent deep learning based ethnicity recognition and classification using facial images38
Motion saliency based multi-stream multiplier ResNets for action recognition37
Person search: New paradigm of person re-identification: A survey and outlook of recent works36
A review of deep learning techniques for 2D and 3D human pose estimation36
A Survey on Object Detection for the Internet of Multimedia Things (IoMT) using Deep Learning and Event-based Middleware: Approaches, Challenges, and Future Directions35
Exploring region relationships implicitly: Image captioning with visual relationship attention33
Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data32
LSTM with bio inspired algorithm for action recognition in sports videos30
Iris and periocular biometrics for head mounted displays: Segmentation, recognition, and synthetic data generation29
An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network28
A survey of iris datasets25
R4 Det: Refined single-stage detector with feature recursion and refinement for rotating object detection in aerial images24
Generative adversarial networks and their application to 3D face generation: A survey24
A survey of micro-expression recognition24
Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers23
Robust biometric authentication system with a secure user template23
Projection-dependent input processing for 3D object recognition in human robot interaction systems23
A two-stage real-time YOLOv2-based road marking detector with lightweight spatial transformation-invariant classification23
Facial expression recognition using human machine interaction and multi-modal visualization analysis for healthcare applications22
Application of 3D laser scanning technology for image data processing in the protection of ancient building sites through deep learning22
CrossATNet - a novel cross-attention based framework for sketch-based image retrieval21
Efficient pedestrian detection in top-view fisheye images using compositions of perspective view patches21
PCANet: Pyramid convolutional attention network for semantic segmentation21
EDS pooling layer20
RoI Tanh-polar transformer network for face parsing in the wild20
Attention-guided chained context aggregation for semantic segmentation19
A survey of methods, datasets and evaluation metrics for visual question answering19
Development of an embedded road boundary detection system based on deep learning19
An unsupervised domain adaptation scheme for single-stage artwork recognition in cultural sites19
Cluster adaptation networks for unsupervised domain adaptation19
Feedback-driven loss function for small object detection18
Generalizable deep features for ocular biometrics18
MEmoR: A Multimodal Emotion Recognition using affective biomarkers for smart prediction of emotional health for people analytics in smart industries18
Improving image captioning with Pyramid Attention and SC-GAN18
Energy clustering for unsupervised person re-identification18
Improved generative adversarial network and its application in image oil painting style transfer18
FastNet: Fast high-resolution network for human pose estimation18
Learning to disentangle scenes for person re-identification18
Few-Shot learning for face recognition in the presence of image discrepancies for limited multi-class datasets17
Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention17
Multi-stream slowFast graph convolutional networks for skeleton-based action recognition17
Investigating bias in deep face analysis: The KANFace dataset and empirical study17
Revisiting crowd counting: State-of-the-art, trends, and future perspectives16
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance16
Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction16
A deep-shallow and global–local multi-feature fusion network for photometric stereo16
Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection16
Unsupervised face Frontalization for pose-invariant face recognition16
Multi-view dynamic facial action unit detection16
Synthetic data for face recognition: Current state and future prospects15
SalFBNet: Learning pseudo-saliency distribution via feedback convolutional networks15
Cross-Correlated Attention Networks for Person Re-Identification15
Point cloud completion using multiscale feature fusion and cross-regional attention15
A high-efficiency energy and storage approach for IoT applications of facial recognition15
Dense convolutional feature histograms for robust visual object tracking15
Cross-database and cross-attack Iris presentation attack detection using micro stripes analyses15
Zero-sum game theory model for segmenting skin regions15
An efficient foreign objects detection network for power substation14
Collaborative representation of blur invariant deep sparse features for periocular recognition from smartphones14
Synergetic reconstruction from 2D pose and 3D motion for wide-space multi-person video motion capture in the wild14
Bald eagle search optimization with deep transfer learning enabled age-invariant face recognition model14
SalED: Saliency prediction with a pithy encoder-decoder architecture sensing local and global information14
Attention guided contextual feature fusion network for salient object detection14
IRANet: Identity-relevance aware representation for cloth-changing person re-identification14
An attention-based deep learning model for multiple pedestrian attributes recognition14
A neural network aided attuned scheme for gun detection in video surveillance images13
CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving13
Dual-path CNN with Max Gated block for text-based person re-identification13
Beyond modality alignment: Learning part-level representation for visible-infrared person re-identification13
CAM: A fine-grained vehicle model recognition method based on visual attention model13
The effect of image recognition traffic prediction method under deep learning and naive Bayes algorithm on freeway traffic safety13
Self-trained prediction model and novel anomaly score mechanism for video anomaly detection13
Certifiable relative pose estimation13
ERF-YOLO: A YOLO algorithm compatible with fewer parameters and higher accuracy13
Face anti-spoofing detection based on multi-scale image quality assessment13
Multi-level refinement enriched feature pyramid network for object detection13
Convolutional prototype learning for zero-shot recognition13
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling13
HPRNet: Hierarchical point regression for whole-body human pose estimation12
Digital video intrusion intelligent detection method based on narrowband Internet of Things and its application12
A new perceptual hashing method for verification and identity classification of occluded faces12
Explaining VQA predictions using visual grounding and a knowledge base12
Fusion of iris and sclera using phase intensive rubbersheet mutual exclusion for periocular recognition12
Feature based video stabilization based on boosted HAAR Cascade and representative point matching algorithm12
A calibration method of computer vision system based on dual attention mechanism12
Multi-level prediction Siamese network for real-time UAV visual tracking12
Dense open-set recognition based on training with noisy negative images12
MFC-Net : Multi-feature fusion cross neural network for salient object detection12
Multimodal assessment of apparent personality using feature attention and error consistency constraint12
Variance-guided attention-based twin deep network for cross-spectral periocular recognition12
I-SOCIAL-DB: A labeled database of images collected from websites and social media for Iris recognition12
Boundary guidance network for camouflage object detection12
A novel co-attention computation block for deep learning based image co-segmentation12
Cancelable Iris template generation by aggregating patch level ordinal relations with its holistically extended performance and security analysis11
Expression recognition with deep features extracted from holistic and part-based models11
Combining complementary trackers for enhanced long-term visual object tracking11
An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers11
Double anchor embedding for accurate multi-person 2D pose estimation11
Real-time semantic segmentation with local spatial pixel adjustment11
Spatiotemporal module for video saliency prediction based on self-attention11
Pose-guided part matching network via shrinking and reweighting for occluded person re-identification11
A study on attention-based LSTM for abnormal behavior recognition with variable pooling11
Novel features for art movement classification of portrait paintings11
Improved YOLOX-X based UAV aerial photography object detection algorithm11
From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts11
Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification11
Intelligent multimodal pedestrian detection using hybrid metaheuristic optimization with deep learning model10
Real-time semantic segmentation with weighted factorized-depthwise convolution10
Gender based face aging with cycle-consistent adversarial networks10
Detection of panoramic vision pedestrian based on deep learning10
Few-shot object detection via baby learning10
Image captioning via proximal policy optimization10
Co-occurrence of deep convolutional features for image search10
Using synthetic data for person tracking under adverse weather conditions10
Point cloud classification with deep normalized Reeb graph convolution10
Detection of anomaly in surveillance videos using quantum convolutional neural networks10
Deep hybrid learning for facial expression binary classifications and predictions10
E2E-VSDL: End-to-end video surveillance-based deep learning model to detect and prevent criminal activities10
Face mask detection using deep convolutional neural network and multi-stage image processing9
PDA: Proxy-based domain adaptation for few-shot image recognition9
Tracking fiducial markers with discriminative correlation filters9
Viewpoint constrained and unconstrained Cricket stroke localization from untrimmed videos9
Demographic classification through pupil analysis9
Joint detection and tracking in videos with identification features9
Edge supervision and multi-scale cost volume for stereo matching9
Dual-branch adaptive attention transformer for occluded person re-identification9
A motion model based on recurrent neural networks for visual object tracking9
Interactive multi-scale feature representation enhancement for small object detection9
E2E-V2SResNet: Deep residual convolutional neural networks for end-to-end video driven speech synthesis9
Composite recurrent network with internal denoising for facial alignment in still and video images in the wild9
Transformer models for enhancing AttnGAN based text to image generation9
Video prediction by efficient transformers9
Does explainable machine learning uncover the black box in vision applications?9
Lightweight boundary refinement module based on point supervision for semantic segmentation8
Geometry consistency aware confidence evaluation for feature matching8
Whether normalized or not? Towards more robust iris recognition using dynamic programming8
MDCS with fully encoding the information of local shape description for 3D Rigid Data matching8
Boundary graph convolutional network for temporal action detection8
A pooling-based feature pyramid network for salient object detection8
Video-based person re-identification by intra-frame and inter-frame graph neural network8
How robust are discriminatively trained zero-shot learning models?8
Triangulate geometric constraint combined with visual-flow fusion network for accurate 6DoF pose estimation8
Activity guided multi-scales collaboration based on scaled-CNN for saliency prediction8
Single stage architecture for improved accuracy real-time object detection on mobile devices8
Knowledge distillation methods for efficient unsupervised adaptation across multiple domains8
RAMT-GAN: Realistic and accurate makeup transfer with generative adversarial network8
Towards generalized morphing attack detection by learning residuals8
ASPset: An outdoor sports pose video dataset with 3D keypoint annotations8
Texture classification-based feature processing for violence-based anomaly detection in crowded environments8
Handcrafted localized phase features for human action recognition8
Multistage temporal convolution transformer for action segmentation8
Short-term anchor linking and long-term self-guided attention for video object detection8
Clothing generation by multi-modal embedding: A compatibility matrix-regularized GAN model8
View knowledge transfer network for multi-view action recognition8
Improving eye movement biometrics in low frame rate eye-tracking devices using periocular and eye blinking features8
Edge-aware salient object detection network via context guidance8
Crowd density detection method based on crowd gathering mode and multi-column convolutional neural network8
Adversarial sliced Wasserstein domain adaptation networks8
Emotion detection and face recognition of drivers in autonomous vehicles in IoT platform8
SiaTrans: Siamese transformer network for RGB-D salient object detection with depth image classification8
Cross-modal feature extraction and integration based RGBD saliency detection8
Context-based image explanations for deep neural networks8
Pose-guided counterfactual inference for occluded person re-identification8
0.036936044692993