Machine Vision and Applications

Papers
(The TQCC of Machine Vision and Applications is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Class-aware cross-domain target detection based on cityscape in fog74
StyleDemorpher: high-quality face demorphing via StyleGAN2’s latent space73
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching60
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data47
Non-contact SpO2 monitoring via multi-channel pulse signals from facial videos using machine learning36
Medtransnet: advanced gating transformer network for medical image classification23
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones22
ECM: arbitrary style transfer via Enhanced-Channel Module21
A method for high dynamic range 3D color modeling of objects through a color camera21
Real estate pricing prediction via textual and visual features21
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer20
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization20
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss20
Using breast density for hybrid region and pixel-level loss function19
An integration of deep network with random forests framework for image quality assessment in real-time19
Lightweight image dehazing via physics-guided neural networks19
Innovative surface roughness detection method based on white light interference images18
Enforced clustering for zero-to-one-shot texture anomaly detection17
Global-guided cross-reference network for co-salient object detection17
A stereo vision SLAM with moving vehicles tracking in outdoor environment17
Editing implicit and explicit representations of radiance fields: a survey16
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation15
LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems15
Enhancing hyperspectral image classification: DeepXTE for efficient semantic feature extraction14
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model14
Tcdgnet: a texture and chromaticity dual-guided network for color document super-resolution13
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking13
Axes-aligned non-linear optimized PnP algorithm13
L-VAE: variational auto-encoder with learnable beta for disentangled representation13
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting13
Motion-region annotation for complex videos via label propagation across occluders13
Discriminant distance template matching for image recognition12
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery12
Ubiquitous vision of transformers for person re-identification12
AFC-Net: adjacent feature complementary for crowded pedestrian detection12
Multi-feature fusion network based on wavelet transform and multi-scale cross-response for hyperspectral image classification12
Generation of realistic synthetic cable images to train deep learning segmentation models12
Alternate guidance network for boundary-aware camouflaged object detection12
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation11
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection11
Kernel based local matching network for video object segmentation11
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction11
A dual progressive strategy for long-tailed visual recognition11
Specular Surface Detection with Deep Static Specular Flow and Highlight11
Redundancy-free label space and dual-feature collaboration for multi-label feature selection10
Improving knowledge distillation via pseudo-multi-teacher network10
A general two-stage framework of tensor low-rank representation for enhanced image denoising and clustering10
Twinned attention network for occlusion-aware facial expression recognition10
Online continual learning with saliency-guided experience replay using tiny episodic memory10
Two-stage structural information enhancement for source-free domain adaptation10
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining10
Traversing the subspace of adversarial patches10
Benchmarking large and small MLLMs10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network10
SGL-SLAM: a semantic and geometric RGB-D visual SLAM enhanced with line features for dynamic environments10
Camera-based mapping in search-and-rescue via flying and ground robot teams10
CAMTrack: a combined appearance-motion method for multiple-object tracking10
Enhanced hyperspectral image reconstruction via parallel 2D/3D convolution with global layer purification and multiscale pooling fusion9
Generating comprehensive scene graphs with integrated multiple attribute detection9
Audio-visual localization based on spatial relative sound order9
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation9
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection9
Enhanced point cloud processing through geometric affine transformations and curvature-based sampling9
Adversarial imitation learning-based network for category-level 6D object pose estimation9
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation9
Correction: Real estate pricing prediction via textual and visual features9
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques9
LDNet: low-light image enhancement with joint lighting and denoising9
Shape related unknown object one-shot learning grasping9
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing9
Explainable interactive projections of images9
GOA-net: generic occlusion aware networks for visual tracking9
Automatic cables segmentation from a substation device based on 3D point cloud8
Shape description losses for medical image segmentation8
IoU-aware feature fusion R-CNN for dense object detection8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots8
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids8
Multi-scale convolution underwater image restoration network8
Overcoming occlusions in AR, via multi-view, real-time 3D human pose estimation8
Reflection removal using recurrent polarization-to-polarization network8
Real-time pedestrian pose estimation, tracking and localization for social distancing8
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection8
Fusing bilinear multi-channel gated vector for fine-grained classification8
FOCUS: Frequency-Optimized Conditioning of diffUSion models for mitigating catastrophic forgetting during test-time adaptation8
Pakistan sign language recognition: leveraging deep learning models with limited dataset8
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system7
Cross-dataset video deepfake detection using Transformer and CNN architectures7
Visual-inertial SLAM with line segment merging and efficient feature tracking method7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning7
Integrating visual-semantic relational reasoning for fake news detection on video platforms7
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms7
Tensor-guided learning for image denoising using anisotropic PDEs7
Parametric loss-based super-resolution for scene text recognition7
An adaptive interpolation and 3D reconstruction algorithm for underwater images7
Enhancing object SLAM for outdoor environments: robust reconstruction and relocalization7
Meta-learning enhanced global–local feature fusion for image quality assessment7
Actions as points: a simple and efficient detector for skeleton-based temporal action detection7
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter7
Evolving brain tumor segmentation: differential evolution-optimized ensemble deep learning for multi-modal MRI analysis7
Multi-view dynamic reconstruction with cross-view smoothing based on surfel6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles6
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent6
Accelerated fixed-point iterations for image deblurring and defiltering6
A review of adaptable conventional image processing pipelines and deep learning on limited datasets6
Text-to-face synthesis based on facial landmarks prediction6
Logit scaling for out-of-distribution detection6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification6
PGA6D: 6D pose estimation for grasping and assemblying based on keypoints voting6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition6
Robust semantic segmentation method of urban scenes in snowy environment6
Residual shuffle attention network for image super-resolution6
Quality assessment of synthetic images via spatial distortion recognition6
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement6
Guest editorial: special issue on human pose estimation and its applications6
Enhanced normal estimation of point clouds via fine-grained geometric information learning6
Improving change detection using conditional discriminative adversarial regularization6
Distortion diminishing with vulnerability filters pruning6
Kinematic calibration of a hexapod robot based on monocular vision6
Enhancing adversarial transferability via importance-aware pixel-level mask6
Tree-managed network ensembles for video prediction6
Diffusion-leveraged GAN dehazing driven by classification: a two-stage framework for real-world monitoring imagery6
Block-recurrent visual transformer for enhanced human detection in thermal imaging6
Multiple object tracking using weighted graph convolutional neural networks6
Swin transformer with part-level tokenization for occluded person re-identification5
React: recognize every action everywhere all at once5
Local region-learning modules for point cloud classification5
VGT-MOT: visibility-guided tracking for online multiple-object tracking5
YOLOMH: you only look once for multi-task driving perception with high efficiency5
Unsupervised single-shot depth estimation using perceptual reconstruction5
Human pose estimation based on lightweight basicblock5
Residual feature learning with hierarchical calibration for gaze estimation5
Edge-aware dual path network for medical image classification5
Cascaded attention-guided multi-granularity feature learning for person re-identification5
Regional filtering distillation for object detection5
Fine-grained 3D vehicle shape manipulation via latent space editing5
Naturally constrained reject option classification5
Rid-slam: a robust illumination-and-dynamics-aware RGB-D SLAM framework for indoor environments5
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices5
Toward phytoplankton parasite detection using autoencoders5
Beyond Kalman filters: deep learning-based filters for improved object tracking5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments5
Personvit: large-scale self-supervised vision transformer for person re-identification5
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder5
FLAVR: flow-free architecture for fast video frame interpolation5
Detecting violent deepfakes: dataset and a compact attention network with multi-scale supervision5
0.52271008491516