Machine Vision and Applications

Papers
(The TQCC of Machine Vision and Applications is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
A method for high dynamic range 3D color modeling of objects through a color camera70
Class-aware cross-domain target detection based on cityscape in fog57
Real estate pricing prediction via textual and visual features53
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data43
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching41
ECM: arbitrary style transfer via Enhanced-Channel Module38
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization37
StyleDemorpher: high-quality face demorphing via StyleGAN2’s latent space34
Triple attention and global reasoning Siamese networks for visual tracking32
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer28
Non-contact SpO2 monitoring via multi-channel pulse signals from facial videos using machine learning21
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss20
Medtransnet: advanced gating transformer network for medical image classification20
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones19
Using breast density for hybrid region and pixel-level loss function19
Global-guided cross-reference network for co-salient object detection18
Motion-region annotation for complex videos via label propagation across occluders17
Enforced clustering for zero-to-one-shot texture anomaly detection17
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation16
A stereo vision SLAM with moving vehicles tracking in outdoor environment16
Editing implicit and explicit representations of radiance fields: a survey16
LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems16
Innovative surface roughness detection method based on white light interference images15
Specular Surface Detection with Deep Static Specular Flow and Highlight14
L-VAE: variational auto-encoder with learnable beta for disentangled representation14
Ubiquitous vision of transformers for person re-identification14
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model14
Alternate guidance network for boundary-aware camouflaged object detection13
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting12
Axes-aligned non-linear optimized PnP algorithm12
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction12
Generation of realistic synthetic cable images to train deep learning segmentation models12
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking12
Discriminant distance template matching for image recognition12
AFC-Net: adjacent feature complementary for crowded pedestrian detection11
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining11
Real-World super-resolution under the guidance of optimal transport11
Kernel based local matching network for video object segmentation11
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection11
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation11
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery11
Traversing the subspace of adversarial patches10
Improving knowledge distillation via pseudo-multi-teacher network10
A dual progressive strategy for long-tailed visual recognition10
LDNet: low-light image enhancement with joint lighting and denoising10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network10
Two-stage structural information enhancement for source-free domain adaptation10
Enhanced hyperspectral image reconstruction via parallel 2D/3D convolution with global layer purification and multiscale pooling fusion10
Twinned attention network for occlusion-aware facial expression recognition9
SGL-SLAM: a semantic and geometric RGB-D visual SLAM enhanced with line features for dynamic environments9
Shape related unknown object one-shot learning grasping9
Generating comprehensive scene graphs with integrated multiple attribute detection9
Camera-based mapping in search-and-rescue via flying and ground robot teams9
Benchmarking large and small MLLMs9
Redundancy-free label space and dual-feature collaboration for multi-label feature selection9
Shape description losses for medical image segmentation9
CAMTrack: a combined appearance-motion method for multiple-object tracking9
Online continual learning with saliency-guided experience replay using tiny episodic memory9
Adversarial imitation learning-based network for category-level 6D object pose estimation9
Correction: Real estate pricing prediction via textual and visual features9
Cross-validation of a semantic segmentation network for natural history collection specimens8
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection8
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation8
Fusing bilinear multi-channel gated vector for fine-grained classification8
Audio-visual localization based on spatial relative sound order8
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques8
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection8
IoU-aware feature fusion R-CNN for dense object detection8
Explainable interactive projections of images8
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation8
GOA-net: generic occlusion aware networks for visual tracking8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification8
Pakistan sign language recognition: leveraging deep learning models with limited dataset8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing8
Multi-scale convolution underwater image restoration network8
An anisotropic non-local attention network for image segmentation7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning7
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment7
Integrating visual-semantic relational reasoning for fake news detection on video platforms7
Clarity method of fog and dust image in fully mechanized mining face7
Automatic cables segmentation from a substation device based on 3D point cloud7
Pattern recognition methodologies for pollen grain image classification: a survey7
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter7
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots7
Real-time pedestrian pose estimation, tracking and localization for social distancing7
Attention-based global context network for driving maneuvers prediction7
An adaptive interpolation and 3D reconstruction algorithm for underwater images7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids7
Enhanced normal estimation of point clouds via fine-grained geometric information learning6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification6
Parametric loss-based super-resolution for scene text recognition6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection6
PGA6D: 6D pose estimation for grasping and assemblying based on keypoints voting6
Block-recurrent visual transformer for enhanced human detection in thermal imaging6
Tree-managed network ensembles for video prediction6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition6
Kinematic calibration of a hexapod robot based on monocular vision6
Visual-inertial SLAM with line segment merging and efficient feature tracking method6
Tensor-guided learning for image denoising using anisotropic PDEs6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles6
Improving change detection using conditional discriminative adversarial regularization6
Robust semantic segmentation method of urban scenes in snowy environment6
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent6
Multi-view dynamic reconstruction with cross-view smoothing based on surfel6
Distortion diminishing with vulnerability filters pruning6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention6
Meta-learning enhanced global–local feature fusion for image quality assessment6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis6
Quality assessment of synthetic images via spatial distortion recognition6
VGT-MOT: visibility-guided tracking for online multiple-object tracking6
YOLOMH: you only look once for multi-task driving perception with high efficiency5
FLAVR: flow-free architecture for fast video frame interpolation5
Fine-grained 3D vehicle shape manipulation via latent space editing5
React: recognize every action everywhere all at once5
Regional filtering distillation for object detection5
A review of adaptable conventional image processing pipelines and deep learning on limited datasets5
Residual shuffle attention network for image super-resolution5
Multiple object tracking using weighted graph convolutional neural networks5
Cascaded attention-guided multi-granularity feature learning for person re-identification5
Personvit: large-scale self-supervised vision transformer for person re-identification5
Toward phytoplankton parasite detection using autoencoders5
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder5
Beyond Kalman filters: deep learning-based filters for improved object tracking5
Guest editorial: special issue on human pose estimation and its applications5
Text-to-face synthesis based on facial landmarks prediction5
Logit scaling for out-of-distribution detection5
Accelerated fixed-point iterations for image deblurring and defiltering5
Unsupervised single-shot depth estimation using perceptual reconstruction5
Human pose estimation based on lightweight basicblock5
Naturally constrained reject option classification5
Swin transformer with part-level tokenization for occluded person re-identification5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement5
Local region-learning modules for point cloud classification5
0.18626093864441