Machine Vision and Applications

Papers
(The median citation count of Machine Vision and Applications is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss110
Real estate pricing prediction via textual and visual features86
ECM: arbitrary style transfer via Enhanced-Channel Module64
Medtransnet: advanced gating transformer network for medical image classification59
Multi-shot person re-identification based on appearance and spatial-temporal cues in a large camera network50
A method for high dynamic range 3D color modeling of objects through a color camera48
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching32
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data30
Class-aware cross-domain target detection based on cityscape in fog29
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones27
Triple attention and global reasoning Siamese networks for visual tracking25
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer25
Adaptive fast scale estimation, with accurate online model update based on kernelized correlation filter22
Enforced clustering for zero-to-one-shot texture anomaly detection22
Global-guided cross-reference network for co-salient object detection21
Motion-region annotation for complex videos via label propagation across occluders19
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model19
A stereo vision SLAM with moving vehicles tracking in outdoor environment19
Innovative surface roughness detection method based on white light interference images18
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery17
Specular Surface Detection with Deep Static Specular Flow and Highlight17
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection17
Alternate guidance network for boundary-aware camouflaged object detection16
Real-World super-resolution under the guidance of optimal transport14
An unsupervised approach for thermal to visible image translation using autoencoder and generative adversarial network14
Discriminant distance template matching for image recognition14
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking13
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction13
Ubiquitous vision of transformers for person re-identification13
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting12
Axes-aligned non-linear optimized PnP algorithm12
AFC-Net: adjacent feature complementary for crowded pedestrian detection12
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network12
Generation of realistic synthetic cable images to train deep learning segmentation models12
Kernel based local matching network for video object segmentation12
Two-stage structural information enhancement for source-free domain adaptation11
A dual progressive strategy for long-tailed visual recognition11
Online continual learning with saliency-guided experience replay using tiny episodic memory11
Improving knowledge distillation via pseudo-multi-teacher network10
Twinned attention network for occlusion-aware facial expression recognition10
Generating comprehensive scene graphs with integrated multiple attribute detection10
Traversing the subspace of adversarial patches10
LDNet: low-light image enhancement with joint lighting and denoising10
CAMTrack: a combined appearance-motion method for multiple-object tracking10
GOA-net: generic occlusion aware networks for visual tracking10
Images denoising for COVID-19 chest X-ray based on multi-resolution parallel residual CNN10
Camera-based mapping in search-and-rescue via flying and ground robot teams10
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation10
Cross-validation of a semantic segmentation network for natural history collection specimens9
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques9
Explainable interactive projections of images9
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection9
Adversarial imitation learning-based network for category-level 6D object pose estimation9
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing8
Correction: Real estate pricing prediction via textual and visual features8
Fusing bilinear multi-channel gated vector for fine-grained classification8
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection8
Shape related unknown object one-shot learning grasping8
Shape description losses for medical image segmentation8
IoU-aware feature fusion R-CNN for dense object detection8
Pakistan sign language recognition: leveraging deep learning models with limited dataset8
Clarity method of fog and dust image in fully mechanized mining face7
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots7
Attention-based global context network for driving maneuvers prediction7
An anisotropic non-local attention network for image segmentation7
Pattern recognition methodologies for pollen grain image classification: a survey7
Multi-scale convolution underwater image restoration network7
Real-time pedestrian pose estimation, tracking and localization for social distancing7
An adaptive interpolation and 3D reconstruction algorithm for underwater images7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids7
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning7
Automatic cables segmentation from a substation device based on 3D point cloud7
Sparse representation with enhanced nonlocal self-similarity for image denoising6
Tensor-guided learning for image denoising using anisotropic PDEs6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification6
Enhanced normal estimation of point clouds via fine-grained geometric information learning6
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment6
On the safety of vulnerable road users by cyclist detection and tracking6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition6
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation6
Lesion-aware attention with neural support vector machine for retinopathy diagnosis6
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis6
A robust information hiding algorithm based on lossless encryption and NSCT-HD-SVD6
A review of adaptable conventional image processing pipelines and deep learning on limited datasets5
Distortion diminishing with vulnerability filters pruning5
Kinematic calibration of a hexapod robot based on monocular vision5
Cascaded attention-guided multi-granularity feature learning for person re-identification5
Regional filtering distillation for object detection5
Text-to-face synthesis based on facial landmarks prediction5
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library5
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention5
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles5
Tree-managed network ensembles for video prediction5
Saliency detection based on color descriptor and high-level prior5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement5
The effect of camera settings on image noise and accuracy of subpixel image registration5
Parametric loss-based super-resolution for scene text recognition5
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification5
Guest editorial: special issue on human pose estimation and its applications5
VGT-MOT: visibility-guided tracking for online multiple-object tracking5
Robust semantic segmentation method of urban scenes in snowy environment5
Swin transformer with part-level tokenization for occluded person re-identification4
Rock segmentation visual system for assisting driving in TBM construction4
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP4
Automatic high fidelity foot contact location and timing for elite sprinting4
Local region-learning modules for point cloud classification4
CCTV-Calib: a toolbox to calibrate surveillance cameras around the globe4
Residual feature learning with hierarchical calibration for gaze estimation4
Naturally constrained reject option classification4
Human pose estimation based on lightweight basicblock4
A collaborative SLAM method for dual payload-carrying UAVs in denied environments4
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent4
Multi-planar geometry and latent image recovery from a single motion-blurred image4
FLAVR: flow-free architecture for fast video frame interpolation4
Toward phytoplankton parasite detection using autoencoders4
Symmetry-induced ambiguity in orientation estimation from RGB images4
React: recognize every action everywhere all at once4
YOLOMH: you only look once for multi-task driving perception with high efficiency4
Personvit: large-scale self-supervised vision transformer for person re-identification4
Residual shuffle attention network for image super-resolution4
Computer-aided automatic detection of acrylamide in deep-fried carbohydrate-rich food items using deep learning4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder4
Structure–texture decomposition-based dehazing of a single image with large sky area4
An image quality assessment method based on edge extraction and singular value for blurriness4
Unsupervised single-shot depth estimation using perceptual reconstruction4
Beyond Kalman filters: deep learning-based filters for improved object tracking4
Enhanced keypoint information and pose-weighted re-ID features for multi-person pose estimation and tracking3
Gait recognition using free-area transformer networks3
Online camera auto-calibration appliable to road surveillance3
A robust vehicle tracking in low-altitude UAV videos3
Image dataset creation and networks improvement method based on CAD model and edge operator for object detection in the manufacturing industry3
Spatial-temporal graph-guided global attention network for video-based person re-identification3
SiamCAR-Kal: anti-occlusion tracking algorithm for infrared ground targets based on SiamCAR and Kalman filter3
Superpixel-based foreground-preserving image stitching3
Removing cloud shadows from ground-based solar imagery3
Ising granularity image analysis on VAE–GAN3
Wavelet and PCA-based glaucoma classification through novel methodological enhanced retinal images3
Unsupervised domain adaptation by cross-domain consistency learning for CT body composition3
Normalized margin loss for action unit detection3
Entangled appearance and motion structures network for multi-object tracking and segmentation3
Abnormal event detection by variation matching3
Bidirectional cascaded multimodal attention for multiple choice visual question answering3
Hierarchical contrastive adaptation for cross-domain object detection3
Vision-based power line cables and pylons detection for low flying aircraft3
Depthwise grouped convolution for object detection3
Mitigating adversarial perturbations via weakly supervised object location and regions recombination3
ViCap-AD: video caption-based weakly supervised video anomaly detection3
The general framework for few-shot learning by kernel HyperNetworks3
RCA-IUnet: a residual cross-spatial attention-guided inception U-Net model for tumor segmentation in breast ultrasound imaging3
FESAR: SAR ship detection model based on local spatial relationship capture and fused convolutional enhancement3
Gabor capsule network with preprocessing blocks for the recognition of complex images3
Pixel representations, sampling, and label correction for semantic part detection3
A deep Retinex network for underwater low-light image enhancement3
Ipdm: identity preserving diffusion model for face sketch and photo synthesis3
Real-time 3D reconstruction using point-dependent pose graph optimization framework3
Self-attention network for few-shot learning based on nearest-neighbor algorithm3
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices3
Supervised contrastive learning with multi-scale interaction and integrity learning for salient object detection3
Multimodal dance style transfer3
Knowledge-based hybrid connectionist models for morphologic reasoning3
FERGCN: facial expression recognition based on graph convolution network3
Ssman: self-supervised masked adaptive network for 3D human pose estimation3
Continuous sign language recognition based on motor attention mechanism and frame-level self-distillation2
Exploring filter placement in convolutional layer topologies based on ResNet for image classification2
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmark2
Material classification of polishing and convex surface objects based on photon accumulation point spread function (PAPSF) from imaging model of binocular pulsed time-of-flight camera2
Few-shot object detection via data augmentation and distribution calibration2
Position Puzzle Network and Augmentation: localizing human keypoints beyond the bounding box2
Utilizing incremental branches on a one-stage object detection framework to avoid catastrophic forgetting2
Synergizing LiDAR and Augmented Reality for precise real-time interior distance measurements for mobile devices2
Interpretable visual transmission lines inspections using pseudo-prototypical part network2
An annotated image database of building facades categorized into land uses for object detection using deep learning2
Motioninsights: real-time object tracking in streaming video2
MYFED: a dataset of affective face videos for investigation of emotional facial dynamics as a soft biometric for person identification2
Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks2
Consensus similarity learning based on tensor nuclear norm2
Zero-shot action recognition by clustered representation with redundancy-free features2
Ensemble learning with advanced fast image filtering features for semi-global matching2
Squeezed fire binary segmentation model using convolutional neural network for outdoor images on embedded device2
Efficient abnormality detection using patch-based 3D convolution with recurrent model2
Dynamically throttleable neural networks2
Discriminative feature learning through feature distance loss2
Multi-level receptive field feature reuse for multi-focus image fusion2
Unsupervised learning of probabilistic subspaces for multi-spectral and multi-temporal image-based disaster mapping2
SGBGAN: minority class image generation for class-imbalanced datasets2
Learning more discriminative local descriptors with parameter-free weighted attention for few-shot learning2
Interpretability of fingerprint presentation attack detection systems: a look at the “representativeness” of samples against never-seen-before attacks2
Generating quality grasp rectangle using Pix2Pix GAN for intelligent robot grasping2
Foreground enhancement network for object detection in sonar images2
FDT − Dr2T: a unified Dense Radiology Report Generation Transformer framework for X-ray images2
Wide-baseline multi-camera calibration from a room filled with people2
Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation2
Cmf-transformer: cross-modal fusion transformer for human action recognition2
SNFR: salient neighbor decoding and text feature refining for scene text recognition2
Rocnet: 3D robust registration of points clouds using deep learning2
Human–object interaction detection based on disentangled axial attention transformer2
Region gradient-guided diffusion model for underwater image enhancement2
A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation2
MFMANet: a multispectral pedestrian detection network using multi-resolution RGB feature reuse with multi-scale FIR attentions2
Multi-person 3D pose estimation from unlabelled data2
Using synthesized facial views for active face recognition2
Enhanced machine perception by a scalable fusion of RGB–NIR image pairs in diverse exposure environments2
Trusted 3D self-supervised representation learning with cross-modal settings2
Multimodal fine-grained grocery product recognition using image and OCR text2
Accurate IoU computation for rotated bounding boxes in $${\mathbb {R}}^2$$ and $${\mathbb {R}}^3$$2
Similarity contrastive estimation for image and video soft contrastive self-supervised learning2
CMNet: a novel model and design rationale based on comparison studies and synergy of CNN and MetaFormer2
Virtual home staging and relighting from a single panorama under natural illumination2
Correction to: A compressed matrix sequence method for solving normal equations of bundle adjustment2
Visible-infrared person re-identification model based on feature consistency and modal indistinguishability1
Study on defect detection of metal castings based on supervised enhancement and attention distillation1
Teacher–student training and triplet loss to reduce the effect of drastic face occlusion1
SiamMMF: multi-modal multi-level fusion object tracking based on Siamese networks1
A novel method for 3D knee anatomical landmark localization by combining global and local features1
Single image dehazing based on multi-scale segmentation and deep learning1
AP-TransNet: a polarized transformer based aerial human action recognition framework1
Effective triplet mining improves training of multi-scale pooled CNN for image retrieval1
Estimating human body orientation from image depth data and its implementation1
IAFPN: interlayer enhancement and multilayer fusion network for object detection1
Global attention guided multi-scale network for face image super-resolution1
A transformer-based neural ODE for dense prediction1
PerSnake: a real-time pedestrian instance segmentation network using contour representation1
A pothole can be seen with two eyes: an ensemble approach to pothole detection1
Underwater image object detection based on multi-scale feature fusion1
RAU-Net: U-Net network based on residual multi-scale fusion and attention skip layer for overall spine segmentation1
Designing effective power law-based loss function for faster and better bounding box regression1
Closing the gap in domain adaptation for semantic segmentation: a time-aware method1
Two-stream lightweight sign language transformer1
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking1
Integration of 2D iteration and a 3D CNN-based model for multi-type artifact suppression in C-arm cone-beam CT1
WideCaps: a wide attention-based capsule network for image classification1
A concept-aware explainability method for convolutional neural networks1
LTM: efficient learning with triangular topology constraint for feature matching with heavy outliers1
Projection model-driven image stitching: a novel warping method using epipolar displacement field1
Deep learning-based object recognition in multispectral satellite imagery for real-time applications1
Graph convolutional networks and LSTM for first-person multimodal hand action recognition1
Editor’s Note: Special Issue from Winter Conference on Applications of Computer Vision - WACV 20231
A review of recent techniques for person re-identification1
Inflated 3D ConvNet context analysis for violence detection1
Hyperspectral image dynamic range reconstruction using deep neural network-based denoising methods1
Adversarial learning for unguided single depth map completion of indoor scenes1
End-to-end optimized image compression with the frequency-oriented transform1
Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes1
Persistent animal identification leveraging non-visual markers1
Automated diagnosis of diverse coffee leaf images through a stage-wise aggregated triple deep convolutional neural network1
A deep learning framework for finding illicit images/videos of children1
0.084357976913452