OOIR: Observatory of International Research

Papers

(The median citation count of Machine Vision and Applications is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss	133
ECM: arbitrary style transfer via Enhanced-Channel Module	103
A method for high dynamic range 3D color modeling of objects through a color camera	61
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching	58
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data	50
Class-aware cross-domain target detection based on cityscape in fog	34
Real estate pricing prediction via textual and visual features	32
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer	31
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones	29
Triple attention and global reasoning Siamese networks for visual tracking	29
Medtransnet: advanced gating transformer network for medical image classification	28
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization	26
Global-guided cross-reference network for co-salient object detection	25
Motion-region annotation for complex videos via label propagation across occluders	24
Enforced clustering for zero-to-one-shot texture anomaly detection	22
A stereo vision SLAM with moving vehicles tracking in outdoor environment	21
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model	18
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation	17
Innovative surface roughness detection method based on white light interference images	17
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection	16
Using breast density for hybrid region and pixel-level loss function	16
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking	14
Discriminant distance template matching for image recognition	14
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting	14
Specular Surface Detection with Deep Static Specular Flow and Highlight	14

Correction: Unsupervised single-shot depth estimation using perceptual reconstruction	14
Generation of realistic synthetic cable images to train deep learning segmentation models	14
Real-World super-resolution under the guidance of optimal transport	14
AFC-Net: adjacent feature complementary for crowded pedestrian detection	14
Axes-aligned non-linear optimized PnP algorithm	14
Alternate guidance network for boundary-aware camouflaged object detection	12
L-VAE: variational auto-encoder with learnable beta for disentangled representation	12
Ubiquitous vision of transformers for person re-identification	11
Two-stage structural information enhancement for source-free domain adaptation	11
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery	11
Kernel based local matching network for video object segmentation	11
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining	10
A dual progressive strategy for long-tailed visual recognition	10
Traversing the subspace of adversarial patches	10
CAMTrack: a combined appearance-motion method for multiple-object tracking	10
LDNet: low-light image enhancement with joint lighting and denoising	10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network	10
Twinned attention network for occlusion-aware facial expression recognition	10
Online continual learning with saliency-guided experience replay using tiny episodic memory	10
Camera-based mapping in search-and-rescue via flying and ground robot teams	10
Improving knowledge distillation via pseudo-multi-teacher network	10
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation	9
Explainable interactive projections of images	9
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection	9
Generating comprehensive scene graphs with integrated multiple attribute detection	9
Adversarial imitation learning-based network for category-level 6D object pose estimation	9
Fusing bilinear multi-channel gated vector for fine-grained classification	9
GOA-net: generic occlusion aware networks for visual tracking	9
Cross-validation of a semantic segmentation network for natural history collection specimens	9
IoU-aware feature fusion R-CNN for dense object detection	8
Pakistan sign language recognition: leveraging deep learning models with limited dataset	8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots	8
Shape related unknown object one-shot learning grasping	8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing	8
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques	8
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection	8
Correction: Real estate pricing prediction via textual and visual features	8
Shape description losses for medical image segmentation	8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification	8
Multi-scale convolution underwater image restoration network	8
Integrating visual-semantic relational reasoning for fake news detection on video platforms	7
Real-time pedestrian pose estimation, tracking and localization for social distancing	7
Clarity method of fog and dust image in fully mechanized mining face	7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning	7
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation	7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids	7
Automatic cables segmentation from a substation device based on 3D point cloud	7
An anisotropic non-local attention network for image segmentation	7
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation	7
Attention-based global context network for driving maneuvers prediction	7

Pattern recognition methodologies for pollen grain image classification: a survey	7
Sparse representation with enhanced nonlocal self-similarity for image denoising	7
Audio-visual localization based on spatial relative sound order	7
An adaptive interpolation and 3D reconstruction algorithm for underwater images	7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system	7
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition	6
Distortion diminishing with vulnerability filters pruning	6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification	6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles	6
Guest editorial: special issue on human pose estimation and its applications	6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection	6
Tensor-guided learning for image denoising using anisotropic PDEs	6
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation	6
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment	6
Tree-managed network ensembles for video prediction	6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification	6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis	6
Saliency detection based on color descriptor and high-level prior	6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms	6
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter	6
On the safety of vulnerable road users by cyclist detection and tracking	6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library	6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention	6
Kinematic calibration of a hexapod robot based on monocular vision	6
A review of adaptable conventional image processing pipelines and deep learning on limited datasets	6
Parametric loss-based super-resolution for scene text recognition	6
Enhanced normal estimation of point clouds via fine-grained geometric information learning	6
Meta-learning enhanced global–local feature fusion for image quality assessment	6
Lesion-aware attention with neural support vector machine for retinopathy diagnosis	6
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement	5
Multi-planar geometry and latent image recovery from a single motion-blurred image	5
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent	5
Cascaded attention-guided multi-granularity feature learning for person re-identification	5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments	5
Robust semantic segmentation method of urban scenes in snowy environment	5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP	5
Regional filtering distillation for object detection	5
Quality assessment of synthetic images via spatial distortion recognition	5
Automatic high fidelity foot contact location and timing for elite sprinting	5
VGT-MOT: visibility-guided tracking for online multiple-object tracking	5
Text-to-face synthesis based on facial landmarks prediction	5
Residual shuffle attention network for image super-resolution	5
Multiple object tracking using weighted graph convolutional neural networks	5
Block-recurrent visual transformer for enhanced human detection in thermal imaging	5
FLAVR: flow-free architecture for fast video frame interpolation	4
Residual feature learning with hierarchical calibration for gaze estimation	4
Self-attention network for few-shot learning based on nearest-neighbor algorithm	4
React: recognize every action everywhere all at once	4
Naturally constrained reject option classification	4
Unsupervised single-shot depth estimation using perceptual reconstruction	4
Toward phytoplankton parasite detection using autoencoders	4
Mitigating adversarial perturbations via weakly supervised object location and regions recombination	4
Human pose estimation based on lightweight basicblock	4
YOLOMH: you only look once for multi-task driving perception with high efficiency	4
Personvit: large-scale self-supervised vision transformer for person re-identification	4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder	4
Removing cloud shadows from ground-based solar imagery	4
Symmetry-induced ambiguity in orientation estimation from RGB images	4
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices	4
Beyond Kalman filters: deep learning-based filters for improved object tracking	4
Local region-learning modules for point cloud classification	4
Swin transformer with part-level tokenization for occluded person re-identification	4
Gait recognition using free-area transformer networks	3
Entangled appearance and motion structures network for multi-object tracking and segmentation	3
Bidirectional cascaded multimodal attention for multiple choice visual question answering	3
Multi-person 3D pose estimation from unlabelled data	3
Consensus similarity learning based on tensor nuclear norm	3
Knowledge-based hybrid connectionist models for morphologic reasoning	3
Supervised contrastive learning with multi-scale interaction and integrity learning for salient object detection	3
FERGCN: facial expression recognition based on graph convolution network	3
ViCap-AD: video caption-based weakly supervised video anomaly detection	3
FESAR: SAR ship detection model based on local spatial relationship capture and fused convolutional enhancement	3
Enhanced keypoint information and pose-weighted re-ID features for multi-person pose estimation and tracking	3
Spatial-temporal graph-guided global attention network for video-based person re-identification	3
Real-time 3D reconstruction using point-dependent pose graph optimization framework	3
CCTV-Calib: a toolbox to calibrate surveillance cameras around the globe	3
Online camera auto-calibration appliable to road surveillance	3
Ising granularity image analysis on VAE–GAN	3
Utilizing incremental branches on a one-stage object detection framework to avoid catastrophic forgetting	3
A deep Retinex network for underwater low-light image enhancement	3

MYFED: a dataset of affective face videos for investigation of emotional facial dynamics as a soft biometric for person identification	3
Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation	3
CMNet: a novel model and design rationale based on comparison studies and synergy of CNN and MetaFormer	3
The general framework for few-shot learning by kernel HyperNetworks	3
Ssman: self-supervised masked adaptive network for 3D human pose estimation	3
A robust vehicle tracking in low-altitude UAV videos	3
Normalized margin loss for action unit detection	3
Vision-based power line cables and pylons detection for low flying aircraft	3
Depthwise grouped convolution for object detection	3
Superpixel-based foreground-preserving image stitching	3
Structure–texture decomposition-based dehazing of a single image with large sky area	3
Ipdm: identity preserving diffusion model for face sketch and photo synthesis	3
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmark	3
Pixel representations, sampling, and label correction for semantic part detection	3
SNFR: salient neighbor decoding and text feature refining for scene text recognition	3
RCA-IUnet: a residual cross-spatial attention-guided inception U-Net model for tumor segmentation in breast ultrasound imaging	3
Multimodal dance style transfer	3
Unsupervised domain adaptation by cross-domain consistency learning for CT body composition	3
Wavelet and PCA-based glaucoma classification through novel methodological enhanced retinal images	3
Image dataset creation and networks improvement method based on CAD model and edge operator for object detection in the manufacturing industry	3
Hierarchical contrastive adaptation for cross-domain object detection	3
SiamCAR-Kal: anti-occlusion tracking algorithm for infrared ground targets based on SiamCAR and Kalman filter	3
An image quality assessment method based on edge extraction and singular value for blurriness	3
Motioninsights: real-time object tracking in streaming video	2
Few-shot object detection via data augmentation and distribution calibration	2
Interpretable visual transmission lines inspections using pseudo-prototypical part network	2
Generating quality grasp rectangle using Pix2Pix GAN for intelligent robot grasping	2
Trusted 3D self-supervised representation learning with cross-modal settings	2
Learning more discriminative local descriptors with parameter-free weighted attention for few-shot learning	2
Editor’s Note: Special Issue on Advances in Visual Computing 2023	2
Pothole segmentation and area estimation with thermal imaging using deep neural networks and unmanned aerial vehicles	2
Zero-shot action recognition by clustered representation with redundancy-free features	2
Group attention retention network for co-salient object detection	2
Foreground enhancement network for object detection in sonar images	2
Multi-scene low-light remote physiological measurement database	2
Squeezed fire binary segmentation model using convolutional neural network for outdoor images on embedded device	2
Accurate IoU computation for rotated bounding boxes in $${\mathbb {R}}^2$$ and $${\mathbb {R}}^3$$	2
Using synthesized facial views for active face recognition	2
Dynamically throttleable neural networks	2
Unsupervised learning of probabilistic subspaces for multi-spectral and multi-temporal image-based disaster mapping	2
Interpretability of fingerprint presentation attack detection systems: a look at the “representativeness” of samples against never-seen-before attacks	2
3D multi-object tracking based on parallel multimodal data association	2
Exploring filter placement in convolutional layer topologies based on ResNet for image classification	2
Multimodal fine-grained grocery product recognition using image and OCR text	2
Cmf-transformer: cross-modal fusion transformer for human action recognition	2
Active perception based on deep reinforcement learning for autonomous robotic damage inspection	2
A global activated feature pyramid network for tiny pest detection in the wild	2
Self-supervised monocular depth estimation via joint attention and intelligent mask loss	2
Virtual home staging and relighting from a single panorama under natural illumination	2
FPANet: Feature-enhanced position attention network for semantic segmentation	2
Region gradient-guided diffusion model for underwater image enhancement	2
A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	2
Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks	2
Material classification of polishing and convex surface objects based on photon accumulation point spread function (PAPSF) from imaging model of binocular pulsed time-of-flight camera	2
Position Puzzle Network and Augmentation: localizing human keypoints beyond the bounding box	2
Human–object interaction detection based on disentangled axial attention transformer	2
An annotated image database of building facades categorized into land uses for object detection using deep learning	2
Efficient abnormality detection using patch-based 3D convolution with recurrent model	2
Multi-level receptive field feature reuse for multi-focus image fusion	2
SGBGAN: minority class image generation for class-imbalanced datasets	2
Discriminative feature learning through feature distance loss	2
Synergizing LiDAR and Augmented Reality for precise real-time interior distance measurements for mobile devices	2
That’s BAD: blind anomaly detection by implicit local feature clustering	2
Semantic scene upgrades for trajectory prediction	2
Wide-baseline multi-camera calibration from a room filled with people	2
Multi-core token mixer: a novel approach for underwater image enhancement	2
Rocnet: 3D robust registration of points clouds using deep learning	2
Synergetic proto-pull and reciprocal points for open set recognition	2
Similarity contrastive estimation for image and video soft contrastive self-supervised learning	2
Improving visual odometry pipeline with feedback from forward and backward motion estimates	2
Continuous sign language recognition based on motor attention mechanism and frame-level self-distillation	2
FDT − Dr2T: a unified Dense Radiology Report Generation Transformer framework for X-ray images	2
MFMANet: a multispectral pedestrian detection network using multi-resolution RGB feature reuse with multi-scale FIR attentions	2
IAFPN: interlayer enhancement and multilayer fusion network for object detection	1
Underwater image object detection based on multi-scale feature fusion	1
AP-TransNet: a polarized transformer based aerial human action recognition framework	1
Adversarial learning for unguided single depth map completion of indoor scenes	1
Projection model-driven image stitching: a novel warping method using epipolar displacement field	1
A three-level benchmark dataset for spatial and temporal forensic analysis of videos	1
Performance analysis of various deep learning models based on Max-Min CNN for lung nodule classification on CT images	1
Evaluation of data augmentation techniques on subjective tasks	1
SiamMMF: multi-modal multi-level fusion object tracking based on Siamese networks	1
RAU-Net: U-Net network based on residual multi-scale fusion and attention skip layer for overall spine segmentation	1
Simultaneous tracking of objects with loose context constraints from multiple views: human–human interaction paradigm	1
Integration of 2D iteration and a 3D CNN-based model for multi-type artifact suppression in C-arm cone-beam CT	1
WideCaps: a wide attention-based capsule network for image classification	1
Effective triplet mining improves training of multi-scale pooled CNN for image retrieval	1
A computer vision system for recognition and defect detection for reusable containers	1
Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes	1
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking	1
A transformer-based neural ODE for dense prediction	1
End-to-end optimized image compression with the frequency-oriented transform	1
Estimating human body orientation from image depth data and its implementation	1
Automated diagnosis of diverse coffee leaf images through a stage-wise aggregated triple deep convolutional neural network	1
Hyperspectral image dynamic range reconstruction using deep neural network-based denoising methods	1