OOIR: Observatory of International Research

Papers

(The TQCC of Machine Vision and Applications is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss	133
ECM: arbitrary style transfer via Enhanced-Channel Module	103
A method for high dynamic range 3D color modeling of objects through a color camera	61
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching	58
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data	50
Class-aware cross-domain target detection based on cityscape in fog	34
Real estate pricing prediction via textual and visual features	32
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer	31
Triple attention and global reasoning Siamese networks for visual tracking	29
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones	29
Medtransnet: advanced gating transformer network for medical image classification	28
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization	26
Global-guided cross-reference network for co-salient object detection	25
Motion-region annotation for complex videos via label propagation across occluders	24
Enforced clustering for zero-to-one-shot texture anomaly detection	22
A stereo vision SLAM with moving vehicles tracking in outdoor environment	21
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model	18
Innovative surface roughness detection method based on white light interference images	17
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation	17
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection	16
Using breast density for hybrid region and pixel-level loss function	16
Specular Surface Detection with Deep Static Specular Flow and Highlight	14
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction	14
Generation of realistic synthetic cable images to train deep learning segmentation models	14
Real-World super-resolution under the guidance of optimal transport	14

AFC-Net: adjacent feature complementary for crowded pedestrian detection	14
Axes-aligned non-linear optimized PnP algorithm	14
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking	14
Discriminant distance template matching for image recognition	14
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting	14
L-VAE: variational auto-encoder with learnable beta for disentangled representation	12
Alternate guidance network for boundary-aware camouflaged object detection	12
Two-stage structural information enhancement for source-free domain adaptation	11
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery	11
Kernel based local matching network for video object segmentation	11
Ubiquitous vision of transformers for person re-identification	11
CAMTrack: a combined appearance-motion method for multiple-object tracking	10
LDNet: low-light image enhancement with joint lighting and denoising	10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network	10
Twinned attention network for occlusion-aware facial expression recognition	10
Online continual learning with saliency-guided experience replay using tiny episodic memory	10
Camera-based mapping in search-and-rescue via flying and ground robot teams	10
Improving knowledge distillation via pseudo-multi-teacher network	10
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining	10
A dual progressive strategy for long-tailed visual recognition	10
Traversing the subspace of adversarial patches	10
Generating comprehensive scene graphs with integrated multiple attribute detection	9
Adversarial imitation learning-based network for category-level 6D object pose estimation	9
Fusing bilinear multi-channel gated vector for fine-grained classification	9
GOA-net: generic occlusion aware networks for visual tracking	9
Cross-validation of a semantic segmentation network for natural history collection specimens	9
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation	9
Explainable interactive projections of images	9
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection	9
Shape related unknown object one-shot learning grasping	8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing	8
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques	8
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection	8
Correction: Real estate pricing prediction via textual and visual features	8
Shape description losses for medical image segmentation	8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification	8
Multi-scale convolution underwater image restoration network	8
IoU-aware feature fusion R-CNN for dense object detection	8
Pakistan sign language recognition: leveraging deep learning models with limited dataset	8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots	8
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids	7
Automatic cables segmentation from a substation device based on 3D point cloud	7
An anisotropic non-local attention network for image segmentation	7
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation	7
Attention-based global context network for driving maneuvers prediction	7
Pattern recognition methodologies for pollen grain image classification: a survey	7
Sparse representation with enhanced nonlocal self-similarity for image denoising	7
Audio-visual localization based on spatial relative sound order	7
An adaptive interpolation and 3D reconstruction algorithm for underwater images	7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system	7

Integrating visual-semantic relational reasoning for fake news detection on video platforms	7
Real-time pedestrian pose estimation, tracking and localization for social distancing	7
Clarity method of fog and dust image in fully mechanized mining face	7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning	7
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation	7
Actions as points: a simple and efficient detector for skeleton-based temporal action detection	6
Tensor-guided learning for image denoising using anisotropic PDEs	6
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation	6
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment	6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification	6
Tree-managed network ensembles for video prediction	6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis	6
Saliency detection based on color descriptor and high-level prior	6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms	6
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter	6
On the safety of vulnerable road users by cyclist detection and tracking	6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library	6
Kinematic calibration of a hexapod robot based on monocular vision	6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention	6
A review of adaptable conventional image processing pipelines and deep learning on limited datasets	6
Parametric loss-based super-resolution for scene text recognition	6
Enhanced normal estimation of point clouds via fine-grained geometric information learning	6
Meta-learning enhanced global–local feature fusion for image quality assessment	6
Lesion-aware attention with neural support vector machine for retinopathy diagnosis	6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition	6
Distortion diminishing with vulnerability filters pruning	6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles	6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification	6
Guest editorial: special issue on human pose estimation and its applications	6
Cascaded attention-guided multi-granularity feature learning for person re-identification	5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments	5
Robust semantic segmentation method of urban scenes in snowy environment	5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP	5
Regional filtering distillation for object detection	5
Quality assessment of synthetic images via spatial distortion recognition	5
Automatic high fidelity foot contact location and timing for elite sprinting	5
VGT-MOT: visibility-guided tracking for online multiple-object tracking	5
Text-to-face synthesis based on facial landmarks prediction	5
Residual shuffle attention network for image super-resolution	5
Multiple object tracking using weighted graph convolutional neural networks	5
Block-recurrent visual transformer for enhanced human detection in thermal imaging	5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement	5
Multi-planar geometry and latent image recovery from a single motion-blurred image	5
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent	5
React: recognize every action everywhere all at once	4
Naturally constrained reject option classification	4
Unsupervised single-shot depth estimation using perceptual reconstruction	4
Toward phytoplankton parasite detection using autoencoders	4
Mitigating adversarial perturbations via weakly supervised object location and regions recombination	4
Human pose estimation based on lightweight basicblock	4
YOLOMH: you only look once for multi-task driving perception with high efficiency	4
Personvit: large-scale self-supervised vision transformer for person re-identification	4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder	4
Removing cloud shadows from ground-based solar imagery	4
Symmetry-induced ambiguity in orientation estimation from RGB images	4
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices	4
Beyond Kalman filters: deep learning-based filters for improved object tracking	4
Local region-learning modules for point cloud classification	4
Swin transformer with part-level tokenization for occluded person re-identification	4
FLAVR: flow-free architecture for fast video frame interpolation	4
Residual feature learning with hierarchical calibration for gaze estimation	4
Self-attention network for few-shot learning based on nearest-neighbor algorithm	4