Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-03-01 to 2024-03-01.)
ArticleCitations
PCT: Point cloud transformer665
Attention mechanisms in computer vision: A survey654
PVT v2: Improved baselines with Pyramid Vision Transformer493
RGB-D salient object detection: A survey153
A survey of visual analytics techniques for machine learning118
Visual attention network90
A survey on deep geometry learning: From a representation perspective61
Transformers in computational visual media: A survey59
VR content creation and exploration with deep learning: A survey45
A survey of recent interactive image segmentation methods39
View planning in robot active vision: A survey of systems, algorithms, and applications37
High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review31
A survey on deep learning-based Monte Carlo denoising29
EfficientPose: Efficient human pose estimation with neural architecture search24
Light field salient object detection: A review and benchmark23
Saliency-based image correction for colorblind patients21
DualFace: Two-stage drawing guidance for freehand portrait sketching21
Learning conditional photometric stereo with high-resolution features19
iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks18
S4Net: Single stage salient-instance segmentation17
A new dataset of dog breed images and a benchmark for finegrained classification17
Detecting human—object interaction with multi-level pairwise feature network16
Inversion-free geometric mapping construction: A survey16
Deep image synthesis from intuitive user input: A review and perspectives16
Improved fuzzy clustering for image segmentation based on a low-rank prior15
Kernel-blending connection approximated by a neural network for image classification15
Scene text removal via cascaded text stroke detection and erasing15
Image resizing by reconstruction from deep features15
Progressive edge-sensing dynamic scene deblurring14
An end-to-end convolutional network for joint detecting and denoising adversarial perturbations in vehicle classification14
Image smoothing based on global sparsity decomposition and a variable parameter14
A survey of urban visual analytics: Advances and future directions13
Learning to assess visual aesthetics of food images12
Learning local shape descriptors for computing non-rigid dense correspondence12
Low and non-uniform illumination color image enhancement using weighted guided image filtering11
What and where: A context-based recommendation system for object insertion10
Jittor-GAN: A fast-training generative adversarial network model zoo based on Jittor10
Joint 3D facial shape reconstruction and texture completion from a single image9
Joint specular highlight detection and removal in single images via Unet-Transformer9
ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation8
WGI-Net: A weighted group integration network for RGB-D salient object detection8
Foveated rendering: A state-of-the-art survey8
Can attention enable MLPs to catch up with CNNs?8
Reference-guided structure-aware deep sketch colorization for cartoons8
Multi-scale joint feature network for micro-expression recognition8
A detail preserving neural network model for Monte Carlo denoising8
Multispectral image denoising using sparse and graph Laplacian Tucker decomposition7
JMNet: A joint matting network for automatic human matting7
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module7
Trajectory distributions: A new description of movement for trajectory prediction7
Mask-aware photorealistic facial attribute manipulation7
Unsupervised random forest for affinity estimation7
Message from the Editor-in-Chief7
Joint regression and learning from pairwise rankings for personalized image aesthetic assessment6
Machine learning for digital try-on: Challenges and progress6
Light field super-resolution using complementary-view feature attention6
WaterNet: An adaptive matching pipeline for segmenting water with volatile appearance6
3D hypothesis clustering for cross-view matching in multi-person motion capture6
Deep unfolding multi-scale regularizer network for image denoising5
A practical path guiding method for participating media5
Neighborhood co-occurrence modeling in 3D point cloud segmentation5
Real-time per-pixel focusing method for light field rendering5
Towards uniform point distribution in feature-preserving point cloud filtering5
Inferring object properties from human interaction and transferring them to new motions5
Computing knots by quadratic and cubic polynomial curves5
Point cloud completion via structured feature maps using a feedback network5
Efficient fall activity recognition by combining shape and motion features4
Temporally consistent video colorization with deep feature propagation and self-regularization learning4
HDR-Net-Fusion: Real-time 3D dynamic scene reconstruction with a hierarchical deep reinforcement network4
SiamCPN: Visual tracking with the Siamese center-prediction network4
3D face recognition: A comprehensive survey in 20224
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery4
Automatic object annotation in streamed and remotely explored large 3D reconstructions4
ARM3D: Attention-based relation module for indoor 3D object detection4
Weight asynchronous update: Improving the diversity of filters in a deep convolutional network4
Flow-aware synthesis: A generic motion model for video frame interpolation4
A survey on rendering homogeneous participating media4
Erroneous pixel prediction for semantic image segmentation4
AOGAN: A generative adversarial network for screen space ambient occlusion4
Semi-supervised 3D shape segmentation with multilevel consistency and part substitution4
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits4
Full-duplex strategy for video object segmentation3
Towards natural object-based image recoloring3
Rendering discrete participating media using geometrical optics approximation3
A survey of deep learning-based 3D shape generation3
Coherent video generation for multiple hand-held cameras with dynamic foreground3
Rectangling irregular videos by optimal spatio-temporal warping3
Pyramid-VAE-GAN: Transferring hierarchical latent variables for image inpainting3
3D corrective nose reconstruction from a single image3
Efficient fastest-path computations for road maps3
Simple primitive recognition via hierarchical face clustering3
Non-dominated sorting based multi-page photo collage3
Designing planar cubic B-spline curves with monotonic curvature for curve interpolation3
Visual exploration of Internet news via sentiment score and topic models3
Recent advances in glinty appearance rendering3
Towards harmonized regional style transfer and manipulation for facial images3
A GAN-based temporally stable shading model for fast animation of photorealistic hair3
MusicFace: Music-driven expressive singing face synthesis3
Image-guided color mapping for categorical data visualization3
0.01823616027832