Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Attention mechanisms in computer vision: A survey1033
PCT: Point cloud transformer927
PVT v2: Improved baselines with Pyramid Vision Transformer817
Visual attention network229
RGB-D salient object detection: A survey171
A survey of visual analytics techniques for machine learning147
Transformers in computational visual media: A survey80
High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review44
EfficientPose: Efficient human pose estimation with neural architecture search36
A survey on deep learning-based Monte Carlo denoising35
Light field salient object detection: A review and benchmark34
DualFace: Two-stage drawing guidance for freehand portrait sketching28
A survey of urban visual analytics: Advances and future directions28
Learning conditional photometric stereo with high-resolution features22
Deep image synthesis from intuitive user input: A review and perspectives20
Improved fuzzy clustering for image segmentation based on a low-rank prior18
Joint specular highlight detection and removal in single images via Unet-Transformer18
Inversion-free geometric mapping construction: A survey18
An end-to-end convolutional network for joint detecting and denoising adversarial perturbations in vehicle classification17
Scene text removal via cascaded text stroke detection and erasing16
Foveated rendering: A state-of-the-art survey16
Image smoothing based on global sparsity decomposition and a variable parameter16
Low and non-uniform illumination color image enhancement using weighted guided image filtering16
Image resizing by reconstruction from deep features15
Learning to assess visual aesthetics of food images14
Progressive edge-sensing dynamic scene deblurring14
Reference-guided structure-aware deep sketch colorization for cartoons14
Towards uniform point distribution in feature-preserving point cloud filtering13
Joint 3D facial shape reconstruction and texture completion from a single image12
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery12
Jittor-GAN: A fast-training generative adversarial network model zoo based on Jittor12
Facial optical flow estimation via neural non-rigid registration11
Multi-modal visual tracking: Review and experimental comparison11
ARM3D: Attention-based relation module for indoor 3D object detection11
Mask-aware photorealistic facial attribute manipulation11
Multi-scale joint feature network for micro-expression recognition10
Unsupervised random forest for affinity estimation10
Can attention enable MLPs to catch up with CNNs?10
ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation10
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module9
WGI-Net: A weighted group integration network for RGB-D salient object detection9
Erroneous pixel prediction for semantic image segmentation9
Temporally consistent video colorization with deep feature propagation and self-regularization learning9
Semi-supervised 3D shape segmentation with multilevel consistency and part substitution8
Real-time per-pixel focusing method for light field rendering8
Point cloud completion via structured feature maps using a feedback network8
Trajectory distributions: A new description of movement for trajectory prediction8
Light field super-resolution using complementary-view feature attention7
Message from the Editor-in-Chief7
3D face recognition: A comprehensive survey in 20227
Joint regression and learning from pairwise rankings for personalized image aesthetic assessment7
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits7
Foundation models meet visualizations: Challenges and opportunities7
Neighborhood co-occurrence modeling in 3D point cloud segmentation7
Stroke-GAN Painter: Learning to paint artworks using stroke-style generative adversarial networks6
HDR-Net-Fusion: Real-time 3D dynamic scene reconstruction with a hierarchical deep reinforcement network6
Recent advances in glinty appearance rendering6
A survey of deep learning-based 3D shape generation6
Full-duplex strategy for video object segmentation6
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms6
Automatic object annotation in streamed and remotely explored large 3D reconstructions6
Image-guided color mapping for categorical data visualization6
Towards harmonized regional style transfer and manipulation for facial images6
Deep unfolding multi-scale regularizer network for image denoising5
Rectangling irregular videos by optimal spatio-temporal warping5
Sphere Face Model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training5
SiamCPN: Visual tracking with the Siamese center-prediction network5
Inferring object properties from human interaction and transferring them to new motions5
A survey on rendering homogeneous participating media5
Flow-aware synthesis: A generic motion model for video frame interpolation5
Discriminative feature encoding for intrinsic image decomposition4
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor4
Recent advances in 3D Gaussian splatting4
AOGAN: A generative adversarial network for screen space ambient occlusion4
Specificity-preserving RGB-D saliency detection4
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images4
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces4
Sequential interactive image segmentation4
Towards natural object-based image recoloring4
6DOF pose estimation of a 3D rigid object based on edge-enhanced point pair features4
MusicFace: Music-driven expressive singing face synthesis4
An attention-embedded GAN for SVBRDF recovery from a single image3
A GAN-based temporally stable shading model for fast animation of photorealistic hair3
Pyramid-VAE-GAN: Transferring hierarchical latent variables for image inpainting3
Efficient fastest-path computations for road maps3
Simple primitive recognition via hierarchical face clustering3
Unsupervised image translation with distributional semantics awareness3
Non-dominated sorting based multi-page photo collage3
Active self-training for weakly supervised 3D scene semantic segmentation3
Rendering discrete participating media using geometrical optics approximation3
Let’s all dance: Enhancing amateur dance motions3
CF-DAN: Facial-expression recognition based on cross-fusion dual-attention network3
3D corrective nose reconstruction from a single image3
Joint self-supervised and reference-guided learning for depth inpainting2
Temporal and spatial anti-aliasing for rendering reflections on water waves2
Co-occurrence based texture synthesis2
A causal convolutional neural network for multi-subject motion modeling and generation2
Neural 3D reconstruction from sparse views using geometric priors2
Deep panoramic depth prediction and completion for indoor scenes2
Temporal scatterplots2
Shape correspondence for cel animation based on a shape association graph and spectral matching2
Improved image denoising via RAISR with fewer filters2
Robust and efficient edge-based visual odometry2
Imposing temporal consistency on deep monocular body shape and pose estimation2
BLNet: Bidirectional learning network for point clouds2
0.06444787979126