Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
PVT v2: Improved baselines with Pyramid Vision Transformer1174
Attention mechanisms in computer vision: A survey1034
Robust and efficient edge-based visual odometry950
Flow-aware synthesis: A generic motion model for video frame interpolation313
Transformers in computational visual media: A survey85
Deep image synthesis from intuitive user input: A review and perspectives48
RecStitchNet: Learning to stitch images with rectangular boundaries40
Improved fuzzy clustering for image segmentation based on a low-rank prior36
Efficient fastest-path computations for road maps36
Message from the Editor-in-Chief30
Towards harmonized regional style transfer and manipulation for facial images29
Temporal and spatial anti-aliasing for rendering reflections on water waves24
APF-GAN: Exploring asymmetric pre-training and fine-tuning strategy for conditional generative adversarial network21
Neural 3D reconstruction from sparse views using geometric priors20
Improved image denoising via RAISR with fewer filters20
Multi-task learning and joint refinement between camera localization and object detection19
Geometry-aware 3D pose transfer using transformer autoencoder17
Continual few-shot patch-based learning for anime-style colorization16
Illuminator: Image-based illumination editing for indoor scene harmonization16
EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection16
Rendering discrete participating media using geometrical optics approximation15
Deep panoramic depth prediction and completion for indoor scenes14
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images14
3D face recognition: A comprehensive survey in 202213
DualSmoke: Sketch-based smoke illustration design with two-stage generative model13
Physics-based fluid simulation in computer graphics: Survey, research trends, and challenges13
[Front Cover]12
Delving into high-quality SVBRDF acquisition: A new setup and method12
Class-conditional domain adaptation for semantic segmentation12
Adaptive sampling and reconstruction for gradient-domain rendering12
3D corrective nose reconstruction from a single image11
Towards natural object-based image recoloring11
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms11
Neighborhood co-occurrence modeling in 3D point cloud segmentation11
Autocompletion of repetitive stroking with image guidance10
Joint self-supervised and reference-guided learning for depth inpainting10
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor9
Controllable multi-domain semantic artwork synthesis9
Can attention enable MLPs to catch up with CNNs?9
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits9
AdaPIP: Adaptive picture-in-picture guidance for 360° film watching8
Mask-aware photorealistic facial attribute manipulation8
Message from the Editor-in-Chief8
Recent advances in glinty appearance rendering8
Low and non-uniform illumination color image enhancement using weighted guided image filtering8
SAM-driven MAE pre-training and background-aware meta-learning for unsupervised vehicle re-identification7
Automatic location and semantic labeling of landmarks on 3D human body models7
Polygonal finite element-based content-aware image warping7
Multi-scale hash encoding based neural geometry representation7
DeepFaceReshaping: Interactive deep face reshaping via landmark manipulation7
Dance2MIDI: Dance-driven multi-instrument music generation7
Noise4Denoise: Leveraging noise for unsupervised point cloud denoising7
Discriminative feature encoding for intrinsic image decomposition6
Specificity-preserving RGB-D saliency detection6
HoLens: A visual analytics design for higher-order movement modeling and visualization6
SGformer: Boosting transformers for indoor lighting estimation from a single image6
Shape embedding and retrieval in multi-flow deformation6
Learning physically based material and lighting decompositions for face editing6
PCT: Point cloud transformer5
CLIP-Flow: Decoding images encoded in CLIP space5
Learning accurate template matching with differentiable coarse-to-fine correspondence refinement5
Message from the Editor-in-Chief5
EfficientPose: Efficient human pose estimation with neural architecture search5
3D hand pose and shape estimation from monocular RGB via efficient 2D cues5
Learning conditional photometric stereo with high-resolution features5
Hierarchical vectorization for facial images5
Dynamic ocean inverse modeling based on differentiable rendering4
Full-duplex strategy for video object segmentation4
Learning layout generation for virtual worlds4
Temporal vectorized visibility for direct illumination of animated models4
Practical construction of globally injective parameterizations with positional constraints4
Multi-granularity sequence generation for hierarchical image classification4
Central similarity consistency hashing for asymmetric image retrieval4
A causal convolutional neural network for multi-subject motion modeling and generation3
Symmetrization of quasi-regular patterns with periodic tilting of regular polygons3
Image-guided color mapping for categorical data visualization3
Bin-scanning: Segmentation of X-ray CT volume of binned parts using Morse skeleton graph of distance transform3
AR assistance for efficient dynamic target search3
CF-DAN: Facial-expression recognition based on cross-fusion dual-attention network3
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset3
Semi-supervised 3D shape segmentation with multilevel consistency and part substitution3
Point cloud completion via structured feature maps using a feedback network3
Shape correspondence for cel animation based on a shape association graph and spectral matching3
TrafPS: A shapley-based visual analytics approach to interpret traffic3
CLIP-SP: Vision-language model with adaptive prompting for scene parsing3
Sequential interactive image segmentation3
Multi-modal visual tracking: Review and experimental comparison3
SiamCPN: Visual tracking with the Siamese center-prediction network2
ARM3D: Attention-based relation module for indoor 3D object detection2
Real-time face view correction for front-facing cameras2
Temporally consistent video colorization with deep feature propagation and self-regularization learning2
PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering2
Message from the Best Paper Award Committee2
Pyramid-VAE-GAN: Transferring hierarchical latent variables for image inpainting2
Active self-training for weakly supervised 3D scene semantic segmentation2
CTSN: Predicting cloth deformation for skeleton-based characters with a two-stream skinning network2
High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review2
See clearly on rainy days: Hybrid multiscale loss guided multi-feature fusion network for single image rain removal2
Rectangling irregular videos by optimal spatio-temporal warping2
Non-dominated sorting based multi-page photo collage2
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces2
High fidelity virtual try-on network via semantic adaptation and distributed componentization2
0.050576210021973