ACM Transactions on Multimedia Computing Communications and Applicatio

Papers
(The TQCC of ACM Transactions on Multimedia Computing Communications and Applicatio is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Dual-path Convolutional Image-Text Embeddings with Instance Loss237
Depth Image Denoising Using Nuclear Norm and Learning Graph Model144
DenseNet-201-Based Deep Neural Network with Composite Learning Factor and Precomputation for Multiple Sclerosis Classification107
Understanding and Creating Art with AI: Review and Outlook102
Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking92
Age-Invariant Face Recognition by Multi-Feature Fusionand Decomposition with Self-attention84
Chinese Image Captioning via Fuzzy Attention-based DenseNet-BiLSTM79
A Fast Defogging Image Recognition Algorithm Based on Bilateral Hybrid Filtering77
Securing Multimedia by Using DNA-Based Encryption in the Cloud Computing Environment72
Precise No-Reference Image Quality Evaluation Based on Distortion Identification71
TripRes67
Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer Encoders66
A Weakly Supervised Semantic Segmentation Network by Aggregating Seed Cues: The Multi-Object Proposal Generation Perspective57
Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry, and Fusion55
Deep Learning-based Smart Predictive Evaluation for Interactive Multimedia-enabled Smart Healthcare54
Smart City Construction and Management by Digital Twins and BIM Big Data in COVID-19 Scenario52
Compatibility-Aware Web API Recommendation for Mashup Creation via Textual Description Mining49
Conditional LSTM-GAN for Melody Generation from Lyrics49
Attention-Based Modality-Gated Networks for Image-Text Sentiment Analysis46
Data Hiding43
Trust Mechanism of Feedback Trust Weight in Multimedia Network43
Perceptual Quality Assessment of Low-light Image Enhancement41
A Survey on Healthcare Data: A Security Perspective41
Knowledge-aware Multi-modal Adaptive Graph Convolutional Networks for Fake News Detection40
Is the Reign of Interactive Search Eternal? Findings from the Video Browser Showdown 202039
Wavelength-based Attributed Deep Neural Network for Underwater Image Restoration38
Towards Integrating Image Encryption with Compression: A Survey36
A Multimodal, Multimedia Point-of-Care Deep Learning Framework for COVID-19 Diagnosis36
Recurrent Attention Network with Reinforced Generator for Visual Dialog35
Blind Image Quality Assessment by Natural Scene Statistics and Perceptual Characteristics35
Cross-modal Graph Matching Network for Image-text Retrieval34
Uncertainty-Aware Semi-Supervised Method Using Large Unlabeled and Limited Labeled COVID-19 Data33
Music2Dance: DanceNet for Music-Driven Dance Generation32
Local Correlation Ensemble with GCN Based on Attention Features for Cross-domain Person Re-ID32
Multitarget Tracking Using Siamese Neural Networks32
EGroupNet31
Human Activity Recognition from Multiple Sensors Data Using Multi-fusion Representations and CNNs31
An Effective Forest Fire Detection Framework Using Heterogeneous Wireless Multimedia Sensor Networks31
Security and Privacy of Patient Information in Medical Systems Based on Blockchain Technology30
A Benchmark Dataset and Comparison Study for Multi-modal Human Action Analytics30
Region-Level Visual Consistency Verification for Large-Scale Partial-Duplicate Image Search29
Fine-Grained Visual Computing Based on Deep Learning29
Integrating Scene Semantic Knowledge into Image Captioning28
Deep Triplet Neural Networks with Cluster-CCA for Audio-Visual Cross-Modal Retrieval28
Automatic Assessment of Depression and Anxiety through Encoding Pupil-wave from HCI in VR Scenes28
Part-wise Spatio-temporal Attention Driven CNN-based 3D Human Action Recognition28
Market2Dish: Health-aware Food Recommendation27
Constrained LSTM and Residual Attention for Image Captioning26
Privacy-preserving Decentralized Learning Framework for Healthcare System26
Few-shot Food Recognition via Multi-view Representation Learning25
LogoDet-3K: A Large-scale Image Dataset for Logo Detection25
Scenario-Aware Recurrent Transformer for Goal-Directed Video Captioning24
SDN-Assisted DDoS Defense Framework for the Internet of Multimedia Things24
UID2021: An Underwater Image Dataset for Evaluation of No-Reference Quality Assessment Metrics23
Analysis of the Security of Internet of Multimedia Things23
Label Consistent Flexible Matrix Factorization Hashing for Efficient Cross-modal Retrieval22
Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network22
An LSH-based Offloading Method for IoMT Services in Integrated Cloud-Edge Environment22
Multi-Tier CloudVR22
Global-Local Enhancement Network for NMF-Aware Sign Language Recognition21
Privacy Protection for Medical Data Sharing in Smart Healthcare21
HCMSL: Hybrid Cross-modal Similarity Learning for Cross-modal Retrieval21
A Multi-agent Feature Selection and Hybrid Classification Model for Parkinson's Disease Diagnosis20
Point Cloud Quality Assessment: Dataset Construction and Learning-based No-reference Metric20
A Review on Methods and Applications in Multimodal Deep Learning20
Bottom-up and Layerwise Domain Adaptation for Pedestrian Detection in Thermal Images20
eDiaPredict: An Ensemble-based Framework for Diabetes Prediction19
Cross-Modal Hybrid Feature Fusion for Image-Sentence Matching19
Lightweight Multi-party Authentication and Key Agreement Protocol in IoT-based E-Healthcare Service19
xCos: An Explainable Cosine Metric for Face Verification Task19
Bi-Directional Co-Attention Network for Image Captioning19
Shuffled ImageNet Banks for Video Event Detection and Search19
Spherical Convolution Empowered Viewport Prediction in 360 Video Multicast with Limited FoV Feedback19
Exploring Image Enhancement for Salient Object Detection in Low Light Images19
Pinball Loss Twin Support Vector Clustering18
Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer’s Disease18
A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image Annotation18
An Adaptive Two-Layer Light Field Compression Scheme Using GNN-Based Reconstruction18
3D Tooth Instance Segmentation Learning Objectness and Affinity in Point Cloud18
An Explainable Deep Learning Ensemble Model for Robust Diagnosis of Diabetic Retinopathy Grading18
Attribute-wise Explainable Fashion Compatibility Modeling17
QoE-Fair DASH Video Streaming Using Server-side Reinforcement Learning17
A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis17
A Multimodal Framework for Large-Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals17
Do Users Behave Similarly in VR? Investigation of the User Influence on the System Design17
A Novel ( t , s , k , n )-Threshold Visual Secret Sharing Scheme Based on Ac16
Spatio-temporal Saliency-based Motion Vector Refinement for Frame Rate Up-conversion16
Medical Image Classification based on an Adaptive Size Deep Learning Model16
A Sorting Fuzzy Min-Max Model in an Embedded System for Atrial Fibrillation Detection16
Disentangling Features for Fashion Recommendation16
Robust Secret Image Sharing Resistant to Noise in Shares16
A DNA Based Colour Image Encryption Scheme Using A Convolutional Autoencoder15
Correlation Discrepancy Insight Network for Video Re-identification15
Controlling Neural Learning Network with Multiple Scales for Image Splicing Forgery Detection15
Deepfake Video Detection via Predictive Representation Learning15
Explainable AI: A Multispectral Palm-Vein Identification System with New Augmentation Features15
ECCNAS: Efficient Crowd Counting Neural Architecture Search14
Listen, Look, and Find the One14
Fog-based Secure Service Discovery for Internet of Multimedia Things14
Kernel Attention Network for Single Image Super-Resolution14
Fully Unsupervised Person Re-Identification via Selective Contrastive Learning14
Decoupled Low-Light Image Enhancement14
Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning14
Requet14
Part-based Structured Representation Learning for Person Re-identification14
Fine-grained Image Classification via Multi-scale Selective Hierarchical Biquadratic Pooling14
Knowledge-driven Egocentric Multimodal Activity Recognition14
Sketch-guided Deep Portrait Generation13
Performance Analysis of ACTE13
Clustering Matters: Sphere Feature for Fully Unsupervised Person Re-identification13
Double Attention Based on Graph Attention Network for Image Multi-Label Classification13
Detection of AI-Manipulated Fake Faces via Mining Generalized Features13
A Novel Multi-Sample Generation Method for Adversarial Attacks13
RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection13
Deep Semantic and Attentive Network for Unsupervised Video Summarization12
A Comprehensive Study of Deep Learning-based Covert Communication12
Image Captioning with a Joint Attention Mechanism by Visual Concept Samples12
Entropy Slicing Extraction and Transfer Learning Classification for Early Diagnosis of Alzheimer Diseases with sMRI12
Meta-path Augmented Sequential Recommendation with Contextual Co-attention Network12
Gaussian Mixture Model Clustering with Incomplete Data12
Introduction to the Special Issue on Recent Trends in Medical Data Security for e-Health Applications12
Learning to Fool the Speaker Recognition12
Smart Director: An Event-Driven Directing System for Live Broadcasting12
A Densely Connected Network Based on U-Net for Medical Image Segmentation11
Revisiting Local Descriptor for Improved Few-Shot Classification11
JoT-GAN: A Framework for Jointly Training GAN and Person Re-Identification Model11
Deep Illumination-Enhanced Face Super-Resolution Network for Low-Light Images11
EiMOL: A Secure Medical Image Encryption Algorithm based on Optimization and the Lorenz System11
GuessUNeed11
SDN Enabled QoE and Security Framework for Multimedia Applications in 5G Networks11
Deep Self-Supervised Hyperspectral Image Reconstruction11
Towards Accurate Oriented Object Detection in Aerial Images with Adaptive Multi-level Feature Fusion11
Dynamic Graph Learning Convolutional Networks for Semi-supervised Classification11
Shuffle-invariant Network for Action Recognition in Videos11
MV2Flow11
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition11
Multi-human Parsing with a Graph-based Generative Adversarial Model11
Distribution Aligned Multimodal and Multi-domain Image Stylization11
Causal Inference with Knowledge Distilling and Curriculum Learning for Unbiased VQA11
Less Is More: Learning from Synthetic Data with Fine-Grained Attributes for Person Re-Identification10
Hierarchical Multi-Attention Transfer for Knowledge Distillation10
An Explainable Framework for Diagnosis of COVID-19 Pneumonia via Transfer Learning and Discriminant Correlation Analysis10
Answer Questions with Right Image Regions: A Visual Attention Regularization Approach10
A Format-compatible Searchable Encryption Scheme for JPEG Images Using Bag-of-words10
Full-reference Screen Content Image Quality Assessment by Fusing Multilevel Structure Similarity10
3D Facial Similarity Measurement and Its Application in Facial Organization10
A Novel GAPG Approach to Automatic Property Generation for Formal Verification: The GAN Perspective10
Exploring Relations in Untrimmed Videos for Self-Supervised Learning10
Secure Chaff-less Fuzzy Vault for Face Identification Systems10
Binary Representation via Jointly Personalized Sparse Hashing10
Blockchain-Based Audio Watermarking Technique for Multimedia Copyright Protection in Distribution Networks10
Single-shot Semantic Matching Network for Moment Localization in Videos10
Transform, Warp, and Dress: A New Transformation-guided Model for Virtual Try-on10
Output-Bounded and RBFNN-Based Position Tracking and Adaptive Force Control for Security Tele-Surgery10
Egocentric Early Action Prediction via Adversarial Knowledge Distillation10
Dilated Convolution-based Feature Refinement Network for Crowd Localization10
Am I Done? Predicting Action Progress in Videos9
Motion-Aware Structured Matrix Factorization for Foreground Detection in Complex Scenes9
Where Are They Going? Predicting Human Behaviors in Crowded Scenes9
Spatio-Temporal Deep Residual Network with Hierarchical Attentions for Video Event Recognition9
Generative Metric Learning for Adversarially Robust Open-world Person Re-Identification9
RDH-DES: Reversible Data Hiding over Distributed Encrypted-Image Servers Based on Secret Sharing9
Deep Q Network–Driven Task Offloading for Efficient Multimedia Data Analysis in Edge Computing–Assisted IoV9
SADnet: Semi-supervised Single Image Dehazing Method Based on an Attention Mechanism9
Harmonious Multi-branch Network for Person Re-identification with Harder Triplet Loss9
Deep Convolutional Pooling Transformer for Deepfake Detection9
Deep Unsupervised Key Frame Extraction for Efficient Video Classification9
Smart Diagnosis9
GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification9
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation9
Structure-aware Meta-fusion for Image Super-resolution9
Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition9
ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes9
MILL: Channel Attention–based Deep Multiple Instance Learning for Landslide Recognition8
Voice-Face Homogeneity Tells Deepfake8
Rectified Meta-learning from Noisy Labels for Robust Image-based Plant Disease Classification8
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition8
PPNet8
Deep Uncoupled Discrete Hashing via Similarity Matrix Decomposition8
A Security and Privacy Validation Methodology for e-Health Systems8
iDAM: Iteratively Trained Deep In-loop Filter with Adaptive Model Selection8
Hyper-node Relational Graph Attention Network for Multi-modal Knowledge Graph Completion8
A Unified Tensor Framework for Clustering and Simultaneous Reconstruction of Incomplete Imaging Data8
FIN8
Differentially Private Tensor Train Deep Computation for Internet of Multimedia Things8
Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification8
Scribble-Supervised Meibomian Glands Segmentation in Infrared Images8
Generation of Realistic Synthetic Financial Time-series8
FasterPose: A Faster Simple Baseline for Human Pose Estimation8
MMFN8
Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes7
SAMAF7
Mimicking Individual Media Quality Perception with Neural Network based Artificial Observers7
Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis7
Adaptive Attention-based High-level Semantic Introduction for Image Caption7
Multi-feature Fusion VoteNet for 3D Object Detection7
Efficient Light Field Image Compression with Enhanced Random Access7
Rank-in-Rank Loss for Person Re-identification7
From Coarse to Fine: Hierarchical Structure-aware Video Summarization7
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval7
Spatial-temporal Regularized Multi-modality Correlation Filters for Tracking with Re-detection7
Toward Intelligent Fashion Design: A Texture and Shape Disentangled Generative Adversarial Network7
Sensor-based Human Activity Recognition Using Graph LSTM and Multi-task Classification Model7
Adversarial Multi-Grained Embedding Network for Cross-Modal Text-Video Retrieval7
HCNCT: A Cross-chain Interaction Scheme for the Blockchain-based Metaverse7
Interactive Search vs. Automatic Search7
Perceptual Image Compression with Block-Level Just Noticeable Difference Prediction7
An l ½ and Graph Regularized Subspace Clustering Method for Robust Image Segmentation7
Exploiting Attention-Consistency Loss For Spatial-Temporal Stream Action Recognition7
Synthesising Privacy by Design Knowledge Toward Explainable Internet of Things Application Designing in Healthcare7
Alignment Enhancement Network for Fine-grained Visual Categorization7
MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting Events7
Image to Modern Chinese Poetry Creation via a Constrained Topic-aware Model7
Interactive Re-ranking via Object Entropy-Guided Question Answering for Cross-Modal Image Retrieval6
A Convolutional Neural Network Model Using Weighted Loss Function to Detect Diabetic Retinopathy6
NR-CNN: Nested-Residual Guided CNN In-loop Filtering for Video Coding6
Hypomimia Recognition in Parkinson’s Disease With Semantic Features6
A Survey on Temporal Sentence Grounding in Videos6
MIS: A Multi-Identifier Management and Resolution System in the Metaverse6
Learning Transferable Perturbations for Image Captioning6
A Fast View Synthesis Implementation Method for Light Field Applications6
Multi-granularity Brushstrokes Network for Universal Style Transfer6
JDAN: Joint Detection and Association Network for Real-Time Online Multi-Object Tracking6
Urban Perception: Sensing Cities via a Deep Interactive Multi-task Learning Framework6
Perturbation-enabled Deep Federated Learning for Preserving Internet of Things-based Social Networks6
Semantics and Non-fungible Tokens for Copyright Management on the Metaverse and Beyond6
Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency Representation Learning6
Doctor's Dilemma: Evaluating an Explainable Subtractive Spatial Lightweight Convolutional Neural Network for Brain Tumor Diagnosis6
Accelerating Transform Algorithm Implementation for Efficient Intra Coding of 8K UHD Videos6
TT-TSVD: A Multi-modal Tensor Train Decomposition with Its Application in Convolutional Neural Networks for Smart Healthcare6
Evaluation of Shared Resource Allocation Using SAND for ABR Streaming6
A Multi-feature and Time-aware-based Stress Evaluation Mechanism for Mental Status Adjustment6
Learning Semantic Representation on Visual Attribute Graph for Person Re-identification and Beyond6
Hierarchical and Progressive Image Matting6
Adaptive Compression for Online Computer Vision: An Edge Reinforcement Learning Approach6
Lightweight Single Image Super-resolution with Dense Connection Distillation Network6
Upgrading the Newsroom6
AMSA: Adaptive Multimodal Learning for Sentiment Analysis6
Posed and Spontaneous Expression Distinction Using Latent Regression Bayesian Networks6
WTRPNet: An Explainable Graph Feature Convolutional Neural Network for Epileptic EEG Classification6
Cross-Domain Brain CT Image Smart Segmentation via Shared Hidden Space Transfer FCM Clustering6
Detection of Moving Object Using Superpixel Fusion Network6
Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection6
Leveraging Deep Statistics for Underwater Image Enhancement6
SPGAN: Face Forgery Using Spoofing Generative Adversarial Networks6
0.049622058868408