IEEE Transactions on Big Data

Papers
(The TQCC of IEEE Transactions on Big Data is 10. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
A Structured Approach Towards Big Data Identification254
A Query-Aware Method for Approximate Range Search in Hamming Space126
Using Construction Waste Hauling Trucks' GPS Data to Classify Earthwork-Related Locations: A Chengdu Case Study108
Mining Stable Communities in Temporal Networks by Density-Based Clustering98
Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models86
A Lightweight Matrix Factorization for Recommendation With Local Differential Privacy in Big Data85
Bi-Selection of Instances and Features Based on Neighborhood Importance Degree84
Multiple Riemannian Manifold-Valued Descriptors Based Image Set Classification With Multi-Kernel Metric Learning83
Don’t Be Misled by Emotion! Disentangle Emotions and Semantics for Cross-Language and Cross-Domain Rumor Detection74
FIFA: A Forest-based Sliding Window Aggregation Scheme for Out-of-Order Data Streams72
Towards Fair and Scalable Trial Assignment in Federated Bandits: A Shapley Value Approach68
Meta-Learning Based Classification for Moving Object Trajectories in Mobile IoT68
FLAG: Faster Learning on Anchor Graph with Label Predictor Optimization61
LGM-GNN: A Local and Global Aware Memory-Based Graph Neural Network for Fraud Detection57
A Novel Concept-Cognitive Learning Model Oriented to Three-Way Concept for Knowledge Acquisition53
FedFAIM: A Model Performance-Based Fair Incentive Mechanism for Federated Learning52
NTFormer: A Composite Node Tokenized Graph Transformer for Node Classification51
On the Convergence of Federated Learning Algorithms Without Data Similarity50
Verifiable and Privacy-Preserving k-NN Query Scheme With Multiple Keys49
DTWformer: a DTW-Based Transformer for Multivariate Time Series Forecasting48
Higher-Order Community Detection by Motif-Based Modularity Optimization46
Guest Editorial TBD Special Issue on Graph Machine Learning for Recommender Systems45
System Identification With Fourier Transformation for Long-Term Time Series Forecasting45
Cosine Multilinear Principal Component Analysis for Recognition44
Towards Fraud Detection via Fine-Grained Classification of User Behavior44
Determination of the Number of Clusters in High-Dimensional Data with Subspace Clusters44
Edge-DPSDG: An Edge-Based Differential Privacy Protection Model for Smart Healthcare43
Parallel Graph Learning With Temporal Stamp Encoding for Fraudulent Transactions Detections43
Privacy-Preserving Public Release of Datasets for Support Vector Machine Classification42
Differential Modal Multistage Adaptive Fusion Networks via Knowledge Distillation for RGB-D Mirror Segmentation39
IEMask R-CNN: Information-Enhanced Mask R-CNN39
Automatic Recognition of Cyberbullying in the Web of Things and social media using Deep Learning Framework39
Natural Language Processing for Arabic Sentiment Analysis: A Systematic Literature Review39
Research of Federated Learning Application Methods and Social Responsibility39
BiLSTM-SSVM: Training the BiLSTM with a Structured Hinge Loss for Named-Entity Recognition39
BitAnalysis: A Visualization System for Bitcoin Wallet Investigation38
Taylor-Sensus Network: Embracing Noise to Enlighten Uncertainty for Scientific Data37
Time Series Anomaly Detection for Trustworthy Services in Cloud Computing Systems37
Casformer: Information Popularity Prediction With Adaptive Cascade Sampling and Graph Transformer in Social Networks37
Joint Multi-Feature Information Entity Alignment for Cross-Lingual Temporal Knowledge Graph With BERT35
Risk-Constrained Reinforcement Learning With Augmented Lagrangian Multiplier for Portfolio Optimization35
Feature Subspace Learning-Based Binary Differential Evolution Algorithm for Unsupervised Feature Selection35
SCOREH+: A High-Order Node Proximity Spectral Clustering on Ratios-of-Eigenvectors Algorithm for Community Detection35
Multi-view Spectral Clustering on the Grassmannian Manifold With Hypergraph Representation34
Label-Weighted Graph-Based Learning for Semi-Supervised Classification Under Label Noise34
Efficient Forward and Backward Private Conjunctive Searchable Encryption With Comprehensive Verification32
Adapt Anything: Tailor Any Image Classifier Across Domains and Categories Using Text-to-Image Diffusion Models32
AnesFormer: An End-to-End Framework for EEG-Based Anesthetic State Classification32
Spatial-Temporal Contrasting for Fine-Grained Urban Flow Inference32
Data-Driven Web APIs Recommendation for Building Web Applications32
Learning Balanced Bayesian Classifiers From Labeled and Unlabeled Data31
Use of Transfer Learning for Affordable In-Context Fake Review Generation30
A Comprehensive Trustworthy Data Collection Approach in Sensor-Cloud Systems30
Data Exchange for the Metaverse With Accountable Decentralized TTPs and Incentive Mechanisms29
Hierarchical Lifelong Machine Learning With “Watchdog”29
Attention-Based Complex Logical Query on Temporal Knowledge Graph via Graph Neural Network29
Intent-Driven Semantic Query: An Effective Approach for Temporal Knowledge Graph Query28
Multi-View Clustering With Self-Representation and Structural Constraint28
A Comprehensive Image Protection Framework Based on High-Capacity Adversarial Data Hiding28
Efficient Event Inference and Context-Awareness in Internet of Things Edge Systems27
SIESTA: A Scalable Infrastructure of Sequential Pattern Analysis27
Decentralized Federated Learning: A Survey on Security and Privacy26
Boosting Encrypted Traffic Classification Using Feature-Enhanced Recurrent Neural Network With Angle Constraint26
Fine-Tuned Personality Federated Learning for Graph Data26
A Composable Generative Framework Based on Prompt Learning for Various Information Extraction Tasks25
STGAN: Spatio-Temporal Generative Adversarial Network for Traffic Data Imputation25
A Federated Convolution Transformer for Fake News Detection24
Efficiently Transfer User Profile Across Networks24
Efficient Asynchronous Multi-Participant Vertical Federated Learning24
Streaming Local Community Detection Through Approximate Conductance24
Two-Step Nyström Sampling for Large-scale Kernel Approximation22
RGSE: Robust Graph Structure Embedding for Anomalous Link Detection22
Weak Supervision Learning for Object Co-Segmentation22
Heterogeneous Social Event Detection via Hyperbolic Graph Representations21
Discovering and Understanding Geographical Video Viewing Patterns in Urban Neighborhoods21
SGAMF: Sparse Gated Attention-Based Multimodal Fusion Method for Fake News Detection21
Topology-Based Node-Level Membership Inference Attacks on Graph Neural Networks21
Graph Prompt Learning Method for the Demand-Responsive Transport Routing Problem21
Information Switching Patterns of Risk Communication in Social Media During Disasters21
Practical Attribute Reconstruction Attack Against Federated Learning20
An Effective 2-Dimension Graph Partitioning for Work Stealing Assisted Graph Processing on Multi-FPGAs20
Computing Significant Cliques in Large Labeled Networks20
A Generalized Deep Learning Algorithm Based on NMF for Multi-View Clustering19
Metagraph-Based Life Pattern Clustering With Big Human Mobility Data19
PPHOPCM: Privacy-Preserving High-Order Possibilistic c-Means Algorithm for Big Data Clustering with Cloud Computing19
Fast Multi-View Outlier Detection via Deep Encoder19
AugGPT: Leveraging ChatGPT for Text Data Augmentation19
Multi-Dimensional Data Recovery via Feature-Based Fully-Connected Tensor Network Decomposition19
Linear Time Community Detection by a Novel Modularity Gain Acceleration in Label Propagation18
Heterogeneous Daily Living Activity Learning Through Domain Invariant Feature Subspace18
Multi-Label Graph Convolutional Network Representation Learning18
GGNN: Graph-Based GPU Nearest Neighbor Search18
eBoF: Interactive Temporal Correlation Analysis for Ensemble Data Based on Bag-of-Features18
Self-Attention Graph Convolution Residual Network for Traffic Data Completion18
Expertise or Hallucination? A Comprehensive Evaluation of ChatGPT's Aptitude in Clinical Genetics18
Adaptive Graph Structure Learning Neural Rough Differential Equations for Multivariate Time Series Forecasting18
Distributed Sparse Class-Imbalance Learning and Its Applications18
Uncovering Local Hierarchical Overlapping Communities at Scale17
SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors17
CAPTOR: Cyber Attack Protection via Temporal Online Graph Representation Learning17
Optimizing Deduplication Parameters via a Change-Estimation Analytical Model17
Revocable DSSE in Healthcare Systems With Range Query Support17
A Survey on the Methods and Results of Data-Driven Koopman Analysis in the Visualization of Dynamical Systems17
A Survey on Spatio-Temporal Big Data Analytics Ecosystem: Resource Management, Processing Platform, and Applications16
PViTGAtt-IP: Severity Quantification of Lung Infections in Chest X-Rays and CT Scans via Parallel and Cross-Attended Encoders16
Mining Hierarchical Information of CNNs for Scene Classification of VHR Remote Sensing Images16
Adaptively-Accelerated Parallel Stochastic Gradient Descent for High-Dimensional and Incomplete Data Representation Learning16
Higher-Order Smoothness Enhanced Graph Collaborative Filtering16
Efficient and Privacy-Preserving Aggregate Query Over Public Property Graphs16
Efficient Interactive Global Cellular Signal Strength Visualization16
GTPool: Graph Transformer Pooling with Diverse Sampling16
Efficient Learned Spatial Index With Interpolation Function Based Learned Model15
An Overall Evaluation on Benefits of Competitive Influence Diffusion15
Portraying Fine-Grained Tenant Portrait for Churn Prediction Using Semi-Supervised Graph Convolution and Attention Network15
GCLNet: Generalized Contrastive Learning for Weakly Supervised Temporal Action Localization15
Regression Analysis of Predictions and Forecasts of Cloud Data Center KPIs Using the Boosted Decision Tree Algorithm15
Editorial Emerging Horizons: The Rise of Large Language Models and Cross-Modal Generative AI15
Outsourced Privacy-Preserving Data Alignment on Vertically Partitioned Database15
CTDI: CNN-Transformer-Based Spatial-Temporal Missing Air Pollution Data Imputation15
Core Maintenance on Dynamic Graphs: A Distributed Approach Built on H-Index14
Identity-Based Dynamic Data Auditing for Big Data Storage14
Towards the Inference of Travel Purpose with Heterogeneous Urban Data14
Improved Gradient Inversion Attacks and Defenses in Federated Learning14
Spatio-Temporal Transformer Network for Weather Forecasting14
Mobile Network Traffic Prediction Based on Seasonal Adjacent Windows Sampling and Conditional Probability Estimation14
Dynamic Entity-Based Named Entity Recognition Under Unconstrained Tagging Schemes14
Multiscale Feature-Guided Adversarial Examples Quality Assessment via Hierarchical Perception of Human Visual System14
Adaptive Superpixel Segmentation With Non-Uniform Seed Initialization13
Cepe-FL: Communication-Efficient and Privacy-Enhanced Federated Learning Via Adaptive Compressive Sensing13
Online Non-Stationary Pricing Incentives for Budget-Limited Crowdsensing13
TAG: Triple Alignment with Rationale Generation for Knowledge-based Visual Question Answering13
Graph-Enhanced Multi-Scale Contrastive Learning for Graph Anomaly Detection with Adaptive Diffusion Models13
A Fast and Robust Attention-Free Heterogeneous Graph Convolutional Network12
GE-GNN: Gated Edge-Augmented Graph Neural Network for Fraud Detection12
OpinionRank: Trustworthy Website Detection Using Three Valued Subjective Logic12
Identification of Communities With Multi-Semantics via Bayesian Generative Model12
Distributed Dual Averaging Based Data Clustering12
Boosting Nonnegative Matrix Factorization Based Community Detection With Graph Attention Auto-Encoder12
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification11
Generative-Contrastive Heterogeneous Graph Neural Network11
Denoising Neural Relation Extraction for Spatio-Temporal Recommendation System11
Big Data for the Social Good: The Drought Early-Warning Experience Report11
Restoration of Recaptured Screen Images With a Divide and Conquer Strategy11
A Multi-Modal Hypergraph Neural Network via Parametric Filtering and Feature Sampling11
ATLAS: GAN-Based Differentially Private Multi-Party Data Sharing11
LongArms: Fraud Prediction in Online Lending Services Using Sparse Knowledge Graph11
Towards Enhancing Inter-Domain Routing Security With Visualization and Visual Analytics11
Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning11
High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data11
Spatial-Attention and Demographic-Augmented Generative Adversarial Imputation Network for Population Health Data Reconstruction10
HEART: Historically Information Embedding and Subspace Re-Weighting Transformer-Based Tracking10
Robust Low Transformed Multi-Rank Tensor Completion With Deep Prior Regularization for Multi-Dimensional Image Recovery10
A Survey of Data Pricing for Data Marketplaces10
Many Hands Make Light Work: Group Influence Maximization in Evolving Social Networks10
Efficient FCTN Decomposition with Structural Sparsity for Noisy Tensor Completion10
Parallel Overlapping Community Detection Algorithm on GPU10
Unlocking Large Language Model Power in Industry: Privacy-Preserving Collaborative Creation of Knowledge Graph10
Understanding the Users and Videos by Mining a Novel Danmu Dataset10
Unsupervised Cross-View Subspace Clustering via Adaptive Contrastive Learning10
Efficient and Tractable Conflict Detection in Knowledge Graphs via Path Analysis10
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications10
EvoSets: Tracking the Sensitivity of Dimensionality Reduction Results Across Subspaces10
Deep Convolutional Neural Network Based Medical Concept Normalization10
Mix2SFL: Two-Way Mixup for Scalable, Accurate, and Communication-Efficient Split Federated Learning10
Cost-Aware Triangle Counting Over Geo-Distributed Datacenters10
0.14983582496643