IEEE Transactions on Big Data

Papers
(The median citation count of IEEE Transactions on Big Data is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
A Query-Aware Method for Approximate Range Search in Hamming Space1303
FLAG: Faster Learning on Anchor Graph with Label Predictor Optimization293
A Structured Approach Towards Big Data Identification189
Don’t Be Misled by Emotion! Disentangle Emotions and Semantics for Cross-Language and Cross-Domain Rumor Detection104
Bi-Selection of Instances and Features Based on Neighborhood Importance Degree100
Towards Fair and Scalable Trial Assignment in Federated Bandits: A Shapley Value Approach85
A Lightweight Matrix Factorization for Recommendation With Local Differential Privacy in Big Data82
Meta-Learning Based Classification for Moving Object Trajectories in Mobile IoT81
A novel concept-cognitive learning model oriented to three-way concept for knowledge acquisition80
Multiple Riemannian Manifold-Valued Descriptors Based Image Set Classification With Multi-Kernel Metric Learning62
Mining Stable Communities in Temporal Networks by Density-Based Clustering62
FedFAIM: A Model Performance-Based Fair Incentive Mechanism for Federated Learning60
FIFA: A Forest-based Sliding Window Aggregation Scheme for Out-of-Order Data Streams60
Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models55
LGM-GNN: A Local and Global Aware Memory-Based Graph Neural Network for Fraud Detection45
Billion-Scale Similarity Search with GPUs44
Cosine Multilinear Principal Component Analysis for Recognition42
On the Convergence of Federated Learning Algorithms Without Data Similarity42
Edge-DPSDG: An Edge-Based Differential Privacy Protection Model for Smart Healthcare42
Time Series Anomaly Detection for Trustworthy Services in Cloud Computing Systems42
IEMask R-CNN: Information-Enhanced Mask R-CNN39
BiLSTM-SSVM: Training the BiLSTM with a Structured Hinge Loss for Named-Entity Recognition39
Natural Language Processing for Arabic Sentiment Analysis: A Systematic Literature Review39
Research of Federated Learning Application Methods and Social Responsibility37
Privacy-Preserving Public Release of Datasets for Support Vector Machine Classification37
Guest Editorial TBD Special Issue on Graph Machine Learning for Recommender Systems36
Verifiable and Privacy-Preserving $k$-NN Query Scheme with Multiple Keys36
Towards Fraud Detection Via Fine-Grained Classification of User Behavior35
Parallel Graph Learning with Temporal Stamp Encoding for Fraudulent Transactions Detections34
SCOREH+: A High-Order Node Proximity Spectral Clustering on Ratios-of-Eigenvectors Algorithm for Community Detection33
Casformer: Information Popularity Prediction With Adaptive Cascade Sampling and Graph Transformer in Social Networks33
Risk-Constrained Reinforcement Learning With Augmented Lagrangian Multiplier for Portfolio Optimization33
Automatic Recognition of Cyberbullying in the Web of Things and social media using Deep Learning Framework32
Differential Modal Multistage Adaptive Fusion Networks via Knowledge Distillation for RGB-D Mirror Segmentation32
System Identification With Fourier Transformation for Long-Term Time Series Forecasting32
Higher-order Community Detection by Motif-based Modularity Optimization31
BitAnalysis: A Visualization System for Bitcoin Wallet Investigation31
Feature Subspace Learning-Based Binary Differential Evolution Algorithm for Unsupervised Feature Selection30
Label-Weighted Graph-Based Learning for Semi-Supervised Classification Under Label Noise30
Adapt Anything: Tailor Any Image Classifier across Domains And Categories Using Text-to-Image Diffusion Models29
Joint Multi-Feature Information Entity Alignment for Cross-Lingual Temporal Knowledge Graph With BERT29
A New Approach of Exploiting Self-Adjoint Matrix Polynomials of Large Random Matrices for Anomaly Detection and Fault Location29
Use of transfer learning for affordable in-context fake review generation28
A Comprehensive Trustworthy Data Collection Approach in Sensor-Cloud Systems27
Data-Driven Web APIs Recommendation for Building Web Applications27
Spatial-Temporal Contrasting for Fine-Grained Urban Flow Inference27
Learning Balanced Bayesian Classifiers From Labeled and Unlabeled Data25
Efficient Forward and Backward Private Conjunctive Searchable Encryption with Comprehensive Verification24
AnesFormer: An End-to-End Framework for EEG-Based Anesthetic State Classification24
Data Exchange for the Metaverse with Accountable Decentralized TTPs and Incentive Mechanisms23
A Composable Generative Framework Based on Prompt Learning for Various Information Extraction Tasks23
Fine-Tuned Personality Federated Learning for Graph Data23
Efficient Event Inference and Context-Awareness in Internet of Things Edge Systems23
Hierarchical Lifelong Machine Learning With “Watchdog”23
ScaleJoin: A Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join23
Attention-Based Complex Logical Query on Temporal Knowledge Graph via Graph Neural Network23
STGAN: Spatio-Temporal Generative Adversarial Network for Traffic Data Imputation22
Decentralized Federated Learning: A Survey on Security and Privacy22
Multi-View Clustering With Self-Representation and Structural Constraint22
SIESTA: A Scalable Infrastructure of Sequential Pattern Analysis22
Boosting Encrypted Traffic Classification Using Feature-Enhanced Recurrent Neural Network with Angle Constraint22
Semantic-Based and Entity-Resolution Fusion to Enhance Quality of Big RDF Data22
Metagraph-Based Life Pattern Clustering With Big Human Mobility Data21
Information Switching Patterns of Risk Communication in Social Media during Disasters21
Heterogeneous Social Event Detection via Hyperbolic Graph Representations21
PPHOPCM: Privacy-Preserving High-Order Possibilistic c-Means Algorithm for Big Data Clustering with Cloud Computing21
Weak Supervision Learning for Object Co-Segmentation20
Discovering and Understanding Geographical Video Viewing Patterns in Urban Neighborhoods20
An Effective 2-Dimension Graph Partitioning for Work Stealing Assisted Graph Processing on Multi-FPGAs20
RGSE: Robust Graph Structure Embedding for Anomalous Link Detection19
Efficient Asynchronous Multi-Participant Vertical Federated Learning19
Streaming Local Community Detection Through Approximate Conductance19
Practical Attribute Reconstruction Attack Against Federated Learning19
Computing Significant Cliques in Large Labeled Networks19
Graph Prompt Learning Method for the Demand-Responsive Transport Routing Problem18
Efficiently Transfer User Profile Across Networks18
Multi-Dimensional Data Recovery via Feature-Based Fully-Connected Tensor Network Decomposition18
SGAMF: Sparse Gated Attention-Based Multimodal Fusion Method for Fake News Detection18
Fast Multi-View Outlier Detection via Deep Encoder18
A Federated Convolution Transformer for Fake News Detection18
Heterogeneous Daily Living Activity Learning Through Domain Invariant Feature Subspace17
AugGPT: Leveraging ChatGPT for Text Data Augmentation17
eBoF: Interactive Temporal Correlation Analysis for Ensemble Data Based on Bag-of-Features17
Adaptive Graph Structure Learning Neural Rough Differential Equations for Multivariate Time Series Forecasting17
Topology-based Node-level Membership Inference Attacks on Graph Neural Networks17
Efficient and Privacy-Preserving Aggregate Query Over Public Property Graphs17
A Generalized Deep Learning Algorithm Based on NMF for Multi-View Clustering17
Expertise or Hallucination? A Comprehensive Evaluation of ChatGPT's Aptitude in Clinical Genetics16
Uncovering Local Hierarchical Overlapping Communities at Scale16
Distributed Sparse Class-Imbalance Learning and Its Applications16
Linear Time Community Detection by a Novel Modularity Gain Acceleration in Label Propagation16
Revocable DSSE in Healthcare Systems with Range Query Support16
Self-Attention Graph Convolution Residual Network for Traffic Data Completion16
PViTGAtt-IP: Severity Quantification of Lung Infections in Chest X-rays and CT Scans via Parallel and Cross-Attended Encoders16
SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors15
Multi-Label Graph Convolutional Network Representation Learning15
Multiple Distance-Based Coding: Toward Scalable Feature Matching for Large-Scale Web Image Search15
A Survey on the Methods and Results of Data-Driven Koopman Analysis in the Visualization of Dynamical Systems15
A Survey on Spatio-Temporal Big Data Analytics Ecosystem: Resource Management, Processing Platform, and Applications15
GGNN: Graph-Based GPU Nearest Neighbor Search15
Adaptively-Accelerated Parallel Stochastic Gradient Descent for High-Dimensional and Incomplete Data Representation Learning15
Managing Big Interval Data with CINTIA: The Checkpoint INTerval Array15
Higher-Order Smoothness Enhanced Graph Collaborative Filtering15
An Overall Evaluation on Benefits of Competitive Influence Diffusion14
Core Maintenance on Dynamic Graphs: A Distributed Approach Built on H-Index14
Mobile Network Traffic Prediction Based on Seasonal Adjacent Windows Sampling and Conditional Probability Estimation14
Regression Analysis of Predictions and Forecasts of Cloud Data Center KPIs Using the Boosted Decision Tree Algorithm14
GCLNet: Generalized Contrastive Learning for Weakly Supervised Temporal Action Localization13
Efficient Learned Spatial Index With Interpolation Function Based Learned Model13
CTDI: CNN-Transformer-Based Spatial-Temporal Missing Air Pollution Data Imputation13
Efficient Interactive Global Cellular Signal Strength Visualization13
Portraying Fine-grained Tenant Portrait for Churn Prediction using Semi-supervised Graph Convolution and Attention Network13
Mining Hierarchical Information of CNNs for Scene Classification of VHR Remote Sensing Images13
Modeling and Visualizing Student Flow12
Improved Gradient Inversion Attacks and Defenses in Federated Learning12
Outsourced Privacy-Preserving Data Alignment on Vertically Partitioned Database12
OpinionRank: Trustworthy Website Detection Using Three Valued Subjective Logic11
A Fast and Robust Attention-Free Heterogeneous Graph Convolutional Network11
Cepe-FL: Communication-efficient and Privacy-enhanced Federated Learning via Adaptive Compressive Sensing11
Adaptive Superpixel Segmentation With Non-Uniform Seed Initialization11
Multiscale Feature-guided Adversarial Examples Quality Assessment via Hierarchical Perception of Human Visual System11
Identity-Based Dynamic Data Auditing for Big Data Storage11
Online Non-Stationary Pricing Incentives for Budget-Limited Crowdsensing11
GE-GNN: Gated Edge-augmented Graph Neural Network for Fraud Detection10
Distributed Dual Averaging Based Data Clustering10
Cost-Aware Triangle Counting over Geo-Distributed Datacenters10
Spatio-Temporal Transformer Network for Weather Forecasting10
Boosting Nonnegative Matrix Factorization Based Community Detection With Graph Attention Auto-Encoder10
Spatial-Attention and Demographic-Augmented Generative Adversarial Imputation Network for Population Health Data Reconstruction10
Dynamic Entity-Based Named Entity Recognition Under Unconstrained Tagging Schemes10
Towards the Inference of Travel Purpose with Heterogeneous Urban Data10
Towards Enhancing Inter-Domain Routing Security with Visualization and Visual Analytics10
Restoration of Recaptured Screen Images with A Divide and Conquer Strategy10
Identification of Communities With Multi-Semantics via Bayesian Generative Model9
Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning9
Denoising Neural Relation Extraction for Spatio-temporal Recommendation System9
Unsupervised Cross-View Subspace Clustering via Adaptive Contrastive Learning9
A Multi-Modal Hypergraph Neural Network via Parametric Filtering and Feature Sampling9
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification9
Efficient Graph Processing with Invalid Update Filtration9
Many Hands Make Light Work: Group Influence Maximization in Evolving Social Networks9
Data-Driven Digital Advertising with Uncertain Demand Model in Metro Networks9
Parallel Overlapping Community Detection Algorithm on GPU9
Mix2SFL: Two-Way Mixup for Scalable, Accurate, and Communication-Efficient Split Federated Learning8
LSTM Based Phishing Detection for Big Email Data8
HEART: Historically Information Embedding and Subspace Re-Weighting Transformer-Based Tracking8
ATLAS: GAN-Based Differentially Private Multi-Party Data Sharing8
LongArms: Fraud Prediction in Online Lending Services Using Sparse Knowledge Graph8
Deep Convolutional Neural Network Based Medical Concept Normalization8
Robust Low Transformed Multi-Rank Tensor Completion With Deep Prior Regularization for Multi-Dimensional Image Recovery8
A Survey of Blockchain-Based Schemes for Data Sharing and Exchange8
Towards Efficient Synchronous Federated Training: A Survey on System Optimization Strategies8
Blockchain-empowered Federated Learning: Benefits, Challenges, and Solutions8
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications8
A Survey of Data Pricing for Data Marketplaces8
GCN-ST-MDIR: Graph Convolutional Network-Based Spatial-Temporal Missing Air Pollution Data Pattern Identification and Recovery8
Task Allocation Under Geo-Indistinguishability via Group-Based Noise Addition8
High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data8
Big Data for the Social Good: The Drought Early-Warning Experience Report8
Personalized Recommendation in P2P Lending Based on Risk-Return Management: A Multi-Objective Perspective8
Unlocking Large Language Model Power in Industry: Privacy-Preserving Collaborative Creation of Knowledge Graph8
MiSTR: A Multiview Structural-Temporal Learning Framework for Rumor Detection7
Scalable Evidential K-Nearest Neighbor Classification on Big Data7
Unsupervised Projected Sample Selector for Active Learning7
Dynamic Radio Map Construction With Minimal Manual Intervention: A State Space Model-Based Approach With Imitation Learning7
FIG: Feature-Weighted Information Granules With High Consistency Rate7
A Multi-Aspect Neural Tensor Factorization Framework for Patent Litigation Prediction7
Memory Scaling of Cloud-Based Big Data Systems: A Hybrid Approach7
Reputation-Aware Federated Learning Client Selection Based on Stochastic Integer Programming7
EvoSets: Tracking the Sensitivity of Dimensionality Reduction Results Across Subspaces7
Robust Semi-Supervised Deep Nonnegative Matrix Factorization with Constraint Propagation for Data Representation7
Practical Vertical Federated Learning With Unsupervised Representation Learning7
MultiTec: A Data-Driven Multimodal Short Video Detection Framework for Healthcare Misinformation on TikTok7
Heterogeneous Device Collaboration Based Federated Learning for Big Data Applications7
Epidemic Spread Modeling for COVID-19 Using Cross-Fertilization of Mobility Data7
Using App Usage Data From Mobile Devices to Improve Activity-Based Travel Demand Models7
Trust Based Incentive Scheme to Allocate Big Data Tasks with Mobile Social Cloud7
Understanding the Users and Videos by Mining a Novel Danmu Dataset7
FEVERLESS: Fast and Secure Vertical Federated Learning Based on XGBoost for Decentralized Labels7
Multi-Objective Graph Contrastive Learning for Recommendation7
Denoised Graph Collaborative Filtering via Neighborhood Similarity and Dynamic Thresholding7
AFS-FCM With Memory: A Model for Air Quality Multi-Dimensional Prediction With Interpretability6
Utility-driven Data Analytics Algorithm for Transaction Modifications Using Pre-large Concept with Single Database Scan6
Multiple-Perspective Clustering of Passive Wi-Fi Sensing Trajectory Data6
SBPA: Sybil-Based Backdoor Poisoning Attacks for Distributed Big Data in AIoT-Based Federated Learning System6
DCLCSE: Dynamic Curriculum Learning Based Contrastive Learning of Sentence Embeddings6
Supervised Discrete Multiple-Length Hashing for Image Retrieval6
On Security of an Identity-Based Dynamic Data Auditing Protocol for Big Data Storage6
Towards an Energy Complexity Model for Distributed Data Processing Algorithms6
GGraph: An Efficient Structure-Aware Approach for Iterative Graph Processing6
Data Reconstruction and Protection in Federated Learning for Fine-Tuning Large Language Models6
A Scalable Algorithm for Large-Scale Unsupervised Multi-View Partial Least Squares6
Resource-Aware Federated Neural Architecture Search over Heterogeneous Mobile Devices6
Augmented Multi-Party Computation Against Gradient Leakage in Federated Learning6
PR3: Reversible and Usability-Enhanced Visual Privacy Protection via Thumbnail Preservation and Data Hiding6
A Distributed Generative Adversarial Network for Data Augmentation Under Vertical Federated Learning6
Robust Joint Graph Learning for Multi-View Clustering6
Training Large-Scale Graph Neural Networks via Graph Partial Pooling6
A Survey on Truth Discovery: Concepts, Methods, Applications, and Opportunities6
MNL: A Highly-Efficient Model for Large-scale Dynamic Weighted Directed Network Representation6
Link Prediction in Knowledge Graphs: A Hierarchy-Constrained Approach6
Self-Guided Graph Refinement with Progressive Fusion for Multiplex Graph Contrastive Representation Learning5
Towards Scalable Multi-View Clustering via Joint Learning of Many Bipartite Graphs5
A Drift-Sensitive Distributed LSTM Method for Short Text Stream Classification5
Emulating Reader Behaviors for Fake News Detection5
Multilevel Stochastic Optimization for Imputation in Massive Medical Data Records5
TgStore: An Efficient Storage System for Large Time-Evolving Graphs5
zkFL: Zero-Knowledge Proof-Based Gradient Aggregation for Federated Learning5
A Multiplex Hypergraph Attribute-based Graph Collaborative Filtering for Cold-start POI Recommendation5
NAGphormer+: A Tokenized Graph Transformer With Neighborhood Augmentation for Node Classification in Large Graphs5
Data-Free Knowledge Filtering and Distillation in Federated Learning5
Noiseless Privacy: Definition, Guarantees, and Applications5
A Privacy-preserving Large-scale Image Retrieval Framework with Vision GNN Hashing5
How to Protect Ourselves From Overlapping Community Detection in Social Networks5
Tailored Definitions With Easy Reach: Complexity-Controllable Definition Generation5
Stable Learning via Dual Feature Learning5
Large Language Models for Link Stealing Attacks Against Graph Neural Networks5
Deep Incremental Hashing for Semantic Image Retrieval With Concept Drift5
Data-Centric Graph Learning: A Survey5
3A Multi-Classification Division-Aggregation Framework for Fake News Detection5
Learning From Crowds Using Graph Neural Networks With Attention Mechanism5
TUSQ: Targeted High-Utility Sequence Querying4
Comment on “Efficient Secure Outsourcing of Large-Scale Sparse Linear Systems of Equations”4
Citywide Traffic Volume Inference with Surveillance Camera Records4
Cross-Region Courier Displacement for On-Demand Delivery With Multi-Agent Reinforcement Learning4
Rethinking Embedded Unsupervised Feature Selection: A Simple Joint Approach4
A Multi-Branch Decoder Network Approach to Adaptive Temporal Data Selection and Reconstruction for Big Scientific Simulation Data4
DIGDUG: Scalable Separable Dense Graph Pruning and Join Operations in MapReduce4
BiG-Fed: Bilevel Optimization Enhanced Graph-Aided Federated Learning4
Denoising Implicit Feedback for Graph Collaborative Filtering via Causal Intervention4
Cost-Efficient Heterogeneous Worker Recruitment under Coverage Requirement in Spatial Crowdsourcing4
Look Closer to Your Enemy: Learning to Attack via Teacher-Student Mimicking4
MarS-FL: Enabling Competitors to Collaborate in Federated Learning4
LBFL: A Lightweight Blockchain-based Federated Learning Framework with Proof-of-Contribution Committee Consensus4
AMDECDA: Attention Mechanism Combined With Data Ensemble Strategy for Predicting CircRNA-Disease Association4
Weakly-Supervised Cross-Domain Segmentation of Electron Microscopy With Sparse Point Annotation4
An Adaptive Pattern Learning Framework to Personalize Online Seizure Prediction4
Visualization of Big Spatial Data Using Coresets for Kernel Density Estimates4
NPP: A New Privacy-Aware Public Auditing Scheme for Cloud Data Sharing with Group Users4
Deep Residual Coupled Prompt Learning for Zero-Shot Sketch-Based Image Retrieval4
Physical Black-Box Adversarial Attacks Through Transformations4
Accelerating Vertical Federated Learning4
Dual Graph Convolutional Networks for Social Network Alignment4
Deep Umbra: A Generative Approach for Sunlight Access Computation in Urban Spaces4
Bi-Directional Feature Fixation-Based Particle Swarm Optimization for Large-Scale Feature Selection3
Moving Conditional GAN Close to Data: Synthetic Tabular Data Generation and its Experimental Evaluation3
Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction3
Combine the Growth of Cascades and Impact of Users for Diffusion Prediction3
Model-Agnostic Method: Exposing Deepfake Using Pixel-Wise Spatial and Temporal Fingerprints3
A Black-Box Adversarial Attack Method via Nesterov Accelerated Gradient and Rewiring Towards Attacking Graph Neural Networks3
0.056333065032959