Big Data Research

Papers
(The median citation count of Big Data Research is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing129
Improved Tesseract optical character recognition performance on Thai document datasets86
Quantitative analysis of big data for land resource classification and zoning at the township level in Northern Shaanxi75
ML-aVAT: A Novel 2-Stage Machine-Learning Approach for Automatic Clustering Tendency Assessment68
Automatic Prediction of T2/T3 Staging of Rectal Cancer Based on Radiomics and Machine Learning67
Data Stream Classification Based on Extreme Learning Machine: A Review64
Early Pathogen Prediction in Crops Using Nano Biosensors and Neural Network-Based Feature Extraction and Classification56
Deep attention dynamic representation learning networks for recommender system review modeling54
A multiscale electricity theft detection model based on feature engineering53
Development of an integrated data system for regional tourism analysis in Italy: A microdata perspective47
Similarity Measurement for Graph Data: An Improved Centrality and Geometric Perspective-Based Approach47
Hourglass pattern matching for deep aware neural network text recommendation model32
Big Data in organizations: Exploring the adoption of Big Data applications and their impact on organizations in China and the Netherlands25
Visual Analytics of Trajectories with RoseTrajVis25
A methodology to assess and evaluate sites with high potential for stormwater harvesting in Dehradun, India24
Analysis of Occupational Profiles in the Brazilian Workforce Based on Non-Negative Matrix Factorization23
Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection20
Complex data in tourism analysis: A stochastic approach to price competition19
Editorial Board18
Unleashing the Power of Digital Twin and Big Data as a New Frontier for Smart Mobility: An Ecosystem Perspective17
Fog-Computing Based Healthcare Framework for Predicting Encephalitis Outbreak17
Big Data in Forecasting Research: A Literature Review16
Editorial Board16
A Multi-View Filter for Relation-Free Knowledge Graph Completion16
Spatio-Temporal Characteristics of Influenza Burden and Its Influence Factors in Japan in the Past Three Decades: An Influenza Disease Burden Data-Based Modeling Study16
A Novel Big Data Approach for Record and Represent Compliance in the Covid-19 Era15
A novel approach for job matching and skill recommendation using transformers and the O*NET database15
A Segmented PageRank-Based Value Compensation Method for Personal Data in Alliance Blockchains14
Augmented Functional Analysis of Variance (A-fANOVA): Theory and Application to Google Trends for Detecting Differences in Abortion Drugs Queries12
Visual Exploratory Data Analysis for Copy Number Variation Studies in Biomedical Research10
What Is a Multi-Modal Knowledge Graph: A Survey10
Meta-Learning Based Dynamic Adaptive Relation Learning for Few-Shot Knowledge Graph Completion10
Deep semantics-preserving cross-modal hashing10
Investigating Influence of Google-Play Application Titles on Success10
BETM: A new pre-trained BERT-guided embedding-based topic model9
Research on the characteristics of information propagation dynamic on the weighted multiplex Weibo networks9
A Facial Expression Recognition Approach for Social IoT Frameworks9
Techniques for interactive visual examination of vessel performance9
Editorial Board9
Correcting inconsistencies in knowledge graphs with correlated knowledge9
Distributed Heterogeneous Transfer Learning9
A novel study of kernel graph regularized semi-non-negative matrix factorization with orthogonal subspace for clustering9
Editorial Board8
Intelligent geological interpretation of AMT data based on machine learning8
GeoYCSB: A Benchmark Framework for the Performance and Scalability Evaluation of Geospatial NoSQL Databases8
Epidemic Spreading in Trajectory Networks8
MLPQ: A Dataset for Path Question Answering over Multilingual Knowledge Graphs8
ImDMI: Improved Distributed M-Invariance model to achieve privacy continuous big data publishing using Apache Spark7
Machine Learning for Tsunami Waves Forecasting Using Regression Trees7
Adaptive spectral GNN and frequency enhanced self-attention for traffic forecasting7
Multi-granularity enhanced graph convolutional network for aspect sentiment triplet extraction6
A Full-Sample Clustering Model Considering Whole Process Optimization of Data6
Integrating Models and Fusing Data in a Deep Ensemble Learning Method for Predicting Epidemic Diseases Outbreak6
Positional-attention based bidirectional deep stacked AutoEncoder for aspect based sentimental analysis6
A Twig-Based Algorithm for Top-k Subgraph Matching in Large-Scale Graph Data6
Editorial Board6
Correlation Expert Tuning System for Performance Acceleration6
The Predictability of Stock Price: Empirical Study on Tick Data in Chinese Stock Market5
Real-Time Traffic Speed Estimation for Smart Cities with Spatial Temporal Data: A Gated Graph Attention Network Approach5
Heterogeneous Graph Convolutional Network Based on Correlation Matrix5
Compression of big data collected in wind farm based on tensor train decomposition5
Deep Learning Techniques for Enhanced Mangrove Land use and Land change from Remote Sensing Imagery: A Blue Carbon Perspective4
Scalable Diversified Top-k Pattern Matching in Big Graphs4
Evaluating Standard Feature Sets Towards Increased Generalisability and Explainability of ML-Based Network Intrusion Detection4
Explanation-Guided Adversarial Example Attacks4
Evolving Dynamic Bayesian Networks by an Analytical Threshold for Dealing with Data Imputation in Time Series Dataset4
Unmasking hate in the pandemic: A cross-platform study of the COVID-19 infodemic4
Big Data Analytics and Visualization in Traffic Monitoring4
Editorial Board4
Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning4
E-word of mouth in sales volume forecasting: Toyota Camry case study3
Chlorophyll-a concentration variations in Bohai sea: Impacts of environmental complexity and human activities based on remote sensing technologies3
NoSQL data warehouse optimizing models: A comparative study of column-oriented approaches3
Study on the Temporal and Spatial Evolution Characteristics of Chinese Public's Cognition and Attitude to “Double Reduction” Policy Based on Big Data3
Editorial Board3
Exogenous variable driven cotton prices prediction: comparison of statistical model with sequence based deep learning models3
The influence of China's exchange rate market on the Belt and Road trade market: Based on temporal two-layer networks3
Accelerating Columnar Storage Based on Asynchronous Skipping Strategy3
Explainable malware detection through integrated graph reduction and learning techniques3
Neural Topic Modeling with Deep Mutual Information Estimation3
A Large Comparison of Normalization Methods on Time Series3
Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective3
Special Issue on Real-Time Intelligent Systems3
Scheduling critical periodic jobs with selective partial computations along with gang jobs3
Faster Multidimensional Data Queries on Infrastructure Monitoring Systems3
Settlement patterns, official statistics and geo-economic dynamics: Evidence from a LADISC approach to Italy3
Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples3
An Embedding Model for Knowledge Graph Completion Based on Graph Sub-Hop Convolutional Network3
A dual algorithmic approach to deal with multiclass imbalanced classification problems3
Modeling meaningful volatility events to classify monetary policy announcements2
Efficiently Mining Colocation Patterns for Range Query2
Corrigendum to “High Frequency Analysis of Macro News Releases on the Foreign Exchange Market: A Survey of Literature” [Big Data Res. 2 (1) (2015) 33–48]2
TE-PADN: A poisoning attack defense model based on temporal margin samples2
Airspace situation analysis of terminal area traffic flow prediction based on big data and machine learning methods2
PATH: A discrete-sequence dataset for evaluating online unsupervised anomaly detection approaches for multivariate time series2
An Integration visual navigation algorithm for urban air mobility2
Predicting option prices: From the Black-Scholes model to machine learning methods2
Graph Waves2
NGCU: A New RNN Model for Time-Series Data Prediction2
Tangible progress: Employing visual metaphors and physical interfaces in AI-based English language learning2
0.03533411026001