Big Data Research

Papers
(The median citation count of Big Data Research is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Automatic Prediction of T2/T3 Staging of Rectal Cancer Based on Radiomics and Machine Learning96
ML-aVAT: A Novel 2-Stage Machine-Learning Approach for Automatic Clustering Tendency Assessment81
Improved Tesseract optical character recognition performance on Thai document datasets64
Data Stream Classification Based on Extreme Learning Machine: A Review52
Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing51
Early Pathogen Prediction in Crops Using Nano Biosensors and Neural Network-Based Feature Extraction and Classification51
Quantitative analysis of big data for land resource classification and zoning at the township level in Northern Shaanxi51
Similarity Measurement for Graph Data: An Improved Centrality and Geometric Perspective-Based Approach50
A multiscale electricity theft detection model based on feature engineering50
Deep attention dynamic representation learning networks for recommender system review modeling50
Visual Analytics of Trajectories with RoseTrajVis49
Analysis of Occupational Profiles in the Brazilian Workforce Based on Non-Negative Matrix Factorization49
Big Data in organizations: Exploring the adoption of Big Data applications and their impact on organizations in China and the Netherlands46
Performance Benchmarking of Parallel Hyperparameter Tuning for Deep Learning Based Tornado Predictions45
A Framework for Pandemic Prediction Using Big Data Analytics41
Image Classification Approach Using Machine Learning and an Industrial Hadoop Based Data Pipeline41
Hourglass pattern matching for deep aware neural network text recommendation model38
A methodology to assess and evaluate sites with high potential for stormwater harvesting in Dehradun, India37
Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection31
Complex data in tourism analysis: A stochastic approach to price competition31
Editorial Board29
Fog-Computing Based Healthcare Framework for Predicting Encephalitis Outbreak28
A Multi-View Filter for Relation-Free Knowledge Graph Completion27
Spatio-Temporal Characteristics of Influenza Burden and Its Influence Factors in Japan in the Past Three Decades: An Influenza Disease Burden Data-Based Modeling Study25
A Novel Big Data Approach for Record and Represent Compliance in the Covid-19 Era25
Dependency Visualization in Data Stream Profiling23
Editorial Board22
Big Data in Forecasting Research: A Literature Review21
A Segmented PageRank-Based Value Compensation Method for Personal Data in Alliance Blockchains21
Meta-Learning Based Dynamic Adaptive Relation Learning for Few-Shot Knowledge Graph Completion20
A novel approach for job matching and skill recommendation using transformers and the O*NET database20
Visual Exploratory Data Analysis for Copy Number Variation Studies in Biomedical Research19
Augmented Functional Analysis of Variance (A-fANOVA): Theory and Application to Google Trends for Detecting Differences in Abortion Drugs Queries19
Investigating Influence of Google-Play Application Titles on Success17
Federated Geo-Distributed Clouds: Optimizing Resource Allocation Based on Request Type Using Autonomous and Multi-objective Resource Sharing Model17
Theodolite: Scalability Benchmarking of Distributed Stream Processing Engines in Microservice Architectures16
Few-Shot Relation Extraction Towards Special Interests16
Deep semantics-preserving cross-modal hashing15
Research on the characteristics of information propagation dynamic on the weighted multiplex Weibo networks14
Distributed Heterogeneous Transfer Learning14
Visual Exploration of Anomalies in Cyclic Time Series Data with Matrix and Glyph Representations14
Retrofitting Soft Rules for Knowledge Representation Learning14
A Facial Expression Recognition Approach for Social IoT Frameworks14
What Is a Multi-Modal Knowledge Graph: A Survey13
Route Search and Planning: A Survey13
Correcting inconsistencies in knowledge graphs with correlated knowledge12
Epidemic Spreading in Trajectory Networks12
An Efficient Algorithm for Spatio-Textual Object Cluster Join12
Editorial Board12
ViewSeeker: An Interactive View Recommendation Framework12
A novel study of kernel graph regularized semi-non-negative matrix factorization with orthogonal subspace for clustering12
LSDDL: Layer-Wise Sparsification for Distributed Deep Learning11
Cost Optimization for Big Data Workloads Based on Dynamic Scheduling and Cluster-Size Tuning11
Scalable and Flexible Two-Phase Ensemble Algorithms for Causality Discovery11
GeoYCSB: A Benchmark Framework for the Performance and Scalability Evaluation of Geospatial NoSQL Databases11
Positional-attention based bidirectional deep stacked AutoEncoder for aspect based sentimental analysis10
Multi-granularity enhanced graph convolutional network for aspect sentiment triplet extraction10
Intelligent geological interpretation of AMT data based on machine learning10
ImDMI: Improved Distributed M-Invariance model to achieve privacy continuous big data publishing using Apache Spark9
CardioNet: An Efficient ECG Arrhythmia Classification System Using Transfer Learning9
A Twig-Based Algorithm for Top-k Subgraph Matching in Large-Scale Graph Data8
Machine Learning for Tsunami Waves Forecasting Using Regression Trees8
Correlation Expert Tuning System for Performance Acceleration8
Data Science Methodologies: Current Challenges and Future Approaches8
A Full-Sample Clustering Model Considering Whole Process Optimization of Data8
MLPQ: A Dataset for Path Question Answering over Multilingual Knowledge Graphs8
Big Data Analytics and Visualization in Traffic Monitoring7
Real-Time Traffic Speed Estimation for Smart Cities with Spatial Temporal Data: A Gated Graph Attention Network Approach7
Hierarchical Multiresolution Representation of Streaming Time Series7
Evolving Dynamic Bayesian Networks by an Analytical Threshold for Dealing with Data Imputation in Time Series Dataset7
Editorial Board7
FDN-learning: Urban PM2.5-concentration Spatial Correlation Prediction Model Based on Fusion Deep Neural Network7
Deep Learning Techniques for Enhanced Mangrove Land use and Land change from Remote Sensing Imagery: A Blue Carbon Perspective7
Heterogeneous Graph Convolutional Network Based on Correlation Matrix7
Integrating Models and Fusing Data in a Deep Ensemble Learning Method for Predicting Epidemic Diseases Outbreak7
The Predictability of Stock Price: Empirical Study on Tick Data in Chinese Stock Market7
Unmasking hate in the pandemic: A cross-platform study of the COVID-19 infodemic6
A dual algorithmic approach to deal with multiclass imbalanced classification problems6
Editorial Board6
On Divide&Conquer in Image Processing of Data Monster6
Evaluating Standard Feature Sets Towards Increased Generalisability and Explainability of ML-Based Network Intrusion Detection6
Explanation-Guided Adversarial Example Attacks6
Analytical Confidence Intervals for the Number of Different Objects in Data Streams6
Editorial Board6
An Embedding Model for Knowledge Graph Completion Based on Graph Sub-Hop Convolutional Network6
Scalable Diversified Top-k Pattern Matching in Big Graphs6
Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning6
Special Issue on Real-Time Intelligent Systems6
Multi-Temperate Logical Data Warehouse Design for Large-Scale Healthcare Data6
Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective5
Scheduling critical periodic jobs with selective partial computations along with gang jobs5
Tracing the Pace of COVID-19 Research: Topic Modeling and Evolution5
Accelerating Columnar Storage Based on Asynchronous Skipping Strategy5
Babel: A Generic Benchmarking Platform for Big Data Architectures5
A Big Data Analytics Architecture for Smart Cities and Smart Companies5
Chlorophyll-a concentration variations in Bohai sea: Impacts of environmental complexity and human activities based on remote sensing technologies5
Faster Multidimensional Data Queries on Infrastructure Monitoring Systems5
Educational Big Data: Predictions, Applications and Challenges5
CSIP: Enhanced Link Prediction with Context of Social Influence Propagation5
NoSQL data warehouse optimizing models: A comparative study of column-oriented approaches4
Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples4
Editorial to the Special Issue on Big Data in Industrial and Commercial Applications4
Settlement patterns, Official statistics and geo-economic dynamics: Evidence from a LADISC approach to Italy4
Study on the Temporal and Spatial Evolution Characteristics of Chinese Public's Cognition and Attitude to “Double Reduction” Policy Based on Big Data4
A Large Comparison of Normalization Methods on Time Series4
Neural Topic Modeling with Deep Mutual Information Estimation4
Intelligent Narrative Summaries: From Indicative to Informative Summarization3
Using Big Data to Improve Safety Performance: An Application of Process Mining to Enhance Data Visualisation3
Predicting option prices: From the Black-Scholes model to machine learning methods3
Efficiently Mining Colocation Patterns for Range Query3
TE-PADN: A poisoning attack defense model based on temporal margin samples3
Corrigendum to “High Frequency Analysis of Macro News Releases on the Foreign Exchange Market: A Survey of Literature” [Big Data Res. 2 (1) (2015) 33–48]3
Graph Waves3
Enhancing Precision Medicine: A Big Data-Driven Approach for the Management of Genomic Data3
Scaling the Growing Neural Gas for Visual Cluster Analysis3
ExplorerTree: A Focus+Context Exploration Approach for 2D Embeddings3
Editorial Board3
Airspace situation analysis of terminal area traffic flow prediction based on big data and machine learning methods3
Editorial Board3
Modeling meaningful volatility events to classify monetary policy announcements3
Risk Prediction of Renal Failure for Chronic Disease Population Based on Electronic Health Record Big Data3
A Semantic Approach for Big Data Exploration in Industry 4.03
NGCU: A New RNN Model for Time-Series Data Prediction3
0.0330970287323