Big Data Research

Papers
(The median citation count of Big Data Research is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Automatic Prediction of T2/T3 Staging of Rectal Cancer Based on Radiomics and Machine Learning115
Data Stream Classification Based on Extreme Learning Machine: A Review100
Early Pathogen Prediction in Crops Using Nano Biosensors and Neural Network-Based Feature Extraction and Classification86
ML-aVAT: A Novel 2-Stage Machine-Learning Approach for Automatic Clustering Tendency Assessment80
MIND: A metadata-driven INgestion design pattern for efficient data ingestion76
Quantitative analysis of big data for land resource classification and zoning at the township level in Northern Shaanxi53
Improved Tesseract optical character recognition performance on Thai document datasets39
Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing29
Deep attention dynamic representation learning networks for recommender system review modeling26
A multiscale electricity theft detection model based on feature engineering25
Similarity Measurement for Graph Data: An Improved Centrality and Geometric Perspective-Based Approach24
An integrating multi-scale and multi-deep architectures for facial expression recognition system to predict mental health status24
Hourglass pattern matching for deep aware neural network text recommendation model23
Analysis of Occupational Profiles in the Brazilian Workforce Based on Non-Negative Matrix Factorization22
Big Data in organizations: Exploring the adoption of Big Data applications and their impact on organizations in China and the Netherlands20
Editorial Board18
A methodology to assess and evaluate sites with high potential for stormwater harvesting in Dehradun, India18
Development of an integrated data system for regional tourism analysis in Italy: A microdata perspective18
Fog-Computing Based Healthcare Framework for Predicting Encephalitis Outbreak16
Unleashing the power of digital twin and big data as a new frontier for smart mobility: An ecosystem perspective16
Complex data in tourism analysis: A stochastic approach to price competition16
Spatio-Temporal Characteristics of Influenza Burden and Its Influence Factors in Japan in the Past Three Decades: An Influenza Disease Burden Data-Based Modeling Study15
A Multi-View Filter for Relation-Free Knowledge Graph Completion13
Editorial Board12
A Segmented PageRank-Based Value Compensation Method for Personal Data in Alliance Blockchains11
A novel approach for job matching and skill recommendation using transformers and the O*NET database11
Meta-Learning Based Dynamic Adaptive Relation Learning for Few-Shot Knowledge Graph Completion11
Augmented Functional Analysis of Variance (A-fANOVA): Theory and Application to Google Trends for Detecting Differences in Abortion Drugs Queries10
A novel study of kernel graph regularized semi-non-negative matrix factorization with orthogonal subspace for clustering10
Deep semantics-preserving cross-modal hashing10
Research on the characteristics of information propagation dynamic on the weighted multiplex Weibo networks10
Editorial Board10
Investigating Influence of Google-Play Application Titles on Success10
A Facial Expression Recognition Approach for Social IoT Frameworks9
Techniques for interactive visual examination of vessel performance9
Distributed Heterogeneous Transfer Learning9
What Is a Multi-Modal Knowledge Graph: A Survey8
BETM: A new pre-trained BERT-guided embedding-based topic model8
Editorial Board8
Correcting inconsistencies in knowledge graphs with correlated knowledge8
Positional-attention based bidirectional deep stacked AutoEncoder for aspect based sentimental analysis7
GeoYCSB: A Benchmark Framework for the Performance and Scalability Evaluation of Geospatial NoSQL Databases7
ImDMI: Improved Distributed M-Invariance model to achieve privacy continuous big data publishing using Apache Spark7
Machine Learning for Tsunami Waves Forecasting Using Regression Trees7
Intelligent geological interpretation of AMT data based on machine learning7
Multi-granularity enhanced graph convolutional network for aspect sentiment triplet extraction6
Editorial Board6
Efficient time series forecasting with gated attention and patched data: A transformer-based approach6
A Twig-Based Algorithm for Top-k Subgraph Matching in Large-Scale Graph Data6
Adaptive spectral GNN and frequency enhanced self-attention for traffic forecasting6
MLPQ: A Dataset for Path Question Answering over Multilingual Knowledge Graphs6
Heterogeneous Graph Convolutional Network Based on Correlation Matrix5
A Full-Sample Clustering Model Considering Whole Process Optimization of Data5
Compression of big data collected in wind farm based on tensor train decomposition5
Correlation Expert Tuning System for Performance Acceleration5
The Predictability of Stock Price: Empirical Study on Tick Data in Chinese Stock Market5
Real-Time Traffic Speed Estimation for Smart Cities with Spatial Temporal Data: A Gated Graph Attention Network Approach5
Editorial Board4
Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning4
Editorial Board4
Explanation-Guided Adversarial Example Attacks4
Unmasking hate in the pandemic: A cross-platform study of the COVID-19 infodemic4
A dual algorithmic approach to deal with multiclass imbalanced classification problems4
The influence of China's exchange rate market on the Belt and Road trade market: Based on temporal two-layer networks4
Scalable Diversified Top-k Pattern Matching in Big Graphs4
Mitigating skill measurement bias in occupational systems: An encoder-Decoder framework cooperated with income prediction4
Evolving Dynamic Bayesian Networks by an Analytical Threshold for Dealing with Data Imputation in Time Series Dataset4
VertexLocater: PIM-enabled dynamic offloading for graph computing4
Evaluating Standard Feature Sets Towards Increased Generalisability and Explainability of ML-Based Network Intrusion Detection4
Special Issue on Real-Time Intelligent Systems4
Deep Learning Techniques for Enhanced Mangrove Land use and Land change from Remote Sensing Imagery: A Blue Carbon Perspective4
Editorial Board4
An Embedding Model for Knowledge Graph Completion Based on Graph Sub-Hop Convolutional Network3
Accelerating Columnar Storage Based on Asynchronous Skipping Strategy3
Chlorophyll-a concentration variations in Bohai sea: Impacts of environmental complexity and human activities based on remote sensing technologies3
Settlement patterns, official statistics and geo-economic dynamics: Evidence from a LADISC approach to Italy3
Study on the Temporal and Spatial Evolution Characteristics of Chinese Public's Cognition and Attitude to “Double Reduction” Policy Based on Big Data3
Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective3
E-word of mouth in sales volume forecasting: Toyota Camry case study3
Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples3
NoSQL data warehouse optimizing models: A comparative study of column-oriented approaches3
Exogenous variable driven cotton prices prediction: comparison of statistical model with sequence based deep learning models3
Asymmetric deviation entropy regularization for semi-supervised fuzzy C-means clustering and its fast Algorithm3
Scheduling critical periodic jobs with selective partial computations along with gang jobs3
Neural Topic Modeling with Deep Mutual Information Estimation3
Explainable malware detection through integrated graph reduction and learning techniques3
A Large Comparison of Normalization Methods on Time Series2
Meteorological and satellite remote sensing-driven machine learning-based approach for large-scale winter oilseed rape prediction and inversion2
Realistic image-to-image machine unlearning via decoupling and knowledge retention2
TE-PADN: A poisoning attack defense model based on temporal margin samples2
PATH: A discrete-sequence dataset for evaluating online unsupervised anomaly detection approaches for multivariate time series2
Tangible progress: Employing visual metaphors and physical interfaces in AI-based English language learning2
Modeling meaningful volatility events to classify monetary policy announcements2
Technology topic identification and trend prediction of new energy vehicle based on knowledge graph2
Predicting option prices: From the Black-Scholes model to machine learning methods2
Editorial Board2
An Integration visual navigation algorithm for urban air mobility2
Crop monitoring using remote sensing land use and land change data: Comparative analysis of deep learning methods using pre-trained CNN models2
Classifier-Based Nonuniform Time Slicing Method for Local Community Evolution Analysis2
Graph Waves2
Airspace situation analysis of terminal area traffic flow prediction based on big data and machine learning methods2
Efficiently Mining Colocation Patterns for Range Query2
Opinion fraud detection on massive datasets by spark2
Efficient training: Federated learning cost analysis2
0.093709945678711