Journal of Big Data

Papers
(The H4-Index of Journal of Big Data is 44. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
DD-KARB: data-driven compliance to quality by rule based benchmarking504
Fine grain algorithm parallelization on a hybrid control-flow and dataflow processor322
Machine learning-based prediction of elliptical double steel columns under compression loading318
SIMER: an accurate and intelligent tool for simulating customizable population data across species in complex scenarios265
Sub-spatial prediction of votes integrating socioeconomic, educational, and age strata with machine learning and topological data analysis245
Unsupervised label generation for severely imbalanced fraud data223
Federated learning-driven IoT system for automated freshness monitoring in resource-constrained vending carts204
A scheduling algorithm to maximize storm throughput in heterogeneous cluster164
Dual channel and multi-scale adaptive morphological methods for infrared small targets161
Efficient pollen grain classification using pre-trained Convolutional Neural Networks: a comprehensive study153
Gaussian transformation enhanced semi-supervised learning for sleep stage classification143
Supervised contrastive pre-training models for mammography screening127
A new dimensionality reduction technique based on the Wavelet Transform for cancer classification124
Data analysis for vague contingency data101
Towards a folksonomy graph-based context-aware recommender system of annotated books98
Apply machine learning techniques to detect malicious network traffic in cloud computing93
PCJ Java library as a solution to integrate HPC, Big Data and Artificial Intelligence workloads87
A data value metric for quantifying information content and utility84
Accuracy improvements for cold-start recommendation problem using indirect relations in social networks82
Real-time spatio-temporal event detection on geotagged social media80
Dissimilarity space reinforced with manifold learning and latent space modeling for improved pattern classification76
Remote patient monitoring and classifying using the internet of things platform combined with cloud computing74
An empirical study on the evaluation of the RDF storage systems72
Detection of fickle trolls in large-scale online social networks71
Machine learning concepts for correlated Big Data privacy71
Diabetes emergency cases identification based on a statistical predictive model71
Exploring the form of big data products and the supporting systems70
Operationalizing and automating Data Governance68
Domain-relevance of influence: characterizing variations in online influence across multiple domains on social media61
Defining user spectra to classify Ethereum users based on their behavior61
Title2Vec: a contextual job title embedding for occupational named entity recognition and other applications60
Classification of long-term clinical course of Parkinson’s disease using clustering algorithms on social support registry database58
Prognostic stratification based on HIF-1α signaling for evaluating hypoxia status and immune landscape in hepatocellular carcinoma55
Free trade as domestic, economic, and strategic issues: a big data analytics approach54
Social media analysis of car parking behavior using similarity based clustering54
Tabular and latent space synthetic data generation: a literature review53
Context-aware prediction of active and passive user engagement: Evidence from a large online social platform52
Detecting unregistered users through semi-supervised anomaly detection with similarity datasets51
ASENN: attention-based selective embedding neural networks for road distress prediction48
New custom rating for improving recommendation system performance47
A survey of graph convolutional networks (GCNs) in FPGA-based accelerators46
Memetic multilabel feature selection using pruned refinement process46
CTGAN-ENN: a tabular GAN-based hybrid sampling method for imbalanced and overlapped data in customer churn prediction45
Sentiment analysis of Indonesian datasets based on a hybrid deep-learning strategy44
Quality assurance strategies for machine learning applications in big data analytics: an overview44
0.052546977996826