Journal of Big Data

Papers
(The median citation count of Journal of Big Data is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Prognostic stratification based on HIF-1α signaling for evaluating hypoxia status and immune landscape in hepatocellular carcinoma733
Long-term survival prediction in patients with acute brain lesions using ensemble machine learning algorithms: a cohort study with combined national health insurance service and its self-run hospital 473
A new dimensionality reduction technique based on the Wavelet Transform for cancer classification459
A universal approach for multi-model schema inference456
Data provider research overview from a public management perspective: a bibliometric analysis utilizing CiteSpace456
Context-aware prediction of active and passive user engagement: Evidence from a large online social platform393
FONDUE—Fine-Tuned Optimization: Nurturing Data Usability & Efficiency278
Integrating deep learning and transfer learning: optimizing white blood cells classification in medical educational institutions267
COA_DNN: a hybrid crayfish optimization with deep neural network for detection of rapid eye movement behaviour disorder210
A multi-granular hybrid neural architecture for detecting abusive content in online social networks (OSNs) with contextual awareness198
GB-AFS: graph-based automatic feature selection for multi-class classification via Mean Simplified Silhouette193
Value-at-risk student prescription trees for price personalization186
Breast cancer prediction using gated attentive multimodal deep learning183
Gene selection via improved nuclear reaction optimization algorithm for cancer classification in high-dimensional data177
An architecture for tactical intention recognition of aerial targets based on unsupervised momentum contrast and transformer175
A proposed hybrid framework to improve the accuracy of customer churn prediction in telecom industry143
Identification of tumor antigens and anoikis-based molecular subtypes in the hepatocellular carcinoma immune microenvironment: implications for mRNA vaccine development and precision treatment141
Domain-relevance of influence: characterizing variations in online influence across multiple domains on social media127
Comprehensive study of driver behavior monitoring systems using computer vision and machine learning techniques125
Efficient pollen grain classification using pre-trained Convolutional Neural Networks: a comprehensive study114
An artificial intelligence platform for predicting postoperative complications in metastatic spinal surgery: development and validation study104
Hybrid beluga whale optimization algorithm with multi-strategy for functions and engineering optimization problems104
Designing and evaluating a big data analytics approach for predicting students’ success factors103
Traffic and road conditions monitoring system using extracted information from Twitter100
Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis99
Exploring differential privacy in CNNs, LSTMs, GRUs, and RNNs for heartbeat detection from multimodal data97
An efficient binary spider wasp optimizer for multi-dimensional knapsack instances: experimental validation and analysis93
Enhancing AQI forecasting accuracy: integrating ARIMA, ANN, and regression techniques with the development of HM4AQI web application90
Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia90
DiabSense: early diagnosis of non-insulin-dependent diabetes mellitus using smartphone-based human activity recognition and diabetic retinopathy analysis with Graph Neural Network89
The adaptive community-response (ACR) method for collecting misinformation on social media89
Big data in human behavior research: a contextual turn81
A model for investment type recommender system based on the potential investors based on investors and experts feedback using ANFIS and MNN78
Distributed fuzzy clustering algorithm for mixed-mode data in Apache SPARK77
Novel transformer models for wheat disease classification with explainable insights74
Risk and UCON-based access control model for healthcare big data71
A unified IoT architectural model for smart hospitals: enhancing interoperability, security, and efficiency through clinical information systems (CIS)70
Deep-Eware: spatio-temporal social event detection using a hybrid learning model69
Artificial intelligence for improving Nitrogen Dioxide forecasting of Abu Dhabi environment agency ground-based stations66
Advancing multimodal emotion recognition in big data through prompt engineering and deep adaptive learning65
An adaptive k-means clustering algorithm based on grid and domain centroid weights for digital twins in the context of digital transformation65
Survey on terminology extraction from texts63
Review of deep learning methods for remote sensing satellite images classification: experimental survey and comparative analysis61
Developing insights from the collective voice of target users in Twitter60
Scalable approach for high-resolution land cover: a case study in the Mediterranean Basin59
Modeling the impact of BDA-AI on sustainable innovation ambidexterity and environmental performance56
Xai-driven knowledge distillation of large language models for efficient deployment on low-resource devices55
Contrastive self-supervised representation learning framework for metal surface defect detection54
Hajj pilgrimage abnormal crowd movement monitoring using optical flow and FCNN54
SMT efficiency in supervised ML methods: a throughput and interference analysis54
Disaggregating IMERG satellite precipitation over Czech Republic: an innovative approach using hybrid Extreme Gradient Boosting based on Fuzzy Spatial-Temporal Multivariate Clustering54
Predicting startup success using two bias-free machine learning: resolving data imbalance using generative adversarial networks53
Novel mathematical model for the classification of music and rhythmic genre using deep neural network53
GEN-SECHEALTH: an AI-powered generative architecture for self-scalable cybersecurity and flexible data privacy protection in intricate healthcare systems53
Efficient surface crack segmentation for industrial and civil applications based on an enhanced YOLOv8 model53
Hybrid wrapper feature selection method based on genetic algorithm and extreme learning machine for intrusion detection52
Traffic flow prediction based on depthwise separable convolution fusion network52
Fast agglomerative clustering using approximate traveling salesman solutions50
Blockchain with improved deep residual shrinking network for ensuring cybersecurity in IoT-driven healthcare systems48
Meta-transformer: leveraging metaheuristic algorithms for agricultural commodity price forecasting48
Advancing hospital healthcare: achieving IoT-based secure health monitoring through multilayer machine learning47
From distributed machine to distributed deep learning: a comprehensive survey46
Machine learning-based network intrusion detection for big and imbalanced data using oversampling, stacking feature embedding and feature extraction46
Advancing stock price prediction through the development of hybrid ensembles: a comprehensive comparative analysis of machine learning approaches46
Short-term photovoltaic power production forecasting based on novel hybrid data-driven models46
Surface defect detection on bolt surface using a real-time fine-tuned YOLOv6 model45
Machine learning based customer churn prediction in home appliance rental business45
Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning44
Metamorphosing forex: advancements in volatility forecasting using a modified fuzzy time series framework43
Efficient spatial data partitioning for distributed $$k$$NN joins43
Air particulate matter (PM2.5) concentration prediction in Kuwait using vision transformer43
The use of class imbalanced learning methods on ULSAM data to predict the case–control status in genome-wide association studies43
Churn management in hospitality43
PoLYTC: a novel BERT-based classifier to detect political leaning of YouTube videos based on their titles42
Comparative analysis of binary and one-class classification techniques for credit card fraud data42
Keratin-net: a lightweight self supervised fusion framework for simultaneous classification and localization of keratinization in oral cancer histopathology40
Fuzzy deep learning architecture for cucumber plant disease detection and classification39
Empowering sentiment analysis in social media: a comprehensive approach to enhance the classification of abusive Tamil comments using transformer models39
Advanced multilevel feature fusion framework for enhanced image retrieval using convolutional neural network and benchmark datasets39
Emotion AWARE: an artificial intelligence framework for adaptable, robust, explainable, and multi-granular emotion analysis39
Self-organizing maps to evaluate optimal strategies for balancing binary class distributions: a methodological approach38
Multi combination pattern labeling by using deep learning for chameleon rotary machine environment38
Enhancing cardiac diagnostics: a deep learning ensemble approach for precise ECG image classification37
Governance and sustainability of distributed continuum systems: a big data approach37
A novel ST-iTransformer model for spatio-temporal ambient air pollution forecasting37
Optimization of credit marketing strategy based on customer demand and RNN37
Enhanced ransomware attacks detection using feature selection, sensitivity analysis, and optimized hybrid model37
Helformer: an attention-based deep learning model for cryptocurrency price forecasting36
CardiaTics: An explainable AI integrated heart disease diagnosis model with feature engineering and stacked ensemble approach36
Siamese Graph Convolutional Split-Attention Network with NLP based Social Sentimental Data for enhanced stock price predictions36
Enhancing academic performance prediction with temporal graph networks for massive open online courses36
Ramifications of incorrect image segmentations; emphasizing on the potential effects on deep learning methods failure36
Machine learning model for malaria risk prediction based on mutation location of large-scale genetic variation data36
A deep contrastive learning-based image retrieval system for automatic detection of infectious cattle diseases35
Liquid biopsy-based identification of prognostic and immunotherapeutically relevant gene signatures in lower grade glioma34
Determinating clusters with a higher proportion of long-term care discharges from hospitals: a nationwide Portuguese study using clustering and decision tree methods34
Uncertainty-aware approach for multiple imputation using conventional and machine learning models: a real-world data study33
Privacy preserved incremental record linkage33
A deep learning-based framework for large-scale plant disease detection using big data analytics in precision agriculture32
Data augmentation for dense passage retrieval using corpus-passage frequency-based token deletion32
Optimizing group utility in itinerary planning: a strategic and crowd-aware approach31
Multi-sample $$\zeta $$-mixup: richer, more realistic synthetic samples from a p-series interpolant31
Research on sentiment analysis method of opinion mining based on multi-model fusion transfer learning31
Data-driven ticket pricing in football: leveraging customer perceived value for revenue optimization30
Deep features fusion for KCF-based moving object tracking30
Big Data Analytics-based life cycle sustainability assessment for sustainable manufacturing enterprises evaluation30
A patent push method and system based on the product life cycle multi-dimensional classification30
Data pipeline approaches in serverless computing: a taxonomy, review, and research trends29
Generative AI in depth: A survey of recent advances, model variants, and real-world applications29
HepScope: CNN-based single-cell discrimination of malignant hepatocytes29
Comprehensive review of artificial intelligence applications in renewable energy systems: current implementations and emerging trends29
A systematic review on big data applications and scope for industrial processing and healthcare sectors28
Optical electrocardiogram based heart disease prediction using hybrid deep learning28
AI crypto trading: multi-class multi-granular analysis for boosting high-frequency trade predictions with fibonacci and hybrid convolutional neural networks28
Tumor antigens and immune subtypes of glioblastoma: the fundamentals of mRNA vaccine and individualized immunotherapy development28
Enhancing data discovery with contextual pre-filtering28
Identification of key drought-tolerant genes in soybean using an integrative data-driven feature engineering pipeline28
Quantifying football player value: a novel combined rating and ML ensemble approach for predicting top players28
A novel sub-network level ensemble deep neural network with a regularized loss function to improve prediction performance28
Federated Freeze BERT for text classification28
Text summarization based on semantic graphs: an abstract meaning representation graph-to-text deep learning approach27
Online variational Gaussian process for time series data27
Distinguishing novel coronavirus influenza A virus pneumonia with CT radiomics and clinical features27
Sentiment analysis classification system using hybrid BERT models27
Deep learning for component fault detection in electricity transmission lines27
Plant disease detection and classification techniques: a comparative study of the performances26
The differences in gastric cancer epidemiological data between SEER and GBD: a joinpoint and age-period-cohort analysis26
A systematic review of AI-enhanced techniques in credit card fraud detection26
Extended version of decision making model for industrial robot selection via fractional continuous fuzzy information26
Data analysis for vague contingency data26
An enhanced machine learning framework for accurate diagnosis of tuberculous pleural effusion25
Decoding defensive performance: a machine learning approach to football player valuation25
De-occlusion and recognition of frontal face images: a comparative study of multiple imputation methods25
Deep learning enhancing banking services: a hybrid transaction classification and cash flow prediction approach25
Block-level masking and feature importance-based adversarial example generation25
Big data processing using hybrid Gaussian mixture model with salp swarm algorithm24
The evolution of the European football transfer network24
Spatial heterogeneities in acute lower respiratory infections prevalence and determinants across Ethiopian administrative zones23
FEL-FRN: fusion ECA long-CLIP feature reconstruction network for few-shot classification23
A fuel consumption-based method for developing local-specific CO2 emission rate database using open-source big data23
An enhanced random forest approach using CoClust clustering: MIMIC-III and SMS spam collection application22
Text-to-video generators: a comprehensive survey22
Unsupervised hyperspectral image segmentation of films: a hierarchical clustering-based approach22
Data reduction techniques for highly imbalanced medicare Big Data22
Atsa: a novel augmented low-discrepancy sequence initialized tunicate swarm-based exponential local escaping operator for engineering design applications22
A machine learning-based credit risk prediction engine system using a stacked classifier and a filter-based feature selection method21
Iterative cleaning and learning of big highly-imbalanced fraud data using unsupervised learning21
Evaluation is key: a survey on evaluation measures for synthetic time series21
Utilizing AI models to identify and predict phase transition patterns of bipolar disorder patients21
Unsupervised label generation for severely imbalanced fraud data21
The state of metaverse research: a bibliometric visual analysis based on CiteSpace21
Venue-aware researcher impact assessment using genetic programming21
Hemorrhage semantic segmentation in fundus images for the diagnosis of diabetic retinopathy by using a convolutional neural network21
Hyperdimensional computing: a framework for stochastic computation and symbolic AI21
Operationalizing and automating Data Governance21
Multi strategy Horned Lizard Optimization Algorithm for complex optimization and advanced feature selection problems20
An optimized hybrid ensemble machine learning model combining multiple classifiers for detecting advanced persistent threats in networks20
Photograph-based machine learning approach for automated detection and differentiation of aerial blight disease in soybean crops20
Evaluation of predictive performance of modeling hyperuricemia using medical big data: comparison of data preprocessing methods20
Main memory controller with multiple media technologies for big data workloads20
Axial compressive behavior of reinforced concrete-filled circular steel tubular columns: finite element and machine learning modelling20
Comparative evaluation of generative audio models for precise disease classification via sound-based datasets20
Data analysis for sequential contingencies under uncertainty20
Towards a deep learning-based outlier detection approach in the context of streaming data20
A real-time predicting online tool for detection of people’s emotions from Arabic tweets based on big data platforms20
Big data: an optimized approach for cluster initialization19
Breast cancer diagnosis with MFF-HistoNet: a multi-modal feature fusion network integrating CNNs and quantum tensor networks19
IGRF-RFE: a hybrid feature selection method for MLP-based network intrusion detection on UNSW-NB15 dataset19
Deep reinforcement learning for job scheduling on load-aware heterogeneous cluster19
Multi-level lag scheme significantly improves training efficiency in deep learning: a case study in air quality alert service over sub-tropical area19
An efficient weighted slime mould algorithm for engineering optimization19
RILS-ROLS: robust symbolic regression via iterated local search and ordinary least squares19
Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection19
Machine learning-based interactive dynamic resilience assessment for complex hydropower systems19
Information preservation-based hashing for image retrieval19
Scalable and space-efficient Robust Matroid Center algorithms19
Bilingual hate speech detection on social media: Amharic and Afaan Oromo19
Transfer learning approach based on satellite image time series for the crop classification problem19
Readers’ affect: predicting and understanding readers’ emotions with deep learning19
Combining review elements for modelling various multi-criteria collaborative recommendation models19
Application of supervised machine learning models in human emotion classification using Tsallis entropy as a feature18
Blind Federated Learning without initial model18
Computational methods for predicting the outcome of thoracic transplantation18
Capturing research literature attitude towards sustainable development goals: an LLM-based topic modeling approach18
Enhanced EEG-based detection of major depressive disorder using maximum likelihood estimation and machine learning18
Learning manifolds from non-stationary streams18
Data engineering for sustainable agriculture: developments, challenges, and case studies of a novel IoRT architecture18
Intelligent average utility pattern analysis using pre-large concept in dynamic stream data18
Fitcam: detecting and counting repetitive exercises with deep learning18
Introducing Mplots: scaling time series recurrence plots to massive datasets18
A computational analysis of aspect-based sentiment analysis research through bibliometric mapping and topic modeling17
Student academic performance prediction via hypergraph and TabNet17
The wisdom of the lexicon crowds: leveraging on decades of lexicon-based sentiment analysis for improved results17
Artifact-free fat-water separation in Dixon MRI using deep learning17
Leveraging ensemble learning-based stock preselection with multiobjective investment optimization for stepwise decision-supported portfolio management17
Integrated multiomics analysis and advanced machine learning techniques to refine molecular subtypes, stratify prognosis, characterize tumor microenvironment, and identify distinct sensitivity pattern17
Enhancing public art communication through emotional intelligence based on type-2 fractional fuzzy sets17
A unified representation and transformation of multi-model data using category theory17
Decision support system for handling control decisions and decision-maker related to supply chain17
Cyberattack detection in wireless sensor networks using a hybrid feature reduction technique with AI and machine learning methods17
DLA-E: a deep learning accelerator for endoscopic images classification16
AI-driven biomedical perspectives on mental fatigue in the post-COVID-19 Era: trends, research gaps, and future directions16
Exploring AI-driven approaches for unstructured document analysis and future horizons16
Retinal photograph-based deep learning system for detection of hyperthyroidism: a multicenter, diagnostic study16
The community discovery method in heterogeneous networks: a survey16
Application of deep learning technique in next generation sequence experiments16
Big social data as a service (BSDaaS): a service composition framework for social media analysis16
Advanced machine learning techniques for cardiovascular disease early detection and diagnosis16
Facial emotion recognition using deep Siamese neural networks: multi-classifier fusion for single-emotion and multi-emotion models across age groups16
Robust metaheuristic algorithms with sequential replacement-improved dynamic population for optimizing energy consumption in a UAV-empowered IoT data collection system16
Multi-omics Mendelian randomization revealing SEMA7A as potential drug target for facial skin aging16
Multivariate response directional regression: a projective resampling approach16
Architecture for determining the cleanliness in shared vehicles using an integrated machine vision and indoor air quality-monitoring system16
Optimal Markowitz portfolio using returns forecasted with time series and machine learning models15
An AI-driven multimodal developmental disability detection and intervention framework using enhanced speech and behavioral analysis with BioNeuroFusionNet classification15
A novel approach for detecting deep fake videos using graph neural network15
B-CAT: a model for detecting botnet attacks using deep attack behavior analysis on network traffic flows15
Improving the identification of influential nodes in complex networks using semi-local structures and shortest-path-based centralities15
Optimization-based convolutional neural model for the classification of white blood cells15
A bibliometric analysis of the first decade of the Journal of Big Data15
Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods15
A multi-manifold learning based instance weighting and under-sampling for imbalanced data classification problems15
Awareness routing algorithm in vehicular ad-hoc networks (VANETs)15
Task-aware conditional GAN with multi-objective loss for realistic and efficient industrial time series generation15
Guardians of digital safety: benchmarking large language models in the fight against online toxicity15
Advances in ECG and PCG-based cardiovascular disease classification: a review of deep learning and machine learning methods15
Unified platform for storing, retrieving, and analysing biomechanical applications data using graph database15
Developing a negative speech emotion recognition model for safety systems using deep learning14
Bilingual video captioning model for enhanced video retrieval14
A toolbox for volleyball data analytics: a case study on the italian women’s league14
Investigating the role of conversational agents in augmented cultural heritage tours14
Simulating imprecise data: sine–cosine and convolution methods with neutrosophic normal distribution14
Comparative analysis of AI techniques in pavement preservation materials: ensemble learning, deep learning, and simplified variance-matching diffusion model based data augmentation14
Implementation of novel loss and activation function-based deep network in big data analytics for pattern discovery across multiple data sources14
Data-driven correlation mapping of H-index and its variants in mathematics14
An ensemble method for estimating the number of clusters in a big data set using multiple random samples14
Unsupervised feature selection and class labeling for credit card fraud13
Trends in real-time artificial intelligence methods in sports: a systematic review13
Attribute annotation and bias evaluation in visual datasets for autonomous driving13
Identifying the determinants of soccer coach turnover by competing risks models: a case study on Italian League Serie A13
Toward a smart health: big data analytics and IoT for real-time miscarriage prediction13
DAPS diagrams for defining Data Science projects13
‘Everything is data’: towards one big data ecosystem using multiple sources of data on higher education in Indonesia13
CMMamba: channel mixing Mamba for time series forecasting13
Identifying key genetic variants in Alzheimer’s disease progression using Graph Convolutional Networks (GCN) and biological impact analysis13
Survey of transformers and towards ensemble learning using transformers for natural language processing13
The power of big data mining to improve the health care system in the United Arab Emirates13
Aspect-level sentiment classification with fused local and global context13
Adapting transformer-based language models for heart disease detection and risk factors extraction13
Personalized region of interest recommendation through adaptive fusion of multi-dimensional user preferences13
0.32057285308838