Journal of Big Data

Papers
(The median citation count of Journal of Big Data is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions2276
CatBoost for big data: an interdisciplinary review370
Selecting critical features for data classification based on machine learning methods337
Cybersecurity data science: an overview from machine learning perspective238
Survey on categorical data for neural networks232
A survey on missing data in machine learning228
Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset201
Deep Learning applications for COVID-19188
Text Data Augmentation for Deep Learning184
A comprehensive survey of anomaly detection techniques for high dimensional big data164
Short-term stock market price trend prediction using a comprehensive deep learning system162
Boosting methods for multi-class imbalanced data classification: an experimental review160
Comparative analysis of deep learning image detection algorithms130
Predictive big data analytics for supply chain demand forecasting: methods, applications, and research opportunities119
A survey on data‐efficient algorithms in big data era112
Resampling imbalanced data for network intrusion detection datasets110
A survey and analysis of intrusion detection models based on CSE-CIC-IDS2018 Big Data108
A survey on generative adversarial networks for imbalance problems in computer vision tasks107
SICE: an improved missing data imputation technique99
A systematic review and research perspective on recommender systems94
The use of Big Data Analytics in healthcare93
A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications91
A novel community detection based genetic algorithm for feature selection86
Analysis and best parameters selection for person recognition based on gait model using CNN algorithm and image augmentation82
Intrusion detection systems using long short-term memory (LSTM)78
Prediction of probable backorder scenarios in the supply chain using Distributed Random Forest and Gradient Boosting Machine learning techniques71
Part of speech tagging: a systematic review of deep learning and machine learning approaches66
A machine learning based credit card fraud detection using the GA algorithm for feature selection64
Air-pollution prediction in smart city, deep learning approach64
A literature review on one-class classification and its potential applications in big data63
Transfer learning: a friendly introduction56
Data science approach to stock prices forecasting in Indonesia during Covid-19 using Long Short-Term Memory (LSTM)52
Machine learning techniques to predict daily rainfall amount52
A survey on artificial intelligence assurance51
IGRF-RFE: a hybrid feature selection method for MLP-based network intrusion detection on UNSW-NB15 dataset50
A novel multi-source information-fusion predictive framework based on deep neural networks for accuracy enhancement in stock market prediction44
Text based personality prediction from multiple social media data sources using pre-trained language model and model averaging41
A novel method of constrained feature selection by the measurement of pairwise constraints uncertainty40
A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench39
Stress detection using natural language processing and machine learning over social interactions38
Designing a Permissioned Blockchain Network for the Halal Industry using Hyperledger Fabric with multiple channels and the raft consensus mechanism38
Deep anomaly detection through visual attention in surveillance videos37
Machine learning-based mathematical modelling for prediction of social media consumer behavior using big data analytics37
Detecting web attacks using random undersampling and ensemble learners37
Application of big data analytics and organizational performance: the mediating role of knowledge management practices36
Big data quality framework: a holistic approach to continuous quality management36
Flight delay prediction based on deep learning and Levenberg-Marquart algorithm35
Sentiment analysis of online product reviews using DLMNN and future prediction of online product using IANFIS34
Detecting cybersecurity attacks across different network features and learners34
A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis33
Enhanced credit card fraud detection based on attention mechanism and LSTM deep model33
Predictive analytics using big data for increased customer loyalty: Syriatel Telecom Company case study32
An alternative approach to dimension reduction for pareto distributed data: a case study32
Arabic text summarization using deep learning approach32
Machine learning approaches in Covid-19 severity risk prediction in Morocco32
The performance of BERT as data representation of text clustering32
Anomaly detection optimization using big data and deep learning to reduce false-positive31
Time-series analysis with smoothed Convolutional Neural Network31
Image captioning model using attention and object features to mimic human image understanding30
Big data analytics on social networks for real-time depression detection30
Apply machine learning techniques to detect malicious network traffic in cloud computing29
Hemorrhage semantic segmentation in fundus images for the diagnosis of diabetic retinopathy by using a convolutional neural network29
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic29
Determining threshold value on information gain feature selection to increase speed and prediction accuracy of random forest28
Cyberbullying detection: advanced preprocessing techniques & deep learning architecture for Roman Urdu data28
Unsupervised outlier detection in multidimensional data28
Sleep stage classification using extreme learning machine and particle swarm optimization for healthcare big data28
Real-time monitoring of traffic parameters28
Multi-layered deep learning perceptron approach for health risk prediction28
Stable bagging feature selection on medical data27
Programming big data analysis: principles and solutions26
Machine learning-based identification of patients with a cardiovascular defect26
User profile correlation-based similarity (UPCSim) algorithm in movie recommendation system25
Chronic kidney disease prediction using machine learning techniques25
Survey on RNN and CRF models for de-identification of medical free text24
Querying knowledge graphs in natural language24
Four-class emotion classification in virtual reality using pupillometry23
Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media23
A novel time efficient learning-based approach for smart intrusion detection system22
A hybrid recommender system based-on link prediction for movie baskets analysis22
Multiclass emotion prediction using heart rate and virtual reality stimuli22
Detecting Denial of Service attacks using machine learning algorithms22
A hybrid machine learning method for increasing the performance of network intrusion detection systems22
A practical Alzheimer’s disease classifier via brain imaging-based deep learning on 85,721 samples22
Convergence of artificial intelligence and high performance computing on NSF-supported cyberinfrastructure22
A set theory based similarity measure for text clustering and classification21
Extending reference architecture of big data systems towards machine learning in edge computing environments21
A novel sensitivity-based method for feature selection21
Mapping and 3D modelling using quadrotor drone and GIS software21
Array databases: concepts, standards, implementations21
Tumor antigens and immune subtypes of glioblastoma: the fundamentals of mRNA vaccine and individualized immunotherapy development21
Gap, techniques and evaluation: traffic flow prediction using machine learning and deep learning20
Social network data analysis to highlight privacy threats in sharing data20
Multivariate cryptocurrency prediction: comparative analysis of three recurrent neural networks approaches20
Exploring big data traits and data quality dimensions for big data analytics application using partial least squares structural equation modelling20
Human behavior in image-based Road Health Inspection Systems despite the emerging AutoML20
Utilizing technologies of fog computing in educational IoT systems: privacy, security, and agility perspective20
Using Big Data-machine learning models for diabetes prediction and flight delays analytics20
Exploring halal tourism tweets on social media20
Context pre-modeling: an empirical analysis for classification based user-centric context-aware predictive modeling19
Minimum threshold determination method based on dataset characteristics in association rule mining19
Modelling customers credit card behaviour using bidirectional LSTM neural networks19
Exploring the efficacy of transfer learning in mining image-based software artifacts19
IoT Big Data provenance scheme using blockchain on Hadoop ecosystem19
A hybrid semantic query expansion approach for Arabic information retrieval19
Big data insight on global mobility during the Covid-19 pandemic lockdown19
Skin-Net: a novel deep residual network for skin lesions classification using multilevel feature extraction and cross-channel correlation with detection of outlier19
Optimized hybrid investigative based dimensionality reduction methods for malaria vector using KNN classifier19
A survey of methods supporting cyber situational awareness in the context of smart cities19
Remote patient monitoring and classifying using the internet of things platform combined with cloud computing18
Using social media for sub-event detection during disasters18
Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics18
Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform18
Governance and sustainability of distributed continuum systems: a big data approach18
The forecast of COVID-19 spread risk at the county level17
IDS-attention: an efficient algorithm for intrusion detection systems using attention mechanism17
Plant diseases detection with low resolution data using nested skip connections17
Uncovering trend-based research insights on teaching and learning in big data17
A survey of dimension reduction and classification methods for RNA-Seq data on malaria vector17
Arabic aspect sentiment polarity classification using BERT17
CRNet: a multimodal deep convolutional neural network for customer revisit prediction17
A reconstruction error-based framework for label noise detection17
Predicting LQ45 financial sector indices using RNN-LSTM17
Impact of rail transit station proximity to commercial property prices: utilizing big data in urban real estate17
Group based emotion recognition from video sequence with hybrid optimization based recurrent fuzzy neural network17
Deep learning for emotion analysis in Arabic tweets17
Automatic LIDAR building segmentation based on DGCNN and euclidean clustering17
Analyzing MRI scans to detect glioblastoma tumor using hybrid deep belief networks17
Using meta-learning for automated algorithms selection and configuration: an experimental framework for industrial big data17
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey17
Hybrid gradient descent spider monkey optimization (HGDSMO) algorithm for efficient resource scheduling for big data processing in heterogenous environment16
Real-time spatio-temporal event detection on geotagged social media16
DV-DVFS: merging data variety and DVFS technique to manage the energy consumption of big data processing16
The effect of feature extraction and data sampling on credit card fraud detection16
Performance evaluation of deep learning techniques for DoS attacks detection in wireless sensor network16
Modeling the public attitude towards organic foods: a big data and text mining approach15
A distributed Content-Based Video Retrieval system for large datasets15
An unsupervised method for social network spammer detection based on user information interests15
Sensing and making sense of tourism flows and urban data to foster sustainability awareness: a real-world experience15
An analytics model for TelecoVAS customers’ basket clustering using ensemble learning approach14
Leveraging machine learning and big data for optimizing medication prescriptions in complex diseases: a case study in diabetes management14
Evaluating classifier performance with highly imbalanced Big Data14
Deep learning-based question answering system for intelligent humanoid robot14
Towards data sharing economy on Internet of Things: a semantic for telemetry data14
The best statistical model to estimate predictors of under-five mortality in Ethiopia14
Developing a mathematical model of the co-author recommender system using graph mining techniques and big data applications13
A scalable association rule learning heuristic for large datasets13
Automatic analysis of social media images to identify disaster type and infer appropriate emergency response13
Investigating the relationship between time and predictive model maintenance13
DaLiF: a data lifecycle framework for data-driven governments12
Multi-criteria collaborative filtering recommender by fusing deep neural network and matrix factorization12
An exploratory content and sentiment analysis of the guardian metaverse articles using leximancer and natural language processing12
An LSTM and GRU based trading strategy adapted to the Moroccan market12
On K-means clustering-based approach for DDBSs design12
Unsupervised feature learning-based encoder and adversarial networks12
Implementation of Long Short-Term Memory and Gated Recurrent Units on grouped time-series data to predict stock prices accurately12
Review of deep learning methods for remote sensing satellite images classification: experimental survey and comparative analysis12
Defining user spectra to classify Ethereum users based on their behavior12
Identification of mRNA vaccines and conserved ferroptosis related immune landscape for individual precision treatment in bladder cancer12
Understanding quality of analytics trade-offs in an end-to-end machine learning-based classification system for building information modeling12
Social media text analytics of Malayalam–English code-mixed using deep learning12
Traffic and road conditions monitoring system using extracted information from Twitter12
Data-centric artificial intelligence in oncology: a systematic review assessing data quality in machine learning models for head and neck cancer11
Identification of distinguishing characteristics of intersections based on statistical analysis and data from video cameras11
Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches11
Tabular and latent space synthetic data generation: a literature review11
Data analytics for crop management: a big data view11
Model fusion of deep neural networks for anomaly detection11
Predictors of outpatients’ no-show: big data analytics using apache spark11
Optimization of air traffic management efficiency based on deep learning enriched by the long short-term memory (LSTM) and extreme learning machine (ELM)11
An analysis of the graph processing landscape11
Inferring the votes in a new political landscape: the case of the 2019 Spanish Presidential elections11
Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis10
SDPSO: Spark Distributed PSO-based approach for feature selection and cancer disease prognosis10
Rating prediction of peer-to-peer accommodation through attributes and topics from customer review10
A comparison of machine learning methods for ozone pollution prediction10
A parallelization model for performance characterization of Spark Big Data jobs on Hadoop clusters10
A new theoretical understanding of big data analytics capabilities in organizations: a thematic analysis10
IoT information theft prediction using ensemble feature selection10
Real-time event detection in social media streams through semantic analysis of noisy terms10
Design matters in patient-level prediction: evaluation of a cohort vs. case-control design when developing predictive models in observational healthcare datasets10
Diabetes emergency cases identification based on a statistical predictive model10
DHPV: a distributed algorithm for large-scale graph partitioning10
Multi Region-Based Feature Connected Layer (RB-FCL) of deep learning models for bone age assessment10
AraXLNet: pre-trained language model for sentiment analysis of Arabic9
The use of knowledge extraction in predicting customer churn in B2B9
Domain randomization for neural network classification9
A novel approach for learning ontology from relational database: from the construction to the evaluation9
Image caption generation using Visual Attention Prediction and Contextual Spatial Relation Extraction9
Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms9
Optimizing classification efficiency with machine learning techniques for pattern matching9
An analysis of COVID-19 economic measures and attitudes: evidence from social media mining9
Sentiment analysis classification system using hybrid BERT models9
Adaptive multiple imputations of missing values using the class center9
Big data fuzzy C-means algorithm based on bee colony optimization using an Apache Hbase9
Why polls fail to predict elections9
Cooperative co-evolution for feature selection in Big Data with random feature grouping9
Breast cancer prediction using gated attentive multimodal deep learning8
A brief survey on big data: technologies, terminologies and data-intensive applications8
Big data actionable intelligence architecture8
Towards more efficient CNN-based surgical tools classification using transfer learning8
An intelligent literature review: adopting inductive approach to define machine learning applications in the clinical domain8
The stability of different aggregation techniques in ensemble feature selection8
A data value metric for quantifying information content and utility8
Artificial intelligence paradigm for ligand-based virtual screening on the drug discovery of type 2 diabetes mellitus8
Mining frequent itemsets from streaming transaction data using genetic algorithms8
Threshold optimization and random undersampling for imbalanced credit card data8
Time series big data: a survey on data stream frameworks, analysis and algorithms8
Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia8
Deep learning enhancing banking services: a hybrid transaction classification and cash flow prediction approach8
Class center-based firefly algorithm for handling missing data8
Twitter sentiment analysis using hybrid gated attention recurrent network8
Opinion mining for national security: techniques, domain applications, challenges and research opportunities8
Vehicle routing problems based on Harris Hawks optimization8
Deep-Eware: spatio-temporal social event detection using a hybrid learning model8
Remote sensing detection enhancement7
Designing a relational model to identify relationships between suspicious customers in anti-money laundering (AML) using social network analysis (SNA)7
Time series modeling of road traffic accidents in Amhara Region7
Development of a regional voice dataset and speaker classification based on machine learning7
Exploration of the investment patterns of potential retail banking customers using two-stage cluster analysis7
Regularized Simple Graph Convolution (SGC) for improved interpretability of large datasets7
Cursor movement detection in brain-computer-interface systems using the K-means clustering method and LSVM7
Deep learning for component fault detection in electricity transmission lines7
Analysis of Bayesian optimization algorithms for big data classification based on Map Reduce framework7
Two-stage credit scoring using Bayesian approach7
Machine learning based customer churn prediction in home appliance rental business7
Comparing traditional news and social media with stock price movements; which comes first, the news or the price change?7
Analyzing Bangkok city taxi ride: reforming fares for profit sustainability using big data driven model7
A survey on bandwidth-aware geo-distributed frameworks for big-data analytics7
Forex market forecasting using machine learning: Systematic Literature Review and meta-analysis7
Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities7
Sentiment analysis for cruises in Saudi Arabia on social media platforms using machine learning algorithms7
Comparative analysis of binary and one-class classification techniques for credit card fraud data7
Enabling real time big data solutions for manufacturing at scale7
Machine learning-based turbulence-risk prediction method for the safe operation of aircrafts7
Fast cluster-based computation of exact betweenness centrality in large graphs7
PCJ Java library as a solution to integrate HPC, Big Data and Artificial Intelligence workloads7
NLP-based platform as a service: a brief review7
OsamorSoft: clustering index for comparison and quality validation in high throughput dataset7
Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection6
Improving efficiency for discovering business processes containing invisible tasks in non-free choice6
Traditional food knowledge of Indonesia: a new high-quality food dataset and automatic recognition system6
A unified representation and transformation of multi-model data using category theory6
Composing high-level stream processing pipelines6
An empirical study on the evaluation of the RDF storage systems6
Estimating runtime of a job in Hadoop MapReduce6
A predictive noise correction methodology for manufacturing process datasets6
Improving lookup and query execution performance in distributed Big Data systems using Cuckoo Filter6
Accuracy improvements for cold-start recommendation problem using indirect relations in social networks6
Social media analysis of car parking behavior using similarity based clustering6
0.044488906860352