Journal of Big Data

Papers
(The median citation count of Journal of Big Data is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions3128
CatBoost for big data: an interdisciplinary review538
A survey on missing data in machine learning368
Text Data Augmentation for Deep Learning254
Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset253
Deep Learning applications for COVID-19204
A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications203
The use of Big Data Analytics in healthcare166
Comparative analysis of deep learning image detection algorithms166
A survey on data‐efficient algorithms in big data era155
A systematic review and research perspective on recommender systems153
A survey on generative adversarial networks for imbalance problems in computer vision tasks139
Resampling imbalanced data for network intrusion detection datasets137
A survey and analysis of intrusion detection models based on CSE-CIC-IDS2018 Big Data129
Intrusion detection systems using long short-term memory (LSTM)113
Transfer learning: a friendly introduction112
A machine learning based credit card fraud detection using the GA algorithm for feature selection109
A novel community detection based genetic algorithm for feature selection103
Part of speech tagging: a systematic review of deep learning and machine learning approaches95
Air-pollution prediction in smart city, deep learning approach94
IGRF-RFE: a hybrid feature selection method for MLP-based network intrusion detection on UNSW-NB15 dataset94
Analysis and best parameters selection for person recognition based on gait model using CNN algorithm and image augmentation92
A literature review on one-class classification and its potential applications in big data86
Machine learning techniques to predict daily rainfall amount76
A survey on artificial intelligence assurance64
Data science approach to stock prices forecasting in Indonesia during Covid-19 using Long Short-Term Memory (LSTM)58
A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis54
Stress detection using natural language processing and machine learning over social interactions53
A novel multi-source information-fusion predictive framework based on deep neural networks for accuracy enhancement in stock market prediction52
Text based personality prediction from multiple social media data sources using pre-trained language model and model averaging52
Time-series analysis with smoothed Convolutional Neural Network51
Enhanced credit card fraud detection based on attention mechanism and LSTM deep model49
A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench49
Big data quality framework: a holistic approach to continuous quality management45
Machine learning-based mathematical modelling for prediction of social media consumer behavior using big data analytics45
Designing a Permissioned Blockchain Network for the Halal Industry using Hyperledger Fabric with multiple channels and the raft consensus mechanism45
Detecting web attacks using random undersampling and ensemble learners44
The performance of BERT as data representation of text clustering44
Chronic kidney disease prediction using machine learning techniques43
Flight delay prediction based on deep learning and Levenberg-Marquart algorithm43
Cyberbullying detection: advanced preprocessing techniques & deep learning architecture for Roman Urdu data41
Skin-Net: a novel deep residual network for skin lesions classification using multilevel feature extraction and cross-channel correlation with detection of outlier40
A practical Alzheimer’s disease classifier via brain imaging-based deep learning on 85,721 samples40
Big data analytics on social networks for real-time depression detection40
Image captioning model using attention and object features to mimic human image understanding39
Detecting cybersecurity attacks across different network features and learners39
Determining threshold value on information gain feature selection to increase speed and prediction accuracy of random forest37
Apply machine learning techniques to detect malicious network traffic in cloud computing37
An alternative approach to dimension reduction for pareto distributed data: a case study36
Arabic text summarization using deep learning approach36
Programming big data analysis: principles and solutions35
Machine learning approaches in Covid-19 severity risk prediction in Morocco35
A novel sensitivity-based method for feature selection34
Hemorrhage semantic segmentation in fundus images for the diagnosis of diabetic retinopathy by using a convolutional neural network34
Stable bagging feature selection on medical data34
Unsupervised outlier detection in multidimensional data33
Sleep stage classification using extreme learning machine and particle swarm optimization for healthcare big data33
A hybrid machine learning method for increasing the performance of network intrusion detection systems32
Review of deep learning methods for remote sensing satellite images classification: experimental survey and comparative analysis32
User profile correlation-based similarity (UPCSim) algorithm in movie recommendation system31
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic31
Querying knowledge graphs in natural language30
Mapping and 3D modelling using quadrotor drone and GIS software30
Gap, techniques and evaluation: traffic flow prediction using machine learning and deep learning30
Sentiment analysis classification system using hybrid BERT models29
Performance evaluation of deep learning techniques for DoS attacks detection in wireless sensor network29
Detecting Denial of Service attacks using machine learning algorithms29
Tabular and latent space synthetic data generation: a literature review29
Social network data analysis to highlight privacy threats in sharing data29
Machine learning-based identification of patients with a cardiovascular defect28
Multivariate cryptocurrency prediction: comparative analysis of three recurrent neural networks approaches28
Utilizing technologies of fog computing in educational IoT systems: privacy, security, and agility perspective27
CRNet: a multimodal deep convolutional neural network for customer revisit prediction27
Plant disease detection and classification techniques: a comparative study of the performances26
Exploring big data traits and data quality dimensions for big data analytics application using partial least squares structural equation modelling26
Remote patient monitoring and classifying using the internet of things platform combined with cloud computing26
The effect of feature extraction and data sampling on credit card fraud detection26
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey26
Array databases: concepts, standards, implementations26
Tumor antigens and immune subtypes of glioblastoma: the fundamentals of mRNA vaccine and individualized immunotherapy development26
IDS-attention: an efficient algorithm for intrusion detection systems using attention mechanism26
Using social media for sub-event detection during disasters26
Arabic aspect sentiment polarity classification using BERT25
Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media25
Human behavior in image-based Road Health Inspection Systems despite the emerging AutoML25
Automatic LIDAR building segmentation based on DGCNN and euclidean clustering25
Exploring halal tourism tweets on social media25
Minimum threshold determination method based on dataset characteristics in association rule mining24
IoT Big Data provenance scheme using blockchain on Hadoop ecosystem24
A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, applications, and future directions24
An exploratory content and sentiment analysis of the guardian metaverse articles using leximancer and natural language processing24
Multiclass emotion prediction using heart rate and virtual reality stimuli24
Governance and sustainability of distributed continuum systems: a big data approach24
Predicting LQ45 financial sector indices using RNN-LSTM24
A hybrid recommender system based-on link prediction for movie baskets analysis23
Optimized hybrid investigative based dimensionality reduction methods for malaria vector using KNN classifier23
A novel time efficient learning-based approach for smart intrusion detection system23
Modelling customers credit card behaviour using bidirectional LSTM neural networks23
Deep learning for emotion analysis in Arabic tweets22
Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform22
A reconstruction error-based framework for label noise detection21
Optimizing classification efficiency with machine learning techniques for pattern matching21
Evaluating classifier performance with highly imbalanced Big Data21
Modeling the public attitude towards organic foods: a big data and text mining approach21
DaLiF: a data lifecycle framework for data-driven governments20
Using meta-learning for automated algorithms selection and configuration: an experimental framework for industrial big data20
Big data insight on global mobility during the Covid-19 pandemic lockdown20
An LSTM and GRU based trading strategy adapted to the Moroccan market20
Automatic analysis of social media images to identify disaster type and infer appropriate emergency response20
Exploration of issues, challenges and latest developments in autonomous cars19
A brief survey on big data: technologies, terminologies and data-intensive applications19
Deep learning enhancing banking services: a hybrid transaction classification and cash flow prediction approach19
A new theoretical understanding of big data analytics capabilities in organizations: a thematic analysis19
Identification of mRNA vaccines and conserved ferroptosis related immune landscape for individual precision treatment in bladder cancer18
Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches18
Real-time spatio-temporal event detection on geotagged social media18
The forecast of COVID-19 spread risk at the county level18
A survey of dimension reduction and classification methods for RNA-Seq data on malaria vector18
Social media text analytics of Malayalam–English code-mixed using deep learning18
An unsupervised method for social network spammer detection based on user information interests18
Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities17
BEST: a web application for comprehensive biomarker exploration on large-scale data in solid tumors17
A comparison of machine learning methods for ozone pollution prediction17
Twitter sentiment analysis using hybrid gated attention recurrent network17
Image caption generation using Visual Attention Prediction and Contextual Spatial Relation Extraction17
Machine learning-based network intrusion detection for big and imbalanced data using oversampling, stacking feature embedding and feature extraction17
AraXLNet: pre-trained language model for sentiment analysis of Arabic17
A distributed Content-Based Video Retrieval system for large datasets17
DV-DVFS: merging data variety and DVFS technique to manage the energy consumption of big data processing17
Real-time event detection in social media streams through semantic analysis of noisy terms17
Data analytics for crop management: a big data view16
A scalable association rule learning heuristic for large datasets16
Sensing and making sense of tourism flows and urban data to foster sustainability awareness: a real-world experience16
Defining user spectra to classify Ethereum users based on their behavior16
Towards data sharing economy on Internet of Things: a semantic for telemetry data16
Unsupervised feature learning-based encoder and adversarial networks15
Model fusion of deep neural networks for anomaly detection15
Traffic and road conditions monitoring system using extracted information from Twitter15
Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis14
Developing a mathematical model of the co-author recommender system using graph mining techniques and big data applications14
Machine learning based customer churn prediction in home appliance rental business14
Diabetes emergency cases identification based on a statistical predictive model14
An analytics model for TelecoVAS customers’ basket clustering using ensemble learning approach14
An analysis of the graph processing landscape14
Unsupervised outlier detection for time-series data of indoor air quality using LSTM autoencoder with ensemble method14
Breast cancer prediction using gated attentive multimodal deep learning14
Optimization of air traffic management efficiency based on deep learning enriched by the long short-term memory (LSTM) and extreme learning machine (ELM)14
Time series big data: a survey on data stream frameworks, analysis and algorithms14
NLP-based platform as a service: a brief review13
Implementation of Long Short-Term Memory and Gated Recurrent Units on grouped time-series data to predict stock prices accurately13
IoT information theft prediction using ensemble feature selection13
Predictors of outpatients’ no-show: big data analytics using apache spark13
Forex market forecasting using machine learning: Systematic Literature Review and meta-analysis13
Understanding quality of analytics trade-offs in an end-to-end machine learning-based classification system for building information modeling13
Advanced machine learning techniques for cardiovascular disease early detection and diagnosis13
Vehicle routing problems based on Harris Hawks optimization13
Data-centric artificial intelligence in oncology: a systematic review assessing data quality in machine learning models for head and neck cancer13
The evolution of Big Data in neuroscience and neurology13
Why polls fail to predict elections13
Comparing traditional news and social media with stock price movements; which comes first, the news or the price change?13
Class center-based firefly algorithm for handling missing data12
Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms12
A parallelization model for performance characterization of Spark Big Data jobs on Hadoop clusters12
Accuracy improvements for cold-start recommendation problem using indirect relations in social networks12
Blockchain meets machine learning: a survey12
Design matters in patient-level prediction: evaluation of a cohort vs. case-control design when developing predictive models in observational healthcare datasets12
Hybrid beluga whale optimization algorithm with multi-strategy for functions and engineering optimization problems12
Deep learning for component fault detection in electricity transmission lines12
Improved cost-sensitive representation of data for solving the imbalanced big data classification problem12
A novel approach for learning ontology from relational database: from the construction to the evaluation12
An analysis of COVID-19 economic measures and attitudes: evidence from social media mining12
Web crawling based context aware recommender system using optimized deep recurrent neural network11
SDPSO: Spark Distributed PSO-based approach for feature selection and cancer disease prognosis11
Enabling real time big data solutions for manufacturing at scale11
Machine learning-based turbulence-risk prediction method for the safe operation of aircrafts11
Deep learning based deep-sea automatic image enhancement and animal species classification11
The stability of different aggregation techniques in ensemble feature selection11
Rating prediction of peer-to-peer accommodation through attributes and topics from customer review11
A unified representation and transformation of multi-model data using category theory11
A data value metric for quantifying information content and utility11
Analysis of Bayesian optimization algorithms for big data classification based on Map Reduce framework11
The use of knowledge extraction in predicting customer churn in B2B11
Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia11
Deep-Eware: spatio-temporal social event detection using a hybrid learning model11
Adaptive multiple imputations of missing values using the class center10
Opinion mining for national security: techniques, domain applications, challenges and research opportunities10
Designing a relational model to identify relationships between suspicious customers in anti-money laundering (AML) using social network analysis (SNA)10
Time series modeling of road traffic accidents in Amhara Region10
Exploration of the investment patterns of potential retail banking customers using two-stage cluster analysis10
Artificial intelligence paradigm for ligand-based virtual screening on the drug discovery of type 2 diabetes mellitus10
Threshold optimization and random undersampling for imbalanced credit card data10
Research in computing-intensive simulations for nature-oriented civil-engineering and related scientific fields, using machine learning and big data: an overview of open problems10
A clustering-based topic model using word networks and word embeddings10
An intelligent literature review: adopting inductive approach to define machine learning applications in the clinical domain10
A large-scale sentiment analysis of tweets pertaining to the 2020 US presidential election10
Highly accurate and efficient two phase-intrusion detection system (TP-IDS) using distributed processing of HADOOP and machine learning techniques10
Sentiment analysis for cruises in Saudi Arabia on social media platforms using machine learning algorithms10
Comparative analysis of binary and one-class classification techniques for credit card fraud data10
Domain randomization for neural network classification10
A survey on bandwidth-aware geo-distributed frameworks for big-data analytics10
Design, development and performance analysis of cognitive assisting aid with multi sensor fused navigation for visually impaired people9
Noninvasive identification of Benign and malignant eyelid tumors using clinical images via deep learning system9
New distributed-topsis approach for multi-criteria decision-making problems in a big data context9
Engineering the advances of the artificial neural networks (ANNs) for the security requirements of Internet of Things: a systematic review9
Improving the accuracy of text classification using stemming method, a case of non-formal Indonesian conversation9
Development of a regional voice dataset and speaker classification based on machine learning9
Examining the impact of cross-domain learning on crime prediction9
Big data fuzzy C-means algorithm based on bee colony optimization using an Apache Hbase9
Big data actionable intelligence architecture8
‘Everything is data’: towards one big data ecosystem using multiple sources of data on higher education in Indonesia8
Two-stage credit scoring using Bayesian approach8
Towards more efficient CNN-based surgical tools classification using transfer learning8
CEU-Net: ensemble semantic segmentation of hyperspectral images using clustering8
Robust visual tracking using very deep generative model8
The state of metaverse research: a bibliometric visual analysis based on CiteSpace8
Normalization and outlier removal in class center-based firefly algorithm for missing value imputation8
KAGN:knowledge-powered attention and graph convolutional networks for social media rumor detection8
Evaluation of recent advances in recommender systems on Arabic content8
Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning8
Sentiment analysis of Indonesian datasets based on a hybrid deep-learning strategy8
PCJ Java library as a solution to integrate HPC, Big Data and Artificial Intelligence workloads8
Cursor movement detection in brain-computer-interface systems using the K-means clustering method and LSVM8
Remote sensing detection enhancement8
Separable convolutional neural networks for facial expressions recognition8
Big Data and precision agriculture: a novel spatio-temporal semantic IoT data management framework for improved interoperability8
Scalable approach for high-resolution land cover: a case study in the Mediterranean Basin8
Machine learning concepts for correlated Big Data privacy8
Cooperative co-evolution for feature selection in Big Data with random feature grouping8
Analyzing Bangkok city taxi ride: reforming fares for profit sustainability using big data driven model7
Network intrusion detection using data dimensions reduction techniques7
A systematic review on big data applications and scope for industrial processing and healthcare sectors7
Toward a smart health: big data analytics and IoT for real-time miscarriage prediction7
A graph-based big data optimization approach using hidden Markov model and constraint satisfaction problem7
Application of microservices patterns to big data systems7
EXABSUM: a new text summarization approach for generating extractive and abstractive summaries7
Fast cluster-based computation of exact betweenness centrality in large graphs7
Detection of fickle trolls in large-scale online social networks7
Bayesian zero-inflated regression model with application to under-five child mortality7
Survey of transformers and towards ensemble learning using transformers for natural language processing7
Short-term photovoltaic power production forecasting based on novel hybrid data-driven models7
A semi-supervised short text sentiment classification method based on improved Bert model from unlabelled data7
Improving lookup and query execution performance in distributed Big Data systems using Cuckoo Filter7
An empirical study on the evaluation of the RDF storage systems7
Ontology generation for flight safety messages in air traffic management7
A novel Multi-Layer Attention Framework for visual description prediction using bidirectional LSTM7
Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models7
Detection and prevention of SQLI attacks and developing compressive framework using machine learning and hybrid techniques7
Social media analysis of car parking behavior using similarity based clustering6
Towards a folksonomy graph-based context-aware recommender system of annotated books6
A universal approach for multi-model schema inference6
0.058028936386108