Biodata Mining

Papers
(The median citation count of Biodata Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation457
ChatGPT and large language models in academia: opportunities and challenges171
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification130
Identification of the active substances and mechanisms of ginger for the treatment of colon cancer based on network pharmacology and molecular docking63
ISLAND: in-silico proteins binding affinity prediction using sequence information45
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making33
Exploring active ingredients and function mechanisms of Ephedra-bitter almond for prevention and treatment of Corona virus disease 2019 (COVID-19) based on network pharmacology32
Indels in SARS-CoV-2 occur at template-switching hotspots27
Acoustic and language analysis of speech for suicidal ideation among US veterans27
Evaluation of different approaches for missing data imputation on features associated to genomic data26
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms25
A comparison of methods for interpreting random forest models of genetic association in the presence of non-additive interactions22
Machine Learning Algorithms for understanding the determinants of under-five Mortality21
Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis19
Benchmarking AutoML frameworks for disease prediction using medical claims18
LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification18
A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare18
eQTpLot: a user-friendly R package for the visualization of colocalization between eQTL and GWAS signals15
Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms14
Data analytics and clinical feature ranking of medical records of patients with sepsis13
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection12
Feature selection using distributions of orthogonal PLS regression vectors in spectral data12
Comparative analysis, applications, and interpretation of electronic health record-based stroke phenotyping methods11
Prediction of short-term mortality in acute heart failure patients using minimal electronic health record data11
Mechanistic modeling of the SARS-CoV-2 disease map11
Machine learning approaches for the genomic prediction of rheumatoid arthritis and systemic lupus erythematosus11
PredictPTB: an interpretable preterm birth prediction model using attention-based recurrent neural networks10
Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines10
LightCUD: a program for diagnosing IBD based on human gut microbiome data10
Robust and rigorous identification of tissue-specific genes by statistically extending tau score10
Interpretable recurrent neural network models for dynamic prediction of the extubation failure risk in patients with invasive mechanical ventilation in the intensive care unit9
Estimating sequencing error rates using families9
Prediction of synergistic drug combinations using PCA-initialized deep learning9
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data9
COVID-TRACK: world and USA SARS-COV-2 testing and COVID-19 tracking9
iGlioSub: an integrative transcriptomic and epigenomic classifier for glioblastoma molecular subtypes8
Privacy-preserving chi-squared test of independence for small samples8
Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms’ representation8
Prescreening and treatment of aortic dissection through an analysis of infinite-dimension data6
An efficient computational method for predicting drug-target interactions using weighted extreme learning machine and speed up robot features6
New neural network classification method for individuals ancestry prediction from SNPs data6
A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma6
Identification of natural selection in genomic data with deep convolutional neural network5
Predicting molecular initiating events using chemical target annotations and gene expression5
mSRFR: a machine learning model using microalgal signature features for ncRNA classification5
Probability calibration-based prediction of recurrence rate in patients with diffuse large B-cell lymphoma5
Comparison of 16S and whole genome dog microbiomes using machine learning5
RASMA: a reverse search algorithm for mining maximal frequent subgraphs5
Prediction of MoRFs based on sequence properties and convolutional neural networks5
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data5
Genetic risk score for ovarian cancer based on chromosomal-scale length variation5
Automated quantitative trait locus analysis (AutoQTL)4
Polygenic risk modeling of tumor stage and survival in bladder cancer4
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis4
Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication4
A multi-feature hybrid classification data mining technique for human-emotion4
Single_cell_GRN: gene regulatory network identification based on supervised learning method and Single-cell RNA-seq data4
Machine-learning based feature selection for a non-invasive breathing change detection4
Learning and visualizing chronic latent representations using electronic health records4
An unsupervised image segmentation algorithm for coronary angiography4
Gene function finding through cross-organism ensemble learning4
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype3
Signature literature review reveals AHCY, DPYSL3, and NME1 as the most recurrent prognostic genes for neuroblastoma3
Personalized single-cell networks: a framework to predict the response of any gene to any drug for any patient3
iU-Net: a hybrid structured network with a novel feature fusion approach for medical image segmentation3
Performance of model-based multifactor dimensionality reduction methods for epistasis detection by controlling population structure3
iSuc-ChiDT: a computational method for identifying succinylation sites using statistical difference table encoding and the chi-square decision table classifier3
Expanding a database-derived biomedical knowledge graph via multi-relation extraction from biomedical abstracts3
Evaluating risk detection methods to uncover ontogenic-mediated adverse drug effect mechanisms in children3
DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping3
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning3
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data3
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records3
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data2
Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis2
Ten simple rules for providing bioinformatics support within a hospital2
A new pipeline for structural characterization and classification of RNA-Seq microbiome data2
Neural network-based prognostic predictive tool for gastric cardiac cancer: the worldwide retrospective study2
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis2
Effective hybrid feature selection using different bootstrap enhances cancers classification performance2
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization2
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning2
Novel digital approaches to the assessment of problematic opioid use2
Sparse self-attention aggregation networks for neural sequence slice interpolation2
Development of glaucoma predictive model and risk factors assessment based on supervised models2
Research agenda for using artificial intelligence in health governance: interpretive scoping review and framework2
Polymorphisms in the mTOR-PI3K-Akt pathway, energy balance-related exposures and colorectal cancer risk in the Netherlands Cohort Study2
Bacteria spatial tracking in Urban Park soils with MALDI-TOF Mass Spectrometry and Specific PCR2
Detecting diseases in medical prescriptions using data mining methods2
Correction: Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies2
Ten important roles for academic leaders to promote equity, diversity, and inclusion in data science2
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods2
Algorithm-based detection of acute kidney injury according to full KDIGO criteria including urine output following cardiac surgery: a descriptive analysis2
DASSI: differential architecture search for splice identification from DNA sequences2
0.021030902862549