Biodata Mining

Papers
(The TQCC of Biodata Mining is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation457
ChatGPT and large language models in academia: opportunities and challenges171
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification130
Identification of the active substances and mechanisms of ginger for the treatment of colon cancer based on network pharmacology and molecular docking63
ISLAND: in-silico proteins binding affinity prediction using sequence information45
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making33
Exploring active ingredients and function mechanisms of Ephedra-bitter almond for prevention and treatment of Corona virus disease 2019 (COVID-19) based on network pharmacology32
Indels in SARS-CoV-2 occur at template-switching hotspots27
Acoustic and language analysis of speech for suicidal ideation among US veterans27
Evaluation of different approaches for missing data imputation on features associated to genomic data26
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms25
A comparison of methods for interpreting random forest models of genetic association in the presence of non-additive interactions22
Machine Learning Algorithms for understanding the determinants of under-five Mortality21
Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis19
A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare18
Benchmarking AutoML frameworks for disease prediction using medical claims18
LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification18
eQTpLot: a user-friendly R package for the visualization of colocalization between eQTL and GWAS signals15
Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms14
Data analytics and clinical feature ranking of medical records of patients with sepsis13
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection12
Feature selection using distributions of orthogonal PLS regression vectors in spectral data12
Mechanistic modeling of the SARS-CoV-2 disease map11
Machine learning approaches for the genomic prediction of rheumatoid arthritis and systemic lupus erythematosus11
Comparative analysis, applications, and interpretation of electronic health record-based stroke phenotyping methods11
Prediction of short-term mortality in acute heart failure patients using minimal electronic health record data11
LightCUD: a program for diagnosing IBD based on human gut microbiome data10
Robust and rigorous identification of tissue-specific genes by statistically extending tau score10
PredictPTB: an interpretable preterm birth prediction model using attention-based recurrent neural networks10
Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines10
Estimating sequencing error rates using families9
Prediction of synergistic drug combinations using PCA-initialized deep learning9
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data9
COVID-TRACK: world and USA SARS-COV-2 testing and COVID-19 tracking9
Interpretable recurrent neural network models for dynamic prediction of the extubation failure risk in patients with invasive mechanical ventilation in the intensive care unit9
Privacy-preserving chi-squared test of independence for small samples8
Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms’ representation8
iGlioSub: an integrative transcriptomic and epigenomic classifier for glioblastoma molecular subtypes8
New neural network classification method for individuals ancestry prediction from SNPs data6
A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma6
Prescreening and treatment of aortic dissection through an analysis of infinite-dimension data6
An efficient computational method for predicting drug-target interactions using weighted extreme learning machine and speed up robot features6
0.030606031417847