Biodata Mining

Papers
(The median citation count of Biodata Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Investigating potential drug targets for IgA nephropathy and membranous nephropathy through multi-queue plasma protein analysis: a Mendelian randomization study based on SMR and co-localization analys393
Correction: Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning352
MOCAT: multi-omics integration with auxiliary classifiers enhanced autoencoder90
Exploring the common genetic basis of metabolic syndrome-related diseases and chronic kidney disease: insights from extensive genome-wide cross-trait analyses69
Transcriptome-based network analysis related to regulatory T cells infiltration identified RCN1 as a potential biomarker for prognosis in clear cell renal cell carcinoma55
Processing imbalanced medical data at the data level with assisted-reproduction data as an example37
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion33
A simple guide to the use of Student’s t-test, Mann-Whitney U test, Chi-squared test, and Kruskal-Wallis test in biostatistics28
Polygenic risk modeling of tumor stage and survival in bladder cancer27
Skin in the game: a review of computational models of the skin25
Unsupervised clustering based coronary artery segmentation25
circGPAcorr: an integrative tool for functional annotation of circular RNAs using expression data24
Ten simple rules for providing bioinformatics support within a hospital23
Comparing new tools of artificial intelligence to the authentic intelligence of our global health students21
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data21
Decoding dynamic miRNA:ceRNA interactions unveils therapeutic insights and targets across predominant cancer landscapes19
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records19
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning17
Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning17
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data16
Genetics and precision health: the ecological fallacy and artificial intelligence solutions14
Mapping the evolving trend of research on efferocytosis: a comprehensive data-mining-based study13
Supervised multiple kernel learning approaches for multi-omics data integration13
Advancing preeclampsia prediction: a tailored machine learning pipeline integrating resampling and ensemble models for handling imbalanced medical data12
Using GPT-4 to write a scientific review article: a pilot evaluation study12
FISM: harnessing deep learning and reinforcement learning for precision detection of microaneurysms and retinal exudates for early diabetic retinopathy diagnosis12
Correction: Motif clustering and digital biomarker extraction for free-living physical activity analysis12
Tree-based ensemble learning models for protein-protein interactions detection: a review and experimental evaluation11
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases11
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models11
FARFOOD: a database of potential interactions between food compounds and drugs11
Effective hybrid feature selection using different bootstrap enhances cancers classification performance10
Predictive modeling of ALS progression: an XGBoost approach using clinical features10
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis10
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery10
Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles10
Motif clustering and digital biomarker extraction for free-living physical activity analysis9
An unsupervised image segmentation algorithm for coronary angiography9
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods9
Understanding predictions of drug profiles using explainable machine learning models9
Open challenges and opportunities in federated foundation models towards biomedical healthcare9
MediNet: ensemble transfer learning approach for classification of medical drugs-related text reviews using significant combined-embeddings9
Reference-free phylogeny from sequencing data9
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability9
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms8
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis8
Computational prediction of cellular elastic modulus from mechanosensitive gene expression at multiple biological levels7
Decoding ancestry-specific genetic risk: interpretable deep feature selection reveals prostate cancer SNP disparities in diverse populations7
Deep learning-based Emergency Department In-hospital Cardiac Arrest Score (Deep EDICAS) for early prediction of cardiac arrest and cardiopulmonary resuscitation in the emergency department7
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records7
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis7
Changing word meanings in biomedical literature reveal pandemics and new technologies6
Endoscopy-based IBD identification by a quantized deep learning pipeline6
Optimizing age-related hearing risk predictions: an advanced machine learning integration with HHIE-S6
Optimizing accuracy and dimensionality: a swarm intelligence strategy for robust cancer genomics classification6
Correction: A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma6
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site6
ChatGPT and large language models in academia: opportunities and challenges6
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification6
Construction and application of medication reminder system: intelligent generation of universal medication schedule5
An explainable machine learning model for predicting postoperative cholangitis in pediatric surgical patients with pancreaticobiliary maljunction5
MultiChem: predicting chemical properties using multi-view graph attention network5
Using artificial intelligence (AI) to model clinical variant reporting for next generation sequencing (NGS) oncology assays5
Network-based multi-omics integrative analysis methods in drug discovery: a systematic review5
Evaluation of network-guided random forest for disease gene discovery5
Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies5
Exo-Tox: Identifying Exotoxins from secreted bacterial proteins5
TGNet: tensor-based graph convolutional networks for multimodal brain network analysis5
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature5
A regularized Cox hierarchical model for incorporating annotation information in predictive omic studies4
Machine learning based study for the classification of Type 2 diabetes mellitus subtypes4
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data4
Polymorphisms in the mTOR-PI3K-Akt pathway, energy balance-related exposures and colorectal cancer risk in the Netherlands Cohort Study4
Algorithm-based detection of acute kidney injury according to full KDIGO criteria including urine output following cardiac surgery: a descriptive analysis4
Cross-regional radiomics: a novel framework for relationship-based feature extraction with validation in Parkinson’s disease motor subtyping4
The ethics of data mining in healthcare: challenges, frameworks, and future directions4
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization4
Machine-learning-based models to predict cardiovascular risk using oculomics and clinic variables in KNHANES4
Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline4
QIGTD: identifying critical genes in the evolution of lung adenocarcinoma with tensor decomposition4
Analysis of risk factors progression of preterm delivery using electronic health records4
Machine Learning Algorithms for understanding the determinants of under-five Mortality4
Deep vision in agriculture: assessing the function of YOLO in the classification of plant leaf diseases (PLDs)3
mSRFR: a machine learning model using microalgal signature features for ncRNA classification3
Leveraging mixed-effects regression trees for the analysis of high-dimensional longitudinal data to identify the low and high-risk subgroups: simulation study with application to genetic study3
Electronic medical records imputation by temporal Generative Adversarial Network3
Enhanced labor pain monitoring using machine learning and ECG waveform analysis for uterine contraction-induced pain3
AI as an accelerator for defining new problems that transcends boundaries3
Quantum analysis of squiggle data3
PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies3
Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis3
subMG automates data submission for metagenomics studies3
Can open source large language models be used for tumor documentation in Germany?—An evaluation on urological doctors’ notes3
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data3
Deciphering the tissue-specific functional effect of Alzheimer risk SNPs with deep genome annotation3
Distinct network patterns emerge from Cartesian and XOR epistasis models: a comparative network science analysis3
Priority-Elastic net for binary disease outcome prediction based on multi-omics data3
Agenda setting for health equity assessment through the lenses of social determinants of health using machine learning approach: a framework and preliminary pilot study3
Novel digital approaches to the assessment of problematic opioid use3
From prompt engineering to agent engineering: expanding the AI toolbox with autonomous agentic AI collaborators for biomedical discovery3
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning3
Short- and long-term weekly patient-reported outcomes prediction undergoing radiotherapy: single-patient time series model vs. transformer-based multi-patient time series model3
Assessing the limitations of relief-based algorithms in detecting higher-order interactions2
Predicting molecular initiating events using chemical target annotations and gene expression2
Deep learning-based approaches for multi-omics data integration and analysis2
A deep learning approach for classifying and predicting children's nutritional status in Ethiopia using LSTM-FC neural networks2
Proteome mining of Yersinia Enterocolitica for drug targets and computational inhibitor identification with ADMET, anti-inflammation potential and formulation characteristics2
Deep learning-driven TCR$$\beta$$ repertoire analysis enhances diagnosis and enables mining of immunological biomarkers in systemic lupus erythematosus2
Learning the therapeutic targets of acute myeloid leukemia through multiscale human interactome network and community analysis2
iSuc-ChiDT: a computational method for identifying succinylation sites using statistical difference table encoding and the chi-square decision table classifier2
Overlapping filter bank convolutional neural network for multisubject multicategory motor imagery brain-computer interface2
Construction and validation of a machine learning-based model predicting early readmission in patients with decompensated cirrhosis: a prospective two-center cohort study2
Private pathological assessment via machine learning and homomorphic encryption2
Automated quantitative trait locus analysis (AutoQTL)2
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection2
CAUSALRLSTACK: adaptive balancing of deep representation and causal effect estimation with application to HIV-related health data2
Unsupervised encoding selection through ensemble pruning for biomedical classification2
Correction: Predictive modeling of ALS progression: an XGBoost approach using clinical features2
Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping2
Inverse problem for parameters identification in a modified SIRD epidemic model using ensemble neural networks2
0.040586948394775