Biodata Mining

Papers
(The median citation count of Biodata Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Investigating potential drug targets for IgA nephropathy and membranous nephropathy through multi-queue plasma protein analysis: a Mendelian randomization study based on SMR and co-localization analys475
MOCAT: multi-omics integration with auxiliary classifiers enhanced autoencoder457
CancerHubs Data Explorer: a web application for investigating mutation-enriched protein interaction hubs in human cancers171
Processing imbalanced medical data at the data level with assisted-reproduction data as an example98
Transcriptome-based network analysis related to regulatory T cells infiltration identified RCN1 as a potential biomarker for prognosis in clear cell renal cell carcinoma72
Correction: Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning58
A fairness-aware machine learning framework for maternal health in Ghana: integrating explainability, bias mitigation, and causal inference for ethical AI deployment47
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion45
A simple guide to the use of Student’s t-test, Mann-Whitney U test, Chi-squared test, and Kruskal-Wallis test in biostatistics42
Exploring the common genetic basis of metabolic syndrome-related diseases and chronic kidney disease: insights from extensive genome-wide cross-trait analyses41
Quantum Angle–distance kernel for ECG classification and anomaly detection: a quantum-inspired framework for biomedical signal analysis40
circGPAcorr: an integrative tool for functional annotation of circular RNAs using expression data37
Disease- and gene-specific deep learning for pathogenicity prediction of rare missense variants in cancer predisposition genes36
Polygenic risk modeling of tumor stage and survival in bladder cancer33
Comparing new tools of artificial intelligence to the authentic intelligence of our global health students26
Skin in the game: a review of computational models of the skin26
Ten simple rules for providing bioinformatics support within a hospital23
Unsupervised clustering based coronary artery segmentation22
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records21
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data18
Decoding dynamic miRNA:ceRNA interactions unveils therapeutic insights and targets across predominant cancer landscapes17
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data17
AI-Driven SaO2 prediction from pulse oximetry and electronic health records17
Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning16
Mapping the evolving trend of research on efferocytosis: a comprehensive data-mining-based study16
Genetics and precision health: the ecological fallacy and artificial intelligence solutions16
Early differentiation between paroxysmal and persistent atrial fibrillation based on interpretable machine learning: a multicenter retrospective study15
Multi-output LSTM-based prediction of postoperative delirium: integrating baseline and perioperative data for enhanced risk stratification in older spine surgery patients15
Supervised multiple kernel learning approaches for multi-omics data integration14
FISM: harnessing deep learning and reinforcement learning for precision detection of microaneurysms and retinal exudates for early diabetic retinopathy diagnosis14
Correction: Motif clustering and digital biomarker extraction for free-living physical activity analysis14
FARFOOD: a database of potential interactions between food compounds and drugs13
Advancing preeclampsia prediction: a tailored machine learning pipeline integrating resampling and ensemble models for handling imbalanced medical data12
Machine learning analysis of Drosophila testis transcriptomic data reveals potential regulatory sequences12
A biology-based quality-diversity algorithm for drug repurposing in Alzheimer’s disease using automated machine learning12
TLEUDS: a cascade Dual-Transfer learning system with quality- and knowledge-enhanced for precise fetal CHD screening12
Tree-based ensemble learning models for protein-protein interactions detection: a review and experimental evaluation12
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases11
Motif clustering and digital biomarker extraction for free-living physical activity analysis11
Using GPT-4 to write a scientific review article: a pilot evaluation study11
Effective hybrid feature selection using different bootstrap enhances cancers classification performance11
MediNet: ensemble transfer learning approach for classification of medical drugs-related text reviews using significant combined-embeddings11
Multimodal deep learning for survival prediction and biomarker discovery in non-small cell lung cancer11
Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles10
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models9
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery9
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis9
Predictive modeling of ALS progression: an XGBoost approach using clinical features8
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability8
Reference-free phylogeny from sequencing data8
An unsupervised image segmentation algorithm for coronary angiography8
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods8
Understanding predictions of drug profiles using explainable machine learning models8
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis8
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms8
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis7
A crisis of overconfidence: Why confidence, not accuracy, is the real risk in clinical AI7
Open challenges and opportunities in federated foundation models towards biomedical healthcare7
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site7
Decoding ancestry-specific genetic risk: interpretable deep feature selection reveals prostate cancer SNP disparities in diverse populations7
Deep learning-based Emergency Department In-hospital Cardiac Arrest Score (Deep EDICAS) for early prediction of cardiac arrest and cardiopulmonary resuscitation in the emergency department7
Computational prediction of cellular elastic modulus from mechanosensitive gene expression at multiple biological levels7
Changing word meanings in biomedical literature reveal pandemics and new technologies6
Light-XAI: a CADx for explainable cervical cancer detection via attention-based lightweight convolutional neural networks and layer-wise feature fusion6
Endoscopy-based IBD identification by a quantized deep learning pipeline6
Optimizing accuracy and dimensionality: a swarm intelligence strategy for robust cancer genomics classification6
Harnessing machine learning with auditory tests and demographic factors to forecast children’s reading abilities in children living with and without HIV6
ChatGPT and large language models in academia: opportunities and challenges6
Clustering-based low-rank matrix approximation for multimodal medical image compression6
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records6
Optimizing age-related hearing risk predictions: an advanced machine learning integration with HHIE-S6
Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies6
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification6
Exo-Tox: Identifying Exotoxins from secreted bacterial proteins5
MultiChem: predicting chemical properties using multi-view graph attention network5
Machine Learning Algorithms for understanding the determinants of under-five Mortality5
Construction and application of medication reminder system: intelligent generation of universal medication schedule5
Correction: A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma5
Using artificial intelligence (AI) to model clinical variant reporting for next generation sequencing (NGS) oncology assays5
Network-based multi-omics integrative analysis methods in drug discovery: a systematic review5
Analysis of risk factors progression of preterm delivery using electronic health records5
An explainable machine learning model for predicting postoperative cholangitis in pediatric surgical patients with pancreaticobiliary maljunction5
Evaluation of network-guided random forest for disease gene discovery5
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature5
A multi-metric evaluation of readability in psychiatric discharge summaries5
Benchmarking genomic foundation models for binary classification of gene fusion breakpoints from DNA sequences5
TGNet: tensor-based graph convolutional networks for multimodal brain network analysis5
Prognostic biomarker discovery in pancreatic cancer through hybrid ensemble feature selection and multi-omics data4
Priority-Elastic net for binary disease outcome prediction based on multi-omics data4
Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline4
Algorithm-based detection of acute kidney injury according to full KDIGO criteria including urine output following cardiac surgery: a descriptive analysis4
Machine-learning-based models to predict cardiovascular risk using oculomics and clinic variables in KNHANES4
Cross-regional radiomics: a novel framework for relationship-based feature extraction with validation in Parkinson’s disease motor subtyping4
Deciphering the tissue-specific functional effect of Alzheimer risk SNPs with deep genome annotation4
Novel digital approaches to the assessment of problematic opioid use4
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization4
QIGTD: identifying critical genes in the evolution of lung adenocarcinoma with tensor decomposition4
subMG automates data submission for metagenomics studies4
Machine learning based study for the classification of Type 2 diabetes mellitus subtypes4
A regularized Cox hierarchical model for incorporating annotation information in predictive omic studies4
Deep vision in agriculture: assessing the function of YOLO in the classification of plant leaf diseases (PLDs)4
The ethics of data mining in healthcare: challenges, frameworks, and future directions4
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning3
Cross-cohort genetic risk prediction for Alzheimer’s disease: a transfer learning approach using GWAS and deep learning models3
Quantum analysis of squiggle data3
Distinct network patterns emerge from Cartesian and XOR epistasis models: a comparative network science analysis3
Enhanced labor pain monitoring using machine learning and ECG waveform analysis for uterine contraction-induced pain3
Learning the therapeutic targets of acute myeloid leukemia through multiscale human interactome network and community analysis3
Radio Frequency Tagging–enabled patient monitoring: integrating mobility tracking with early warning systems for enhanced safety3
PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies3
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data3
Short- and long-term weekly patient-reported outcomes prediction undergoing radiotherapy: single-patient time series model vs. transformer-based multi-patient time series model3
Agenda setting for health equity assessment through the lenses of social determinants of health using machine learning approach: a framework and preliminary pilot study3
Leveraging mixed-effects regression trees for the analysis of high-dimensional longitudinal data to identify the low and high-risk subgroups: simulation study with application to genetic study3
Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis3
Electronic medical records imputation by temporal Generative Adversarial Network3
Network analysis of longitudinal electronic health records using linear mixed models3
Deep learning based prediction of RNA 5hmC modifications using composite feature representations and comparative benchmarking with transformer models3
AI as an accelerator for defining new problems that transcends boundaries3
Deep learning-driven TCR$$\beta$$ repertoire analysis enhances diagnosis and enables mining of immunological biomarkers in systemic lupus erythematosus3
A deep learning approach for classifying and predicting children's nutritional status in Ethiopia using LSTM-FC neural networks2
KeySDL: sparse dictionary learning for keystone microbe identification from steady-state observations using a dynamical-systems model2
Hierarchical machine learning models for predicting antenatal care utilisation among Nigerian women: Identifying actionable insights for health policy2
Deep learning-based approaches for multi-omics data integration and analysis2
Early prediction of longitudinal treatment adherence in obstructive sleep apnea using machine learning approaches2
Inverse problem for parameters identification in a modified SIRD epidemic model using ensemble neural networks2
Correction: Predictive modeling of ALS progression: an XGBoost approach using clinical features2
DBSCAN applied to EHRs data from patients with glioblastoma clusters patients based on cytosolic Hsp70 protein, sex, and brain subventricular zone2
Private pathological assessment via machine learning and homomorphic encryption2
Can open source large language models be used for tumor documentation in Germany?—An evaluation on urological doctors’ notes2
From prompt engineering to agent engineering: expanding the AI toolbox with autonomous agentic AI collaborators for biomedical discovery2
CAUSALRLSTACK: adaptive balancing of deep representation and causal effect estimation with application to HIV-related health data2
Proteome mining of Yersinia Enterocolitica for drug targets and computational inhibitor identification with ADMET, anti-inflammation potential and formulation characteristics2
Construction and validation of a machine learning-based model predicting early readmission in patients with decompensated cirrhosis: a prospective two-center cohort study2
Machine learning-based assessment of the healthy human gut mycobiota landscape using ITS1 DNA metabarcoding data2
Assessing the limitations of relief-based algorithms in detecting higher-order interactions2
Automated quantitative trait locus analysis (AutoQTL)2
Overlapping filter bank convolutional neural network for multisubject multicategory motor imagery brain-computer interface2
Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping2
0.10181498527527