BMC Bioinformatics

Papers
(The median citation count of BMC Bioinformatics is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
Locality-sensitive hashing enables efficient and scalable signal classification in high-throughput mass spectrometry raw data1059
Establishment and validation of a predictive model of preeclampsia based on transcriptional signatures of 43 genes in decidua basalis and peripheral blood683
IgMAT: immunoglobulin sequence multi-species annotation tool for any species including those with incomplete antibody annotation or unusual characteristics294
MLM-based typographical error correction of unstructured medical texts for named entity recognition182
Moment estimators of relatedness from low-depth whole-genome sequencing data142
Hybrid generative adversarial network based on frequency and spatial domain for histopathological image synthesis121
Enumeration and comprehensive in-silico modeling of three-helix bundle structures composed of typical αα-hairpins115
Topology preserving stratification of tissue neoplasticity using Deep Neural Maps and microRNA signatures110
scSNPdemux: a sensitive demultiplexing pipeline using single nucleotide polymorphisms for improved pooled single-cell RNA sequencing analysis94
B-LBConA: a medical entity disambiguation model based on Bio-LinkBERT and context-aware mechanism84
Cortexa: a comprehensive resource for studying gene expression and alternative splicing in the murine brain82
MetaCRS: unsupervised clustering of contigs with the recursive strategy of reducing metagenomic dataset’s complexity78
Performance improvement for a 2D convolutional neural network by using SSC encoding on protein–protein interaction tasks70
HAVoC, a bioinformatic pipeline for reference-based consensus assembly and lineage assignment for SARS-CoV-2 sequences68
Development of the International Classification of Diseases Ontology (ICDO) and its application for COVID-19 diagnostic data analysis68
Mixture density networks for the indirect estimation of reference intervals62
CircWalk: a novel approach to predict CircRNA-disease association based on heterogeneous network representation learning60
EnsembleSplice: ensemble deep learning model for splice site prediction57
Impact of adaptive filtering on power and false discovery rate in RNA-seq experiments51
A mixed-effects stochastic model reveals clonal dominance in gene therapy safety studies51
Consensus clustering for Bayesian mixture models50
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA–protein interaction prediction46
Predicting biological pathways of chemical compounds with a profile-inspired approach46
Assigning protein function from domain-function associations using DomFun45
Multiview clustering of multi-omics data integration by using a penalty model45
Normalization of gene expression data revisited: the three viewpoints of the transcriptome in human skeletal muscle undergoing load-induced hypertrophy and why they matter45
gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network44
Aristotle: stratified causal discovery for omics data44
A comprehensive database for integrated analysis of omics data in autoimmune diseases43
InClust+: the deep generative framework with mask modules for multimodal data integration, imputation, and cross-modal generation42
DiviK: divisive intelligent K-means for hands-free unsupervised clustering in big biological data42
Using dual-network-analyser for communities detecting in dual networks40
A Poisson reduced-rank regression model for association mapping in sequencing data40
Physcraper: a Python package for continually updated phylogenetic trees using the Open Tree of Life38
Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure38
ReRF-Pred: predicting amyloidogenic regions of proteins based on their pseudo amino acid composition and tripeptide composition38
Transcript annotation tool (TransAT): an R package for retrieving annotations for transcript-specific genetic variants38
Volumetric macromolecule identification in cryo-electron tomograms using capsule networks37
llperm: a permutation of regressor residuals test for microbiome data37
Acute stress reduces population-level metabolic and proteomic variation36
ERStruct: a fast Python package for inferring the number of top principal components from whole genome sequencing data36
AutoDTI++: deep unsupervised learning for DTI prediction by autoencoders35
CysPresso: a classification model utilizing deep learning protein representations to predict recombinant expression of cysteine-dense peptides35
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis35
GENTLE: a novel bioinformatics tool for generating features and building classifiers from T cell repertoire cancer data34
Statistical inference for a quasi birth–death model of RNA transcription34
Hydropathicity-based prediction of pain-causing NaV1.7 variants33
SpinSPJ: a novel NMR scripting system to implement artificial intelligence and advanced applications33
Identifying biomarkers for breast cancer by gene regulatory network rewiring32
Fusing graph transformer with multi-aggregate GCN for enhanced drug–disease associations prediction32
Overlapping group screening for detection of gene-environment interactions with application to TCGA high-dimensional survival genomic data31
A drug repositioning algorithm based on a deep autoencoder and adaptive fusion31
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens31
Critical assessment of on-premise approaches to scalable genome analysis31
Predicting weighted unobserved nodes in a regulatory network using answer set programming31
bsgenova: an accurate, robust, and fast genotype caller for bisulfite-sequencing data30
A novel nonparametric computational strategy for identifying differential methylation regions30
Graph regularized non-negative matrix factorization with prior knowledge consistency constraint for drug–target interactions prediction30
Efficient design of synthetic gene circuits under cell-to-cell variability30
Combining denoising of RNA-seq data and flux balance analysis for cluster analysis of single cells30
Using machine learning to determine the correlation between physiological and environmental parameters and the induction of acute mountain sickness30
Pan-genome de Bruijn graph using the bidirectional FM-index30
Identification and validation of tumor-infiltrating lymphocyte-related prognosis signature for predicting prognosis and immunotherapeutic response in bladder cancer30
QuickPed: an online tool for drawing pedigrees and analysing relatedness30
A global $$Anopheles\ gambiae$$ gene co-expression network constructed from hundreds of experimental conditions with missing values30
adabmDCA: adaptive Boltzmann machine learning for biological sequences29
Improvement of variables interpretability in kernel PCA29
eSPRESSO: topological clustering of single-cell transcriptomics data to reveal informative genes for spatio–temporal architectures of cells29
RNA-clique: a method for computing genetic distances from RNA-seq data29
The metabolomics workbench file status website: a metadata repository promoting FAIR principles of metabolomics data29
CMIC: predicting DNA methylation inheritance of CpG islands with embedding vectors of variable-length k-mers29
Nonnegative matrix factorization analysis and multiple machine learning methods identified IL17C and ACOXL as novel diagnostic biomarkers for atherosclerosis28
MUREN: a robust and multi-reference approach of RNA-seq transcript normalization28
CNN-Siam: multimodal siamese CNN-based deep learning approach for drug‒drug interaction prediction28
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data28
An uncertainty-based interpretable deep learning framework for predicting breast cancer outcome27
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform27
Ab initio protein structure prediction: the necessary presence of external force field as it is delivered by Hsp40 chaperone27
Enabling personalised disease diagnosis by combining a patient’s time-specific gene expression profile with a biomedical knowledge base27
Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation26
SNARER: new molecular descriptors for SNARE proteins classification26
Efficient automatic 3D segmentation of cell nuclei for high-content screening26
Distance correlation application to gene co-expression network analysis26
RGMQL: scalable and interoperable computing of heterogeneous omics big data and metadata in R/Bioconductor26
CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter25
Using flux theory in dynamic omics data sets to identify differentially changing signals using DPoP25
Supervised topological data analysis for MALDI mass spectrometry imaging applications25
Linear programming based gene expression model (LPM-GEM) predicts the carbon source for Bacillus subtilis25
Early effects of gene duplication on the robustness and phenotypic variability of gene regulatory networks25
scInterpreter: a knowledge-regularized generative model for interpretably integrating scRNA-seq data25
ALGAEFUN with MARACAS, microALGAE FUNctional enrichment tool for MicroAlgae RnA-seq and Chip-seq AnalysiS24
A generalized covariate-adjusted top-scoring pair algorithm with applications to diabetic kidney disease stage classification in the Chronic Renal Insufficiency Cohort (CRIC) Study24
Latent dirichlet allocation for double clustering (LDA-DC): discovering patients phenotypes and cell populations within a single Bayesian framework24
SAPFIR: A webserver for the identification of alternative protein features24
Risk prediction for dermatomyositis-associated hepatocellular carcinoma24
Supervised promoter recognition: a benchmark framework24
Pre-capture multiplexing provides additional power to detect copy number variation in exome sequencing24
PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data23
Boosting tissue-specific prediction of active cis-regulatory regions through deep learning and Bayesian optimization techniques23
ELIMINATOR: essentiality analysis using multisystem networks and integer programming23
Deep learning-based classification model for GPR151 activator activity prediction23
A multimodal deep learning model to infer cell-type-specific functional gene networks23
Allele-specific binding (ASB) analyzer for annotation of allele-specific binding SNPs22
PIKE-R2P: Protein–protein interaction network-based knowledge embedding with graph neural network for single-cell RNA to protein prediction22
Proteome-wide analysis of Coxiella burnetii for conserved T-cell epitopes with presentation across multiple host species22
Discovery of moiety preference by Shapley value in protein kinase family using random forest models22
PEPMatch: a tool to identify short peptide sequence matches in large sets of proteins22
Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting22
Simultant: simultaneous curve fitting of functions and differential equations using analytical gradient calculations22
aRNAque: an evolutionary algorithm for inverse pseudoknotted RNA folding inspired by Lévy flights22
Gene co-expression network based on part mutual information for gene-to-gene relationship and gene-cancer correlation analysis22
From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon22
Injectiondesign: web service of plate design with optimized stratified block randomization for modern GC/LC-MS-based sample preparation22
PIGNON: a protein–protein interaction-guided functional enrichment analysis for quantitative proteomics22
topr: an R package for viewing and annotating genetic association results21
MetaTron: advancing biomedical annotation empowering relation annotation and collaboration21
BreakAlign: a Perl program to align chimaeric (split) genomic NGS reads and allow visual confirmation of novel retroviral integrations21
Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data21
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks21
Joint deep learning for batch effect removal and classification toward MALDI MS based metabolomics21
Fast and accurate genome-wide predictions and structural modeling of protein–protein interactions using Galaxy21
DGDTA: dynamic graph attention network for predicting drug–target binding affinity21
IndelsRNAmute: predicting deleterious multiple point substitutions and indels mutations21
Gene regulatory network inference based on a nonhomogeneous dynamic Bayesian network model with an improved Markov Monte Carlo sampling21
Ideal adaptive control in biological systems: an analysis of $$\mathbb {P}$$-invariance and dynamical compensation properties21
Bayesian inference for identifying tumour-specific cancer dependencies through integration of ex-vivo drug response assays and drug-protein profiling21
EG-TransUNet: a transformer-based U-Net with enhanced and guided models for biomedical image segmentation20
SALON ontology for the formal description of sequence alignments20
Reply: Correspondence on NanoVar’s performance outlined by Jiang T. et al. in ‘Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation’20
Predictive modeling of gene expression regulation20
Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures20
Multilayer network alignment based on topological assessment via embeddings20
Dualmarker: a flexible toolset for exploratory analysis of combinatorial dual biomarkers for clinical efficacy20
Prediction of the effects of small molecules on the gut microbiome using machine learning method integrating with optimal molecular features20
Correspondence on NanoVar’s performance outlined by Jiang T. et al. in “Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation”20
Symptoms are known by their companies: towards association guided disease diagnosis assistant20
Functional glyco-metagenomics elucidates the role of glycan-related genes in environments20
Hitac: a hierarchical taxonomic classifier for fungal ITS sequences compatible with QIIME220
End-to-end learning for compound activity prediction based on binding pocket information20
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios19
Prediction of HIV-1 protease cleavage site from octapeptide sequence information using selected classifiers and hybrid descriptors19
CarSite-II: an integrated classification algorithm for identifying carbonylated sites based on K-means similarity-based undersampling and synthetic minority oversampling techniques19
A systematic comparison of human mitochondrial genome assembly tools19
DeepNetBim: deep learning model for predicting HLA-epitope interactions based on network analysis by harnessing binding and immunogenicity information19
RankCompV3: a differential expression analysis algorithm based on relative expression orderings and applications in single-cell RNA transcriptomics19
Tpgen: a language model for stable protein design with a specific topology structure19
scDIOR: single cell RNA-seq data IO software19
Pattern recognition of topologically associating domains using deep learning19
Implementation of machine learning in the clinic: challenges and lessons in prospective deployment from the System for High Intensity EvaLuation During Radiation Therapy (SHIELD-RT) randomized control19
MADGAN: unsupervised medical anomaly detection GAN using multiple adjacent brain MRI slice reconstruction18
CNN-based two-branch multi-scale feature extraction network for retrosynthesis prediction18
Development and validation of a coagulation-related genes prognostic model for hepatocellular carcinoma18
Sketching and sampling approaches for fast and accurate long read classification18
A graph-based algorithm for detecting rigid domains in protein structures18
Paired-end small RNA sequencing reveals a possible overestimation in the isomiR sequence repertoire previously reported from conventional single read data analysis18
Contrastive self-supervised clustering of scRNA-seq data18
Learning self-supervised molecular representations for drug–drug interaction prediction18
kegg_pull: a software package for the RESTful access and pulling from the Kyoto Encyclopedia of Gene and Genomes18
Weighted minimum feedback vertex sets and implementation in human cancer genes detection18
LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification18
ACO:lossless quality score compression based on adaptive coding order18
Accuracy of a machine learning method based on structural and locational information from AlphaFold2 for predicting the pathogenicity of TARDBP and FUS gene variants in ALS18
Lantern: an integrative repository of functional annotations for lncRNAs in the human genome18
RUBic: rapid unsupervised biclustering18
DI2: prior-free and multi-item discretization of biological data and its applications17
Exploration of chemical space with partial labeled noisy student self-training and self-supervised graph embedding17
Employing phylogenetic tree shape statistics to resolve the underlying host population structure17
Noisecut: a python package for noise-tolerant classification of binary data using prior knowledge integration and max-cut solutions17
Concentration optimization of combinatorial drugs using Markov chain-based models17
MQF and buffered MQF: quotient filters for efficient storage of k-mers with their counts and metadata17
In silico drug repositioning based on the integration of chemical, genomic and pharmacological spaces17
MetHoS: a platform for large-scale processing, storage and analysis of metabolomics data17
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships17
rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects17
Integrated analysis of the voltage-gated potassium channel-associated gene KCNH2 across cancers17
LinkedImm: a linked data graph database for integrating immunological data17
ChiMera: an easy to use pipeline for bacterial genome based metabolic network reconstruction, evaluation and visualization16
Application of convolutional neural networks towards nuclei segmentation in localization-based super-resolution fluorescence microscopy images16
gcMECM: graph clustering of mutual exclusivity of cancer mutations16
Molecular docking analysis reveals the functional inhibitory effect of Genistein and Quercetin on TMPRSS2: SARS-COV-2 cell entry facilitator spike protein16
Multioviz: an interactive platform for in silico perturbation and interrogation of gene regulatory networks16
CMIC: an efficient quality score compressor with random access functionality16
Drug-Online: an online platform for drug-target interaction, affinity, and binding sites identification using deep learning16
CrisprVi: a software for visualizing and analyzing CRISPR sequences of prokaryotes16
Mathematical modelling of SigE regulatory network reveals new insights into bistability of mycobacterial stress response16
RAE1 is a prognostic biomarker and is correlated with clinicopathological characteristics of patients with hepatocellular carcinoma16
Readsynth: short-read simulation for consideration of composition-biases in reduced metagenome sequencing approaches16
A new approach to describe the taxonomic structure of microbiome and its application to assess the relationship between microbial niches16
Computational comparison of common event-based differential splicing tools: practical considerations for laboratory researchers16
CircNetVis: an interactive web application for visualizing interaction networks of circular RNAs16
CrMP-Sol database: classification, bioinformatic analyses and comparison of cancer-related membrane proteins and their water-soluble variant designs16
MTG-Link: leveraging barcode information from linked-reads to assemble specific loci15
Mora: abundance aware metagenomic read re-assignment for disentangling similar strains15
Strain-specific behavior of Mycobacterium tuberculosis in A549 lung cancer cell line15
A unified framework for the integration of multiple hierarchical clusterings or networks from multi-source data15
A machine learning framework that integrates multi-omics data predicts cancer-related LncRNAs15
Quantitative prediction model for affinity of drug–target interactions based on molecular vibrations and overall system of ligand-receptor15
Integrated structure-based protein interface prediction15
PWN: enhanced random walk on a warped network for disease target prioritization15
Asc-Seurat: analytical single-cell Seurat-based web application15
123VCF: an intuitive and efficient tool for filtering VCF files15
Few-shot genes selection: subset of PAM50 genes for breast cancer subtypes classification15
A risk factor attention-based model for cardiovascular disease prediction15
Evaluation of word embedding models to extract and predict surgical data in breast cancer15
Examination of blood samples using deep learning and mobile microscopy15
SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions15
tidyMicro: a pipeline for microbiome data analysis and visualization using the tidyverse in R15
Hierarchical representation for PPI sites prediction15
Statistical image processing quantifies the changes in cytoplasmic texture associated with aging in Caenorhabditis elegans oocytes15
Nfinder: automatic inference of cell neighborhood in 2D and 3D using nuclear markers15
Explainable deep drug–target representations for binding affinity prediction15
tRNA-derived fragments as novel potential biomarkers for relapsed/refractory multiple myeloma15
nf-core/circrna: a portable workflow for the quantification, miRNA target prediction and differential expression analysis of circular RNAs15
Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data15
Model selection and robust inference of mutational signatures using Negative Binomial non-negative matrix factorization15
SVDNVLDA: predicting lncRNA-disease associations by Singular Value Decomposition and node2vec15
vcf2fhir: a utility to convert VCF files into HL7 FHIR format for genomics-EHR integration15
Cellograph: a semi-supervised approach to analyzing multi-condition single-cell RNA-sequencing data using graph neural networks14
nPoRe: n-polymer realigner for improved pileup-based variant calling14
Robust optimization of convolutional neural networks with a uniform experiment design method: a case of phonocardiogram testing in patients with heart diseases14
A comparison of embedding aggregation strategies in drug–target interaction prediction14
scEGOT: single-cell trajectory inference framework based on entropic Gaussian mixture optimal transport14
SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data14
Mabs, a suite of tools for gene-informed genome assembly14
Prop3D: A flexible, Python-based platform for machine learning with protein structural properties and biophysical data14
Cellstitch: 3D cellular anisotropic image segmentation via optimal transport14
Dyport: dynamic importance-based biomedical hypothesis generation benchmarking technique14
SEMplMe: a tool for integrating DNA methylation effects in transcription factor binding affinity predictions14
Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models14
Sparse sliced inverse regression for high dimensional data analysis14
Amplidiff: an optimized amplicon sequencing approach to estimating lineage abundances in viral metagenomes14
Polynomial superlevel set representation of the multistationarity region of chemical reaction networks14
Exploring cell-specific miRNA regulation with single-cell miRNA-mRNA co-sequencing data14
PhytoPipe: a phytosanitary pipeline for plant pathogen detection and diagnosis using RNA-seq data14
Atlas of regulated target genes of transcription factors (ART-TF) in human ES cells14
Detecting genomic deletions from high-throughput sequence data with unsupervised learning14
PangenomeNet: a pan-genome-based network reveals functional modules on antimicrobial resistome for Escherichia coli strains13
Using BioPAX-Parser (BiP) to enrich lists of genes or proteins with pathway data13
DM-MOGA: a multi-objective optimization genetic algorithm for identifying disease modules of non-small cell lung cancer13
Understanding and mathematical modelling of cellular resource allocation in microorganisms: a comparative synthesis13
CoQUAD: a COVID-19 question answering dataset system, facilitating research, benchmarking, and practice13
xenoGI 3: using the DTLOR model to reconstruct the evolution of gene families in clades of microbes13
Multi-dimensional data integration algorithm based on random walk with restart13
GMO Genetic Elements Thesaurus (GMO-GET): a controlled vocabulary for the consensus designation of introduced or modified genetic elements in genetically modified organisms13
MRI-based brain tumor segmentation using FPGA-accelerated neural network13
NeuronBridge: an intuitive web application for neuronal morphology search across large data sets13
Prediction of hot spots in protein–DNA binding interfaces based on discrete wavelet transform and wavelet packet transform13
MSPCD: predicting circRNA-disease associations via integrating multi-source data and hierarchical neural network13
Sparse data embedding and prediction by tropical matrix factorization13
ProgPerm: Progressive permutation for a dynamic representation of the robustness of microbiome discoveries13
0.83067488670349