BMC Bioinformatics

Papers
(The TQCC of BMC Bioinformatics is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-01-01 to 2025-01-01.)
ArticleCitations
Locality-sensitive hashing enables efficient and scalable signal classification in high-throughput mass spectrometry raw data1059
Establishment and validation of a predictive model of preeclampsia based on transcriptional signatures of 43 genes in decidua basalis and peripheral blood683
IgMAT: immunoglobulin sequence multi-species annotation tool for any species including those with incomplete antibody annotation or unusual characteristics294
MLM-based typographical error correction of unstructured medical texts for named entity recognition182
Enumeration and comprehensive in-silico modeling of three-helix bundle structures composed of typical αα-hairpins142
Topology preserving stratification of tissue neoplasticity using Deep Neural Maps and microRNA signatures121
scSNPdemux: a sensitive demultiplexing pipeline using single nucleotide polymorphisms for improved pooled single-cell RNA sequencing analysis115
B-LBConA: a medical entity disambiguation model based on Bio-LinkBERT and context-aware mechanism110
Cortexa: a comprehensive resource for studying gene expression and alternative splicing in the murine brain94
MetaCRS: unsupervised clustering of contigs with the recursive strategy of reducing metagenomic dataset’s complexity90
Performance improvement for a 2D convolutional neural network by using SSC encoding on protein–protein interaction tasks84
HAVoC, a bioinformatic pipeline for reference-based consensus assembly and lineage assignment for SARS-CoV-2 sequences82
Development of the International Classification of Diseases Ontology (ICDO) and its application for COVID-19 diagnostic data analysis78
CircWalk: a novel approach to predict CircRNA-disease association based on heterogeneous network representation learning70
Mixture density networks for the indirect estimation of reference intervals70
Impact of adaptive filtering on power and false discovery rate in RNA-seq experiments68
EnsembleSplice: ensemble deep learning model for splice site prediction68
A mixed-effects stochastic model reveals clonal dominance in gene therapy safety studies62
Consensus clustering for Bayesian mixture models60
Predicting biological pathways of chemical compounds with a profile-inspired approach57
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA–protein interaction prediction54
Normalization of gene expression data revisited: the three viewpoints of the transcriptome in human skeletal muscle undergoing load-induced hypertrophy and why they matter51
Multiview clustering of multi-omics data integration by using a penalty model51
Assigning protein function from domain-function associations using DomFun50
Aristotle: stratified causal discovery for omics data46
gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network46
DiviK: divisive intelligent K-means for hands-free unsupervised clustering in big biological data45
InClust+: the deep generative framework with mask modules for multimodal data integration, imputation, and cross-modal generation45
A comprehensive database for integrated analysis of omics data in autoimmune diseases45
Using dual-network-analyser for communities detecting in dual networks44
A Poisson reduced-rank regression model for association mapping in sequencing data44
Identifying host-specific amino acid signatures for influenza A viruses using an adjusted entropy measure43
ReRF-Pred: predicting amyloidogenic regions of proteins based on their pseudo amino acid composition and tripeptide composition42
Transcript annotation tool (TransAT): an R package for retrieving annotations for transcript-specific genetic variants42
Volumetric macromolecule identification in cryo-electron tomograms using capsule networks40
Physcraper: a Python package for continually updated phylogenetic trees using the Open Tree of Life40
CysPresso: a classification model utilizing deep learning protein representations to predict recombinant expression of cysteine-dense peptides38
llperm: a permutation of regressor residuals test for microbiome data38
ERStruct: a fast Python package for inferring the number of top principal components from whole genome sequencing data38
Acute stress reduces population-level metabolic and proteomic variation38
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis37
AutoDTI++: deep unsupervised learning for DTI prediction by autoencoders37
Statistical inference for a quasi birth–death model of RNA transcription36
GENTLE: a novel bioinformatics tool for generating features and building classifiers from T cell repertoire cancer data36
Hydropathicity-based prediction of pain-causing NaV1.7 variants35
SpinSPJ: a novel NMR scripting system to implement artificial intelligence and advanced applications35
Identifying biomarkers for breast cancer by gene regulatory network rewiring35
Critical assessment of on-premise approaches to scalable genome analysis34
Fusing graph transformer with multi-aggregate GCN for enhanced drug–disease associations prediction34
Predicting weighted unobserved nodes in a regulatory network using answer set programming33
A drug repositioning algorithm based on a deep autoencoder and adaptive fusion33
Overlapping group screening for detection of gene-environment interactions with application to TCGA high-dimensional survival genomic data32
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens32
A global $$Anopheles\ gambiae$$ gene co-expression network constructed from hundreds of experimental conditions with missing values31
Pan-genome de Bruijn graph using the bidirectional FM-index31
Efficient design of synthetic gene circuits under cell-to-cell variability31
Using machine learning to determine the correlation between physiological and environmental parameters and the induction of acute mountain sickness31
Graph regularized non-negative matrix factorization with prior knowledge consistency constraint for drug–target interactions prediction31
CMIC: predicting DNA methylation inheritance of CpG islands with embedding vectors of variable-length k-mers30
QuickPed: an online tool for drawing pedigrees and analysing relatedness30
A novel nonparametric computational strategy for identifying differential methylation regions30
adabmDCA: adaptive Boltzmann machine learning for biological sequences30
Self-organizing maps with variable neighborhoods facilitate learning of chromatin accessibility signal shapes associated with regulatory elements30
Combining denoising of RNA-seq data and flux balance analysis for cluster analysis of single cells30
bsgenova: an accurate, robust, and fast genotype caller for bisulfite-sequencing data30
RNA-clique: a method for computing genetic distances from RNA-seq data30
Improvement of variables interpretability in kernel PCA30
Identification and validation of tumor-infiltrating lymphocyte-related prognosis signature for predicting prognosis and immunotherapeutic response in bladder cancer30
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data29
CNN-Siam: multimodal siamese CNN-based deep learning approach for drug‒drug interaction prediction29
The metabolomics workbench file status website: a metadata repository promoting FAIR principles of metabolomics data29
MUREN: a robust and multi-reference approach of RNA-seq transcript normalization29
eSPRESSO: topological clustering of single-cell transcriptomics data to reveal informative genes for spatio–temporal architectures of cells29
Nonnegative matrix factorization analysis and multiple machine learning methods identified IL17C and ACOXL as novel diagnostic biomarkers for atherosclerosis29
Data-driven biological network alignment that uses topological, sequence, and functional information28
Ab initio protein structure prediction: the necessary presence of external force field as it is delivered by Hsp40 chaperone28
Enabling personalised disease diagnosis by combining a patient’s time-specific gene expression profile with a biomedical knowledge base28
An uncertainty-based interpretable deep learning framework for predicting breast cancer outcome28
SNARER: new molecular descriptors for SNARE proteins classification27
Efficient automatic 3D segmentation of cell nuclei for high-content screening27
Distance correlation application to gene co-expression network analysis27
RGMQL: scalable and interoperable computing of heterogeneous omics big data and metadata in R/Bioconductor27
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform27
Using flux theory in dynamic omics data sets to identify differentially changing signals using DPoP26
Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation26
scInterpreter: a knowledge-regularized generative model for interpretably integrating scRNA-seq data26
Linear programming based gene expression model (LPM-GEM) predicts the carbon source for Bacillus subtilis26
Early effects of gene duplication on the robustness and phenotypic variability of gene regulatory networks26
Supervised promoter recognition: a benchmark framework25
Pre-capture multiplexing provides additional power to detect copy number variation in exome sequencing25
A generalized covariate-adjusted top-scoring pair algorithm with applications to diabetic kidney disease stage classification in the Chronic Renal Insufficiency Cohort (CRIC) Study25
Supervised topological data analysis for MALDI mass spectrometry imaging applications25
Latent dirichlet allocation for double clustering (LDA-DC): discovering patients phenotypes and cell populations within a single Bayesian framework25
SAPFIR: A webserver for the identification of alternative protein features25
CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter25
Allele-specific binding (ASB) analyzer for annotation of allele-specific binding SNPs24
Risk prediction for dermatomyositis-associated hepatocellular carcinoma24
ELIMINATOR: essentiality analysis using multisystem networks and integer programming24
Deep learning-based classification model for GPR151 activator activity prediction24
PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data24
Boosting tissue-specific prediction of active cis-regulatory regions through deep learning and Bayesian optimization techniques24
ALGAEFUN with MARACAS, microALGAE FUNctional enrichment tool for MicroAlgae RnA-seq and Chip-seq AnalysiS24
A multimodal deep learning model to infer cell-type-specific functional gene networks24
From a genome assembly to full regulatory network prediction: the case study of Rhodotorula toruloides putative Haa1-regulon23
Gene co-expression network based on part mutual information for gene-to-gene relationship and gene-cancer correlation analysis23
Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting23
PEPMatch: a tool to identify short peptide sequence matches in large sets of proteins23
PIKE-R2P: Protein–protein interaction network-based knowledge embedding with graph neural network for single-cell RNA to protein prediction23
Proteome-wide analysis of Coxiella burnetii for conserved T-cell epitopes with presentation across multiple host species22
Injectiondesign: web service of plate design with optimized stratified block randomization for modern GC/LC-MS-based sample preparation22
BreakAlign: a Perl program to align chimaeric (split) genomic NGS reads and allow visual confirmation of novel retroviral integrations22
Gene regulatory network inference based on a nonhomogeneous dynamic Bayesian network model with an improved Markov Monte Carlo sampling22
Simultant: simultaneous curve fitting of functions and differential equations using analytical gradient calculations22
Discovery of moiety preference by Shapley value in protein kinase family using random forest models22
Bayesian inference for identifying tumour-specific cancer dependencies through integration of ex-vivo drug response assays and drug-protein profiling22
Fast and accurate genome-wide predictions and structural modeling of protein–protein interactions using Galaxy22
aRNAque: an evolutionary algorithm for inverse pseudoknotted RNA folding inspired by Lévy flights22
PIGNON: a protein–protein interaction-guided functional enrichment analysis for quantitative proteomics22
Joint deep learning for batch effect removal and classification toward MALDI MS based metabolomics22
Ideal adaptive control in biological systems: an analysis of $$\mathbb {P}$$-invariance and dynamical compensation properties22
MetaTron: advancing biomedical annotation empowering relation annotation and collaboration22
Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures21
Functional glyco-metagenomics elucidates the role of glycan-related genes in environments21
Correspondence on NanoVar’s performance outlined by Jiang T. et al. in “Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation”21
IndelsRNAmute: predicting deleterious multiple point substitutions and indels mutations21
topr: an R package for viewing and annotating genetic association results21
Symptoms are known by their companies: towards association guided disease diagnosis assistant21
SALON ontology for the formal description of sequence alignments21
EG-TransUNet: a transformer-based U-Net with enhanced and guided models for biomedical image segmentation21
Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data21
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks21
Multilayer network alignment based on topological assessment via embeddings21
Dualmarker: a flexible toolset for exploratory analysis of combinatorial dual biomarkers for clinical efficacy21
DGDTA: dynamic graph attention network for predicting drug–target binding affinity21
Prediction of the effects of small molecules on the gut microbiome using machine learning method integrating with optimal molecular features20
Hitac: a hierarchical taxonomic classifier for fungal ITS sequences compatible with QIIME220
Prediction of HIV-1 protease cleavage site from octapeptide sequence information using selected classifiers and hybrid descriptors20
CarSite-II: an integrated classification algorithm for identifying carbonylated sites based on K-means similarity-based undersampling and synthetic minority oversampling techniques20
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios20
End-to-end learning for compound activity prediction based on binding pocket information20
DeepNetBim: deep learning model for predicting HLA-epitope interactions based on network analysis by harnessing binding and immunogenicity information20
RankCompV3: a differential expression analysis algorithm based on relative expression orderings and applications in single-cell RNA transcriptomics20
Tpgen: a language model for stable protein design with a specific topology structure20
Reply: Correspondence on NanoVar’s performance outlined by Jiang T. et al. in ‘Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation’20
Predictive modeling of gene expression regulation20
Implementation of machine learning in the clinic: challenges and lessons in prospective deployment from the System for High Intensity EvaLuation During Radiation Therapy (SHIELD-RT) randomized control20
Pattern recognition of topologically associating domains using deep learning20
scDIOR: single cell RNA-seq data IO software20
A systematic comparison of human mitochondrial genome assembly tools19
Contrastive self-supervised clustering of scRNA-seq data19
ACO:lossless quality score compression based on adaptive coding order19
MADGAN: unsupervised medical anomaly detection GAN using multiple adjacent brain MRI slice reconstruction19
Development and validation of a coagulation-related genes prognostic model for hepatocellular carcinoma19
Paired-end small RNA sequencing reveals a possible overestimation in the isomiR sequence repertoire previously reported from conventional single read data analysis19
Accuracy of a machine learning method based on structural and locational information from AlphaFold2 for predicting the pathogenicity of TARDBP and FUS gene variants in ALS19
CNN-based two-branch multi-scale feature extraction network for retrosynthesis prediction19
LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification19
Weighted minimum feedback vertex sets and implementation in human cancer genes detection19
Employing phylogenetic tree shape statistics to resolve the underlying host population structure18
Sketching and sampling approaches for fast and accurate long read classification18
A graph-based algorithm for detecting rigid domains in protein structures18
PCirc: random forest-based plant circRNA identification software18
MQF and buffered MQF: quotient filters for efficient storage of k-mers with their counts and metadata18
Integrated analysis of the voltage-gated potassium channel-associated gene KCNH2 across cancers18
Learning self-supervised molecular representations for drug–drug interaction prediction18
kegg_pull: a software package for the RESTful access and pulling from the Kyoto Encyclopedia of Gene and Genomes18
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships18
rKOMICS: an R package for processing mitochondrial minicircle assemblies in population-scale genome projects18
MetHoS: a platform for large-scale processing, storage and analysis of metabolomics data18
Concentration optimization of combinatorial drugs using Markov chain-based models18
Lantern: an integrative repository of functional annotations for lncRNAs in the human genome18
RUBic: rapid unsupervised biclustering18
DI2: prior-free and multi-item discretization of biological data and its applications18
Exploration of chemical space with partial labeled noisy student self-training and self-supervised graph embedding18
Application of convolutional neural networks towards nuclei segmentation in localization-based super-resolution fluorescence microscopy images17
Drug-Online: an online platform for drug-target interaction, affinity, and binding sites identification using deep learning17
LinkedImm: a linked data graph database for integrating immunological data17
CrisprVi: a software for visualizing and analyzing CRISPR sequences of prokaryotes17
gcMECM: graph clustering of mutual exclusivity of cancer mutations17
In silico drug repositioning based on the integration of chemical, genomic and pharmacological spaces17
A new approach to describe the taxonomic structure of microbiome and its application to assess the relationship between microbial niches17
CMIC: an efficient quality score compressor with random access functionality17
Multioviz: an interactive platform for in silico perturbation and interrogation of gene regulatory networks17
Noisecut: a python package for noise-tolerant classification of binary data using prior knowledge integration and max-cut solutions17
ChiMera: an easy to use pipeline for bacterial genome based metabolic network reconstruction, evaluation and visualization17
Computational comparison of common event-based differential splicing tools: practical considerations for laboratory researchers17
Readsynth: short-read simulation for consideration of composition-biases in reduced metagenome sequencing approaches17
Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data16
RAE1 is a prognostic biomarker and is correlated with clinicopathological characteristics of patients with hepatocellular carcinoma16
Model selection and robust inference of mutational signatures using Negative Binomial non-negative matrix factorization16
nf-core/circrna: a portable workflow for the quantification, miRNA target prediction and differential expression analysis of circular RNAs16
tRNA-derived fragments as novel potential biomarkers for relapsed/refractory multiple myeloma16
CircNetVis: an interactive web application for visualizing interaction networks of circular RNAs16
CrMP-Sol database: classification, bioinformatic analyses and comparison of cancer-related membrane proteins and their water-soluble variant designs16
A machine learning framework that integrates multi-omics data predicts cancer-related LncRNAs16
Quantitative prediction model for affinity of drug–target interactions based on molecular vibrations and overall system of ligand-receptor16
Moment estimators of relatedness from low-depth whole-genome sequencing data16
Mathematical modelling of SigE regulatory network reveals new insights into bistability of mycobacterial stress response16
Molecular docking analysis reveals the functional inhibitory effect of Genistein and Quercetin on TMPRSS2: SARS-COV-2 cell entry facilitator spike protein16
Few-shot genes selection: subset of PAM50 genes for breast cancer subtypes classification16
Evaluation of word embedding models to extract and predict surgical data in breast cancer16
A unified framework for the integration of multiple hierarchical clusterings or networks from multi-source data16
Asc-Seurat: analytical single-cell Seurat-based web application15
Statistical image processing quantifies the changes in cytoplasmic texture associated with aging in Caenorhabditis elegans oocytes15
MTG-Link: leveraging barcode information from linked-reads to assemble specific loci15
SEMplMe: a tool for integrating DNA methylation effects in transcription factor binding affinity predictions15
scEGOT: single-cell trajectory inference framework based on entropic Gaussian mixture optimal transport15
Mora: abundance aware metagenomic read re-assignment for disentangling similar strains15
Hierarchical representation for PPI sites prediction15
Integrated structure-based protein interface prediction15
Examination of blood samples using deep learning and mobile microscopy15
A risk factor attention-based model for cardiovascular disease prediction15
tidyMicro: a pipeline for microbiome data analysis and visualization using the tidyverse in R15
Mabs, a suite of tools for gene-informed genome assembly15
Atlas of regulated target genes of transcription factors (ART-TF) in human ES cells15
SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data15
Multiblock variable influence on orthogonal projections (MB-VIOP) for enhanced interpretation of total, global, local and unique variations in OnPLS models15
PWN: enhanced random walk on a warped network for disease target prioritization15
123VCF: an intuitive and efficient tool for filtering VCF files15
Explainable deep drug–target representations for binding affinity prediction15
SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions15
Nfinder: automatic inference of cell neighborhood in 2D and 3D using nuclear markers15
PhytoPipe: a phytosanitary pipeline for plant pathogen detection and diagnosis using RNA-seq data15
Sparse sliced inverse regression for high dimensional data analysis15
Detecting genomic deletions from high-throughput sequence data with unsupervised learning15
vcf2fhir: a utility to convert VCF files into HL7 FHIR format for genomics-EHR integration15
SVDNVLDA: predicting lncRNA-disease associations by Singular Value Decomposition and node2vec15
Strain-specific behavior of Mycobacterium tuberculosis in A549 lung cancer cell line15
Dyport: dynamic importance-based biomedical hypothesis generation benchmarking technique14
Cellstitch: 3D cellular anisotropic image segmentation via optimal transport14
Prop3D: A flexible, Python-based platform for machine learning with protein structural properties and biophysical data14
In silico drug repositioning using deep learning and comprehensive similarity measures14
xenoGI 3: using the DTLOR model to reconstruct the evolution of gene families in clades of microbes14
Multi-dimensional data integration algorithm based on random walk with restart14
NeuronBridge: an intuitive web application for neuronal morphology search across large data sets14
Robust optimization of convolutional neural networks with a uniform experiment design method: a case of phonocardiogram testing in patients with heart diseases14
A comparison of embedding aggregation strategies in drug–target interaction prediction14
Exploring cell-specific miRNA regulation with single-cell miRNA-mRNA co-sequencing data14
Amplidiff: an optimized amplicon sequencing approach to estimating lineage abundances in viral metagenomes14
Prediction of hot spots in protein–DNA binding interfaces based on discrete wavelet transform and wavelet packet transform14
KATZNCP: a miRNA–disease association prediction model integrating KATZ algorithm and network consistency projection14
PopMLvis: a tool for analysis and visualization of population structure using genotype data from genome-wide association studies14
Polynomial superlevel set representation of the multistationarity region of chemical reaction networks14
Cellograph: a semi-supervised approach to analyzing multi-condition single-cell RNA-sequencing data using graph neural networks14
nPoRe: n-polymer realigner for improved pileup-based variant calling14
PangenomeNet: a pan-genome-based network reveals functional modules on antimicrobial resistome for Escherichia coli strains14
Using BioPAX-Parser (BiP) to enrich lists of genes or proteins with pathway data14
MSPCD: predicting circRNA-disease associations via integrating multi-source data and hierarchical neural network14
Modeling relaxation experiments with a mechanistic model of gene expression13
DrGA: cancer driver gene analysis in a simpler manner13
0.118577003479