Journal of Cheminformatics

Papers
(The median citation count of Journal of Cheminformatics is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder232
Generating diversity and securing completeness in algorithmic retrosynthesis121
European Registry of Materials: global, unique identifiers for (undisclosed) nanomaterials120
Moldina: a fast and accurate search algorithm for simultaneous docking of multiple ligands100
Explainable uncertainty quantifications for deep learning-based molecular property prediction90
Evolve with your research: stepwise system evolution from document-driven to fact-centric research data management in materials science90
Dimensionally reduced machine learning model for predicting single component octanol–water partition coefficients87
Biosynfoni: a biosynthesis-informed and interpretable lightweight molecular fingerprint76
An explainability framework for deep learning on chemical reactions exemplified by enzyme-catalysed reaction classification70
ANNalog: generation of MedChem-similar molecules68
Equivariant diffusion for structure-based de novo ligand generation with latent-conditioning63
Novel molecular design via a scaffold-aware transformer with multi-scale attention mechanisms63
ExPO: an exposure-conditioned neural operator for L1000 signature prediction61
Exploring the ability of machine learning-based virtual screening models to identify the functional groups responsible for binding60
Assessing interaction recovery of predicted protein-ligand poses58
HERGAI: an artificial intelligence tool for structure-based prediction of hERG inhibitors56
MolPrice: assessing synthetic accessibility of molecules based on market value51
Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph51
Paths to cheminformatics: Q&A with Ann M. Richard50
AdapTor: Adaptive Topological Regression for quantitative structure–activity relationship modeling50
VSFlow: an open-source ligand-based virtual screening tool49
Novel molecule design with POWGAN, a policy-optimized Wasserstein generative adversarial network49
Computer-aided pattern scoring (C@PS): a novel cheminformatic workflow to predict ligands with rare modes-of-action48
One chiral fingerprint to find them all46
AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry45
Adaptmol: domain adaptation for molecular image recognition with limited supervision44
PMF-CPI: assessing drug selectivity with a pretrained multi-functional model for compound–protein interactions44
MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data43
NPBS Atlas: a comprehensive data resource for exploring the biological sources of natural products43
Fifteen years of ChEMBL and its role in cheminformatics and drug discovery43
Privileged structure-based molecular fingerprints for organic electronic materials: towards intuitive machine learning interpretation41
Deep learning of multimodal networks with topological regularization for drug repositioning41
Machine learning to predict food effects during drug development: a comprehensive review40
A quantum chemical dataset of interacting molecular pairs for chemical reaction studies40
APBIO: bioactive profiling of air pollutants through inferred bioactivity signatures and prediction of novel target interactions40
Analysis of the benefits of imputation models over traditional QSAR models for toxicity prediction40
AiZynthFinder 4.0: developments based on learnings from 3 years of industrial application40
The BinDiscover database: a biology-focused meta-analysis tool for 156,000 GC–TOF MS metabolome samples39
NanoBinder: a machine learning assisted nanobody binding prediction tool using Rosetta energy scores38
Crossover operators for molecular graphs with an application to virtual drug screening37
Shinyscreen: mass spectrometry data inspection and quality checking utility36
Implementation of an open chemistry knowledge base with a Semantic Wiki36
Bitter peptide prediction using graph neural networks35
Advancements in thermochemical predictions: a multi-output thermodynamics-informed neural network approach34
Collision-free morgan fingerprints: a principled approach to enhance machine learning performance and interpretability in chemistry34
Explaining compound activity predictions with a substructure-aware loss for graph neural networks33
ProjFusNet: deep neural network for peptide precursor prediction using projection-fused protein language model and structural features33
Prediction model for chemical explosion consequences via multimodal feature fusion33
Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop32
PURE: policy-guided unbiased REpresentations for structure-constrained molecular generation32
PyL3dMD: Python LAMMPS 3D molecular descriptors package32
Structure-based machine learning screening identifies natural product candidates as potential geroprotectors31
The development of the generative adversarial supporting vector machine for molecular property generation31
Accelerated hit identification with target evaluation, deep learning and automated labs: prospective validation in IRAK131
A comprehensive evaluation of advanced methods for identifying structural alerts using extensive toxicity data30
Predictive modeling of visible-light azo-photoswitches’ properties using structural features30
CRAFT: a web-integrated cavity prediction tool based on flow transfer algorithm30
ProQSAR: A modular and reproducible framework for small-data QSAR modeling with fit-and-use models29
On the difficulty of validating molecular generative models realistically: a case study on public and proprietary data29
Papyrus: a large-scale curated dataset aimed at bioactivity predictions29
How quantum-chemical geometry optimization level affects classical 3D descriptors and QSAR performance: a comparative study29
Summarizing relationships between chemicals, genes, proteins, and diseases in PubChem using analysis of their co-occurrences in patents28
A systematic review of deep learning chemical language models in recent era28
GT-NMR: a novel graph transformer-based approach for accurate prediction of NMR chemical shifts27
Ionization efficiency prediction of electrospray ionization mass spectrometry analytes based on molecular fingerprints and cumulative neutral losses27
UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines*27
Comprehensive benchmarking of computational tools for predicting toxicokinetic and physicochemical properties of chemicals27
Accurate structure-activity relationship prediction of antioxidant peptides using a multimodal deep learning framework27
Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices26
New algorithms demonstrate untargeted detection of chemically meaningful changing units and formula assignment for HRMS data of polymeric mixtures in the open-source constellation web application26
VGSC-DB: an online database of voltage-gated sodium channels26
Scaffold Generator: a Java library implementing molecular scaffold functionalities in the Chemistry Development Kit (CDK)26
ScaffoldGVAE: scaffold generation and hopping of drug molecules via a variational autoencoder based on multi-view graph neural networks24
FlavorMiner: a machine learning platform for extracting molecular flavor profiles from structural data24
PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank24
Gromacs MetaDump: a tool for extracting GROMACS simulation metadata24
Infrared spectrum analysis of organic molecules with neural networks using standard reference data sets in combination with real-world data24
Notes on molecular fragmentation and parameter settings for a dissipative particle dynamics study of a C10E4/water mixture with lamellar bilayer formation23
Using test-time augmentation to investigate explainable AI: inconsistencies between method, model and human intuition23
Visualising lead optimisation series using reduced graphs23
Implementation of a soft grading system for chemistry in a Moodle plugin23
Applying atomistic neural networks to bias conformer ensembles towards bioactive-like conformations23
How to crack a SMILES: automatic crosschecked chemical structure resolution across multiple services using MoleculeResolver23
Activity cliff-aware reinforcement learning for de novo drug design22
Subgrapher: visual fingerprinting of chemical structures22
The specification game: rethinking the evaluation of drug response prediction for precision oncology22
Reaction rebalancing: a novel approach to curating reaction databases22
Automatic molecular fragmentation by evolutionary optimisation22
YoDe-Segmentation: automated noise-free retrieval of molecular structures from scientific publications22
AI-powered prediction of critical properties and boiling points: a hybrid ensemble learning and QSPR approach22
Chemical rules for optimization of chemical mutagenicity via matched molecular pairs analysis and machine learning methods21
Novel multi-objective affinity approach allows to identify pH-specific μ-opioid receptor agonists21
Ilm-NMR-P31: an open-access 31P nuclear magnetic resonance database and data-driven prediction of 31P NMR shifts21
Data mining of PubChem bioassay records reveals diverse OXPHOS inhibitory chemotypes as potential therapeutic agents against ovarian cancer21
Achieving well-informed decision-making in drug discovery: a comprehensive calibration study using neural network-based structure-activity models20
Towards a partial order graph for interactive pharmacophore exploration: extraction of pharmacophores activity delta20
DeepSA: a deep-learning driven predictor of compound synthesis accessibility20
AI-driven molecular generation of not-patented pharmaceutical compounds using world open patent data19
Searching chemical databases in the pre-history of cheminformatics19
Deepmol: an automated machine and deep learning framework for computational chemistry19
A transformer based generative chemical language AI model for structural elucidation of organic compounds19
Leveraging computational tools to combat malaria: assessment and development of new therapeutics19
SPARFlow: a KNIME workflow for integrated structure–activity or structure–property relationship analysis19
TCMSID: a simplified integrated database for drug discovery from traditional chinese medicine19
Torsion angular bin strings: algorithmic update and additional validation18
Integrating synthetic accessibility with AI-based generative drug design18
HepatoToxicity Portal (HTP): an integrated database of drug-induced hepatotoxicity knowledgebase and graph neural network-based prediction model18
piscesCSM: prediction of anticancer synergistic drug combinations18
Enhancing molecular property prediction with quantized GNN models18
Multimodal graph fusion with statistically guided parsimonious descriptor selection for molecular property prediction18
Multiscale analysis and optimal glioma therapeutic candidate discovery using the CANDO platform18
Systematic benchmarking of 13 AI methods for predicting cyclic peptide membrane permeability18
Correction: Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries18
MolNexTR: a generalized deep learning model for molecular image recognition18
UnCorrupt SMILES: a novel approach to de novo design18
Geometric deep learning for molecular property predictions with chemical accuracy across chemical space18
Paths to cheminformatics: Q&A with Phyo Phyo Kyaw Zin18
TB-IECS: an accurate machine learning-based scoring function for virtual screening18
AI/ML methodologies and the future-will they be successful in designing the next generation of new chemical entities?18
Nanodesigner: resolving the complex-CDR interdependency with iterative refinement17
One size does not fit all: revising traditional paradigms for assessing accuracy of QSAR models used for virtual screening17
Decrypting orphan GPCR drug discovery via multitask learning17
Contrastive representation learning and capsule networks enable accurate identification of ferroptosis-related proteins17
Tuning gradient boosting for imbalanced bioassay modelling with custom loss functions17
What makes a reaction network “chemical”?17
Advancing material property prediction: using physics-informed machine learning models for viscosity16
Human-in-the-loop active learning for goal-oriented molecule generation16
LAGNet: better electron density prediction for LCAO-based data and drug-like substances16
rMSIfragment: improving MALDI-MSI lipidomics through automated in-source fragment annotation16
strainedSMILES2xyz: a workflow for reliable 3D structures of strained molecules from SMILES16
Assessing the factors influencing the quality of pocket-conditioned 3D generative models16
Automated molecular structure segmentation from documents using ChemSAM16
VitroBert: modeling DILI by pretraining BERT on in vitro data16
Predicting fire consequences with the transformer model based on multimodal feature fusion16
DECIMER—hand-drawn molecule images dataset16
RGReco: a unified framework for automated R-group recognition in chemical publications16
PROTEOMAS: a workflow enabling harmonized proteomic meta-analysis and proteomic signature mapping15
PIKAChU: a Python-based informatics kit for analysing chemical units15
Solvent flashcards: a visualisation tool for sustainable chemistry15
Correction: Advancements in thermochemical predictions: a multi-output thermodynamics-informed neural network approach15
Nipah Virus Inhibitor Knowledgebase (NVIK): a combined evidence approach to prioritise small molecule inhibitors15
VNFlow: integration of variational autoencoders and normalizing flows for novel molecular design15
Application of machine reading comprehension techniques for named entity recognition in materials science15
Prediction of UGT-mediated phase II metabolism via ligand- and structure-based predictive models15
ReMODE: a deep learning-based web server for target-specific drug design15
Suitability of large language models for extraction of high-quality chemical reaction dataset from patent literature15
Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management15
qHTSWaterfall: 3-dimensional visualization software for quantitative high-throughput screening (qHTS) data15
A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example15
OWSum: algorithmic odor prediction and insight into structure-odor relationships14
Benchmarking ML in ADMET predictions: the practical impact of feature representations in ligand-based models14
Evaluation of chirality descriptors derived from SMILES heteroencoders14
Llamol: a dynamic multi-conditional generative transformer for de novo molecular design14
SMILES all around: structure to SMILES conversion for transition metal complexes14
Evaluating uncertainty-based active learning for accelerating the generalization of molecular property prediction14
Improving protein-ligand complex generation with force field guidance14
XSMILES: interactive visualization for molecules, SMILES and XAI attribution scores14
Predicting the critical micelle concentration of binary surfactant mixtures using machine learning14
From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials14
PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models13
Chemical characteristics vectors map the chemical space of natural biomes from untargeted mass spectrometry data13
Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors13
Pretraining graph transformers with atom-in-a-molecule quantum properties for improved ADMET modeling13
PKSmart: an open-source computational model to predict intravenous pharmacokinetics of small molecules13
Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries13
InterDILI: interpretable prediction of drug-induced liver injury through permutation feature importance and attention mechanism13
Capsule graph networks for accurate and interpretable crystalline materials property prediction13
Uncovering molecular determinants of potency and binding affinity in hit compounds targeting FGF14/Nav1.6 complex13
MDDI-SCL: predicting multi-type drug-drug interactions via supervised contrastive learning13
graphpancake: a Python package for representing organic molecules as molecular graphs utilizing electronic structure theory12
Large language models open new way of AI-assisted molecule design for chemists12
Integrating QSAR modelling with reinforcement learning for Syk inhibitor discovery12
FAME3R: an efficient, practical and reliable open-source tool for predicting phase 1 and phase 2 sites of metabolism12
DrugDiff: small molecule diffusion model with flexible guidance towards molecular properties12
A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL12
kMoL: an open-source machine and federated learning library for drug discovery12
Defining peptides in ChEBI12
Correction: Reconstruction of lossless molecular representations from fingerprints12
Prediction of blood–brain barrier and Caco-2 permeability through the Enalos Cloud Platform: combining contrastive learning and atom-attention message passing neural networks12
E-GuARD: expert-guided augmentation for the robust detection of compounds interfering with biological assays12
Building shape-focused pharmacophore models for effective docking screening12
Naturally-meaningful and efficient descriptors: machine learning of material properties based on robust one-shot ab initio descriptors12
Small molecule autoencoders: architecture engineering to optimize latent space utility and sustainability11
InflamNat: web-based database and predictor of anti-inflammatory natural products11
Benchmarking molecular conformer augmentation with context-enriched training: graph-based transformer versus GNN models11
Physicochemical modelling of the retention mechanism of temperature-responsive polymeric columns for HPLC through machine learning algorithms11
Advantages of two quantum programming platforms in quantum computing and quantum chemistry11
A molecule perturbation software library and its application to study the effects of molecular design constraints11
BitterMatch: recommendation systems for matching molecules with bitter taste receptors11
COMA: efficient structure-constrained molecular generation using contractive and margin losses11
Machine learning-driven generation and screening of potential ionic liquids for cellulose dissolution11
Enhancing atom mapping with multitask learning and symmetry-aware deep graph matching11
Generate what you can make: achieving in-house synthesizability with readily available resources in de novo drug design11
TraceMetrix: a traceable metabolomics interactive analysis platform11
Double-head transformer neural network for molecular property prediction11
Relative molecule self-attention transformer10
PermuteDDS: a permutable feature fusion network for drug-drug synergy prediction10
Matched pairs demonstrate robustness against inter-assay variability10
Exploring QSAR models for activity-cliff prediction10
StreaMD: the toolkit for high-throughput molecular dynamics simulations10
Contrastive explanations for machine learning predictions in chemistry10
Empowering federated learning for robust compound-protein interaction prediction across heterogeneous cross-pharma domains10
PubChem synonym filtering process using crowdsourcing10
A comprehensive comparison of deep learning-based compound-target interaction prediction models to unveil guiding design principles10
Reproducible MS/MS library cleaning pipeline in matchms10
Improving VAE based molecular representations for compound property prediction10
Transfer learning across different chemical domains: virtual screening of organic materials with deep learning models pretrained on small molecule and chemical reaction data10
Classification of battery compounds using structure-free Mendeleev encodings10
Syn-MolOpt: a synthesis planning-driven molecular optimization method using data-derived functional reaction templates10
TUCAN: A molecular identifier and descriptor applicable to the whole periodic table from hydrogen to oganesson10
Sequence-based drug-target binding site pre-training enables cryptic pocket detection and improves binding affinity and kinetics prediction9
Ai derivation and exploration of antibiotic class spaces9
TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry9
PINNED: identifying characteristics of druggable human proteins using an interpretable neural network9
Deep learning for assay nuisance compound detection using a gated co-attention graph embedding model (CAGE-Fusion)9
Principles and requirements for nanomaterial representations to facilitate machine processing and cooperation with nanoinformatics tools9
Combatting over-specialization bias in growing chemical databases9
Barlow Twins deep neural network for advanced 1D drug–target interaction prediction9
In-silico target prediction by ensemble chemogenomic model based on multi-scale information of chemical structures and protein sequences9
Mass-Suite: a novel open-source python package for high-resolution mass spectrometry data analysis9
Enhancing molecular property prediction with auxiliary learning and task-specific adaptation9
Learnable protein representations in computational biology for predicting drug-target affinity9
BioisoIdentifier: an online free tool to investigate local structural replacements from PDB9
DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning9
A pipeline for developing AI-driven models to predict molecular initiating events: a case study on neural tube defects9
Improved estimation of intrinsic solubility of drug-like molecules through multi-task graph transformer9
De novo design of anticancer 4-thiazolidinone derivatives: a generative framework shaped by activity cliffs9
Paths to cheminformatics: Q&A with Rajarshi Guha9
DeepRNA-DTI: a deep learning approach for RNA-compound interaction prediction with binding site interpretability9
xBitterT5: an explainable transformer-based framework with multimodal inputs for identifying bitter-taste peptides8
A novel approach for enhancing the potency of kinase inhibitors using topological water networks8
Unveiling polyphenol-protein interactions: a comprehensive computational analysis8
Box embeddings for extending ontologies: a data-driven and interpretable approach8
Evaluating ligand docking methods for drugging protein–protein interfaces: insights from AlphaFold2 and molecular dynamics refinement8
Graph-based transformer to predict the octanol–water partition coefficient8
Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation8
Graph neural processes for molecules: an evaluation on docking scores and strategies to improve generalization8
SMPR: a structure-enhanced multimodal drug‒disease prediction model for drug repositioning and cold start8
ChemScreener: an active learning enabled hit discovery workflow with WDR5 inhibitor case study8
“DompeKeys”: a set of novel substructure-based descriptors for efficient chemical space mapping, development and structural interpretation of machine learning models, and indexing of large databases8
Investigation of the structure-odor relationship using a Transformer model8
Application of the digital annealer unit in optimizing chemical reaction conditions for enhanced production yields8
Multi-fidelity graph neural networks for predicting toluene/water partition coefficients8
Molecular embedding-based algorithm selection in protein-ligand docking8
Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation8
hERGAT: predicting hERG blockers using graph attention mechanism through atom- and molecule-level interaction analyses8
MEF-AlloSite: an accurate and robust Multimodel Ensemble Feature selection for the Allosteric Site identification model7
Context-dependent similarity analysis of analogue series for structure–activity relationship transfer based on a concept from natural language processing7
UmetaFlow: an untargeted metabolomics workflow for high-throughput data processing and analysis7
Group graph: a molecular graph representation with enhanced performance, efficiency and interpretability7
0.12946009635925