Database-The Journal of Biological Databases and Curation

Papers
(The median citation count of Database-The Journal of Biological Databases and Curation is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
CBGDA: a manually curated resource for gene–disease associations based on genome-wide CRISPR124
The importance of graph databases and graph learning for clinical applications124
CenhANCER: a comprehensive cancer enhancer database for primary tissues and cell lines86
AVPCD: a plant-derived medicine database of antiviral phytochemicals for cancer, Covid-19, malaria and HIV73
Collecting and managing in situ banana genetic resources information (Musa spp.) using online resources and citizen science68
Building resource-efficient community databases using open-source software63
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data53
FishTEDB 2.0: an update fish transposable element (TE) database with new functions to facilitate TE research50
BioKC: a collaborative platform for curation and annotation of molecular interactions48
CCIDB: a manually curated cell–cell interaction database with cell context information36
Multi-omics molecular biomarkers and database of osteoarthritis33
A roadmap for the functional annotation of protein families: a community perspective33
Pathway-based, reaction-specific annotation of disease variants for elucidation of molecular phenotypes29
Post-composing ontology terms for efficient phenotyping in plant breeding26
Assessing the performance of generative artificial intelligence in retrieving information against manually curated genetic and genomic data25
DisGeNet: a disease-centric interaction database among diseases and various associated genes25
NLM-Chem-BC7: manually annotated full-text resources for chemical entity annotation and indexing in biomedical articles24
CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources23
The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII22
OncoCardioDB: a public and curated database of molecular information in onco-cardiology/cardio-oncology22
Phosprof: pathway analysis database of drug response based on phosphorylation activity measurements21
New approaches in developing medicinal herbs databases19
GeniePool: genomic database with corresponding annotated samples based on a cloud data lake architecture19
Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences19
ESOMIR: a curated database of biomarker genes and miRNAs associated with esophageal cancer17
SCANNER: a web platform for annotation, visualization and sharing of single cell RNA-seq data17
CO-19 PDB 2.0: A Comprehensive COVID-19 Database with Global Auto-Alerts, Statistical Analysis, and Cancer Correlations16
HOFE: an interactive forensic entomological database16
COVIDium: a COVID-19 resource compendium16
DSDBASE 2.0: updated version of DiSulphide dataBASE, a database on disulphide bonds in proteins15
AFTM: a database of transmembrane regions in the human proteome predicted by AlphaFold15
OBO Foundry in 2021: operationalizing open data principles to evaluate ontologies13
Interactive tools for functional annotation of bacterial genomes13
TFLink: an integrated gateway to access transcription factor–target gene interactions for multiple species13
BCEDB: a linear B-cell epitopes database for SARS-CoV-213
CaRPE: the Carbon Reduction Potential Evaluation tool for building climate mitigation scenarios on US agricultural lands12
PheNormGPT: a framework for extraction and normalization of key medical findings12
HoloFood Data Portal: holo-omic datasets for analysing host–microbiota interactions in animal production12
Acupuncture indication knowledge bases: meridian entity recognition and classification based on ACUBERT11
Optimized biomedical entity relation extraction method with data augmentation and classification using GPT-4 and Gemini11
Localizatome: a database for stress-dependent subcellular localization changes in proteins11
Towards discovery: an end-to-end system for uncovering novel biomedical relations11
OncoCTMiner: streamlining precision oncology trial matching via molecular profile analysis10
Centralizing neurofibromatosis experimental tool knowledge with the NF Research Tools Database10
GinkgoDB: an ecological genome database for the living fossil, Ginkgo biloba9
Anti-CRISPRdb v2.2: an online repository of anti-CRISPR proteins including information on inhibitory mechanisms, activities and neighbors of curated anti-CRISPR proteins9
StopKB: a comprehensive knowledgebase for nonsense suppression therapies9
Ontology Development Kit: a toolkit for building, maintaining and standardizing biomedical ontologies9
FungiProteomeDB: a database for the molecular weight and isoelectric points of the fungal proteomes9
AbAMPdb: a database of Acinetobacter baumannii specific antimicrobial peptides9
PharmaKoVariome database for supporting genetic testing9
Correction to: The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII8
SingleQ: a comprehensive database of single-cell expression quantitative trait loci (sc-eQTLs) cross human tissues8
Rapid automated validation, annotation and publication of SARS-CoV-2 sequences to GenBank8
Artificial Intelligence-based database for prediction of protein structure and their alterations in ocular diseases8
Visualization and exploration of linked data using virtual reality8
CardioHotspots: a database of mutational hotspots for cardiac disorders8
LICEDB: light industrial core enzyme database for industrial applications and AI enzyme design8
TopEx: topic exploration of COVID-19 corpora - Results from the BioCreative VII Challenge Track 48
Correction to: An interactive web application for exploring systemic lupus erythematosus blood transcriptomic diversity8
The Sickle Cell Disease Ontology: recent development and expansion of the universal sickle cell knowledge representation8
Integrated ACMG-approved genes and ICD codes for the translational research and precision medicine8
LSD600: the first corpus of biomedical abstracts annotated with lifestyle–disease relations8
Data sharing and ontology use among agricultural genetics, genomics, and breeding databases and resources of the Agbiodata Consortium8
JTIS: enhancing biomedical document-level relation extraction through joint training with intermediate steps8
SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata7
ProBioQuest: a database and semantic analysis engine for literature, clinical trials and patents related to probiotics7
Aerial Wildlife Image Repository for animal monitoring with drones in the age of artificial intelligence7
SMCVdb: a database of experimental cellular toxicity information for drug candidate molecules7
Conference report: Biocuration 2021 Virtual Conference7
Is metadata of articles about COVID-19 enough for multilabel topic classification task?7
Development of marine biodiversity database (BISMaL) to enable estimations past habitat conditions for marine life in the northwestern Pacific7
AFED, a comprehensive resource for Aspergillus flavus gene expression profiling7
An interactive web application for exploring systemic lupus erythematosus blood transcriptomic diversity7
IHM-DB: a curated collection of metagenomics data from the Indian Himalayan Region, and automated pipeline for 16S rRNA amplicon-based analysis (AutoQii2)7
Classifying domain-specific text documents containing ambiguous keywords6
AcetoBase Version 2: a database update and re-analysis of formyltetrahydrofolate synthetase amplicon sequencing data from anaerobic digesters6
TMC-SNPdb 2.0: an ethnic-specific database of Indian germline variants6
TRustDB: A comprehensive bioinformatics resource for understanding the complete Wheat—Stem rust host–pathogen interactome6
Toward clearer recognition and easier usefulness: development of a cross-lingual atherosclerotic cerebrovascular disease ontology6
ImmRNA: a database of RNAs associated with tumor immunity6
PASS2.7: a database containing structure-based sequence alignments and associated features of protein domain superfamilies from SCOPe6
ELiAH: the atlas of E3 ligases in human tissues for targeted protein degradation with reduced off-target effect6
MANUDB: database and application to retrieve and visualize mammalian NUMTs6
Automated extraction of genes associated with antibiotic resistance from the biomedical literature6
PETCH-DB: a Portal for Exploring Tissue-specific and Complex disease-associated 5-Hydroxymethylcytosines6
A review on antimicrobial peptides databases and the computational tools6
NbThermo: a new thermostability database for nanobodies6
GeMI: interactive interface for transformer-based Genomic Metadata Integration5
LitCovid ensemble learning for COVID-19 multi-label classification5
BENviewer: a gene interaction network visualization server based on graph embedding model5
Correction to: CardioHotspots: a database of mutational hotspots for cardiac disorders5
Transformer-based approach for symptom recognition and multilingual linking5
Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database5
Correction to: The importance of graph databases and graph learning for clinical applications5
Pipeline to explore information on genome editing using large language models and genome editing meta-database5
Transverse aortic constriction multi-omics analysis uncovers pathophysiological cardiac molecular mechanisms5
gymnotoa-db: a database and application to optimize functional annotation in gymnosperms5
BCSCdb: a database of biomarkers of cancer stem cells5
CPMKG: a condition-based knowledge graph for precision medicine5
CancerMHL: the database of integrating key DNA methylation, histone modifications and lncRNAs in cancer5
The World Spider Trait database: a centralized global open repository for curated data on spider traits5
ProbResist: a database for drug-resistant probiotic bacteria5
ForestForward: visualizing and accessing integrated world forest data from the last 50 years5
Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database5
PlasticDB: a database of microorganisms and proteins linked to plastic biodegradation5
Filling knowledge gaps in insect conservation by leveraging genetic data from public archives5
Autophagy3D: a comprehensive autophagy structure database4
Standardized naming of microbiome samples in Genomes OnLine Database4
BPPRC database: a web-based tool to access and analyse bacterial pesticidal proteins4
cancercelllines.org—a novel resource for genomic variants in cancer cell lines4
MicroRNA childhood cancer catalog (M3Cs): a resource for translational bioinformatics toward health informatics in pediatric cancer4
ENCD: a manually curated database of experimentally supported endocrine system disease and lncRNA associations4
Peptipedia v2.0: a peptide sequence database and user-friendly web platform. A major update4
scBrainMap: a landscape for cell types and associated genetic markers in the brain4
TRGdb: a universal resource for the exploration of taxonomically restricted genes in bacteria4
ARAapp: filling gaps in the ecological knowledge of spiders using an automated and dynamic approach to analyze systematically collected community data4
mPPI: a database extension to visualize structural interactome in a one-to-many manner4
AneRBC dataset: a benchmark dataset for computer-aided anemia diagnosis using RBC images4
Emati: a recommender system for biomedical literature based on supervised learning4
AgingReG: a curated database of aging regulatory relationships in humans4
lncHUB2: aggregated and inferred knowledge about human and mouse lncRNAs4
TEx-MST: tissue expression profiles of MANE select transcripts4
HPVMD-C: a disease-based mutation database of human papillomavirus in China4
PYK-SubstitutionOME: an integrated database containing allosteric coupling, ligand affinity and mutational, structural, pathological, bioinformatic and computational information about pyruvate kinase 4
The landscape of microRNA interaction annotation: analysis of three rare disorders as a case study4
LitSumm: large language models for literature summarization of noncoding RNAs4
Probe my Pathway (PmP): a portal to explore the chemical coverage of the human Reactome4
scEccDNAdb: an integrated single-cell eccDNA resource for human and mouse4
KinMod database: a tool for investigating metabolic regulation4
ProNet DB: a proteome-wise database for protein surface property representations and RNA-binding profiles3
Assessing the use of supplementary materials to improve genomic variant discovery3
VariantHunter: a method and tool for fast detection of emerging SARS-CoV-2 variants3
EfGD: the Erianthus fulvus genome database3
ImmuMethy, a database of DNA methylation plasticity at a single cytosine resolution in human blood and immune cells3
Correction to: Acinetobase: the comprehensive database and repository of Acinetobacter strains3
RegulaTome: a corpus of typed, directed, and signed relations between biomedical entities in the scientific literature3
piOxi database: a web resource of germline and somatic tissue piRNAs identified by chemical oxidation3
DrugRepoBank: a comprehensive database and discovery platform for accelerating drug repositioning3
Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration3
AIMedGraph: a comprehensive multi-relational knowledge graph for precision medicine3
Do syntactic trees enhance Bidirectional Encoder Representations from Transformers (BERT) models for chemical–drug relation extraction?3
ReMeDy: a platform for integrating and sharing published stem cell research data with a focus on iPSC trials3
PCRMS: a database of predicted cis-regulatory modules and constituent transcription factor binding sites in genomes3
https://invertebratefungi.org/: an expert-curated web-based platform for the identification and classification of invertebrate-associated fungi and fungus-like organisms3
MantaID: a machine learning–based tool to automate the identification of biological database IDs3
MetaCOXI: an integrated collection of metazoan mitochondrial cytochrome oxidase subunit-I DNA sequences3
A combinatorial approach implementing new database structures to facilitate practical data curation management of QTL, association, correlation and heritability data on trait variants3
Pre-trained models, data augmentation, and ensemble learning for biomedical information extraction and document classification3
ImmuneData: an integrated data discovery system for immunology data repositories3
An open-access data set of pig skin anatomy and physiology for modelling purposes3
MDDOmics: multi-omics resource of major depressive disorder3
Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics3
CBPDdb: a curated database of compounds derived from Coumarin–Benzothiazole–Pyrazole3
PDC: a highly compact file format to store protein 3D coordinates3
IBDTransDB: a manually curated transcriptomic database for inflammatory bowel disease3
Helping authors produce FAIR taxonomic data: evaluation of an author-driven phenotype data production prototype3
PDB NextGen Archive: centralizing access to integrated annotations and enriched structural information by the Worldwide Protein Data Bank3
A comprehensive experimental comparison between federated and centralized learning3
RNA-Chrom: a manually curated analytical database of RNA–chromatin interactome3
OakRootRNADB—a consolidated RNA-seq database for coding and noncoding RNA in roots of pedunculate oak (Quercus robur)3
CiliaMiner: an integrated database for ciliopathy genes and ciliopathies3
GeMo: a web-based platform for the visualization and curation of genome ancestry mosaics2
Best practices for the manual curation of intrinsically disordered proteins in DisProt2
Notes on the data quality of bibliographic records from the MEDLINE database2
VarGuideAtlas: a repository of variant interpretation guidelines2
MineProt: a stand-alone server for structural proteome curation2
FatPlants: a comprehensive information system for lipid-related genes and metabolic pathways in plants2
MACSFeD—a database of mosquito acoustic communication and swarming features2
RettDb: the Rett syndrome omics database to navigate the Rett syndrome genomic landscape2
The landscape of health disparities in the UK Biobank2
FibROAD: a manually curated resource for multi-omics level evidence integration of fibrosis research2
AI4FoodDB: a database for personalized e-Health nutrition and lifestyle through wearable devices and artificial intelligence2
Correction to: A Terpenoids Database with the Chemical Content as A Novel Agronomic Trait2
Assessing resource use: a case study with the Human Disease Ontology2
The biomedical relationship corpus of the BioRED track at the BioCreative VIII challenge and workshop2
The Immunopeptidomics Ontology (ImPO)2
Expression of Concern: DisGeNet: a disease-centric interaction database among diseases and various associated genes2
The TOXIN knowledge graph: supporting animal-free risk assessment of cosmetics2
emiRIT: a text-mining-based resource for microRNA information2
Biomedical relation extraction with knowledge base–refined weak supervision2
Acinetobase: the comprehensive database and repository of Acinetobacter strains2
MyxoPortal: a database of myxobacterial genomic features2
MBS: a genome browser annotation track for high-confident microRNA binding sites in whole human transcriptome2
PlantIntronDB: a database for plant introns that host functional elements2
hCoronavirusesDB: an integrated bioinformatics resource for human coronaviruses2
CaviDB: a database of cavities and their features in the structural and conformational space of proteins2
Chemical–protein relation extraction with ensembles of carefully tuned pretrained language models2
Fisheries data management systems in the NW Mediterranean: from data collection to web visualization2
The Genomic SSR Millets Database (GSMDB): enhancing genetic resources for sustainable agriculture2
nhanesA: achieving transparency and reproducibility in NHANES research2
Task reformulation and data-centric approach for Twitter medication name extraction2
Structural signatures: a web server for exploring a database of and generating protein structural features from human cell lines and tissues2
PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web2
XePhIR: the zebrafish xenograft phenotype interactive repository2
CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes2
Genome-wide identification of SSR markers from coding regions for endangered Argania spinosa L. skeels and construction of SSR database: AsSSRdb2
A dataset of tumour-infiltrating lymphocytes in colorectal cancer patients using limited resources2
MiCK: a database of gut microbial genes linked with chemoresistance in cancer patients2
ChemBioPort: an online portal to navigate the structure, function and chemical inhibition of the human proteome2
Full-text chemical identification with improved generalizability and tagging consistency2
0.064517974853516