Database-The Journal of Biological Databases and Curation

Papers
(The median citation count of Database-The Journal of Biological Databases and Curation is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Post-translational modifications in proteins: resources, tools and prediction methods405
OBO Foundry in 2021: operationalizing open data principles to evaluate ontologies105
bc-GenExMiner 4.5: new mining module computes breast cancer differential gene expression analyses90
PlasticDB: a database of microorganisms and proteins linked to plastic biodegradation88
A review on antimicrobial peptides databases and the computational tools60
Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration57
GrainGenes: a data-rich repository for small grains genetics and genomics45
TFLink: an integrated gateway to access transcription factor–target gene interactions for multiple species42
UbiNet 2.0: a verified, classified, annotated and updated database of E3 ubiquitin ligase–substrate interactions40
BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images40
The World Spider Trait database: a centralized global open repository for curated data on spider traits38
A Simple Standard for Sharing Ontological Mappings (SSSOM)38
Curation of over 10 000 transcriptomic studies to enable data reuse29
Ontology Development Kit: a toolkit for building, maintaining and standardizing biomedical ontologies28
Human IRES Atlas: an integrative platform for studying IRES-driven translational regulation in humans28
An overview of graph databases and their applications in the biomedical domain25
MetamORF: a repository of unique short open reading frames identified by both experimental and computational approaches for gene and metagene analyses24
BPPRC database: a web-based tool to access and analyse bacterial pesticidal proteins23
SynLethDB 2.0: a web-based knowledge graph database on synthetic lethality for novel anticancer drug discovery23
A roadmap for the functional annotation of protein families: a community perspective23
Scaling up oligogenic diseases research with OLIDA: the Oligogenic Diseases Database22
HPVMD-C: a disease-based mutation database of human papillomavirus in China22
Drugmonizome and Drugmonizome-ML: integration and abstraction of small molecule attributes for drug enrichment analysis and machine learning21
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations21
phytochemdb: a platform for virtual screening and computer-aided drug designing20
HBFP: a new repository for human body fluid proteome20
Peptipedia: a user-friendly web application and a comprehensive database for peptide research supported by Machine Learning approach19
An update of KAIKObase, the silkworm genome database19
The landscape of nutri-informatics: a review of current resources and challenges for integrative nutrition research18
Curation of a reference database of COI sequences for insect identification through DNA metabarcoding: COins18
The Progenetix oncogenomic resource in 202117
COVIDOUTCOME—estimating COVID severity based on mutation signatures in the SARS-CoV-2 genome16
SilkBase: an integrated transcriptomic and genomic database for Bombyx mori and related species15
Overview of DrugProt task at BioCreative VII: data and methods for large-scale text mining and knowledge graph generation of heterogenous chemical–protein relations14
Anti-CRISPRdb v2.2: an online repository of anti-CRISPR proteins including information on inhibitory mechanisms, activities and neighbors of curated anti-CRISPR proteins14
A review of the International Seabed Authority database DeepData from a biological perspective: challenges and opportunities in the UN Ocean Decade14
CANNUSE, a database of traditional Cannabis uses—an opportunity for new research14
NbThermo: a new thermostability database for nanobodies13
CMBD: a manually curated cancer metabolic biomarker knowledge database12
Automatization and self-maintenance of the O-GlcNAcome catalog: a smart scientific database11
GinkgoDB: an ecological genome database for the living fossil, Ginkgo biloba11
NEMAR: an open access data, tools and compute resource operating on neuroelectromagnetic data11
https://botryosphaeriales.org/, an online platform for up-to-date classification and account of taxa of Botryosphaeriales11
Wormicloud: a new text summarization tool based on word clouds to explore the C. elegans literature11
The Breeding Information Management System (BIMS): an online resource for crop breeding11
BCSCdb: a database of biomarkers of cancer stem cells11
Increasing metadata coverage of SRA BioSample entries using deep learning–based named entity recognition11
GeMI: interactive interface for transformer-based Genomic Metadata Integration10
PCRMS: a database of predicted cis-regulatory modules and constituent transcription factor binding sites in genomes10
APICURON: a database to credit and acknowledge the work of biocurators10
EpiSurf: metadata-driven search server for analyzing amino acid changes within epitopes of SARS-CoV-2 and other viral species10
Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students9
MMV-db: vaccinomics and RNA-based therapeutics database for infectious hemorrhagic fever-causing mammarenaviruses9
MGTdb: a web service and database for studying the global and local genomic epidemiology of bacterial pathogens9
MetaCOXI: an integrated collection of metazoan mitochondrial cytochrome oxidase subunit-I DNA sequences9
TIDB: a comprehensive database of trained immunity9
emiRIT: a text-mining-based resource for microRNA information9
IBDDB: a manually curated and text-mining-enhanced database of genes involved in inflammatory bowel disease9
Challenges and opportunities for mining adverse drug reactions: perspectives from pharma, regulatory agencies, healthcare providers and consumers9
COTTONOMICS: a comprehensive cotton multi-omics database9
FibROAD: a manually curated resource for multi-omics level evidence integration of fibrosis research9
Tripal MegaSearch: a tool for interactive and customizable query and download of big data8
Integration of 1:1 orthology maps and updated datasets into Echinobase8
H3ABioNet genomic medicine and microbiome data portals hackathon proceedings8
Nabe: an energetic database of amino acid mutations in protein–nucleic acid binding interfaces8
Extraction of causal relations based on SBEL and BERT model7
PearMODB: a multiomics database for pear (Pyrus) genomics, genetics and breeding study7
SLOAD: a comprehensive database of cancer-specific synthetic lethal interactions for precision cancer therapy via multi-omics analysis7
mPPI: a database extension to visualize structural interactome in a one-to-many manner7
Review of databases for experimentally validated human microRNA–mRNA interactions7
Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics7
GNIFdb: a neoantigen intrinsic feature database for glioma7
Chemical–protein relation extraction with ensembles of carefully tuned pretrained language models7
PLBD: protein–ligand binding database of thermodynamic and kinetic intrinsic parameters7
The United States Swine Pathogen Database: integrating veterinary diagnostic laboratory sequence data to monitor emerging pathogens of swine7
Bioinformatics tools developed to support BioCompute Objects6
PlantGF: an analysis and annotation platform for plant gene families6
NanoLAS: a comprehensive nanobody database with data integration, consolidation and application6
CarrotOmics: a genetics and comparative genomics database for carrot (Daucus carota)6
Application of beta and gamma carbonic anhydrase sequences as tools for identification of bacterial contamination in the whole genome sequence of inbred Wuzhishan minipig (Sus scrofa) annotated in dat6
SalivaDB—a comprehensive database for salivary biomarkers in humans6
RNA-Chrom: a manually curated analytical database of RNA–chromatin interactome6
Continuous development of the semantic search engine preVIEW: from COVID-19 to long COVID6
Prototheca-ID: a web-based application for molecular identification of Prototheca species6
Gene Ontology curation of the blood–brain barrier to improve the analysis of Alzheimer’s and other neurological diseases6
Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences6
scBrainMap: a landscape for cell types and associated genetic markers in the brain6
PSINDB: the postsynaptic protein–protein interaction database6
Multi-omics molecular biomarkers and database of osteoarthritis5
The Sickle Cell Disease Ontology: recent development and expansion of the universal sickle cell knowledge representation5
dbBIP: a comprehensive bipolar disorder database for genetic research5
CNVIntegrate: the first multi-ethnic database for identifying copy number variations associated with cancer5
Standardization of assay representation in the Ontology for Biomedical Investigations5
Neodb: a comprehensive neoantigen database and discovery platform for cancer immunotherapy5
Acinetobase: the comprehensive database and repository of Acinetobacter strains5
Developing TeroENZ and TeroMAP modules for the terpenome research platform TeroKit5
COVIDium: a COVID-19 resource compendium5
New approaches in developing medicinal herbs databases5
PolyQ Database—an integrated database on polyglutamine diseases5
CPMCP: a database of Chinese patent medicine and compound prescription5
Coronavirus Immunotherapeutic Consortium Database5
SwissBioPics—an interactive library of cell images for the visualization of subcellular location data5
lncHUB2: aggregated and inferred knowledge about human and mouse lncRNAs5
SITVITBovis—a publicly available database and mapping tool to get an improved overview of animal and human cases caused by Mycobacterium bovis4
SCANNER: a web platform for annotation, visualization and sharing of single cell RNA-seq data4
Evaluating the predictive accuracy of curated biological pathways in a public knowledgebase4
Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies4
Rapid automated validation, annotation and publication of SARS-CoV-2 sequences to GenBank4
CenhANCER: a comprehensive cancer enhancer database for primary tissues and cell lines4
PYK-SubstitutionOME: an integrated database containing allosteric coupling, ligand affinity and mutational, structural, pathological, bioinformatic and computational information about pyruvate kinase 4
DGPD: a knowledge database of dense granule proteins of the Apicomplexa4
JAMIR-eQTL: Japanese genome-wide identification of microRNA expression quantitative trait loci across dementia types4
Food Enzyme Database (FEDA): a web application gathering information about food enzyme preparations available on the European market4
QSDB—a graphical Quorum Sensing Database4
Automatic Extraction of Medication Mentions from Tweets—Overview of the BioCreative VII Shared Task 3 Competition4
Classifying domain-specific text documents containing ambiguous keywords4
NLM-Chem-BC7: manually annotated full-text resources for chemical entity annotation and indexing in biomedical articles4
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences4
Standardized naming of microbiome samples in Genomes OnLine Database4
A transcriptome atlas and interactive analysis platform for autoimmune disease4
A nomenclature for echinoderm genes4
ProbResist: a database for drug-resistant probiotic bacteria4
ReMeDy: a platform for integrating and sharing published stem cell research data with a focus on iPSC trials4
dbGENVOC: database of GENomic Variants of Oral Cancer, with special reference to India4
AFTM: a database of transmembrane regions in the human proteome predicted by AlphaFold4
InSexBase: an annotated genomic resource of sex chromosomes and sex-biased genes in insects4
circExp database: an online transcriptome platform for human circRNA expressions in cancers4
ChemBioPort: an online portal to navigate the structure, function and chemical inhibition of the human proteome4
SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata4
PSL-LCCL: a resource for subcellular protein localization in liver cancer cell line SK_HEP14
Mining drug–target and drug–adverse drug reaction databases to identify target–adverse drug reaction relationships4
PeptiHub: a curated repository of precisely annotated cancer-related peptides with advanced utilities for peptide exploration and discovery4
NetREx: Network-based Rice Expression Analysis Server for abiotic stress conditions3
Chemical identification and indexing in full-text articles: an overview of the NLM-Chem track at BioCreative VII3
Pre-trained models, data augmentation, and ensemble learning for biomedical information extraction and document classification3
AcetoBase Version 2: a database update and re-analysis of formyltetrahydrofolate synthetase amplicon sequencing data from anaerobic digesters3
EfGD: the Erianthus fulvus genome database3
TopEx: topic exploration of COVID-19 corpora - Results from the BioCreative VII Challenge Track 43
CropGF: a comprehensive visual platform for crop gene family mining and analysis3
A BERT-based ensemble learning approach for the BioCreative VII challenges: full-text chemical identification and multi-label classification in PubMed articles3
LeishMANIAdb: a comparative resource for Leishmania proteins3
HIR V2: a human interactome resource for the biological interpretation of differentially expressed genes via gene set linkage analysis3
ImmuneData: an integrated data discovery system for immunology data repositories3
Automated extraction of genes associated with antibiotic resistance from the biomedical literature3
ImmuMethy, a database of DNA methylation plasticity at a single cytosine resolution in human blood and immune cells3
ESOMIR: a curated database of biomarker genes and miRNAs associated with esophageal cancer3
FooDrugs: a comprehensive food–drug interactions database with text documents and transcriptional data3
AVPCD: a plant-derived medicine database of antiviral phytochemicals for cancer, Covid-19, malaria and HIV3
AI4FoodDB: a database for personalized e-Health nutrition and lifestyle through wearable devices and artificial intelligence3
dbAQP-SNP: a database of missense single-nucleotide polymorphisms in human aquaporins3
GeMo: a web-based platform for the visualization and curation of genome ancestry mosaics3
XePhIR: the zebrafish xenograft phenotype interactive repository3
A sequence labeling framework for extracting drug–protein relations from biomedical literature3
Translational drug–interaction corpus3
PETCH-DB: a Portal for Exploring Tissue-specific and Complex disease-associated 5-Hydroxymethylcytosines3
Halophytes.tn: an innovative database for Tunisian halophyte plant identification, distribution and characterization3
DSDBASE 2.0: updated version of DiSulphide dataBASE, a database on disulphide bonds in proteins3
LiqBioer: a manually curated database of cancer biomarkers in body fluid3
hCoronavirusesDB: an integrated bioinformatics resource for human coronaviruses3
BC-TFdb: a database of transcription factor drivers in breast cancer3
nhanesA: achieving transparency and reproducibility in NHANES research3
OUP accepted manuscript3
Managing and monitoring a pandemic: showcasing a practical approach for the genomic surveillance of SARS-CoV-23
covid19census: U.S. and Italy COVID-19 metrics and other epidemiological data2
Overview of the COVID-19 text mining tool interactive demonstration track in BioCreative VII2
Large-scale regulatory and signaling network assembly through linked open data2
LINPS: a database for cancer-cell-specific perturbations of biological networks2
A combinatorial approach implementing new database structures to facilitate practical data curation management of QTL, association, correlation and heritability data on trait variants2
Emati: a recommender system for biomedical literature based on supervised learning2
An open-access data set of pig skin anatomy and physiology for modelling purposes2
TEx-MST: tissue expression profiles of MANE select transcripts2
SinEx DB 2.0 update 2020: database for eukaryotic single-exon coding sequences2
Authors’ attitude toward adopting a new workflow to improve the computability of phenotype publications2
MEDFORD: A human- and machine-readable metadata markup language2
MicroRNA childhood cancer catalog (M3Cs): a resource for translational bioinformatics toward health informatics in pediatric cancer2
SC2sepsis: sepsis single-cell whole gene expression database2
RegulaTome: a corpus of typed, directed, and signed relations between biomedical entities in the scientific literature2
HCDT: an integrated highly confident drug–target resource2
Biomedical relation extraction with knowledge base–refined weak supervision2
CaviDB: a database of cavities and their features in the structural and conformational space of proteins2
https://www.fungiofpakistan.com: a continuously updated online database of fungi in Pakistan2
Notes on the data quality of bibliographic records from the MEDLINE database2
CBPDdb: a curated database of compounds derived from Coumarin–Benzothiazole–Pyrazole2
HSDatabase—a database of highly similar duplicate genes from plants, animals, and algae2
Fisheries data management systems in the NW Mediterranean: from data collection to web visualization2
LitCovid ensemble learning for COVID-19 multi-label classification2
Tissue-specific transcriptomes reveal potential mechanisms of microbiome heterogeneity in an ancient fish2
Reviewing knowledgebase and database grant proposals in the life sciences: the role of innovation2
HFIP: an integrated multi-omics data and knowledge platform for the precision medicine of heart failure2
Semi-automatic translation of medicine usage data (in Dutch, free-text) from Lifelines COVID-19 questionnaires to ATC codes2
https://invertebratefungi.org/: an expert-curated web-based platform for the identification and classification of invertebrate-associated fungi and fungus-like organisms2
Assigning species information to corresponding genes by a sequence labeling framework2
GeniePool: genomic database with corresponding annotated samples based on a cloud data lake architecture2
Which methods are the most effective in enabling novice users to participate in ontology creation? A usability study2
SmartWoodID—an image collection of large end-grain surfaces to support wood identification systems2
OUP accepted manuscript2
The African Human Microbiome Portal: a public web portal of curated metagenomic metadata2
LBD: a manually curated database of experimentally validated lymphoma biomarkers2
Phosprof: pathway analysis database of drug response based on phosphorylation activity measurements2
Development of a biomarker database toward performing disease classification and finding disease interrelations2
CausalBuilder: bringing the MI2CAST causal interaction annotation standard to the curator2
AIMedGraph: a comprehensive multi-relational knowledge graph for precision medicine2
HGFDB: a collective database of helmeted guinea fowl genomics2
Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications2
ProBioQuest: a database and semantic analysis engine for literature, clinical trials and patents related to probiotics2
PASS2.7: a database containing structure-based sequence alignments and associated features of protein domain superfamilies from SCOPe2
HumanMine: advanced data searching, analysis and cross-species comparison2
CARD*Shark: automated prioritization of literature curation for the Comprehensive Antibiotic Resistance Database2
ENCD: a manually curated database of experimentally supported endocrine system disease and lncRNA associations2
An online database for einkorn wheat to aid in gene discovery and functional genomics studies2
0.043631076812744