GigaScience

Papers
(The median citation count of GigaScience is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis229
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices93
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection85
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning76
MBGC: Multiple Bacteria Genome Compressor70
Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework66
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources62
The probability of edge existence due to node degree: a baseline for network-based predictions61
Reducing skin microbiome exposure impacts through swine farm biosecurity59
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources49
Hecatomb: an integrated software platform for viral metagenomics47
CODARFE: Unlocking the prediction of continuous environmental variables based on microbiome46
Qiber3D—an open-source software package for the quantitative analysis of networks from 3D image stacks46
Preventing dataset shift from breaking machine-learning biomarkers38
A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta)37
FAIR data station for lightweight metadata management and validation of omics studies35
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism35
A decade of GigaScience: A perspective on conservation genetics35
CoVEffect: interactive system for mining the effects of SARS-CoV-2 mutations and variants based on deep learning35
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface34
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq32
The Nencki-Symfonia electroencephalography/event-related potential dataset: Multiple cognitive tasks and resting-state data collected in a sample of healthy adults31
scMAPA: Identification of cell-type–specific alternative polyadenylation in complex tissues31
X-ray microtomography imaging of craniofacial hard tissues in selected reptile species with different types of dentition31
Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis30
A Case for estradiol: younger brains in women with earlier menarche and later menopause29
Characteristics and filtering of low-frequency artificial short deletion variations based on nanopore sequencing27
gNOMO2: a comprehensive and modular pipeline for integrated multi-omics analyses of microbiomes27
WaveSeekerNet: accurate prediction of influenza A virus subtypes and host source using attention-based deep learning26
A virtual library for behavioral performance in standard conditions—rodent spontaneous activity in an open field during repeated testing and after treatment with drugs or brain lesions26
ricu: R’s interface to intensive care data26
Molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes26
Knowledge graph–based thought: a knowledge graph–enhanced LLM framework for pan-cancer question answering26
vEMstitch: an algorithm for fully automatic image stitching of volume electron microscopy25
A Decade of GigaScience: Milestones in Open Science25
Harnessing population diversity: in search of tools of the trade25
Studying mutation rate evolution in primates—the effects of computational pipelines and parameter choices25
Early microbial intervention reshapes phenotypes of newborn Bos taurus through metabolic regulations25
xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments24
CAT Bridge: an efficient toolkit for gene–metabolite association mining from multiomics data24
Hi-GDT: A Hi-C-based 3D gene domain analysis tool for analyzing local chromatin contacts in plants23
Genomic insights into endangerment and conservation of the garlic-fruit tree (Malania oleifera), a plant species with extremely small populations23
External validation of machine learning models—registered models and adaptive sample splitting23
Computational reproducibility of Jupyter notebooks from biomedical publications23
Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging23
Multiomics uncovers the epigenomic and transcriptomic response to viral and bacterial stimulation in turbot23
Population modeling with machine learning can enhance measures of mental health22
A new haplotype-resolved turkey genome to enable turkey genetics and genomics research21
Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios21
Hiding in plain sight: a research parasite’s perspective on new lessons in old data21
An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy20
Spacemake: processing and analysis of large-scale spatial transcriptomics data20
Deterministic succession patterns in the rumen and fecal microbiome associate with host metabolic shifts in peripartum dairy cattle20
GADMA2: more efficient and flexible demographic inference from genetic data19
High-quality genome assembles from key Hawaiian coral species19
Disentangling river and swamp buffalo genetic diversity: initial insights from the 1000 Buffalo Genomes Project19
PEPhub: a database, web interface, and API for editing, sharing, and validating biological sample metadata19
De novoscreening of disease-resistant genes from the chromosome-level genome of rare minnow using CRISPR-cas9 random mutation18
The whole-genome assembly of an endangered Salicaceae species: Chosenia arbutifolia (Pall.) A. Skv18
A near telomere-to-telomere phased genome assembly and annotation for the Australian central bearded dragon Pogona vitticeps18
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics18
Publishing data to support the fight against human vector-borne diseases18
Lessons learned about the biology and genomics of Diaphorina citri infection with “Candidatus Liberibacter asiaticus” by integrating new and archived organ-specific transcriptome data18
The Capparis spinosa var. herbacea genome provides the first genomic instrument for a diversity and evolution study of the Capparaceae family18
Retracted and Replaced: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis18
Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection17
Data standardization of plant–pollinator interactions17
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids17
A Decade of GigaScience: Women in Science: Past, Present, and Future17
DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach17
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences17
Loop detection using Hi-C data with HiCExplorer17
Deep learning links localized digital pathology phenotypes with transcriptional subtype and patient outcome in glioblastoma16
ssMutPA: single-sample mutation-based pathway analysis approach for cancer precision medicine16
High temporal resolution Nanopore sequencing dataset of SARS-CoV-2 and host cell RNAs16
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome15
On the benefits of self-taught learning for brain decoding15
A chromosome-scale genome assembly of the pioneer plant Stylosanthes angustifolia: insights into genome evolution and drought adaptation15
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors15
Lifting the curse from high-dimensional data: automated projection pursuit clustering for a variety of biological data modalities15
NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer15
Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding15
Telomere-to-telomere chromosome-scale genome assemblies of black and golden koi carp variants support construction of an ancient karyotype of Cypriniformes15
Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI15
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios15
ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages15
Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas15
RWRtoolkit: multi-omic network analysis using random walks on multiplex networks in any species15
A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles14
Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification14
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants14
Celebrating 30 years of access to NASA Space Life Sciences data14
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus14
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood14
learnMSA: learning and aligning large protein families13
Message in a Bottle—Metabarcoding enables biodiversity comparisons across ecoregions13
HeteroMRI: Robust white matter abnormality classification across multi-scanner MRI data13
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks13
A near-complete genome assembly of the bearded dragon Pogona vitticeps provides insights into the origin of Pogona sex chromosomes13
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics13
simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods13
Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4)13
The state of Medusozoa genomics: current evidence and future challenges13
A dataset of ant colonies’ motion trajectories in indoor and outdoor scenes to study clustering behavior12
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution12
CoCoPyE: feature engineering for learning and prediction of genome quality indices12
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics12
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing12
A near telomere-to-telomere genome assembly of the Jinhua pig: enabling more accurate genetic research12
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce12
An in vitro whole-cell electrophysiology dataset of human cortical neurons11
A telomere-to-telomere genome assembly of koi carp (Cyprinus carpio) using long reads and Hi-C technology11
A high-quality assembly revealing the PMEL gene for the unique plumage phenotype in Liancheng ducks11
Metaphor—A workflow for streamlined assembly and binning of metagenomes11
Monash DaCRA fPET-fMRI: A dataset for comparison of radiotracer administration for high temporal resolution functional FDG-PET11
A Decade of GigaScience: GigaDB and the Open Data Movement11
Living in darkness: Exploring adaptation of Proteus anguinus in 3 dimensions by X-ray imaging11
DENTIST—using long reads for closing assembly gaps at high accuracy11
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis11
Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology11
M6Allele: a toolkit for detection of allele-specific RNA N6-methyladenosine modifications11
On the variability of dynamic functional connectivity assessment methods10
Chromosome-level reference genome of tetraploid Isoetes sinensis provides insights into evolution and adaption of lycophytes10
Deepdefense: annotation of immune systems in prokaryotes using deep learning10
Katdetectr: an R/bioconductor package utilizing unsupervised changepoint analysis for robust kataegis detection10
Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality10
Chromosome-level genome assembly of goose provides insight into the adaptation and growth of local goose breeds10
Retraction and replacement of: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis10
Chromosome-level genome assemblies of two littorinid marine snails indicate genetic basis of intertidal adaptation and ancient karyotype evolved from bilaterian ancestors10
Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data10
A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes10
Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning9
Accurate gene consensus at low nanopore coverage9
MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model9
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles9
Chromosome-length genome assembly and linkage map of a critically endangered Australian bird: the helmeted honeyeater9
d-StructMAn: Containerized structural annotation on the scale from genetic variants to whole proteomes9
Chromosome-level genome assemblies of Channa argus and Channa maculata and comparative analysis of their temperature adaptability9
Correction to: The state of Medusozoa genomics: current evidence and future challenges9
The chromosome-level genome assembly of an endangered herbBergenia scopulosaprovides insights into local adaptation and genomic vulnerability under climate change9
SODAR: managing multiomics study data and metadata9
Deciphering cancer genomes with GenomeSpy: a grammar-based visualization toolkit9
Cerebellocerebral connectivity predicts body mass index: a new open-source Python-based framework for connectome-based predictive modeling9
Fully resolved assembly of Cryptosporidium parvum9
A molecular phenotypic map of malignant pleural mesothelioma9
A decade of GigaScience: 10 years of the evolving genomic and biomedical standards landscape9
Interspecific hybridization in Brassica species leads to changes in agronomic traits through the regulation of gene expression by chromatin accessibility and DNA methylation9
A trade-off in evolution: the adaptive landscape of spiders without venom glands8
On the relationship between research parasites and fairness in machine learning: challenges and opportunities8
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning–driven data analysis8
The molecular basis of octocoral calcification revealed by genome and skeletal proteome analyses8
Streamlining remote nanopore data access with slow5curl8
Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA–minus RNA sequencing data8
MuLan-Methyl—multiple transformer-based language models for accurate DNA methylation prediction8
Corrigendum to: The new COST Action European Venom Network (EUVEN)—synergy and future perspectives of modern venomics8
Genomic analyses provide insights into the evolution and salinity adaptation of halophyte Tamarix chinensis8
Chromosome-level genome assembly of the Pacific geoduck Panopea generosa reveals major inter- and intrachromosomal rearrangements and substantial expansion of the copine gene family8
Accurate and fast clade assignment via deep learning and frequency chaos game representation8
Telomere-to-telomere gap-free genome assembly of the endangered Yangtze finless porpoise and East Asian finless porpoise8
ChemChaste: Simulating spatially inhomogeneous biochemical reaction–diffusion systems for modeling cell–environment feedbacks8
SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations7
Alignstein: Optimal transport for improved LC-MS retention time alignment7
Metabarcoding versus mapping unassembled shotgun reads for identification of prey consumed by arthropod epigeal predators7
Building the mega single-cell transcriptome ocular meta-atlas7
A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae7
Telomere-to-telomere African wild rice (Oryza longistaminata) reference genome reveals segmental and structural variation7
Korea4K: whole genome sequences of 4,157 Koreans with 107 phenotypes derived from extensive health check-ups7
Genome size evolution in the diverse insect order Trichoptera7
RNAVirHost: a machine learning–based method for predicting hosts of RNA viruses through viral genomes7
Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data7
A high-quality genome and comparison of short- versus long-read transcriptome of the palaearctic duck Aythya fuligula (tufted duck)7
DeePVP: Identification and classification of phage virion proteins using deep learning7
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines7
Comparative maternal protein profiling of mouse biparental and uniparental embryos6
Chromosome-level genome assembly of Pinus massoniana provides insights into conifer adaptive evolution6
LED color gradient as a new screening tool for rapid phenotyping of plant responses to light quality6
epialleleR: an R/Bioconductor package for sensitive allele-specific methylation analysis in NGS data6
Container Profiler: Profiling resource utilization of containerized big data pipelines6
Chromosome-level reference genome for the medically important Arabian horned viper (Cerastes gasperettii)6
The rise of genomics in snake venom research: recent advances and future perspectives6
Similar, but not the same: multiomics comparison of human valve interstitial cells and osteoblast osteogenic differentiation expanded with an estimation of data-dependent and data-independent PASEF pr6
Near-chromosomal de novo assembly of Bengal tiger genome reveals genetic hallmarks of apex predation6
Construction and analysis of telomere-to-telomere genomes for 2 sweet oranges: Longhuihong and Newhall (Citrus sinensis)6
A ghost moth olfactory prototype of the lepidopteran sex communication6
Lessons learned to boost a bioinformatics knowledge base reusability, the Bgee experience6
Analysis-ready VCF at Biobank scale using Zarr6
Genomic view of the diversity and functional role of archaea and bacteria in the skeleton of the reef-building corals Porites lutea and Isopora palifera6
A telomere-to-telomere phased genome of an octoploid strawberry reveals a receptor kinase conferring anthracnose resistance6
T cell receptor repertoire sequencing reveals chemotherapy-driven clonal expansion in colorectal liver metastases6
Linking big biomedical datasets to modular analysis with Portable Encapsulated Projects6
PeptideMiner—neuropeptide discovery across the animal kingdom6
Desiderata for the development of next-generation electronic health record phenotype libraries6
Imputation method for single-cell RNA-seq data using neural topic model6
Exploring the cobia (Rachycentron canadum) genome: unveiling putative male heterogametic regions and identification of sex-specific markers6
Telomere-to-telomere genome and resequencing of 231 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis6
Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps6
Statistical quantification of confounding bias in machine learning models6
Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-26
Genome resequencing reveals independent domestication and breeding improvement of naked oat6
A high-quality assembly reveals genomic characteristics, phylogenetic status, and causal genes for leucism plumage of Indian peafowl6
Image segmentation of treated and untreated tumor spheroids by fully convolutional networks6
ARA: a flexible pipeline for automated exploration of NCBI SRA datasets6
Deciphering the distinct transcriptomic and gene regulatory map in adult macaque basal ganglia cells6
Unlocking the power of AI for phenotyping fruit morphology in Arabidopsis6
Correction to: Scientists without borders: lessons from Ukraine6
Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants6
An interconnected data infrastructure to support large-scale rare disease research6
A workflow reproducibility scale for automatic validation of biological interpretation results6
Spatial integration of multi-omics data from serial sections using the novel Multi-Omics Imaging Integration Toolset6
Myth-busting the provider-user relationship for digital sequence information5
Contrast subgraphs allow comparing homogeneous and heterogeneous networks derived from omics data5
The first high-altitude autotetraploid haplotype-resolved genome assembled (Rhododendron nivale subsp. boreale) provides new insights into mountaintop adaptation5
The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features5
Identifying candidate genetic variants for egg number by analyzing over 1,000 fully sequenced layers5
DNA methylation analysis to differentiate reference, breed, and parent-of-origin effects in the bovine pangenome era5
HVSeeker: a deep-learning-based method for identification of host and viral DNA sequences5
CheRRI—Accurate classification of the biological relevance of putative RNA–RNA interaction sites5
Efficient real-time selective genome sequencing on resource-constrained devices5
Euler characteristic curves and profiles: a stable shape invariant for big data problems5
Pangenome databases improve host removal and mycobacteria classification from clinical metagenomic data5
A comprehensive water buffalo pangenome reveals extensive structural variation linked to population-specific signatures of selection5
Integrating comparative genomics and risk classification by assessing virulence, antimicrobial resistance, and plasmid spread in microbial communities with gSpreadComp5
The complexity landscape of viral genomes5
Chromosome-level assembly and annotation of the blue catfishIctalurus furcatus, an aquaculture species for hybrid catfish reproduction, epigenetics, and heterosis studies5
A telomere-to-telomere gapless genome reveals SlPRR1 control of circadian rhythm and photoperiodic flowering in tomato5
Diaci v3.0: chromosome-level assembly, de novo transcriptome, and manual annotation of Diaphorina citri, insect vector of Huanglongbing5
Toward global integration of biodiversity big data: a harmonized metabarcode data generation module for terrestrial arthropods5
Standardized genome-wide function prediction enables comparative functional genomics: a new application area for Gene Ontologies in plants5
Modern venomics—Current insights, novel methods, and future perspectives in biological and applied animal venom research5
A high-quality chromosomal genome assembly of the sea cucumber Chiridota heheva and its hydrothermal adaptation5
Antibiotic resistance genes are differentially mobilized according to resistance mechanism5
Association mapping across a multitude of traits collected in diverse environments in maize5
Delineating regions of interest for mass spectrometry imaging by multimodally corroborated spatial segmentation5
Mutation impact on mRNA versus protein expression across human cancers5
Correction to: molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes5
SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples5
Evolution of complex genome architecture in gymnosperms5
Single-cell transcriptome analysis illuminating the characteristics of species-specific innate immune responses against viral infections5
Overture: an open-source genomics data platform5
Highly accurate whole-genome imputation of SARS-CoV-2 from partial or low-quality sequences4
Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts4
Best genome sequencing strategies for annotation of complex immune gene families in wildlife4
Long-read metagenomic sequencing negates inferred loss of cytosine methylation in Myxosporea (Cnidaria: Myxozoa)4
AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data4
A chromosome-level assembly supports genome-wide investigation of the DMRT gene family in the golden mussel (Limnoperna fortunei)4
A chromosome-level genome assembly for the eastern fence lizard (Sceloporus undulatus), a reptile model for physiological and evolutionary ecology4
Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa4
Comparing linear and nonlinear finite element models of vertebral strength across the thoracolumbar spine: a benchmark from density-calibrated computed tomography4
Comparative analysis of common alignment tools for single-cell RNA sequencing4
The Australasian dingo archetype: de novo chromosome-length genome assembly, DNA methylome, and cranial morphology4
0.16038393974304