GigaScience

Papers
(The median citation count of GigaScience is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Reducing skin microbiome exposure impacts through swine farm biosecurity338
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis109
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning86
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection76
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices72
Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework67
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources58
Hecatomb: an integrated software platform for viral metagenomics56
MBGC: Multiple Bacteria Genome Compressor54
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources50
The probability of edge existence due to node degree: a baseline for network-based predictions46
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism38
A decade of GigaScience: A perspective on conservation genetics38
Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis38
The Nencki-Symfonia electroencephalography/event-related potential dataset: Multiple cognitive tasks and resting-state data collected in a sample of healthy adults38
On the path to reference genomes for all biodiversity: laboratory protocols and lessons learned from processing over 2,000 species in the Sanger Tree of Life37
X-ray microtomography imaging of craniofacial hard tissues in selected reptile species with different types of dentition37
A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta)36
TinkerHap—a novel read-based phasing algorithm with integrated multimethod support for enhanced accuracy33
Characteristics and filtering of low-frequency artificial short deletion variations based on nanopore sequencing32
Qiber3D—an open-source software package for the quantitative analysis of networks from 3D image stacks32
CoVEffect: interactive system for mining the effects of SARS-CoV-2 mutations and variants based on deep learning32
WaveSeekerNet: accurate prediction of influenza A virus subtypes and host source using attention-based deep learning31
A Case for estradiol: younger brains in women with earlier menarche and later menopause31
FAIR data station for lightweight metadata management and validation of omics studies31
CODARFE: Unlocking the prediction of continuous environmental variables based on microbiome31
ricu: R’s interface to intensive care data30
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface29
Datagraphy: toward a systematic approach to dataset discovery29
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq29
scMAPA: Identification of cell-type–specific alternative polyadenylation in complex tissues28
gNOMO2: a comprehensive and modular pipeline for integrated multi-omics analyses of microbiomes28
The genomes of five mantises provide insights into sex chromosome evolution and Mantodea phylogeny clarification28
Hi-GDT: A Hi-C-based 3D gene domain analysis tool for analyzing local chromatin contacts in plants27
A virtual library for behavioral performance in standard conditions—rodent spontaneous activity in an open field during repeated testing and after treatment with drugs or brain lesions27
Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging25
Multiomics uncovers the epigenomic and transcriptomic response to viral and bacterial stimulation in turbot25
Early microbial intervention reshapes phenotypes of newborn Bos taurus through metabolic regulations25
Genomic insights into endangerment and conservation of the garlic-fruit tree (Malania oleifera), a plant species with extremely small populations25
A Decade of GigaScience: Milestones in Open Science25
CAT Bridge: an efficient toolkit for gene–metabolite association mining from multiomics data24
xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments24
vEMstitch: an algorithm for fully automatic image stitching of volume electron microscopy23
Computational reproducibility of Jupyter notebooks from biomedical publications23
Knowledge graph–based thought: a knowledge graph–enhanced LLM framework for pan-cancer question answering23
External validation of machine learning models—registered models and adaptive sample splitting22
Spacemake: processing and analysis of large-scale spatial transcriptomics data22
Harnessing population diversity: in search of tools of the trade22
Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios22
Hiding in plain sight: a research parasite’s perspective on new lessons in old data22
Disentangling river and swamp buffalo genetic diversity: initial insights from the 1000 Buffalo Genomes Project22
Molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes22
PEPhub: a database, web interface, and API for editing, sharing, and validating biological sample metadata21
Deterministic succession patterns in the rumen and fecal microbiome associate with host metabolic shifts in peripartum dairy cattle20
GADMA2: more efficient and flexible demographic inference from genetic data20
High-quality genome assembles from key Hawaiian coral species20
A new haplotype-resolved turkey genome to enable turkey genetics and genomics research20
An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy20
First chromosome-level genome assembly of the colonial chordate model Botryllus schlosseri (Tunicata)19
Lessons learned about the biology and genomics of Diaphorina citri infection with “Candidatus Liberibacter asiaticus” by integrating new and archived organ-specific transcriptome data19
pyRootHair: Machine Learning Accelerated Software for High-Throughput Phenotyping of Plant Root Hair Traits19
The whole-genome assembly of an endangered Salicaceae species: Chosenia arbutifolia (Pall.) A. Skv19
Segmentation-Based Quality Control of Structural MRI using the CAT12 Toolbox19
Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection19
Exploring the role of normalization and feature selection in microbiome disease classification pipelines18
Retracted and Replaced: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis18
A near telomere-to-telomere phased genome assembly and annotation for the Australian central bearded dragon Pogona vitticeps18
Publishing data to support the fight against human vector-borne diseases18
NApy: efficient statistics in Python for large-scale heterogeneous data with enhanced support for missing data17
Telomere-to-telomere chromosome-scale genome assemblies of black and golden koi carp variants support construction of an ancient karyotype of Cypriniformes17
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences17
The Capparis spinosa var. herbacea genome provides the first genomic instrument for a diversity and evolution study of the Capparaceae family17
A high-quality reference genome for the Ural Owl (Strix uralensis) enables investigations of cell cultures as a genomic resource for endangered species17
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics17
ssMutPA: single-sample mutation-based pathway analysis approach for cancer precision medicine17
High temporal resolution Nanopore sequencing dataset of SARS-CoV-2 and host cell RNAs16
A Decade of GigaScience: Women in Science: Past, Present, and Future16
NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer16
The effects of bioinformatics preprocessing on cell-free DNA fragment analysis16
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors16
Deep learning links localized digital pathology phenotypes with transcriptional subtype and patient outcome in glioblastoma16
A chromosome-scale genome assembly of the pioneer plant Stylosanthes angustifolia: insights into genome evolution and drought adaptation16
Challenges in structural variant calling in low-complexity regions16
On the benefits of self-taught learning for brain decoding16
Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI16
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids16
Loop detection using Hi-C data with HiCExplorer16
Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding16
ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages15
HeteroMRI: Robust white matter abnormality classification across multi-scanner MRI data15
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome15
Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas15
AEnet: a practical tool to construct the splicing-associated phenotype atlas at a single cell level15
Data standardization of plant–pollinator interactions15
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood15
Lifting the curse from high-dimensional data: automated projection pursuit clustering for a variety of biological data modalities14
RWRtoolkit: multi-omic network analysis using random walks on multiplex networks in any species14
A near-complete genome assembly of the bearded dragon Pogona vitticeps provides insights into the origin of Pogona 14
Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification14
The state of Medusozoa genomics: current evidence and future challenges13
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants13
A telomere-to-telomere genome assembly of koi carp (Cyprinus carpio) using long reads and Hi-C technology13
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus13
Celebrating 30 years of access to NASA Space Life Sciences data13
A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles13
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios13
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics13
Message in a Bottle—Metabarcoding enables biodiversity comparisons across ecoregions13
A near telomere-to-telomere genome assembly of the Jinhua pig: enabling more accurate genetic research12
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce12
Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4)12
DENTIST—using long reads for closing assembly gaps at high accuracy12
PathoGFAIR: a collection of FAIR and adaptable (meta)genomics workflows for (foodborne) pathogens detection and tracking12
simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods12
CoCoPyE: feature engineering for learning and prediction of genome quality indices12
A dataset of ant colonies’ motion trajectories in indoor and outdoor scenes to study clustering behavior12
The Open Pediatric Cancer Project12
An in vitro whole-cell electrophysiology dataset of human cortical neurons12
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics12
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing12
A Decade of GigaScience: GigaDB and the Open Data Movement12
Monash DaCRA fPET-fMRI: A dataset for comparison of radiotracer administration for high temporal resolution functional FDG-PET12
learnMSA: learning and aligning large protein families12
PanGIA: A universal framework for identifying association between ncRNAs and diseases11
Deepdefense: annotation of immune systems in prokaryotes using deep learning11
Retraction and replacement of: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis11
Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data11
A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes11
Deciphering cancer genomes with GenomeSpy: a grammar-based visualization toolkit11
A high-quality assembly revealing the PMEL gene for the unique plumage phenotype in Liancheng ducks11
M6Allele: a toolkit for detection of allele-specific RNA N6-methyladenosine modifications11
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks11
Katdetectr: an R/bioconductor package utilizing unsupervised changepoint analysis for robust kataegis detection11
Chromosome-level genome assembly of goose provides insight into the adaptation and growth of local goose breeds11
Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology11
Using synthetic RNA to benchmark poly(A) length inference from direct RNA sequencing11
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution11
Chromosome-level genome assemblies of two littorinid marine snails indicate genetic basis of intertidal adaptation and ancient karyotype evolved from bilaterian ancestors10
Metaphor—A workflow for streamlined assembly and binning of metagenomes10
SurGen: 1020 H&E-stained whole-slide images with survival and genetic markers10
Accurate gene consensus at low nanopore coverage10
On the variability of dynamic functional connectivity assessment methods10
Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality10
Translating short-form Python exercises to other programming languages using diverse prompting strategies10
d-StructMAn: Containerized structural annotation on the scale from genetic variants to whole proteomes10
Chromosome-level reference genome of tetraploid Isoetes sinensis provides insights into evolution and adaption of lycophytes10
Living in darkness: Exploring adaptation of Proteus anguinus in 3 dimensions by X-ray imaging10
Cerebellocerebral connectivity predicts body mass index: a new open-source Python-based framework for connectome-based predictive modeling10
Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning10
Fully resolved assembly of Cryptosporidium parvum9
Improving taxonomic inference from ancient environmental metagenomes by masking microbial-like regions in reference genomes9
A molecular phenotypic map of malignant pleural mesothelioma9
MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model9
The chromosome-level genome assembly of an endangered herbBergenia scopulosaprovides insights into local adaptation and genomic vulnerability under climate change9
Interspecific hybridization in Brassica species leads to changes in agronomic traits through the regulation of gene expression by chromatin accessibility and DNA methylation9
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles9
Correction to: The state of Medusozoa genomics: current evidence and future challenges9
ChemChaste: Simulating spatially inhomogeneous biochemical reaction–diffusion systems for modeling cell–environment feedbacks9
Open RGB Imaging Workflow for Morphological and Morphometric Analysis of Fruits using Deep Learning: A Case Study on Almonds9
Genomic analyses provide insights into the evolution and salinity adaptation of halophyte Tamarix chinensis9
SODAR: managing multiomics study data and metadata9
A framework to mine laser microdissection-based omics data and uncover regulators of pancreatic cancer heterogeneity9
A decade of GigaScience: 10 years of the evolving genomic and biomedical standards landscape9
Chromosome-length genome assembly and linkage map of a critically endangered Australian bird: the helmeted honeyeater9
Streamlining remote nanopore data access with slow5curl9
A trade-off in evolution: the adaptive landscape of spiders without venom glands8
Accurate and fast clade assignment via deep learning and frequency chaos game representation8
The molecular basis of octocoral calcification revealed by genome and skeletal proteome analyses8
Telomere-to-telomere African wild rice (Oryza longistaminata) reference genome reveals segmental and structural variation8
A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae8
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning–driven data analysis8
Improving the reliability, quality, and maintainability of bioinformatics pipelines with nf-test8
MuLan-Methyl—multiple transformer-based language models for accurate DNA methylation prediction8
MRanalysis: a comprehensive online platform for integrated, multimethod Mendelian randomization and associated post-GWAS analyses8
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines8
SynProtX: a large-scale proteomics-based deep learning model for predicting synergistic anticancer drug combinations8
Chromosome-level genome assembly of the Pacific geoduck Panopea generosa reveals major inter- and intrachromosomal rearrangements and substantial expansion of the copine gene family8
Telomere-to-telomere gap-free genome assembly of the endangered Yangtze finless porpoise and East Asian finless porpoise8
Genos: a human-centric genomic foundation model8
Alignstein: Optimal transport for improved LC-MS retention time alignment8
Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data8
Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps7
Spatial integration of multi-omics data from serial sections using the novel Multi-Omics Imaging Integration Toolset7
Similar, but not the same: multiomics comparison of human valve interstitial cells and osteoblast osteogenic differentiation expanded with an estimation of data-dependent and data-independent PASEF pr7
Lessons learned to boost a bioinformatics knowledge base reusability, the Bgee experience7
CryoDataBot: a pipeline to curate cryoEM datasets for AI-driven structural biology7
Telomere-to-telomere genome and resequencing of 231 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis7
Analysis-ready VCF at Biobank scale using Zarr7
Correction to: Scientists without borders: lessons from Ukraine7
Imputation method for single-cell RNA-seq data using neural topic model7
Genome size evolution in the diverse insect order Trichoptera7
DeePVP: Identification and classification of phage virion proteins using deep learning7
Comparative maternal protein profiling of mouse biparental and uniparental embryos7
Giant chromosomes of a tiny plant - the complete telomere-to-telomere genome assembly of the simple thalloid liverwort Apopellia endiviifolia (Jungermann7
Haplotype-resolved reference genomes of the sea turtle clade unveil ultra-syntenic genomes with hotspots of divergence7
Metabarcoding versus mapping unassembled shotgun reads for identification of prey consumed by arthropod epigeal predators7
Korea4K: whole genome sequences of 4,157 Koreans with 107 phenotypes derived from extensive health check-ups7
Exploring the cobia (Rachycentron canadum) genome: unveiling putative male heterogametic regions and identification of sex-specific markers7
Statistical quantification of confounding bias in machine learning models7
Genomic view of the diversity and functional role of archaea and bacteria in the skeleton of the reef-building corals Porites lutea and Isopora palifera7
Genome resequencing reveals independent domestication and breeding improvement of naked oat7
A ghost moth olfactory prototype of the lepidopteran sex communication7
An evaluation of computational methods for reconstruction of human viral DNA genomes7
An interconnected data infrastructure to support large-scale rare disease research7
RNAVirHost: a machine learning–based method for predicting hosts of RNA viruses through viral genomes7
The complete genome assembly of Astragalus membranaceus: enabling more accurate genetic research7
A workflow reproducibility scale for automatic validation of biological interpretation results7
Nanopore- and AI-empowered microbial viability inference6
Antibiotic resistance genes are differentially mobilized according to resistance mechanism6
HVSeeker: a deep-learning-based method for identification of host and viral DNA sequences6
Unlocking the power of AI for phenotyping fruit morphology in Arabidopsis6
Container Profiler: Profiling resource utilization of containerized big data pipelines6
Chromosome-level genome assembly of Pinus massoniana provides insights into conifer adaptive evolution6
LED color gradient as a new screening tool for rapid phenotyping of plant responses to light quality6
Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants6
A high-quality assembly reveals genomic characteristics, phylogenetic status, and causal genes for leucism plumage of Indian peafowl6
T cell receptor repertoire sequencing reveals chemotherapy-driven clonal expansion in colorectal liver metastases6
Improved reference assembly and core collection re-sequencing to facilitate exploration of important agronomical traits for the improvement of oilseed crop, Carthamus tinctorius L.6
Comparative genomics and multiomics analyses reveal the evolution and physiological basis of rubber biosynthesis in Hevea species6
Diaci v3.0: chromosome-level assembly, de novo transcriptome, and manual annotation of Diaphorina citri, insect vector of Huanglongbing6
Standardized genome-wide function prediction enables comparative functional genomics: a new application area for Gene Ontologies in plants6
PeptideMiner—neuropeptide discovery across the animal kingdom6
epialleleR: an R/Bioconductor package for sensitive allele-specific methylation analysis in NGS data6
Deciphering the distinct transcriptomic and gene regulatory map in adult macaque basal ganglia cells6
Near-chromosomal de novo assembly of Bengal tiger genome reveals genetic hallmarks of apex predation6
The enduring advantages of the SLOW5 file format for raw nanopore sequencing data6
ARA: a flexible pipeline for automated exploration of NCBI SRA datasets6
Toward a standardized framework for pangenome graph evaluation: assessing crop plant pangenome variation graph construction from multiple assemblies6
CheRRI—Accurate classification of the biological relevance of putative RNA–RNA interaction sites6
Single-cell transcriptome analysis illuminating the characteristics of species-specific innate immune responses against viral infections6
A comprehensive water buffalo pangenome reveals extensive structural variation linked to population-specific signatures of selection6
Image segmentation of treated and untreated tumor spheroids by fully convolutional networks6
Unsupervised multiscale clustering of single-cell transcriptomes to identify hierarchical structures of cell subtypes6
The rise of genomics in snake venom research: recent advances and future perspectives6
Chromosome-level reference genome for the medically important Arabian horned viper ( Cerastes gasperettii )6
Construction and analysis of telomere-to-telomere genomes for 2 sweet oranges: Longhuihong and Newhall (Citrus sinensis)6
A telomere-to-telomere phased genome of an octoploid strawberry reveals a receptor kinase conferring anthracnose resistance6
Overture: an open-source genomics data platform6
Delineating regions of interest for mass spectrometry imaging by multimodally corroborated spatial segmentation5
DNA methylation analysis to differentiate reference, breed, and parent-of-origin effects in the bovine pangenome era5
RNA-SeqEZPZ: A Point-and-Click Pipeline for Comprehensive Transcriptomics Analysis with Interactive Visualizations5
The first high-altitude autotetraploid haplotype-resolved genome assembled (Rhododendron nivale subsp. boreale) provides new insights into mountaintop adaptation5
Finding easy regions for short-read variant calling from pangenome data5
Chromosome-level assembly and annotation of the blue catfishIctalurus furcatus, an aquaculture species for hybrid catfish reproduction, epigenetics, and heterosis studies5
The complexity landscape of viral genomes5
The telomere-to-telomere gap-free reference genome and taxonomic reassessment of Siniperca roulei5
Association mapping across a multitude of traits collected in diverse environments in maize5
Mutation impact on mRNA versus protein expression across human cancers5
The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features5
V-pipe 3.0: a sustainable pipeline for within-sample viral genetic diversity estimation5
0.35514092445374