GigaScience

Papers
(The median citation count of GigaScience is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Reducing skin microbiome exposure impacts through swine farm biosecurity209
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources90
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices82
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection71
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning65
MBGC: Multiple Bacteria Genome Compressor62
Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework61
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources60
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis59
Hecatomb: an integrated software platform for viral metagenomics48
The probability of edge existence due to node degree: a baseline for network-based predictions47
A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta)46
A decade of GigaScience: A perspective on conservation genetics45
Qiber3D—an open-source software package for the quantitative analysis of networks from 3D image stacks38
Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis35
The Nencki-Symfonia electroencephalography/event-related potential dataset: Multiple cognitive tasks and resting-state data collected in a sample of healthy adults35
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface34
gNOMO2: a comprehensive and modular pipeline for integrated multi-omics analyses of microbiomes34
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism34
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq32
scMAPA: Identification of cell-type–specific alternative polyadenylation in complex tissues31
X-ray microtomography imaging of craniofacial hard tissues in selected reptile species with different types of dentition31
Characteristics and filtering of low-frequency artificial short deletion variations based on nanopore sequencing30
CoVEffect: interactive system for mining the effects of SARS-CoV-2 mutations and variants based on deep learning30
CODARFE: Unlocking the prediction of continuous environmental variables based on microbiome29
A Case for estradiol: younger brains in women with earlier menarche and later menopause28
FAIR data station for lightweight metadata management and validation of omics studies26
Knowledge graph–based thought: a knowledge graph–enhanced LLM framework for pan-cancer question answering26
ricu: R’s interface to intensive care data26
Preventing dataset shift from breaking machine-learning biomarkers26
Harnessing population diversity: in search of tools of the trade25
Early microbial intervention reshapes phenotypes of newborn Bos taurus through metabolic regulations25
Genomic insights into endangerment and conservation of the garlic-fruit tree (Malania oleifera), a plant species with extremely small populations25
A virtual library for behavioral performance in standard conditions—rodent spontaneous activity in an open field during repeated testing and after treatment with drugs or brain lesions25
External validation of machine learning models—registered models and adaptive sample splitting25
Molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes25
Studying mutation rate evolution in primates—the effects of computational pipelines and parameter choices24
vEMstitch: an algorithm for fully automatic image stitching of volume electron microscopy24
xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments23
Hi-GDT: A Hi-C-based 3D gene domain analysis tool for analyzing local chromatin contacts in plants23
CAT Bridge: an efficient toolkit for gene–metabolite association mining from multiomics data23
A Decade of GigaScience: Milestones in Open Science23
Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging23
Computational reproducibility of Jupyter notebooks from biomedical publications22
Spacemake: processing and analysis of large-scale spatial transcriptomics data21
A new haplotype-resolved turkey genome to enable turkey genetics and genomics research21
Population modeling with machine learning can enhance measures of mental health21
Disentangling river and swamp buffalo genetic diversity: initial insights from the 1000 Buffalo Genomes Project21
Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios21
Deterministic succession patterns in the rumen and fecal microbiome associate with host metabolic shifts in peripartum dairy cattle21
Hiding in plain sight: a research parasite’s perspective on new lessons in old data21
High-quality genome assembles from key Hawaiian coral species20
An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy20
PEPhub: a database, web interface, and API for editing, sharing, and validating biological sample metadata19
GADMA2: more efficient and flexible demographic inference from genetic data19
Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection18
Publishing data to support the fight against human vector-borne diseases18
Lessons learned about the biology and genomics of Diaphorina citri infection with “Candidatus Liberibacter asiaticus” by integrating new and archived organ-specific transcriptome data18
The whole-genome assembly of an endangered Salicaceae species: Chosenia arbutifolia (Pall.) A. Skv18
Retracted and Replaced: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis18
The Capparis spinosa var. herbacea genome provides the first genomic instrument for a diversity and evolution study of the Capparaceae family18
DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach18
ssMutPA: single-sample mutation-based pathway analysis approach for cancer precision medicine17
A Decade of GigaScience: Women in Science: Past, Present, and Future17
Telomere-to-telomere chromosome-scale genome assemblies of black and golden koi carp variants support construction of an ancient karyotype of Cypriniformes17
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids17
Loop detection using Hi-C data with HiCExplorer17
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics17
Deep learning links localized digital pathology phenotypes with transcriptional subtype and patient outcome in glioblastoma17
Data standardization of plant–pollinator interactions17
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences17
De novoscreening of disease-resistant genes from the chromosome-level genome of rare minnow using CRISPR-cas9 random mutation17
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors16
On the benefits of self-taught learning for brain decoding16
High temporal resolution Nanopore sequencing dataset of SARS-CoV-2 and host cell RNAs16
Lifting the curse from high-dimensional data: automated projection pursuit clustering for a variety of biological data modalities15
Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification15
Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI15
ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages15
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios15
A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles15
A chromosome-scale genome assembly of the pioneer plant Stylosanthes angustifolia: insights into genome evolution and drought adaptation15
Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding15
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood15
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus15
NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer15
Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas15
The state of Medusozoa genomics: current evidence and future challenges14
Celebrating 30 years of access to NASA Space Life Sciences data14
RWRtoolkit: multi-omic network analysis using random walks on multiplex networks in any species14
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants14
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics14
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome14
CoCoPyE: feature engineering for learning and prediction of genome quality indices13
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks13
DENTIST—using long reads for closing assembly gaps at high accuracy13
Monash DaCRA fPET-fMRI: A dataset for comparison of radiotracer administration for high temporal resolution functional FDG-PET13
simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods13
Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4)13
Message in a Bottle—Metabarcoding enables biodiversity comparisons across ecoregions13
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution13
learnMSA: learning and aligning large protein families13
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis13
A near telomere-to-telomere genome assembly of the Jinhua pig: enabling more accurate genetic research12
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing12
Chromosome-level genome assemblies of two littorinid marine snails indicate genetic basis of intertidal adaptation and ancient karyotype evolved from bilaterian ancestors12
A Decade of GigaScience: GigaDB and the Open Data Movement12
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics12
Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality12
A dataset of ant colonies’ motion trajectories in indoor and outdoor scenes to study clustering behavior12
An in vitro whole-cell electrophysiology dataset of human cortical neurons12
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce12
Metaphor—A workflow for streamlined assembly and binning of metagenomes11
Living in darkness: Exploring adaptation of Proteus anguinus in 3 dimensions by X-ray imaging11
Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology11
M6Allele: a toolkit for detection of allele-specific RNA N6-methyladenosine modifications11
A high-quality assembly revealing the PMEL gene for the unique plumage phenotype in Liancheng ducks11
A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes11
Deciphering cancer genomes with GenomeSpy: a grammar-based visualization toolkit11
On the variability of dynamic functional connectivity assessment methods11
Katdetectr: an R/bioconductor package utilizing unsupervised changepoint analysis for robust kataegis detection11
Chromosome-level genome assembly of goose provides insight into the adaptation and growth of local goose breeds10
Deepdefense: annotation of immune systems in prokaryotes using deep learning10
Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data10
d-StructMAn: Containerized structural annotation on the scale from genetic variants to whole proteomes10
Chromosome-level reference genome of tetraploid Isoetes sinensis provides insights into evolution and adaption of lycophytes10
Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning10
Chromosome-length genome assembly and linkage map of a critically endangered Australian bird: the helmeted honeyeater9
ChemChaste: Simulating spatially inhomogeneous biochemical reaction–diffusion systems for modeling cell–environment feedbacks9
Cerebellocerebral connectivity predicts body mass index: a new open-source Python-based framework for connectome-based predictive modeling9
Chromosome-level genome assemblies of Channa argus and Channa maculata and comparative analysis of their temperature adaptability9
Interspecific hybridization in Brassica species leads to changes in agronomic traits through the regulation of gene expression by chromatin accessibility and DNA methylation9
The chromosome-level genome assembly of an endangered herbBergenia scopulosaprovides insights into local adaptation and genomic vulnerability under climate change9
RNAProt: an efficient and feature-rich RNA binding protein binding site predictor9
The molecular basis of octocoral calcification revealed by genome and skeletal proteome analyses9
Accurate gene consensus at low nanopore coverage9
Genomic analyses provide insights into the evolution and salinity adaptation of halophyte Tamarix chinensis9
A molecular phenotypic map of malignant pleural mesothelioma9
A decade of GigaScience: 10 years of the evolving genomic and biomedical standards landscape9
SODAR: managing multiomics study data and metadata9
Accurate and fast clade assignment via deep learning and frequency chaos game representation9
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles9
Fully resolved assembly of Cryptosporidium parvum9
Correction to: The state of Medusozoa genomics: current evidence and future challenges9
MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model9
On the relationship between research parasites and fairness in machine learning: challenges and opportunities8
Chromosome-level genome assembly of the Pacific geoduck Panopea generosa reveals major inter- and intrachromosomal rearrangements and substantial expansion of the copine gene family8
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning–driven data analysis8
A high-quality genome and comparison of short- versus long-read transcriptome of the palaearctic duck Aythya fuligula (tufted duck)8
Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA–minus RNA sequencing data8
MuLan-Methyl—multiple transformer-based language models for accurate DNA methylation prediction8
Telomere-to-telomere gap-free genome assembly of the endangered Yangtze finless porpoise and East Asian finless porpoise8
RNAVirHost: a machine learning–based method for predicting hosts of RNA viruses through viral genomes8
A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae8
Corrigendum to: The new COST Action European Venom Network (EUVEN)—synergy and future perspectives of modern venomics8
A trade-off in evolution: the adaptive landscape of spiders without venom glands8
Streamlining remote nanopore data access with slow5curl8
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines8
A workflow reproducibility scale for automatic validation of biological interpretation results7
Analysis-ready VCF at Biobank scale using Zarr7
Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-27
Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data7
DeePVP: Identification and classification of phage virion proteins using deep learning7
Building the mega single-cell transcriptome ocular meta-atlas7
Spatial integration of multi-omics data from serial sections using the novel Multi-Omics Imaging Integration Toolset7
Alignstein: Optimal transport for improved LC-MS retention time alignment7
Comparative maternal protein profiling of mouse biparental and uniparental embryos7
Korea4K: whole genome sequences of 4,157 Koreans with 107 phenotypes derived from extensive health check-ups7
Genome size evolution in the diverse insect order Trichoptera7
Metabarcoding versus mapping unassembled shotgun reads for identification of prey consumed by arthropod epigeal predators7
Myth-busting the provider-user relationship for digital sequence information6
A high-quality assembly reveals genomic characteristics, phylogenetic status, and causal genes for leucism plumage of Indian peafowl6
Chromosome-level genome assembly of Pinus massoniana provides insights into conifer adaptive evolution6
Genomic view of the diversity and functional role of archaea and bacteria in the skeleton of the reef-building corals Porites lutea and Isopora palifera6
Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps6
Chromosome-level reference genome for the medically important Arabian horned viper (Cerastes gasperettii)6
Container Profiler: Profiling resource utilization of containerized big data pipelines6
A telomere-to-telomere phased genome of an octoploid strawberry reveals a receptor kinase conferring anthracnose resistance6
Imputation method for single-cell RNA-seq data using neural topic model6
Modern venomics—Current insights, novel methods, and future perspectives in biological and applied animal venom research6
epialleleR: an R/Bioconductor package for sensitive allele-specific methylation analysis in NGS data6
The rise of genomics in snake venom research: recent advances and future perspectives6
LED color gradient as a new screening tool for rapid phenotyping of plant responses to light quality6
Statistical quantification of confounding bias in machine learning models6
A ghost moth olfactory prototype of the lepidopteran sex communication6
Genome resequencing reveals independent domestication and breeding improvement of naked oat6
An interconnected data infrastructure to support large-scale rare disease research6
ARA: a flexible pipeline for automated exploration of NCBI SRA datasets6
Near-chromosomal de novo assembly of Bengal tiger genome reveals genetic hallmarks of apex predation6
Correction to: Scientists without borders: lessons from Ukraine6
Standardized genome-wide function prediction enables comparative functional genomics: a new application area for Gene Ontologies in plants6
T cell receptor repertoire sequencing reveals chemotherapy-driven clonal expansion in colorectal liver metastases6
Unlocking the power of AI for phenotyping fruit morphology in Arabidopsis6
Exploring the cobia (Rachycentron canadum) genome: unveiling putative male heterogametic regions and identification of sex-specific markers6
Construction and analysis of telomere-to-telomere genomes for 2 sweet oranges: Longhuihong and Newhall (Citrus sinensis)6
Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants6
Similar, but not the same: multiomics comparison of human valve interstitial cells and osteoblast osteogenic differentiation expanded with an estimation of data-dependent and data-independent PASEF pr6
Lessons learned to boost a bioinformatics knowledge base reusability, the Bgee experience6
Deciphering the distinct transcriptomic and gene regulatory map in adult macaque basal ganglia cells6
Desiderata for the development of next-generation electronic health record phenotype libraries6
Efficient real-time selective genome sequencing on resource-constrained devices6
Linking big biomedical datasets to modular analysis with Portable Encapsulated Projects6
Image segmentation of treated and untreated tumor spheroids by fully convolutional networks6
The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features5
Association mapping across a multitude of traits collected in diverse environments in maize5
Mutation impact on mRNA versus protein expression across human cancers5
HVSeeker: a deep-learning-based method for identification of host and viral DNA sequences5
Evolution of complex genome architecture in gymnosperms5
Toward global integration of biodiversity big data: a harmonized metabarcode data generation module for terrestrial arthropods5
Antibiotic resistance genes are differentially mobilized according to resistance mechanism5
Overture: an open-source genomics data platform5
Integrating comparative genomics and risk classification by assessing virulence, antimicrobial resistance, and plasmid spread in microbial communities with gSpreadComp5
The complexity landscape of viral genomes5
Identifying candidate genetic variants for egg number by analyzing over 1,000 fully sequenced layers5
CNVpytor: a tool for copy number variation detection and analysis from read depth and allele imbalance in whole-genome sequencing5
Diaci v3.0: chromosome-level assembly, de novo transcriptome, and manual annotation of Diaphorina citri, insect vector of Huanglongbing5
CheRRI—Accurate classification of the biological relevance of putative RNA–RNA interaction sites5
SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples5
A high-quality chromosomal genome assembly of the sea cucumber Chiridota heheva and its hydrothermal adaptation5
DNA methylation analysis to differentiate reference, breed, and parent-of-origin effects in the bovine pangenome era5
Delineating regions of interest for mass spectrometry imaging by multimodally corroborated spatial segmentation5
Correction to: molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes5
Chromosome-level assembly and annotation of the blue catfishIctalurus furcatus, an aquaculture species for hybrid catfish reproduction, epigenetics, and heterosis studies5
Contrast subgraphs allow comparing homogeneous and heterogeneous networks derived from omics data5
Euler characteristic curves and profiles: a stable shape invariant for big data problems5
Single-cell transcriptome analysis illuminating the characteristics of species-specific innate immune responses against viral infections5
Pangenome databases improve host removal and mycobacteria classification from clinical metagenomic data5
A telomere-to-telomere gapless genome reveals SlPRR1 control of circadian rhythm and photoperiodic flowering in tomato5
The telomere-to-telomere gap-free reference genome and taxonomic reassessment of Siniperca roulei5
Extraction of biological terms using large language models enhances the usability of metadata in the BioSample database4
A chromosome-level assembly supports genome-wide investigation of the DMRT gene family in the golden mussel (Limnoperna fortunei)4
Construction of a new chromosome-scale, long-read reference genome assembly for the Syrian hamster, Mesocricetus auratus4
Highly accurate whole-genome imputation of SARS-CoV-2 from partial or low-quality sequences4
Generation and application of pseudo–long reads for metagenome assembly4
A chromosome-level genome assembly and intestinal transcriptome of Trypoxylus dichotomus (Coleoptera: Scarabaeidae) to understand its lignocellulose digestion ability4
The founding charter of the Omic Biodiversity Observation Network (Omic BON)4
demuxSNP: supervised demultiplexing single-cell RNA sequencing using cell hashing and SNPs4
Karyon: a computational framework for the diagnosis of hybrids, aneuploids, and other nonstandard architectures in genome assemblies4
Fungal and ciliate protozoa are the main rumen microbes associated with methane emissions in dairy cattle4
The Australasian dingo archetype: de novo chromosome-length genome assembly, DNA methylome, and cranial morphology4
AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data4
A chromosome-level genome assembly for the eastern fence lizard (Sceloporus undulatus), a reptile model for physiological and evolutionary ecology4
Data management strategy for a collaborative research center4
Quantitative monitoring of nucleotide sequence data from genetic resources in context of their citation in the scientific literature4
Design and implementation of a scalable high-performance computing (HPC) cluster for omics data analysis: achievements, challenges and recommendations in LMICs4
Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts4
Correction to: DivBrowse—interactive visualization and exploratory data analysis of variant call matrices4
Best genome sequencing strategies for annotation of complex immune gene families in wildlife4
Long-read metagenomic sequencing negates inferred loss of cytosine methylation in Myxosporea (Cnidaria: Myxozoa)4
What the Phage: a scalable workflow for the identification and analysis of phage sequences4
0.043808937072754