GigaScience

Papers
(The TQCC of GigaScience is 11. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios111
Selection signatures in goats reveal a novel deletion mutant underlying cashmere yield and diameter77
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning69
Label3DMaize: toolkit for 3D point cloud data annotation of maize shoots59
Driftage: a multi-agent system framework for concept drift detection58
Interpretable network-guided epistasis detection58
A large-scale metagenomic survey dataset of the post-weaning piglet gut lumen55
Erratum to: An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis50
A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles50
An improved ovine reference genome assembly to facilitate in-depth functional annotation of the sheep genome49
Triku: a feature selection method based on nearest neighbors for single-cell data47
Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework41
A high-quality genome and comparison of short- versus long-read transcriptome of the palaearctic duck Aythya fuligula (tufted duck)41
Chromosome-level genome of the globe skimmer dragonfly (Pantala flavescens)39
The assembled and annotated genome of the masked palm civet (Paguma larvata)38
A chromosome-level genome of the booklouse, Liposcelis brunnea, provides insight into louse evolution and environmental stress adaptation38
Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification38
The state of Medusozoa genomics: current evidence and future challenges35
A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae35
A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines34
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations33
DrugSim2DR: systematic prediction of drug functional similarities in the context of specific disease for drug repurposing31
A reference genome of Commelinales provides insights into the commelinids evolution and global spread of water hyacinth (Pontederia crassipes)30
Open Data Governance at the Canadian Open Neuroscience Platform (CONP): From the Walled Garden to the Arboretum30
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome30
DriverMP enables improved identification of cancer driver genes29
GSC: efficient lossless compression of VCF files with fast query29
Hecatomb: an integrated software platform for viral metagenomics29
Exploring the role of polymorphic interspecies structural variants in reproductive isolation and adaptive divergence in Eucalyptus28
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants28
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources26
Web of venom: exploration of big data resources in animal toxin research26
RNAVirHost: a machine learning–based method for predicting hosts of RNA viruses through viral genomes26
Celebrating 30 years of access to NASA Space Life Sciences data25
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood25
AltaiR: a C toolkit for alignment-free and temporal analysis of multi-FASTA data25
Genomic exploration of the endangered oriental stork, Ciconia boyciana, sheds light on migration adaptation and future conservation24
Alignstein: Optimal transport for improved LC-MS retention time alignment23
DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology23
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices22
Genomes and demographic histories of the endangered Bretschneidera sinensis (Akaniaceae)22
annotate_my_genomes: an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing22
Chromosome-level genome assembly for the Aldabra giant tortoise enables insights into the genetic health of a threatened population22
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis21
GERONIMO: A tool for systematic retrieval of structural RNAs in a broad evolutionary context21
Correction to: A graph clustering algorithm for detection and genotyping of structural variants from long reads21
Health record hiccups—5,526 real-world time series with change points labelled by crowdsourced visual inspection21
High-quality phenotypic and genotypic dataset of barley genebank core collection to unlock untapped genetic diversity21
An ecosystem for producing and sharing metadata within the web of FAIR Data21
High-throughput imaging of powdery mildew resistance of the winter wheat collection hosted at the German Federal ex situ Genebank for Agricultural and Horticultural Crops21
M2aia—Interactive, fast, and memory-efficient analysis of 2D and 3D multi-modal mass spectrometry imaging data21
The Global Atlas of Bamboo and Rattan (GABR) Phase II: new resources for sustainable development20
Computational prediction of human deep intronic variation20
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection19
Message in a Bottle—Metabarcoding enables biodiversity comparisons across ecoregions19
Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data19
Confound-leakage: confound removal in machine learning leads to leakage19
Genomic and transcriptomic analyses of Heteropoda venatoria reveal the expansion of P450 family for starvation resistance in spiders18
Maternal plasma lipids are involved in the pathogenesis of preterm birth18
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus18
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources18
BrumiR: A toolkit for de novo discovery of microRNAs from sRNA-seq data17
An analysis of performance bottlenecks in MRI preprocessing17
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics17
scShapes: a statistical framework for identifying distribution shapes in single-cell RNA-sequencing data17
MBGC: Multiple Bacteria Genome Compressor17
The probability of edge existence due to node degree: a baseline for network-based predictions17
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq16
xRead: a coverage-guided approach for scalable construction of read overlapping graph16
A chromosome-level genome assembly for the eastern fence lizard (Sceloporus undulatus), a reptile model for physiological and evolutionary ecology16
Similar, but not the same: multiomics comparison of human valve interstitial cells and osteoblast osteogenic differentiation expanded with an estimation of data-dependent and data-independent PASEF pr16
Preventing dataset shift from breaking machine-learning biomarkers16
Long-read metagenomic sequencing negates inferred loss of cytosine methylation in Myxosporea (Cnidaria: Myxozoa)16
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks16
Best genome sequencing strategies for annotation of complex immune gene families in wildlife16
A streamlined workflow for conversion, peer review, and publication of genomics metadata as omics data papers15
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics15
The genomes of Dahlia pinnata, Cosmos bipinnatus, and Bidens alba in tribe Coreopsideae provide insights into polyploid evolution and inulin biosynthesis15
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing15
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism15
Qiber3D—an open-source software package for the quantitative analysis of networks from 3D image stacks15
The telomere-to-telomere (T2T) genome of Peucedanum praeruptorum Dunn provides insights into the genome evolution and coumarin biosynthesis15
Benchmarking short-read metagenomics tools for removing host contamination15
Vulcan: Improved long-read mapping and structural variant calling via dual-mode alignment15
A ghost moth olfactory prototype of the lepidopteran sex communication15
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface14
An interconnected data infrastructure to support large-scale rare disease research14
CoCoPyE: feature engineering for learning and prediction of genome quality indices14
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce14
Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa14
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution13
Future-proofing and maximizing the utility of metadata: The PHA4GE SARS-CoV-2 contextual data specification package13
The Nencki-Symfonia electroencephalography/event-related potential dataset: Multiple cognitive tasks and resting-state data collected in a sample of healthy adults13
An analysis of security vulnerabilities in container images for scientific data analysis13
Stratum corneum nanotexture feature detection using deep learning and spatial analysis: a noninvasive tool for skin barrier assessment13
Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-213
Chromosome-level genome assembly, annotation, and phylogenomics of the gooseneck barnacle Pollicipes pollicipes13
PlasGO: enhancing GO-based function prediction for plasmid-encoded proteins based on genetic structure13
Genome size evolution in the diverse insect order Trichoptera13
Metabarcoding versus mapping unassembled shotgun reads for identification of prey consumed by arthropod epigeal predators13
Monash DaCRA fPET-fMRI: A dataset for comparison of radiotracer administration for high temporal resolution functional FDG-PET12
A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta)12
A Decade of GigaScience: GigaDB and the Open Data Movement12
X-ray microtomography imaging of craniofacial hard tissues in selected reptile species with different types of dentition12
scMAPA: Identification of cell-type–specific alternative polyadenylation in complex tissues12
Statistical quantification of confounding bias in machine learning models12
KOREF_S1: phased, parental trio-binned Korean reference genome using long reads and Hi-C sequencing methods12
AMR-meta: a k-mer and metafeature approach to classify antimicrobial resistance from high-throughput short-read metagenomics data12
A decade of GigaScience: A perspective on conservation genetics12
Comparative maternal protein profiling of mouse biparental and uniparental embryos11
Centering inclusivity in the design of online conferences—An OHBM–Open Science perspective11
CORAL: A framework for rigorous self-validated data modeling and integrative, reproducible data analysis11
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis11
The germline mutational process in rhesus macaque and its implications for phylogenetic dating11
Building the mega single-cell transcriptome ocular meta-atlas11
DeePVP: Identification and classification of phage virion proteins using deep learning11
Internet of Samples (iSamples): Toward an interdisciplinary cyberinfrastructure for material samples11
DENTIST—using long reads for closing assembly gaps at high accuracy11
A dataset of ant colonies’ motion trajectories in indoor and outdoor scenes to study clustering behavior11
0.23913788795471