GigaScience

Papers
(The TQCC of GigaScience is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Hecatomb: an integrated software platform for viral metagenomics459
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection39
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning38
Characterizing a species-rich and understudied tropical insect fauna using DNA barcoding38
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis36
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices29
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources28
Reducing skin microbiome exposure impacts through swine farm biosecurity28
Expression-driven genetic dependency reveals targets for precision oncology28
The probability of edge existence due to node degree: a baseline for network-based predictions28
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism27
The genomes of 5 mantises provide insights into sex chromosome evolution and Mantodea phylogeny clarification26
TinkerHap—a novel read-based phasing algorithm with integrated multimethod support for enhanced accuracy25
On the path to reference genomes for all biodiversity: laboratory protocols and lessons learned from processing over 2,000 species in the Sanger Tree of Life25
gNOMO2: a comprehensive and modular pipeline for integrated multi-omics analyses of microbiomes24
CoVEffect: interactive system for mining the effects of SARS-CoV-2 mutations and variants based on deep learning24
Datagraphy: toward a systematic approach to dataset discovery24
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface23
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq23
SeuratExtend: streamlining single-cell RNA-seq analysis through an integrated and intuitive framework22
CODARFE: Unlocking the prediction of continuous environmental variables based on microbiome22
ricu: R’s interface to intensive care data22
FAIR data station for lightweight metadata management and validation of omics studies22
WaveSeekerNet: accurate prediction of influenza A virus subtypes and host source using attention-based deep learning22
Characteristics and filtering of low-frequency artificial short deletion variations based on nanopore sequencing21
Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis21
xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments20
Hi-GDT: A Hi-C-based 3D gene domain analysis tool for analyzing local chromatin contacts in plants20
A Case for estradiol: younger brains in women with earlier menarche and later menopause20
Molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes19
vEMstitch: an algorithm for fully automatic image stitching of volume electron microscopy19
CAT Bridge: an efficient toolkit for gene–metabolite association mining from multiomics data19
Knowledge graph–based thought: a knowledge graph–enhanced LLM framework for pan-cancer question answering18
Chromosome-level genome assembly and transcriptomes of the leaf insect Cryptophyllium westwoodii provide insights into the evolution of leaf-like masquer18
Early microbial intervention reshapes phenotypes of newborn Bos taurus through metabolic regulations18
Multiomics uncovers the epigenomic and transcriptomic response to viral and bacterial stimulation in turbot17
Harnessing population diversity: in search of tools of the trade17
Genomic insights into endangerment and conservation of the garlic-fruit tree (Malania oleifera), a plant species with extremely small populations17
Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging16
External validation of machine learning models—registered models and adaptive sample splitting16
Hiding in plain sight: a research parasite’s perspective on new lessons in old data16
Computational reproducibility of Jupyter notebooks from biomedical publications16
PEPhub: a database, web interface, and API for editing, sharing, and validating biological sample metadata16
scDenorm: a denormalisation tool for integrating single-cell transcriptomics data15
Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios15
MBGC2: Boosting compression via efficient encoding of approximate matches in genome collections15
Deterministic succession patterns in the rumen and fecal microbiome associate with host metabolic shifts in peripartum dairy cattle15
An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy15
pyRootHair: Machine learning accelerated software for high-throughput phenotyping of plant root hair traits14
GADMA2: more efficient and flexible demographic inference from genetic data14
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics14
Retracted and Replaced: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis14
Segmentation-based quality control of structural MRI using the CAT12 toolbox14
Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection14
A near telomere-to-telomere phased genome assembly and annotation for the Australian central bearded dragon Pogona vitticeps14
Publishing data to support the fight against human vector-borne diseases14
A new haplotype-resolved turkey genome to enable turkey genetics and genomics research14
Disentangling river and swamp buffalo genetic diversity: initial insights from the 1000 Buffalo Genomes Project14
NApy: efficient statistics in Python for large-scale heterogeneous data with enhanced support for missing data14
First chromosome-level genome assembly of the colonial chordate model Botryllus schlosseri (Tunicata)14
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids13
Exploring the role of normalization and feature selection in microbiome disease classification pipelines13
An interpretable Graph-Regularized Optimal Transport Framework for Diagonal Single-Cell Integrative Analysis13
ssMutPA: single-sample mutation-based pathway analysis approach for cancer precision medicine13
ChronoRoot 2.0: an open AI-powered platform for 2D temporal plant phenotyping13
Telomere-to-telomere chromosome-scale genome assemblies of black and golden koi carp variants support construction of an ancient karyotype of Cypriniformes13
Interactive analysis of single-cell trajectories in 3D space with Cell Journey13
A chromosome-scale genome assembly of the pioneer plant Stylosanthes angustifolia: insights into genome evolution and drought adaptation12
ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages12
Challenges in structural variant calling in low-complexity regions12
On the benefits of self-taught learning for brain decoding12
A high-quality reference genome for the Ural Owl (Strix uralensis) enables investigations of cell cultures as a genomic resource for endangered species12
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors12
The effects of bioinformatics preprocessing on cell-free DNA fragment analysis12
Deep learning links localized digital pathology phenotypes with transcriptional subtype and patient outcome in glioblastoma12
Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas12
AEnet: a practical tool to construct the splicing-associated phenotype atlas at a single cell level11
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics11
A near-complete genome assembly of the bearded dragon Pogona vitticeps provides insights into the origin of Pogona 11
HeteroMRI: Robust white matter abnormality classification across multi-scanner MRI data11
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios11
nf-core/proteinfamilies: a scalable pipeline for the generation of protein families11
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood11
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus11
Celebrating 30 years of access to NASA Space Life Sciences data11
Stereo-cell deciphers the spatial and functional heterogeneity of polyploid hepatocytes10
simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods10
RWRtoolkit: multi-omic network analysis using random walks on multiplex networks in any species10
CoCoPyE: feature engineering for learning and prediction of genome quality indices10
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome10
Toward total recall: Enhancing data FAIRness through AI-driven metadata standardization10
PathoGFAIR: a collection of FAIR and adaptable (meta)genomics workflows for (foodborne) pathogens detection and tracking10
A near telomere-to-telomere genome assembly of the Jinhua pig: enabling more accurate genetic research10
Lifting the curse from high-dimensional data: automated projection pursuit clustering for a variety of biological data modalities10
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants10
Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4)10
A telomere-to-telomere genome assembly of koi carp (Cyprinus carpio) using long reads and Hi-C technology10
The Open Pediatric Cancer Project9
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce9
Federated knowledge retrieval elevates large language model performance on biomedical benchmarks9
PanGIA: A universal framework for identifying association between ncRNAs and diseases9
Using synthetic RNA to benchmark poly(A) length inference from direct RNA sequencing9
Learning inherent genetic patterns and trait associations with deep generative models for discrete genotype simulation9
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing9
Enhanced semantic classification of microbiome sample origins using large language models (LLMs)9
Retraction and replacement of: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis9
Deepdefense: annotation of immune systems in prokaryotes using deep learning9
Translating short-form Python exercises to other programming languages using diverse prompting strategies9
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks9
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution9
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics9
M6Allele: a toolkit for detection of allele-specific RNA N6-methyladenosine modifications9
Katdetectr: an R/bioconductor package utilizing unsupervised changepoint analysis for robust kataegis detection9
Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality9
Chromosome-level genome assembly of goose provides insight into the adaptation and growth of local goose breeds8
Chromosome-level genome assemblies of two littorinid marine snails indicate genetic basis of intertidal adaptation and ancient karyotype evolved from bilaterian ancestors8
Cerebellocerebral connectivity predicts body mass index: a new open-source Python-based framework for connectome-based predictive modeling8
Interspecific hybridization in Brassica species leads to changes in agronomic traits through the regulation of gene expression by chromatin accessibility and DNA methylation8
Deciphering cancer genomes with GenomeSpy: a grammar-based visualization toolkit8
Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology8
Chromosome-level reference genome of tetraploid Isoetes sinensis provides insights into evolution and adaption of lycophytes8
A high-quality assembly revealing the PMEL gene for the unique plumage phenotype in Liancheng ducks8
A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes8
Improving taxonomic inference from ancient environmental metagenomes by masking microbial-like regions in reference genomes8
On the variability of dynamic functional connectivity assessment methods8
Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data8
Metaphor—A workflow for streamlined assembly and binning of metagenomes8
SurGen: 1020 H&E-stained whole-slide images with survival and genetic markers8
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles8
0.32858109474182