The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Alliance of Genome Resources

bioinformatics biology Caenorhabditis elegans Danio rerio Drosophila melanogaster fasta gene expression genetic genome genomic Homo sapiens life sciences Mus musculus protein Rattus norvegicus transcriptomics vcf

Description

The Alliance of Genome Resources is a consortium that integrates genomic, genetic, and molecular data from leading model organism databases including Drosophila melanogaster, Caenorhabditis elegans, Danio rerio (zebrafish), Mus musculus (mouse), Rattus norvegicus (rat), Saccharomyces cerevisiae (yeast), Xenopus laevis and Xenopus tropicalis (frogs), and human reference data. The Alliance provides comprehensive datasets including gene annotations, disease associations, expression data (bulk and single-cell RNA-Seq), protein and genetic interactions, orthology relationships, variants and alleles, and complete genome sequences with annotations. Data is organized into Alliance-wide integrated datasets and organism-specific collections, supporting comparative genomics, disease modeling, and functional genomics research.

Update Frequency

Quarterly releases (every ~3 months)

License

Most Alliance data is available under CC0 1.0 Universal (Public Domain Dedication). Some datasets may use CC-BY 4.0 (attribution required). Full details at https://www.alliancegenome.org/terms-of-use

Documentation

https://github.com/alliance-genome/agr_open_data

Managed By

Alliance of Genome Resources Consortium

See all datasets managed by Alliance of Genome Resources Consortium.

Contact

help@alliancegenome.org

How to Cite

Alliance of Genome Resources was accessed on DATE from https://registry.opendata.aws/alliance-genome-resources. Alliance of Genome Resources Consortium. Alliance of Genome Resources Portal - unified model organism research platform. Nucleic Acids Research (2023). https://doi.org/10.1093/nar/gkac1003

Usage Examples

Tutorials
Tools & Applications
Publications

Resources on AWS

  • Description
    Alliance-wide integrated datasets including disease associations, gene expression, molecular and genetic interactions, orthology relationships, gene descriptions, and variants across all Alliance organisms. Data is organized by release version (8.3.0/, 8.2.0/, etc.), then by data type, with organism-specific collections for FB (FlyBase/Drosophila), MGI (Mouse), RGD (Rat), SGD (Yeast), WB (Worm), XBXL/XBXT (Xenopus), ZFIN (Zebrafish), and HUMAN reference data. Available in TSV, JSON, and VCF formats.
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::alliance-genome-downloads
    AWS Region
    us-east-1
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://alliance-genome-downloads/
    Explore
    Browse Bucket
  • Description
    FlyBase-specific data for Drosophila melanogaster and related species, including gene annotations, GO annotations, expression data (bulk RNA-Seq, single-cell RNA-Seq), disease associations, phenotypes, interactions, orthologs, genome sequences (FASTA), and genome annotations (GFF3/GTF). Data organized by release (current/, FB2025_04/, etc.) with precomputed analysis files and complete Chado XML database dumps. Publicly accessible via HTTPS for direct download without AWS credentials.
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::s3ftp.flybase.org
    AWS Region
    us-east-1
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://s3ftp.flybase.org/
    Explore
    Browse via HTTPS

Edit this dataset entry on GitHub

Tell us about your project

Home