USearch Molecules

biology chemical biology life sciences pharmaceutical

Description

Collection of 7 billion small molecules in SMILES notation with 28 billion fingerprints, including MACCS, ECFP4, FCFP4, and PubChem, with pre-constructed USearch indexes over them.

Update Frequency

Not updated

License

Apache 2.0

Documentation

https://github.com/ashvardanian/usearch-molecules

Managed By

Ash Vardanian

See all datasets managed by Ash Vardanian.

Contact

ash.vardanian@unum.cloud

How to Cite

USearch Molecules was accessed on DATE from https://registry.opendata.aws/usearch-molecules.

Resources on AWS

  • Description
    Project data files in a public bucket
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::usearch-molecules
    AWS Region
    us-west-2
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://usearch-molecules/

Edit this dataset entry on GitHub

Tell us about your project

Home