Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Try the next-generation Data Catalog at catalog-beta.data.gov and help shape it with your feedback.

14 datasets found
  • Federal

    NCBI Virus

    U.S. Department of Health & Human Services —

    NCBI Virus is an integrative, value-added resource designed to support retrieval, display and analysis of a curated collection of virus sequences and large sequence...
  • Federal

    NCBI Datasets

    U.S. Department of Health & Human Services —

    NCBI Datasets is one-stop shop for finding, browsing, and downloading genomic data. Find and download taxonomy, genome, gene, transcript, protein data, including...
  • Federal

    Structure - Molecular Modeling Database (MMDB)

    U.S. Department of Health & Human Services —

    Three dimensional structures provide a wealth of information on the biological function and the evolutionary history of macromolecules. They can be used to examine...
  • Federal

    PSSM Viewer

    U.S. Department of Health & Human Services —

    Users can display, sort, subset and download position-specific score matrices (PSSMs) either from CDD records or from Position Specific Iterated (PSI)-BLAST protein...
  • Federal

    Protein

    U.S. Department of Health & Human Services —

    The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as...
  • Federal

    BLAST (Basic Local Alignment Search Tool)

    U.S. Department of Health & Human Services —

    BLAST (Basic Local Alignment Search Tool) finds regions of similarity between biological sequences. BLAST includes several specialized search interfaces: SmartBLAST,...
  • Federal

    Constraint-Based Multiple Alignment Tool (COBALT)

    U.S. Department of Health & Human Services —

    Constraint-Based Multiple Alignment Tool (COBALT) is a protein multiple sequence alignment tool that finds a collection of pairwise constraints derived from conserved...
  • Federal

    Conserved Domain Database (CDD)

    U.S. Department of Health & Human Services —

    Conserved Domain Database (CDD) is a protein annotation resource that consists of a collection of well-annotated multiple sequence alignment models for ancient...
  • Federal

    Multiple Sequence Alignment (MSA) Viewer

    U.S. Department of Health & Human Services —

    An interactive Web application that enables users to visualize multiple alignments created by database search results or other software applications. The MSA Viewer...
  • Federal

    ProSplign

    U.S. Department of Health & Human Services —

    A utility for computing alignment of proteins to genomic nucleotide sequence based on a variation of the Needleman Wunsch global alignment algorithm and specifically...
  • Federal

    Vector Alignment Search Tool (VAST)

    U.S. Department of Health & Human Services —

    A computer algorithm that identifies similar protein 3-dimensional structures. Structure neighbors for every structure in MMDB are pre-computed and accessible via...
  • Federal

    Consensus CDS (CCDS)

    U.S. Department of Health & Human Services —

    The Consensus CDS (CCDS) project is a collaborative effort to identify a core set of human and mouse protein coding regions that are consistently annotated and of...
  • Federal

    CDTree

    U.S. Department of Health & Human Services —

    CDTree is a stand-alone application for classifying protein sequences and investigating their evolutionary relationships. CDTree can import, analyze and update...
  • Federal

    Protein Clusters

    U.S. Department of Health & Human Services —

    A collection of Reference Sequence (RefSeq) proteins, from the complete genomes of prokaryotes, plasmids, and organelles, that have been grouped and annotated based...
14 datasets found

You can also access this registry using the API (see API Docs).

Didn't find what you're looking for? Suggest a dataset here.