Your browser version may not work well with NCBI's Web applications. More information here...
Protein Clusters

 About the Database

Welcome to Entrez Protein Clusters (ProtClustDB). This collection of related protein sequences (clusters) consists of Reference Sequence proteins encoded by complete genomes. This database contains both curated and non-curated clusters. For release-specific information check the stats page.

The Protein Clusters database provides easy access to annotation information, publications, domains, structures, and external links and analysis tools including multiple alignments, phylogenetic trees, and genomic neighborhoods (ProtMap).

Protein Clusters can be searched like any other Entrez database. For more information on how to use Entrez please examine the Entrez Help Document.

A publication describing ProtClustDB is now available: Klimke et al., 2009. The National Center for Biotechnology Information's Protein Clusters Database. Nucleic Acids Res. 2009 Jan;37(Database issue):D216-23. Epub 2008 Oct 21.

A specialized BLAST service is accessible (Concise Protein BLAST).

Data is available for download via Protein Clusters FTP
 Example Searches

all clusters with ribosomal protein as the curated name

"ribosomal protein"[Protein Name]

all clusters that are encoded by chloroplasts

"source chloroplast"[All Fields]

Check the limits page and the help document for more information.