SysImm Logo

DASH

Database of Aligned Structural Homologs



What is DASH?

DASH is a database of structural alignments for all known structurally homologous protein domains and chains in the PDB.

The processing involves (a) clustering sequence-unique proteins from the PDB using CD-HIT at 99% sequence identity; (b) decomposing the sequence representatives into domains using Protein Domain Parser (Alexandrov N, Shindyalov I; Bioinformatics 2003); (c) aligning all domains against all domains on Google Cloud using RASH (Standley DM, Toh H, Nakamura H; BMC Bioinformatics 2007); (d) building composite chain alignments from individual domain alignments. (Rozewicki, et al.; Nucleic Acids Research 2019 [PubMed])

Database Information


Search for Alignments

By PDB ID:

Individual chains or domains can be searched for by separating them with '_'. Examples: 5VZ0_A -or- 5VZ0_A_01

By Single Sequence:

MAFFT-DASH


REST API


Upcoming Feature Roadmap


FASTA Databases


Browser Compatibility



Bug Reporting & Feedback


Powered by CD-HIT, CentOS, DSSP, Go, Google Cloud, Molmil, MSAViewer, NCBI BLAST+, and PostgreSQL.
© 2019 Department of Genome Informatics; Research Institute for Microbial Diseases; Osaka University