Compute Dictionary of protein secondary structure (DSSP) secondary structure assignments
Parameters: | traj : md.Trajectory
simplified : bool, default=True
|
---|---|
Returns: | assignments : np.ndarray, shape=(n_frames, n_residues), dtype=S1
|
Notes
The DSSP assignment codes are:
- ‘H’ : Alpha helix
- ‘B’ : Residue in isolated beta-bridge
- ‘E’ : Extended strand, participates in beta ladder
- ‘G’ : 3-helix (3/10 helix)
- ‘I’ : 5 helix (pi helix)
- ‘T’ : hydrogen bonded turn
- ‘S’ : bend
- ‘ ‘ : Loops and irregular elements
The simplified DSSP codes are:
- ‘H’ : Helix. Either of the ‘H’, ‘G’, or ‘I’ codes.
- ‘E’ : Strand. Either of the ‘E’, or ‘B’ codes.
- ‘C’ : Coil. Either of the ‘T’, ‘S’ or ‘ ‘ codes.
A special ‘NA’ code will be assigned to each ‘residue’ in the topology which isn’t actually a protein residue (does not contain atoms with the names ‘CA’, ‘N’, ‘C’, ‘O’), such as water molecules that are listed as ‘residue’s in the topology.
Our implementation is based on DSSP-2.2.0, written by Maarten L. Hekkelman and distributed under the Boost Software license.
References
[R30] | Kabsch W, Sander C (1983). “Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features”. Biopolymers 22 (12): 2577-637. dio:10.1002/bip.360221211 |