Loop modeling in YASARA

YASARA Structure features a loop modeler, that can quickly retrieve loop conformations that bridge two sets of anchor residues from the Protein Data Bank. This knowledge-based approach benefits enormously from the steadily growing PDB, and has been shown to be a strong competition for ab-initio loop prediction[1].
- A non-redundant subset of the PDB (90% sequence identity cutoff[2]) forms the database from which the loop conformations can be retrieved. Currently the database contains ~14000 complete protein chains.
- Since complete chains are stored in the database, queries are not limited to loops. It is easily possible to extract e.g. all fragments spanning two selected anchor points that contain 10 helical residues in the middle.
- For loops up to 18 residues long, a database index allows to retrieve the hits within a fraction of a second. Longer loops take around a minute to model.
- The loop conformations extracted from the database are ranked by a combined quality score, that considers sequence similarity, bumps with the rest of the structure, and the fit to the terminal loop anchor points.
- Since loops extracted from a database do normally not fit the anchor points exactly, they are closed using cyclic coordinate descent, an algorithm developed to steer robotic arms[3].
- To identify the best conformation of potentially hundreds of database hits, YASARA analyzes the loops sequentially, optimizing the side-chains of loop- and surrounding residues[4], and calculating interaction and solvation energies, including knowledge based dihedral and packing components.
R E F E R E N C E S
[1] Loops In Proteins (LIP) - a comprehensive loop database for homology modelling
Michalsky E, Goede A and Preissner R (2003) Protein Engineering 16, 979-985
[2] PISCES: a protein sequence culling server
Wang G and Dunbrack RL Jr. (2003) Bioinformatics 19, 1589-1591
[3] Cyclic coordinate descent: A robotics algorithm for protein loop closure
Canutescu AA and Dunbrack RL Jr. (2003)Protein Sci. 12, 963-972
[4] A graph-theory algorithm for rapid protein side-chain prediction
Canutescu AA, Shelenkov AA and Dunbrack RL Jr. (2003), Protein Sci. 12,2001-2014.