Thesis (Index)   <-  Sean Forman   <-  You Are Here



Torsion Angle Selection and Emergent Non-local Secondary Structure in Protein Structure Prediction







Sean Forman
Department of Math and Computer Science
Saint Joseph's University



Working with
Alberto Segre, University of Iowa
Yinyu Ye, University of Iowa
Kenneth Murphy, University of Iowa

What use is a genome?

Why do we care about proteins?

What determines a protein's structure?

What do proteins look like?

What is an amino acid's structure?

\begin{figure}\begin{center}\leavevmode
\begin{picture}(0,0)%
\epsfig{file=GRAPHICS/xfig_amino.pstex}%
\end{picture}%%
\end{center}\end{figure}

\includegraphics[height=2.5in]{GRAPHICS/bonding.pstex}<tex2html_comment_mark>32

\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Alanine - AL...
...udegraphics[height=2.5in]{GRAPHICS/TRP_tri.ps} \\
\end{array}\end{displaymath}

How can the shape of the amino acid change?

Energetics: How does the long chain become a globular molecule?

Effects of hydrogen bonding and secondary structure

Figure: $ \alpha $-helix hydrogen bonding pattern. These images are produced using Rasmol [#!program-rasmol!#].
\includegraphics[width=5.0in]{GRAPHICS/hbonds.ps}

Figure: Hydrogen bonding patterns for parallel and anti-parallel $ \beta $-sheets.
\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Anti-Paralle...
...egraphics[height=3.0in]{GRAPHICS/paraPattern.pstex}
\end{array}\end{displaymath}

\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Carbonic Anh...
...cludegraphics[height=3.0in]{GRAPHICS/fancypdb_2.ps}
\end{array}\end{displaymath}

Problem Description

The protein folding problem is defined as the prediction of the protein's three-dimensional (tertiary) structure from its amino acid sequence.

MTYKLILNGKTLKGETTTEAVDAATAEK
VFKQYANDNGVDGEWTYDDATKTFTVTE






\begin{displaymath}
\begin{array}{c c c}
\includegraphics[width=2.5in]{GRAPHICS/...
...includegraphics[width=2.5in]{GRAPHICS/1PGB.ps} \\
\end{array}\end{displaymath}

Current Techniques in Protein Folding

HOPS-Hybrid Optimizer of Protein Structure

Protein Representation

Searching


\begin{picture}(0,0)%
\includegraphics{GRAPHICS/treeInit.pstex}%
\end{picture}
@font
picture(6316,4765)(301,-3962) (281,-3415)(0,0)[lb]AAcid $ L - 1$% (281,-3939)(0,0)[lb]AAcid $ L$% (5026,-3736)(0,0)[lb]at each level% (5026,-3436)(0,0)[lb]$ n_k$ angle choices% (1051,-286)(0,0)[lb]AAcid $ 2$% (1051, 89)(0,0)[lb]AAcid $ 1$% (2851,464)(0,0)[lb] $ (\phi_1,\psi_1)$% (4876,464)(0,0)[lb] $ (\phi_n,\psi_n)$%

Continuous Optimization

Contributions to Energy Function

\epsfig{file=GRAPHICS/1SHG.eps, height=2.5in}<tex2html_comment_mark>63

\epsfig{file=GRAPHICS/1YMB.eps, height=2.5in}<tex2html_comment_mark>64

\epsfig{file=GRAPHICS/2ST1.eps, height=2.5in}<tex2html_comment_mark>65

A Parallel Programming Method: Nagging

Nagging and Protein Folding

Clustering

\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Glycine }} &...
...ght=2.8in, angle=-90]{GRAPHICS/TRP_graph.ps} \\
\end{array}
\end{displaymath}

Algorithms

Weber's Solution

Multiple Cluster Points

\begin{displaymath}
\begin{array}{c c c}
\includegraphics[width=2.0in]{GRAPHICS/...
...[width=2.0in]{GRAPHICS/cluster_final.xfig.eps} \\
\end{array}\end{displaymath}

Clustering, other issues

Clustering Samples

\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Threonine}} ...
...ght=3.5in, angle=-90]{GRAPHICS/ASN_graph_3.ps} \\
\end{array}\end{displaymath}

Tweaking

\begin{displaymath}
\begin{array}{l r}
\includegraphics[width=2.5in]{GRAPHICS/st...
...egraphics[width=3.5in]{GRAPHICS/strandsAligned.eps}
\end{array}\end{displaymath}

Global Position Matrix

\begin{multline}
\mathnormal{P} \left(r_X, r_Y, r_Z, l_X, l_Y, l_Z \right) =  ...
...cos r_X \cos r_Y & l_Z \\
0 & 0 & 0 & 1 \\
\end{array}\right)
\end{multline}

Backbone Transform Matrix

$\displaystyle \mathnormal{T} \mathbf{ \left(\tau, \kappa, L \right) } = \left( ...
...\cos \kappa \sin \tau & \cos \tau & 0   0 & 0 & 0 & 1   \end{array} \right)$ (4)

One atom to another

$\displaystyle \mathnormal{P}_{i,\ensuremath{\mathbf{N}}} = \mathnormal{P}_{i - 1,\ensuremath{\mathbf{C'}}} \mathnormal{T}_{i - 1,\omega}$ (5)

A string of atoms. To align two atoms, we adjust intermediate transform matrices.

$\displaystyle \mathnormal{P}_{j,\ensuremath{\mathbf{C^\prime}}} \mathnormal{T}_...
..... \mathnormal{T}_{k-1,\psi} \mathnormal{T}_{k-1,\omega} = \mathnormal{P}_{k,N}$ (6)

$\displaystyle \mathnormal{T}_{j,\omega} \mathnormal{T}_{j+1,\phi} \mathnormal{T...
...^{-1} \mathnormal{P}_{k,\ensuremath{\mathbf{N}}} \doteqdot \mathnormal{G_\zeta}$ (7)

Each transform matrix is a function of $ \displaystyle{ \mathnormal{T}_{i,\theta} ( \tau_{i,\theta} + \Delta
\tau_{i,\theta})}$. We want to find $ \{ \Delta
\tau_{i,\theta} \}$ that cause an equality in Equation [*].

Problem Specification

$\displaystyle \operatorname{minimize}$ $\displaystyle \qquad \sum_{(i,\theta) = (j+1,\phi) }^{ (k-1,\omega)} \Vert \Delta \tau_{i,\theta} \Vert _2$ (8)
$\displaystyle \operatorname{subject to}$ $\displaystyle \qquad \mathnormal{T}_{j,\omega} ( \tau_{j,\omega}) \prod_{(i,\th...
...eta} ( \tau_{i,\theta} + \Delta \tau_{i,\theta}) \doteqdot \mathnormal{G_\zeta}$    

or restated using vector notation as

$\displaystyle \operatorname{minimize}$ $\displaystyle \qquad \Vert \boldsymbol{\Delta \tau} \Vert _2$ (9)
$\displaystyle \operatorname{subject to}$ $\displaystyle \qquad \mathbf{T} ( \boldsymbol{\tau + \Delta \tau}) \doteqdot \mathnormal{G_\zeta}$ (10)

Solving the System of Equations

$\displaystyle \mathbf{I}\boldsymbol{\Delta \tau}_{i + 1}$ $\displaystyle \quad-$ $\displaystyle \quad\ensuremath{\frac{d\mathbf{T}(\boldsymbol{\tau_{i}})}{d\boldsymbol{\tau}}}^\mathnormal{T} \boldsymbol{\lambda}$ $\displaystyle \quad=$ $\displaystyle \quad\mathbf{0}$    
$\displaystyle \ensuremath{\frac{d\mathbf{T}(\boldsymbol{\tau_{i}})}{d\boldsymbol{\tau}}}\boldsymbol{\Delta \tau}_{i + 1}$ $\displaystyle \quad-$ $\displaystyle \quad\boldsymbol{0 \lambda}$ $\displaystyle \quad=$ $\displaystyle \quad\mathnormal{G_\zeta} - \mathbf{T} (\boldsymbol{\tau}_{i}) + ...
...hbf{T}(\boldsymbol{\tau_{i}})}{d\boldsymbol{\tau}}}\boldsymbol{\Delta \tau_{i}}$    

Tweaking, other issues

\begin{displaymath}
\begin{array}{c c}
\multicolumn{1}{l}{\mbox{\bf Tweaked \mbo...
...6}} &
\multicolumn{1}{l}{\mbox{\bf Score: 75736}}
\end{array}\end{displaymath}

\includegraphics[width=3.0in]{GRAPHICS/tweak_soln.ps}

\begin{displaymath}
\begin{array}{c c c}
\includegraphics[width=2.5in]{GRAPHICS/...
...lumn{1}{c}{\mbox{\bf Prediction Score: 6941 }} \\
\end{array}\end{displaymath}

Future Work

No References!

About this document ...

This document was generated using the LaTeX2HTML translator Version 99.2beta8 (1.46)

Copyright © 1993, 1994, 1995, 1996, Nikos Drakos, Computer Based Learning Unit, University of Leeds.
Copyright © 1997, 1998, 1999, Ross Moore, Mathematics Department, Macquarie University, Sydney.

The command line arguments were:
latex2html -split 8 -dir /home/sforman/thesis-html/defense defense

The translation was initiated by Sean Forman on 2001-11-21



next_inactive up previous
sforman@sju.edu