|
Journal of Computational Biology
Hypergraph Model of Multi-Residue Interactions in Proteins: Sequentially-Constrained Partitioning Algorithms for Optimization of Site-Directed Protein Recombination
To cite this paper:
Xiaoduan Ye, Alan M. Friedman, Chris Bailey-Kellogg.
Journal of Computational Biology.
July 1, 2007,
14(6): 777-790.
doi:10.1089/cmb.2007.R016.
Xiaoduan Ye Department of Computer Science, Dartmouth College, Hanover, New Hampshire. Alan M. Friedman Markey Center for Structural Biology, Department of Biological Sciences and Purdue Cancer Center, Purdue University, West Lafayette, Indiana. Chris Bailey-Kellogg Department of Computer Science, Dartmouth College, Hanover, New Hampshire. Relationships among amino acids determine stability and function and are also constrained by evolutionary history. We develop a probabilistic hypergraph model of residue relationships that generalizes traditional pairwise contact potentials to account for the statistics of multi-residue interactions. Using this model, we detected non-random associations in protein families and in the protein database. We also use this model in optimizing site-directed recombination experiments to preserve significant interactions and thereby increase the frequency of generating useful recombinants. We formulate the optimization as a sequentially-constrained hypergraph partitioning problem; the quality of recombinant libraries with respect to a set of breakpoints is characterized by the total perturbation to edge weights. We prove this problem to be NP-hard in general, but develop exact and heuristic polynomial-time algorithms for a number of important cases. Application to the beta-lactamase family demonstrates the utility of our algorithms in planning site-directed recombination. 
|
|