Genomic Data Analysis Using DNA Structure:  An Analysis of Conserved Nongenic Sequences and Ultraconserved Elements

Eleanor J. Gardiner,* Linda Hirons, Christopher A. Hunter, and Peter Willett
Centre for Chemical Biology, Krebs Institute for Biomolecular Science, Department of Chemistry, University of Sheffield, Sheffield S3 7HF, United Kingdom, and Department of Information Studies, University of Sheffield, Sheffield S1 4DP, United Kingdom
J. Chem. Inf. Model., 2006, 46 (2), pp 753–761
DOI: 10.1021/ci050384i
Publication Date (Web): January 6, 2006
Copyright © 2006 American Chemical Society
*

 Corresponding author. Tel.:  (+44) 0114 2222674; fax:  (+44) 0114 278 0300; e-mail:  e.gardiner@sheffield.ac.uk.

,

 Department of Chemistry, University of Sheffield.

,

 Department of Information Studies, University of Sheffield.

Abstract

Recent comparative studies of the human and mouse genomes have revealed sets of conserved nongenic sequences (CNGs) and sets of ultraconserved elements (UCEs). Both sets of sequences, which exhibit extremely high levels of conservation, extend over hundreds of bases and have no known function. Since there is no detectable sequence homology between paralogous CNGs or UCEs in either of the species, an alignment-free technique is needed for their analysis. We have previously compiled a database of the structural properties of all 32 896 unique DNA octamers, including information on stability, the minimum energy conformation, and flexibility. We have used Fourier techniques to analyze the UCEs and CNGs in terms of their octamer structural properties, to reveal structural correlations which may indicate possible functions for some of these sequences.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

History

  • Published In Issue March 27, 2006
  • Received September 9, 2005

Recommend & Share

Related Content

Other ACS content by these authors: