Molecular Query Language (MQL)A Context-Free Grammar for Substructure Matching

Ewgenij Proschak,* Jörg K. Wegner, Andreas Schüller, Gisbert Schneider, and Uli Fechner
Johann Wolfgang Goethe-University, Institute of Organic Chemistry and Chemical Biology, Chair for Chem- and Bioinformatics, Siesmayerstr. 70, D-60323 Frankfurt am Main, Germany, and Tibotec BVBA, Generaal De Wittelaan L11 B3, 2800 Mechelen, Belgium
J. Chem. Inf. Model., 2007, 47 (2), pp 295–301
DOI: 10.1021/ci600305h
Publication Date (Web): January 11, 2007
Copyright © 2007 American Chemical Society
*

 Corresponding author tel:  +49-69 798 24878; fax:  +49-69 798 24880; e-mail:  proschak@bioinformatik.uni-frankfurt.de.

,

 Johann Wolfgang Goethe-University.

,

 Tibotec BVBA.

Abstract

Abstract Image

We have developed a Java library for substructure matching that features easy-to-read syntax and extensibility. This molecular query language (MQL) is grounded on a context-free grammar, which allows for straightforward modification and extension. The formal description of MQL is provided in this paper. Molecule primitives are atoms, bonds, properties, branching, and rings. User-defined features can be added via a Java interface. In MQL, molecules are represented as graphs. Substructure matching was implemented using the Ullmann algorithm because of favorable run-time performance. The Ullmann algorithm carries out a fast subgraph isomorphism search by combining backtracking with effective forward checking. MQL software design was driven by the aim to facilitate the use of various cheminformatics toolkits. Two Java interfaces provide a bridge from our MQL package to an external toolkit:  the first one provides the matching rules for every feature of a particular toolkit; the second one converts the found match from the internal format of MQL to the format of the external toolkit. We already implemented these interfaces for the Chemistry Development Toolkit.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

History

  • Published In Issue March 26, 2007
  • Received July 19, 2006

Recommend & Share

Related Content

Other ACS content by these authors: