JChemTidy:  A Tool for Converting Chemical Web Document Collections to an XHTML Representation

Georgios V. Gkoutos, Philip R. Kenway, and Henry S. Rzepa*
Department of Chemistry, Imperial College of Science, Technology and Medicine, London, SW7 2AY. Merck Sharp and Dohme Research Laboratories, Neuroscience Research Centre, Terlings Park, Harlow, Essex, CM20 2QR.
J. Chem. Inf. Comput. Sci., 2001, 41 (2), pp 253–258
DOI: 10.1021/ci000396y
Publication Date (Web): March 26, 2001
Copyright © 2001 American Chemical Society
*

In papers with more than one author, the asterisk indicates the name of the author to whom inquiries about the paper should be addressed.

Web Enhanced Object

Abstract

A robot-based procedure is described for traversing a collection of hyperlinked documents written in HTML and converting these to the XML-compliant and well-formed XHTML representation. Transcluded chemical content invoked using <embed> or <applet> HTML calls are converted to the XHTML recommended <object> form. Additional attributes such as title or derived chemical attributes such as a SMILES descriptor are added to improve the indexing of the resulting document collection. Conformance tests for the popular Web browsers are reported.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

History

  • Published In Issue March 26, 2001
  • Received July 26, 2000

Recommend & Share

Related Content

Other ACS content by these authors: