Foreign Language Translation of Chemical Nomenclature by Computer

Roger Sayle*
OpenEye Scientific Software, Santa Fe, New Mexico 87508
J. Chem. Inf. Model., 2009, 49 (3), pp 519–530
DOI: 10.1021/ci800243w
Publication Date (Web): February 24, 2009
Copyright © 2009 American Chemical Society
* Corresponding author e-mail: roger@eyesopen.com.
ACS AuthorChoice

Abstract

Abstract Image

Chemical compound names remain the primary method for conveying molecular structures between chemists and researchers. In research articles, patents, chemical catalogues, government legislation, and textbooks, the use of IUPAC and traditional compound names is universal, despite efforts to introduce more machine-friendly representations such as identifiers and line notations. Fortunately, advances in computing power now allow chemical names to be parsed and generated (read and written) with almost the same ease as conventional connection tables. A significant complication, however, is that although the vast majority of chemistry uses English nomenclature, a significant fraction is in other languages. This complicates the task of filing and analyzing chemical patents, purchasing from compound vendors, and text mining research articles or Web pages. We describe some issues with manipulating chemical names in various languages, including British, American, German, Japanese, Chinese, Spanish, Swedish, Polish, and Hungarian, and describe the current state-of-the-art in software tools to simplify the process.

Citing Articles

View all 1 citing articles

Citation data is made available by participants in CrossRef's Cited-by Linking service. For a more comprehensive list of citations to this article, users are encouraged to perform a search in SciFinder.

This article has been cited by 1 ACS Journal articles (1 most recent appear below).

  • Cover Image

    Improved Chemical Text Mining of Patents with Infinite Dictionaries and Automatic Spelling Correction

    Roger Sayle, Paul Hongxing Xie, and Sorel Muresan
    Journal of Chemical Information and Modeling2012 52 (1), 51-62
    • Improved Chemical Text Mining of Patents with Infinite Dictionaries and Automatic Spelling Correction

      Roger Sayle, Paul Hongxing Xie, and Sorel Muresan
      Journal of Chemical Information and Modeling2012 52 (1), 51-62

      The text mining of patents of pharmaceutical interest poses a number of unique challenges not encountered in other fields of text mining. Unlike fields, such as bioinformatics, where the number of terms of interest is enumerable and essentially static, ...

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

Explore by:


History

  • Published In Issue March 23, 2009
  • Article ASAPFebruary 24, 2009
  • Received: July 21, 2008

Recommend & Share

  • Share on ACS NetworkACS Network
  • Add to FacebookFacebook
  • Tweet ThisTweet This
  • Add to CiteULikeCiteULike
  • Add to NewsvineNewsvine
  • Digg ThisDigg This
  • Add to DeliciousDelicious

Related Content

Other ACS content by these authors: