Environ. Sci. Technol., 42 (10), 37323738, 2008. 10.1021/es071301c
Web Release Date: April 16, 2008

This article not subject to U.S. Copyright. Published 2008 by the Americal Chemical Society.

Evaluation of Statistical Treatments of Left-Censored Environmental Data using Coincident Uncensored Data Sets: I. Summary Statistics

Ronald C. Antweiler* and Howard E. Taylor

U.S. Geological Survey, 3215 Marine Street, Boulder, Colorado 80303

Received for review June 1, 2007

Revised manuscript received February 25, 2008

Accepted February 29, 2008

Abstract:

The main classes of statistical treatment of below-detection limit (left-censored) environmental data for the determination of basic statistics that have been used in the literature are substitution methods, maximum likelihood, regression on order statistics (ROS), and nonparametric techniques. These treatments, along with using all instrument-generated data (even those below detection), were evaluated by examining data sets in which the true values of the censored data were known. It was found that for data sets with less than 70% censored data, the best technique overall for determination of summary statistics was the nonparametric Kaplan–Meier technique. ROS and the two substitution methods of assigning one-half the detection limit value to censored data or assigning a random number between zero and the detection limit to censored data were adequate alternatives. The use of these two substitution methods, however, requires a thorough understanding of how the laboratory censored the data. The technique of employing all instrument-generated dataincluding numbers below the detection limitwas found to be less adequate than the above techniques. At high degrees of censoring (greater than 70% censored data), no technique provided good estimates of summary statistics. Maximum likelihood techniques were found to be far inferior to all other treatments except substituting zero or the detection limit value to censored data.

Download the full text: PDF | HTML