Saturday, November 17, 2012

Disclaimer: Even Incomplete Data Are Useful

On Wednesday, SkyTruth released a database created from over 27,000 "chemical disclosure reports" voluntarily submitted by industry to FracFocus.org for gas and oil wells fracked between January 2011 and August 2012. But as a few folks have pointed out, there are some obvious issues with this database: the data hardly represent all of the wells drilled; roughly 21% of the records are concealed as proprietary or "trade secret;" and the data include a wide variety of mistakes, from spelling errors to invalid Chemical Abstract Service (CAS) numbers. 

So why did we publish this database?

In the news release we said, "SkyTruth hopes this data release will promote discussion about effective disclosure, and credible research on this nationally significant issue."

Our objective was to capture the FracFocus dataset as is - warts and all - in hopes that other enterprising researchers and analysts would be able to help clean it up and produce useful analyses to inform the public. Many have complained the data don't rise to the standard of adequate public disclosure (see this article from Bloomberg News).

Like a broken clock, bad data can be fixed.
However, we have to know exactly how it is

broken in order to make it useful.

Photo: Wikimedia Commons, Audrius Meskauskas
By making the data available, these shortcomings can now be systematically analyzed and quantified by anyone with the skills and interest to do so. We think this is tremendously important, because some states have already designated FracFocus as the place for public disclosure of the chemicals used in fracking; and the federal government may be on the brink of doing the same for millions of acres of public land. 

The database has already helped us produce useful information about disclosure rates in West Virginia and Pennsylvania, the large amount of water used in the fracking process (relative to Niagara Falls and Central Park in New York City), and even the continued use of diesel fuels for fracking in spite of the Safe Drinking Water Act. We expect that other NGOs, academics, the general public, and regulators can use the database to calculate disclosure rates in other states, compare the data with public health information, analyze trends in the use of certain chemicals geographically or by company, and much more.

The FracFocus data give us only a partial picture of the chemicals used in fracking.  We hope this data release will help inform the discussion about what true transparency should look like -- and stimulate real, concrete action by government and industry to meet that high standard.


2 comments:

  1. Having worked with a partial constellation of imaging satellites for most of my professional life, I'm well aware of the tendency for data to be incomplete; your headline strikes home the point that without some data, we can't do anything but speculate about the problems we face.

    Well done, SkyTruth.

    ReplyDelete
  2. Thanks Thom! This is a case where some is better than none; but some isn't nearly good enough to qualify as adequate public disclosure. Industry can and should do a whole lot better. Regulators need to step it up.

    ReplyDelete