05 OSCReadiness2008 FIELDS Data Analysis

download 05 OSCReadiness2008 FIELDS Data Analysis

of 67

Transcript of 05 OSCReadiness2008 FIELDS Data Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    1/67

    Data Analysis

    John Bing-CanarFIELDS Group

    U.S. EPA, Region 5, Superfund

    [email protected]

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    2/67

    Data Analysis topics: overview data requirements data processing statistical analysis secondary sampling cleanup objectives

    software and resources case studies

    Outline

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    3/67

    Stating the obvious:

    Overview

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    4/67

    Why this topic? data, data issues, in-field devices, clean-up

    estimation

    available softwares and where to get them case studies to highlight concepts

    Overview

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    5/67

    Basic data needs

    contaminate concentration(s) locational data (X and Y)

    true coordinates (e.g., UTM)

    latitutde/longitude local coordinates (e.g., northing and easting from one

    corner of site)

    date/time collected

    additional dataelevation (Z)

    Data Requirements

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    6/67

    Exploratory Data Analysis (EDA)

    data error detectioncontaminate, X, Y, Z, date valuesunits (ppm vs. ppb)

    data error detection techniquesdescriptive statistics such as maximum/minimum,

    graphing (boxplots, histograms) negative values, values outside of range (% oxygen > 100)

    mapping (spatial max/min values) are locations far from majority of sample locations?

    outlier tests?not a good idea; high contamination values usually

    indicate a source not an outlier

    Data Processing

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    7/67

    Exploratory Data Analysis (EDA)

    LOD treatment1/2 LOD?

    Duplicates and splits treatments take maximum? take average? take median?

    Summary

    choose a method and document it (transparency)

    Data Processing

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    8/67

    Calibration of field methods with lab methodsGoal: calibration equation, e.g.,

    y = (m)(x) + badjusted value = (slope)(lab value) + intercept

    For example, InnovX XRFadjusted lead = (1.54)(lab value) 14.13Natural log adjusted lead = (1.00)(Natural log of lab

    value) + 0.37

    if lab value = 100ppm, then adjusted lead value = 149.75

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    9/67

    Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b

    non-linearity

    linear

    curvilinear

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    10/67

    Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b

    outliers

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    11/67

    Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b heteroscedasticity of residuals

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    12/67

    Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b

    heteroscedasticity of residuals

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    13/67

    Calibration of field methods with lab methods factors to consider when comparing field to lab

    methods:differences in field equipment

    Niton XLp712 with Americium 241 radioactive source versusInnovX 4000 with an X-ray source. (apples to oranges)

    differences in weather conditions weather (dry soil versus wet soil)

    differences in material tested slag or ground slag versus soil residential soil versus cattail marsh (huge differences inorganic matter, i.e., roots) (XRF measures metals in soil notvegetation)

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    14/67

    Calibration of field methods with lab methods

    basic guide to use when using in-field devices:send a range of samples values to the lab (especially

    want higher values). DO NOT randomly send samplesto the lab, or send every 5th sample to the lab

    try to send at least 20 samples to the lab. (20 labvalues will give the minimum number of values to usefor calibration of field values to lab values.)Method 6200 (SW-846) calls for a minimum of 5% of

    samples sent to the lab.

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    15/67

    Upper confidence limits use upper confidence limits

    (UCLs) to determine if sitehas met cleanup goals

    EPA-funded software:ProUCL

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    16/67

    Upper confidence limits

    Statistical Analysis

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    17/67

    Secondary or phased sampling

    given interpretation of previous sampling event,where to continue sampling?

    Secondary Sampling

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    18/67

    Adaptive fill sampling user-specified number of samples in the most poorly

    sampled areas

    Initial locationsSecondary locations

    Kalamazoo River

    Secondary Sampling

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    19/67

    Initial locationsRadial locations

    Radial sampling to meet the need to adequately describe spatial

    variation over distance

    Kalamazoo River

    Secondary Sampling

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    20/67

    Initial locations

    Radial locations

    Radial sampling to meet the need to adequately describe spatial

    variation over distance

    Kalamazoo River

    Secondary Sampling

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    21/67

    Four components:

    2D and 3D data spatial estimation (interpolation)

    identify cleanup areas

    estimate mass/volume

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    22/67

    2D and 3D data

    2D data (e.g., mercury levels in sediment surface) 3D data (e.g., downhole gamma logging)

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    23/67

    Spatial estimation (interpolation)

    interpolation generates estimates at unsampledlocations

    interpolation

    Scio Pottery

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    24/67

    Spatial estimation (interpolation)

    why interpolate?hot spot delineationsmass/volume calculationsdetermine remediation areas

    interpolation

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    25/67

    Identify cleanup areas

    goal:average concentration; orany area above a specified value

    Deer Lake

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    26/67

    Estimate mass and volume

    what is expected volume of soil (sediment) andmass of contaminant to meet a cleanup goal?

    Deer Lake

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    27/67

    Estimate mass and volume

    Deer Lake

    Cleanup Objectives

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    28/67

    Software

    Freeware FIELDS Tools for ArcGIS (http://epa.instepsoftware.com/FIELDS/) F/S Plus [stand-alone] (http://epa.instepsoftware.com/FIELDS/) SADA [stand-alone] (http://www.tiem.utk.edu/~sada/index.shtml)

    GMS [stand-alone] (http://www.ems-i.com/)

    Proprietary Software WinGslib (www.statios.com)

    EVS/MVS (www.ctech.com) SAGE2001 (www.isaaks.com) earthVision (www.dgi.com)

    Software and Resources

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    29/67

    SoftwareFreeware FORMS II Lite (http://www.epa.gov/superfund/programs/clp/f2lite.htm)

    Generates sample labels, bottle tags, and Chain of Custody (COC) forms; Tracks samples from the field to the laboratory; Facilitates electronic capture of sample information into databases; and

    Exports data electronically as .xml, .dbf or .txt files. MARSSIM (http://www.epa.gov/radiation/marssim/index.html)

    The Multi-Agency Radiation Surveys and Site Investigation Manual(MARSSIM) provides detailed guidance for planning, implementing, andevaluating environmental and facility radiological surveys conducted to

    demonstrate compliance with a dose- or risk-based regulation ProUCL (http://www.epa.gov/esd/tsc/software.htm)

    Estimates Upper Confidence Limit (UCL) of the mean using variousparametric and nonparametric methods Statistical methods that can be used to verify the attainment of cleanupstandards

    Software and Resources

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    30/67

    Software

    Freeware VSP (http://dqo.pnl.gov/index.htm)

    Statiscally-based sample design software Supported and financed by the USEPA to meet the DQ) (Data QualityObjectives)

    Proprietary Software TerraSeer (http://www.terraseer.com/index.php)

    Space-Time Intelligence System (STIS) A GIS software that allows for space and time analyses (e.g., clusteranalysis)

    Software and Resources

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    31/67

    End of Talk

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    32/67

    Site Info Management: Decision MakingTools ArcGIS

    Internet Tools Google Earth ArcGIS Explorer

    MS Virtual Earth

    IMAAC

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    33/67

    How Robust are these tools? ArcGIS

    Capable of sophisticated Spatial and Statistical Analysis On the fly Complex Queries Requires extensive knowledge of GIS

    Internet Tools (Google Earth, ArcGIS Explorer, MSVirtual Earth)

    More for Presentation Purposes and general use Less functionality for on the fly or detailed data analysis Designed for non-GIS personnel More extensive analysis would require customized

    programming

    IMAAC Capable of short term, quick response plume mapping More detailed modeling can be conducted

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    34/67

    GIS = Geographic Information Systems

    It is a collection of computer hardware, software, and geographic data for capturing,

    managing, analyzing, and displaying all forms of geographically referenced

    information.

    How does a full-blown GIS software like ArcGIS differ from Google Earth or other

    Internet-based free software?

    ArcGIS can answer the following questions:

    Query and display:Find and display all samples collected in the last week

    Find and display all H2S levels above a criterion

    Find and display the highest Cadmium levels in the top 2 feet of soil

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    35/67

    ArcGIS can answer the following questions:

    Quantification of changes that have occurred over space or time:

    What is the area (yds2) of a proposed clean up?

    How much has the elevation of a landfill decreased (subsided) overtime?

    Where have PCB concentrations increased in the sediment surface from

    last summer?

    What is the mass of Mercury in a proposed removal area?

    Statistical analysis:Is there a statistically significant relationship between residential soil levels for Lead?

    Is there a pattern of Arsenic levels in soil from an airborne release?

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    36/67

    Case StudyNicor Gas Company Response, Park Ridge, IL

    In June 2007, Nicor Gas company (Nicor) discovered PCBs in gas

    meters at four homes in Park Ridge, IL. Nicor cleaned up thehomes and contacted U.S. EPA, whose inspectors then performedfollow-up testing of indoor air, soil, and hard surfaces. In July andAugust, further sampling of other homes in Park Ridge wasconducted in order to determine the extent of contaminated gasmeters.

    In July and August, further sampling of other homes in Park Ridgewas conducted in order to determine the extent of contaminatedgas meters. In total, more than 140 homes were sampled. Nicor,U.S. EPA Region 5 Land and Chemicals Division (LCD), U.S. EPARegion 5 Superfund Emergency Response Branch (ERB), and

    Illinois Environmental Protection Agency (IEPA) contributed to thesampling effort.

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    37/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    38/67

    Following the production of thesemaps, new sample data was

    received and added to thedatabase. In preparation foranother site meeting, additionaldata queries were created usingArcMap. For example, querieswere generated that showed theresults data and locations of theoriginal four sampled homes, allcommercial sampled properties,

    and properties at which detectedPCBs were found concurrently inmeter, air, and wipe samples.

    PCB was detected in source samples. Sourcesamples were from gas meters, regulators, and driplegs

    Nicor Gas Company Response

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    39/67

    Decision Making Tools: Internet Tools

    Google Earth

    ArcGIS Explorer

    MS Virtual Earth

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    40/67

    Google Earth Utilizes Google search (address) with satellite imagery, maps, and terrain. Free and license versions with various functionality

    Google Earth (free) Earth Plus ($20)

    Import GPS data

    Earth Pro ($400)-EPA OSCs all have a 1 year license Increased functionality and import/export capabilities Allows GIS coverages to be read directly

    Enterprise

    Limitations Requires Internet Aerials cant be downloaded and used somewhere else. Limited cartographic features Limited GIS functionality

    Advantages Simple Fast Maintained

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    41/67

    Ohio River Tools

    Sample Designs and navigation Map generation with save

    Data saved as ArcView shapefile, DBF,and AutoCAD DXF

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    42/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    43/67

    Query Excel Spreadsheet

    Queried data can be saved and exported to

    a csv and kml format

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    44/67

    Google KML Display

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    45/67

    Outfall Point Link

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    46/67

    Kerr-McGee

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    47/67

    ArcGIS Explorer

    Layers Search (address) Imagery Topographic maps historical maps street maps

    Free Pros

    Simple Fast Maintained

    Connect to local data and servicesfrom ArcGIS Server

    Cons Requires Internet Limited GIS functionality

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    48/67

    MW FloodEOC staffed for a week on Midwest

    Floods including GIS:1. Incident coordinates and information reported

    and brought into ARCGIS Explorer

    2. Used ArcGIS Explorer 9.2 and added incidentlabels and coordinates rapidly along with SFsites, flood plains, etc.

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    49/67

    NRC Locations of Interest NPL Locations of Interest

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    50/67

    FEMA 500 Year Flood Plain FEMA 100 Year Flood Plain

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    51/67

    FEMA 100 Year Flood Plain

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    52/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    53/67

    MS Virtual Earth

    Build/Develop web page on top of their platform Free next 2 years for EPA Flexibility Pros

    Flexibility and customizability Access all on the web

    Cons Not an out of the box tool Web Security In order to make dynamic need to setup/maintain

    database

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    54/67

    Virtual Earth at EPA

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    55/67

    Virtual Earth at EPA

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    56/67

    The IMAAC (Interagency Modeling andAtmospheric Assessment Center) providesatmospheric hazards predictions in support ofFederal agencies responding to incidents ofairborne releases with national significance.

    The IMAAC leverages existing Federalcapabilities and is responsible for providing

    accurate, reliable estimates of predicted hazardareas, with associated concentrations, thatserve as the foundation for decisions by theauthorized emergency managers.

    What is IMAAC?

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    57/67

    The NARAC (National Atmospheric Release Advisory

    Center), located at the University of CaliforniasLawrence Livermore National Laboratory, has beendesignated as the primary interim provider of IMAACcapabilities and is currently supporting hundreds ofDepartment of Homeland Security stakeholders inaddition to its traditional suite of customers and users

    NARAC is a distributed system, providing modeling andgeographical information tools for deployment to an end

    user's computer system as well as real-time access toglobal meteorological and geographical databases andadvanced three-dimensional model predictions from thenational center

    What is NARAC?

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    58/67

    NARAC Features NARAC provides a suite of multi-scale (local-, regional-,

    continental- and global-scale) atmospheric flow and

    dispersion models for a wide range of hazards. Some ofthe key features of this modeling system are: Automated, and validated real-time 3-D centralized

    modeling system at NARAC that simulates complex

    wind flows, detailed particle dispersion, wet and drydeposition on multiple spatial scales: Local-scale and regional-scale meteorological forecast and

    dispersion models Long-range meteorological forecast and dispersion models

    Models re-locatable anywhere in the world in real-time Nuclear explosion fallout model Fast-running, deployable local-scale dispersion model

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    59/67

    The IMAAC/NARAC supports customers

    through several channels: Direct telephone calls to expert operations

    staff

    Internet interface (NARAC Web) to a high-performance computing center

    Internet-based remote access softwareinstalled on a customer's local computer.

    Types of NARAC Support

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    60/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    61/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    62/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    63/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    64/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    65/67

    Ardent SentryDatabase Query based on NARAC Plume

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    66/67

  • 8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis

    67/67

    Contact Information

    NARAC - http://narac.llnl.govCustomer SupportEmergency Only: 925-424-6465 (24x7)Non-emergency: 935-424-2722 (daytime)e-mail: [email protected] request account:

    https://naracwebx2.llnl.gov/NaracWeb/jsp/RequestAccount.jsp

    IMAAC Web

    http://imaacweb.llnl.gov925-422-9159 7:30 am - 4:15 pm (PT), M-F925-422-7627

    IMAAC emergency contact dispatcher 925-422-9100 (24x7)