Loading...
HomeMy WebLinkAboutDEQ Comment Response on STATS TM 012020171/20/2017 1 The following informal comments regarding the draft Technical Memorandum, Statistical Methods for Developing Reference Background Concentrations for Groundwater at Coal Ash Facilities, HDR, October 2016 were provided by DEQ through email from Steve Lanter dated 11/22/2016. Responses prepared by Duke Energy consultants HDR and SynTerra are noted below each comment. DEQ Comment #1 Ensure that HDR and SynTerra use the exact same technical approach to determine preliminary provisional background threshold values (PPBTVs) for groundwater. Duke Response #1 HDR and SynTerra will use the same technical approach to determine PPBTVs for groundwater. The draft version of the TM Statistical Methods for Developing Reference Background Concentrations for Groundwater at Coal Ash Facilities was developed jointly by HDR and SynTerra and the revised version was jointly developed by HDR and SynTerra but was developed to be a guidance document issued by Duke to be used to develop PPBTVs for groundwater and soil (Statistical Methods for Developing Reference Background Concentrations for Groundwater and Soil at Coal Ash Facilities, January 2017- DRAFT). Although the same technical approach will be used by both HDR and SynTerra, due to site differences, the methodology was developed to provide for technical flexibility if needed due to site differences. DEQ Comment #2 Low probability, extreme upper tail outliers should not be used to develop reference background concentrations. As ProUCL-V5.0 Technical Guide (page 17, paragraph 1) indicates, since the presence of outliers in a data set tends to yield distorted values of decision making statistics, elevated outliers should not be included in background data sets and estimation of background threshold values. The objective here is to compute background statistics based upon the majority of the data set representing the main dominant background population and not to accommodate a few low probability high outliers that may also be present in the background data set. Duke Response #2 Part II of the methodology describes the process that will be used to identify outliers for groundwater results. While there are several tests available to test for possible outliers, Dixon’s or Rosner’s Outlier tests are specifically identified in the Unified Guidance (USEPA 2009) for providing requisite statistical strength and power necessary to appropriately identify potential outliers. Additionally, visual methods such as Q-Q plots and box plots will be utilized in the identification of outliers. Statistical outliers will be tabulated as part of the background determination report. If statistical outliers have been detected, the project scientist will review the outliers to determine if the values should be removed or if they are representative of background and should be retained for statistical analysis. Reasons as to why a particular statistical outlier should be included or excluded from the background dataset will be documented as part of the final background determination report. 1/20/2017 2 DEQ Comment #3 Duke should adopt and use the protocols (“rules”) that were internally agreed upon by Division staff (no samples with pH > 8.5, no samples with turbidity > 10 NTUs, and no samples with detection or reporting limits that are higher the applicable standard). Duke Response #3 Unless site specific conditions exist that dictate exceptions, Duke will use the following protocol in determining if groundwater sample results are to be considered valid for determining PPBTVs: • Sample pH is less than or equal to 8.5 standard units (S.U.); • Sample turbidity is less than or equal to 10.0 Nephelometric Turbidity Units (NTUs); and • Result is not a statistical outlier identified for background sample data. Professional judgment can be used to retain data that does not meet these criteria (such as pH); however, the decision to retain data that does not satisfy these criteria must be documented. Should sample results with detection or reporting limits greater than the applicable standard impact the data pool such that sufficient data is not available for producing PPBTVs, NCDEQ will be consulted to discuss alternative evaluation options for the particular constituents, such as modeling. DEQ Comment #4 Because it is difficult to know how many comparisons are going to be made to background (that is, is difficult to know the number of down- and side-gradient wells and sample events that will be compared to the computed background concentration), it is recommended that prediction limits not be used since that knowledge is needed to calculate the upper prediction limit. Instead, it is recommended that a 95-95 UTL be used. The log normal distribution should not be used to compute the UTL unless the data are not heavily skewed and the standard deviation is <= 1. Preference should be given to a gamma distribution over that of the log normal distribution as discussed on pages 22-23. Duke Response #4 As described in Part IV of the methodology, the upper tolerance limit (UTL) will be used to develop the PPBTVs. As described in the ProUCL guidance document (Section 3.4), the UTL95-95 is designed to provide coverage for 95 percent of all observations from the background population with a confidence coefficient (CC) of 0.95. A UTL95-95 will be exceeded by all (current and future) values coming from the background population less than 5 percent of the time with a CC of 0.95, that is 5 exceedances per 100 comparisons (of background values) can result just by chance for an overall CC of 0.95. Unlike a UPL95, a UTL95-95 can be used when many, or unknown number of current or future onsite observations need to be compared with a BTV. The actual confidence coefficient for a given sample population is determined by the sample size and the order statistic. For sample sizes as low as 11 samples, the coverage limit may be 85 percent while maintaining a confidence limit of 95 percent. Part IV of the methodology describes the general approach to be utilized when samples sizes are not sufficient to calculate UTL95-95 values. For constituents with insufficient data (i.e., <8 samples), continued sampling of background wells will be performed until sufficient sample size is achieved. 1/20/2017 3 As described on the referenced pages, preference will be given to the gamma distribution over the log normal distribution, with acknowledgement that in the end, the goal is to compute the correct statistics of practical merit representing the data set. As additional data is collected over time, the background data set distribution and UTL will be re-evaluated. DEQ Comment #5 If the UPL method is employed, many comparisons may need to be made in order to make a determination of PPBTVs. Since PPBTVs will be determined with limited data sets, this approach may be difficult to defend. Duke Response #5 As noted in Duke Response #4, PPBTVs will be calculated using UTLs. DEQ Comment #6 As a UPL based upon the Chebyshev Inequality tends to yield higher estimates of BTVs than other methods, HDR proposed to use Chebyshev’s non-parametric estimate at a lower confidence coefficient, 85% or 90% (page 23). If the UPL approach is used, please specify when 85% is used and when 90% is considered. Otherwise, to be conservative, 85% should be considered. Duke Response #6 As noted in Duke Response #4, PPBTVs will be calculated using UTLs. DEQ Comment #7 Page 2. “If the concentration in the future sample is higher than the estimated UPL95, then steps are taken to validate the observation by means of quality assurance checks and/or by re-sampling and/or to investigate for possible natural or anthropogenic reasons for observing a higher concentration than expected.” This implies that we are in a “detection monitoring” stage, which we are not. We are in an “assessment monitoring” stage in which down- and side-gradient wells are compared to 2L or background to map the extent of contaminated groundwater. Duke Response #7 The methodology was developed to provide guidance on statistical methods to use used in calculating PPBTVs, therefore the referenced text describing re-sampling was removed. DEQ Comment #8 Pooling of background data should not be done across flow systems. That is, shallow, deep, and bedrock wells should not be pooled together for purposes of background determination. Duke Response #8 Duke will not pool background data across groundwater flow layers. Separate PPBTVs will be calculated for the site flow layers (i.e., shallow, transition zone, deep or bedrock, or others as appropriate by site). There may be specific sites where geologic and geochemical conditions vary by depth within a flow layer and these conditions may warrant the calculation of separate PPBTVs within a particular flow layer. 1/20/2017 4 For example, at Cape Fear, background well results for the shallow (surficial) and bedrock flow zones show geochemical differences within the flow zones. In the surficial zone at Cape Fear, differences are observed in groundwater geochemistry between the upper and lower surficial zone. Differences in background groundwater geochemistry are also observed in the bedrock layer at Cape Fear between wells screened in sandstone and those screened in mudstone. For those cases, separate PPBTVs may be calculated for separate conditions within those flow layers. DEQ Comment #9 Page 3. “If distinct sub-groups exist as which may be the case across different hydrostratigraphic units, HDR and SynTerra will develop separate background concentrations for each distinct sub- group.” Since background determinations are needed to continue and complete assessments and corrective action plans, sufficient quantities of data for these sub-group tests generally will not be available. Sub-group dataset sizes of generally 8 to 10 are needed to conduct the sub-group tests which include the testing of seasonal effects, etc. Note that on page 15 HDR recommends that sample sizes per sub-group should be at least 30. In many or most cases this would result in significant delays in the background determinations that are not feasible given CAMA deadlines. The Division reserves the right to re-visit sub-group testing once adequate data are available. Duke Response #9 Sub-group analysis will be performed as needed, where sufficient datasets of at least eight samples are available for each potential sub-group . DEQ Comment #10 Under no circumstances should a maximum detection limit be used to establish a background value. Duke Response #10 PPBTVs will be based on UTLs and not based on maximum detection values. DEQ Comment #11 In the meeting, it was agreed that Duke Energy would email the Regional Hydrogeologists a spreadsheet for each facility that contains all background well data and the actual culled data (see comment 2 and 3 above) that they plan to use for background determinations. The Regional Hydrogeologists would then review the dataset to ensure that the Division and Duke Energy are using the same data for the background determinations. Duke Response #11 Duke has provided comprehensive summary spreadsheets of the background groundwater data set by flow layer to DEQ.