--------------------------------------------------------------------- Atmospheric Non Methane Hydrocarbons Mole Fractions from the NOAA GML Carbon Cycle Surface Network Version: 2021-05-04 -------------------------------------------------------------------- CONTENTS 1. Data source and contacts 2. Use of data 2.1 Citation 3. Reciprocity 4. Warnings 5. Update notes 6. Introduction 7. DATA - General Comments 7.1 DATA - Sampling Locations 7.2 DATA - File Name Description 7.3 DATA - Event with single parameter 7.4 DATA - Event with multiple parameters 7.5 DATA - QC Flags 8. Data retrieval 9. References -------------------------------------------------------------------- 1. DATA SOURCE AND CONTACTS National Oceanic and Atmospheric Administration (NOAA) Global Monitoring Laboratory (GML) Carbon Cycle Greenhouse Gases (CCGG) University of Colorado Institute of Arctic and Alpine Research (INSTAAR) Atmospheric Research Laboratory group (ARL) Correspondence concerning these data should be directed to: John Mund NOAA Global Monitoring Laboratory 325 Broadway, R/GML-1 Boulder, CO 80305 U.S.A. email: John.Mund@noaa.gov -------------------------------------------------------------------- 2. USE OF DATA These data are made freely available to the public and the scientific community in the belief that their wide dissemination will lead to greater understanding and new scientific insights. The availability of these data does not constitute publication of the data. NOAA relies on the ethics and integrity of the user to ensure that GML receives fair credit for their work. If the data are obtained for potential use in a publication or presentation, GML should be informed at the outset of the nature of this work. If the GML data are essential to the work, or if an important result or conclusion depends on the GML data, co-authorship may be appropriate. This should be discussed at an early stage in the work. Manuscripts using the GML data should be sent to GML for review before they are submitted for publication so we can ensure that the quality and limitations of the data are accurately represented. 2.1 CITATION Please reference these data as Helmig, D., Hueber, J., Tans, P., University Of Colorado Institute Of Arctic And Alpine Research (INSTAAR), & NOAA GML CCGG Group. (2021). University of Colorado Institute of Arctic and Alpine Research (INSTAAR) Flask-Air Sample Measurements of Atmospheric Non Methane Hydrocarbons Mole Fractions from the NOAA GML Carbon Cycle Surface Network at Global and Regional Background Sites, 2004-2016 (Version 2021.05.04) [Data set]. NOAA Global Monitoring Laboratory. https://doi.org/10.15138/6AV8-GS57 -------------------------------------------------------------------- 3. RECIPROCITY Use of these data implies an agreement to reciprocate. Laboratories making similar measurements agree to make their own data available to the general public and to the scientific community in an equally complete and easily accessible form. Modelers are encouraged to make available to the community, upon request, their own tools used in the interpretation of the GML data, namely well documented model code, transport fields, and additional information necessary for other scientists to repeat the work and to run modified versions. Model availability includes collaborative support for new users of the models. -------------------------------------------------------------------- 4. WARNINGS Every effort is made to produce the most accurate and precise measurements possible. However, we reserve the right to make corrections to the data based on recalibration of standard gases or for other reasons deemed scientifically justified. We are not responsible for results and conclusions based on use of these data without regard to this warning. -------------------------------------------------------------------- 5. UPDATE NOTES +++++++++++++++++++++++++++++++ Lab-wide notes: 2011-10-07 We introduced the term "measurement group", which identifies the group within NOAA and Institute of Arctic and Alpine Research (INSTAAR) University of Colorado Boulder that made the measurement. We can now have multiple groups measuring some of the same trace gas species in our discrete samples. Measurement groups within NOAA and INSTAAR are ccgg: NOAA Carbon Cycle Greenhouse Gases group (CCGG) hats: NOAA Halocarbons and other Atmospheric Trace Species group (HATS) arl: INSTAAR Atmospheric Research Laboratory (ARL) sil: INSTAAR Stable Isotope Laboratory (SIL) curl: INSTAAR Laboratory for Radiocarbon Preparation and Research (CURL) We also changed the file naming convention (see section "DATA - FILE NAME DESCRIPTION"). +++++++++++++++++++++++++++++++ Project-specific notes: 2015-11-20 Updated the content and format of event files to include elevation in meters above sea level (masl) and sample collection intake height in meters above ground level (magl). Elevation plus collection intake height equals altitude, which has always been included in the NOAA distribution. In adding these 2 fields, the event number column has moved. The new format is described in Sections 7.3 and 7.4. 2013-08-27 Coordinates of some of the sample locations have changed. These changes improve the specified location based on new information. Changes tend to be minor and do not necessarily reflect a change in the actual sampling location. 2011-10-01 The data file format has been modified to include the measurement group and, additionally, the sample collection and analysis times now include second information (e.g., 2011 03 15 23 06 12). See section 7.3 for details. 2010-10-01 The format of the NOAA ESRL data records has been changed to include an estimate of the uncertainty associated with each measurement. The determination of the estimate is trace gas specific and described in section 6 (INTRODUCTION). +++++++++++++++++++++++++++++++ Parameter-specific notes: 2021-05-03 Several analysis dates were determined to be in error with data to correct currently unavailable. These dates have been reset to the sample date and time which should be interpreted as meaning that the analysis date is unavailable. This should not impact data quality in other ways as in-depth storage studies did not find a notable influence of the time that has passed between sampling and analysis. -------------------------------------------------------------------- 6. INTRODUCTION All samples in this archive were analyzed for light hydrocarbons at INSTAAR, Atmospheric Research Laboratory (ARL) in Boulder, Colorado by gas chromatography. ARL measured samples from the NOAA GML network from 2004 through 2016, analyzing flasks from a dozen sites at first and then rapidly extending the coverage to more than 40 sites. ARL also made in-situ measurements at Pico (Azores) and at Summit (Greenland) alongside the flask samples. In-situ data from the Deutscher Wetterdienst in Hohenpeissenberg (HPB) were also regularly compared to flask samples collected at this site. The INSTAAR NMHC laboratory was audited by the World Calibration Center (WCC) for Volatile Organic Compounds in 2008. All measurement results passed the quality criteria set by the WCC. ------------------------------------------------------------------- 7. DATA - GENERAL COMMENTS Method: The GC analysis involves drying of approximately 600 cc (STP) of sample gas, then preconcentration of analytes on an adsorbent bed, followed by thermal desorption onto a short capillary guard column and main capillary column (HP-Plot AL2O3/KCl 0.53 mm x 60 m) for separation, and finally detection by a Flame Ionization Detector (FID). The gas chromatograph used is a HP-5890 series II. Drying is accomplished by running the sample gas through a tube cooled to -25°C. The custom-built preconcentration system is cooled to -35°C and uses Carboxen 1000 and Carboxen 1016 as adsorbents. Desorption is controlled at 310°C. The flask analysis system is completely automated, allowing analysis of up to 12 flasks in one sequence. Calibration: Analysis of air samples are compared to analysis of synthetic and compressed air standards. One compressed air standard is run twice a day to check on the stability of the FID response, the other standards are ran at least twice a month to determine the response of the FID. Blanks are prepared by running compressed air through a heated bed of PtO. Blank samples are analyzed twice daily. Results: Results are expressed as mole fractions (pmol/mol) Uncertainty: Reproducibility varies by compound and with mole fraction. It is monitored routinely using the results of the paired flask sampling. Reproducibility is typically better than 5% RSD. Lower RSDs are obtained for the higher abundance compounds (ethane, propane). Accuracy is determined by the calibration standard scale. A minimum of three independently prepared and cross-referenced standards is in use at any given time. Deviations in the response factors from these different standards are smaller than 5%, with results for ethane and propane typically being equal or less than 2-3%. Data correction: After comparing the Isoprene (C5H8) flask data at the German Weather Service Hohenpeissenberg (HPB) site with results from parallel sampling and in-situ GC analysis at the site, a systematic correction was applied to the whole isoprene dataset from 2007 to 2016, to scale the flask data to the in-situ results. For mole fractions ranging from 8 ppt (detection limit) to 450 ppt, a polynomial function was applied and for mole fractions above 450 ppt, a linear function was applied. These correction functions adjust the original flask results for isoprene by no more than the larger of 5 pmol/mol or 10%. The Pacific Ocean Cruise (POC, travelling between the US west coast and New Zealand or Australia) data have been merged and grouped into 5 degree latitude bins. For the South China Sea cruises (SCS) the data are grouped in 3 degree latitude bins. Sampling frequencies are approximately weekly for the fixed sites and average one sample every 3 weeks per latitude zone for POC and about one sample every week per latitude for SCS. Historically, samples have been collected using two general methods: flushing and then pressurizing glass flasks with a pump, or opening a stopcock on an evacuated glass flask; since 28 April 2003, only the former method is used. During each sampling event, a pair of flasks is filled. ------------------------------------------------------------------- 7.1 DATA - SAMPLING LOCATIONS For a summary of sampling locations, please visit http://www.esrl.noaa.gov/gmd/dv/site/site_table.html. IMPORTANT NOTES: 1. Data for all species may not be available for all sites listed in the table. 2. The exact location of a sampling site recorded in our database may change or become better defined over time. The latitude, longitude, and altitude of a sample event is based on the best information available at the time of sample collection. Differences in sample position associated with a particular site may be due to the site moving or changes in technology that permit a more accurate location determination. To view near real-time data, manipulate and compare data, and create custom graphs, please visit http://www.esrl.noaa.gov/gmd/dv/iadv/. ------------------------------------------------------------------- 7.2 DATA - FILE NAME DESCRIPTION Encoded into each file name are the parameter (trace gas identifier); sampling site; sampling project; laboratory ID number; measurement group; and optional qualifiers that further define the file contents. All file names use the following naming scheme: 1 2 3 4 5 [parameter]_[site]_[project]_[lab ID number]_[measurement group]_[optional 6 7 qualifiers].[file type] 1. [parameter] Identifies the measured parameter or trace gas species. (ex) co2 Carbon dioxide ch4 Methane co2c13 d13C (co2) merge more than one parameter 2. [site] Identifies the sampling site code. (ex) brw pocn30 car amt 3. [project] Identifies sampling platform and strategy. (ex) surface-flask surface-pfp surface-insitu aircraft-pfp aircraft-insitu tower-insitu 4. [lab ID number] A numeric field that identifies the sampling laboratory (1,2,3, ...). NOAA GML is lab number 1 (see http://www.esrl.noaa.gov/gmd/ccgg/obspack/labinfo.html). 5. [measurement group] Identifies the group within NOAA GML or INSTAAR that makes the actual measurement. See Section 5 (UPDATE NOTES) for details. (ex) ccgg hats arl sil 6. [optional qualifiers] Optional qualifier(s) may indicate data subsetting or averaging. Multiple qualifiers are delimited by an underscore (_). A more detailed description of the file contents is included within each data file. (ex) event All measurement results for all collected samples (discrete (flask) data only). month Computed monthly averages all collected samples (discrete (flask) data only). hour_#### Computed hourly averages for the specified 4-digit year (quasi-continuous data only) HourlyData Computed hourly averages for entire record (quasi-continuous data only) DailyData Computed daily averages for entire record (quasi-continuous data only) MonthlyData Computed monthly averages for entire record (quasi-continuous data only) 7. [file type] File format (netCDF, ASCII text). (ex) txt ASCII text file nc netCDF4 file ------------------------------------------------------------------- 7.3 DATA - EVENT WITH SINGLE PARAMETER The event data files in ftp://aftp.cmdl.noaa.gov/data/trace_gases/voc/<parameter>/flask/surface/ scheme (see Section 7.2): [parameter]_[site]_[project]_[lab ID number]_[measurement group]_[optional qualifiers].txt (ex) C2H6_brw_surface-flask_1_arl.txt contains C2H6 arl measurement results for all surface flask samples collected at Barrow, Alaska. The data files contain multiple lines of header information followed by one record for each atmospheric measurement of a single parameter or trace gas species. Fields are defined as follows: Field 1: [SITE CODE] The three-character sampling location code (see above). Field 2: [YEAR] The sample collection date and time in UTC. Field 3: [MONTH] Field 4: [DAY] Field 5: [HOUR] Field 6: [MINUTE] Field 7: [SECOND] Field 8: [FLASK ID] The sample container ID. Field 9: [METHOD] A single-character code that identifies the sample collection method. The codes are: P - Sample collected using a portable, battery powered pumping unit. Two flasks are connected in series, flushed with air, and then pressurized to 1.2 - 1.5 times ambient pressure. D - Similar to P but the air passes through a condenser cooled to about 5 deg C to partially dry the sample. G - Similar to D but with a gold-plated condenser. T - Evacuated flask filled by opening an O-ring sealed stopcock. S - Flasks filled at NOAA ESRL observatories by sampling air from the in situ CO2 measurement air intake system. N - Before 1981, flasks filled using a hand-held aspirator bulb. After 1981, flasks filled using a pump different from those used in method P, D, or G. F - Five liter evacuated flasks filled by opening a ground glass, greased stopcock. Field 10: [TRACE GAS NAME] Gas identifier (e.g., co2, co2c13). Field 11: [MEASUREMENT GROUP] Identifies the group within NOAA and INSTAAR making the actual measurement (e.g., ccgg, hats, arl). See Section 5 (UPDATE NOTES) for details. Field 12: [MEASURED VALUE] Dry air mole fraction or isotopic composition. Missing values are denoted by -999.99[9]. Field 13: [ESTIMATED UNCERTAINTY] Estimated uncertainty of the reported measurement value. Missing values are denoted by -999.99[9]. Field 14: [QC FLAG] A three-character field indicating the results of our data rejection and selection process, described in section 7.5. Field 15: [INSTRUMENT] A 2-character code that identifies the instrument used for the measurement. Field 16: [YEAR] The measurement date and time in LT. Field 17: [MONTH] Field 18: [DAY] Field 19: [HOUR] Field 20: [MINUTE] Field 21: [SECOND] Field 22: [LATITUDE] The latitude where the sample was collected, (negative (-) numbers indicate samples collected in the southern hemipshere). Field 23: [LONGITUDE] The longitude where the sample was collected, (negative (-) numbers indicate samples collected in the western hemisphere). Field 24: [ALTITUDE] The altitude of the sample inlet (masl). The reported altitude is the surface elevation plus sample intake height. Field 25: [ELEVATION] Surface elevation (masl). Field 26: [INTAKE HEIGHT] Air sample collection height above ground level (magl). Field 27: [EVENT NUMBER] A long integer that uniquely identifies the sampling event. Fields in each line are delimited by whitespace. (ex) LEF 2005 12 08 17 34 00 847-99 N C3H6 ARL 53.423 -999.990 ... v1 2006 03 14 00 13 00 45.9100 -90.2600 868.00 472.00 396.00 210595 ------------------------------------------------------------------- 7.4 DATA - EVENT WITH MULTIPLE PARAMETERS On special request we can distribute a "merged" file, which includes for each sampling event, measurement results for muliple parameters or trace gas species. A merged file does not include all information found in a single parameter data file. For example, merged files exclude measurement uncertainty, analysis instrument ID and date and time for each parameter. Thus, the single parameter data file is our most comprehensive data archive. The format of a merged file is slightly different from single parameter event file. A "merged" file will have the word "merge" in the parameter field of the file name. The file name does not inform on the number of parameters included in the file. Merged data files use the following naming scheme (see Section 7.2): merge_[site]_[project]_[lab ID number]_[measurement group]_[optional qualifiers].txt (ex) merge_brw_surface-flask_1_arl.txt contains arl measurement results for two or more parameters for all surface flask samples collected at Barrow, Alaska. The data files contain multiple lines of header information followed by one record for each atmospheric measurement of a single parameter or trace gas species. Fields are defined as follows: Field 1: [SITE CODE] The three-character sampling location code (see above). Field 2: [YEAR] The sample collection date and time in UTC. Field 3: [MONTH] Field 4: [DAY] Field 5: [HOUR] Field 6: [MINUTE] Field 7: [SECOND] Field 8: [FLASK ID] The sample container ID. Field 9: [METHOD] A single-character code that identifies the sample collection method. The codes are: P - Sample collected using a portable, battery powered pumping unit. Two flasks are connected in series, flushed with air, and then pressurized to 1.2 - 1.5 times ambient pressure. D - Similar to P but the air passes through a condenser cooled to about 5 deg C to partially dry the sample. G - Similar to D but with a gold-plated condenser. T - Evacuated flask filled by opening an O-ring sealed stopcock. S - Flasks filled at NOAA ESRL observatories by sampling air from the in situ CO2 measurement air intake system. N - Before 1981, flasks filled using a hand-held aspirator bulb. After 1981, flasks filled using a pump different from those used in method P, D, or G. F - Five liter evacuated flasks filled by opening a ground glass, greased stopcock. Field 10: [LATITUDE] The latitude where the sample was collected, (negative (-) numbers indicate samples collected in the southern hemipshere). Field 11: [LONGITUDE] The longitude where the sample was collected, (negative (-) numbers indicate samples collected in the western hemisphere). Field 12: [ALTITUDE] The altitude of the sample inlet (masl). The reported altitude is the surface elevation plus sample intake height. Field 13: [ELEVATION] Surface elevation (masl). Field 14: [INTAKE HEIGHT] Air sample collection height above ground level (magl). Field 15: [EVENT NUMBER] A long integer that uniquely identifies the sampling event. There is a group of 4 fields for each parameter and measurement group included in the merge file. Field ##+1: [TRACE GAS NAME] Gas identifier (e.g., co2, co2c13). Field ##+2: [MEASUREMENT GROUP] Identifies the group within NOAA and INSTAAR making the actual measurement (e.g., ccgg, hats, arl). See Section 5 (UPDATE NOTES) for details. Field ##+3: [MEASURED VALUE] Dry air mole fraction or isotopic composition. Missing values are denoted by -999.99[9]. Field ##+4: [QC FLAG] A three-character field indicating the results of our data rejection and selection process, described in section 7.5. Fields in each line are delimited by whitespace. (ex) LEF 2006 05 17 17 14 00 1382-99 N 45.9100 -90.2600 868.00 472.00 396.00 217438 C2H2 ARL -75.541 ... C2H4 ARL 76.617 ... ------------------------------------------------------------------- 7.5 QC FLAGS NOAA GML uses a 3-column quality control flag where each column is defined as follows: column 1 REJECTION flag. An alphanumeric other than a period (.) in the FIRST column indicates a sample with obvious problems during collection or analysis. This measurement should not be interpreted. column 2 SELECTION flag. An alphanumeric other than a period (.) in the SECOND column indicates a sample that is likely valid but does not meet selection criteria determined by the goals of a particular investigation. column 3 INFORMATION flag. An alphanumeric other than a period (.) in the THIRD column provides additional information about the collection or analysis of the sample. WARNING: A "P" in the 3rd column of the QC flag indicates the measurement result is preliminary and has not yet been carefully examined by the PI. The "P" flag is removed once the quality of the measurement has been assessed. FLAGS DESCRIPTION: '...' is a good sample 1st column flags: 'P..' flag: poor flask pair agreement Since samples are collected in pairs, the pair difference is calculated. If the mole fraction x falls between the below margins, and the difference exceeds abs(dx) the flag is applied. x <= 10 and dx > |5| 11 > x <= 25 and dx > |10| 26 > x <= 63 and dx > |20| 64 > x <= 156 and dx > |40| 157 > x <= 391 and dx > |80| 392 > x <= 977 and dx > |160| 978 > x <= 2441 and dx > |320| 2442 > x <= 6104 and dx > |640| 6105 > x <= 15259 and dx > |1280| 'A..' flag: no data obtained This flag is applied when there was an instrument failure or if the amount of air left in the flask was too low for an analysis '*..' flag: This flag is applied when an unstable baseline made the peak integration impossible 'D..' flag: This flag is applied when this particular compound was below the detection limit of our measurement 'C..' flag: This flag is applied when a contamination was found 3rd column flags: '..C' flag: This flag is applied when the amount of air left in the flask was too low for the instrument to draw the standard volume for analysis. In that case an analysis was performed with a reduced sampling volume. Note: We feel that this data set best represents hydrocarbons distribution in the remote, well-mixed global troposphere. It is possible, and even likely, that some values represent poorly mixed air parcels influenced by local anthropogenic sources. Users of these data should be aware that further data screening is necessary. We will provide advice in selecting the data that will best fit the goals of user's particular investigation. ------------------------------------------------------------------- 8. DATA RETRIEVAL To transfer all files in a directory, it is more efficient to download the tar or zipped files. To transfer a tar file, use the following steps from the ftp prompt: 1. ftp> binary ! set transfer mode to binary 2. ftp> get filename.tar.gz ! transfer the file 3. ftp> bye ! leave ftp 4. $ gunzip filename.tar.gz ! unzip your local copy 5. $ tar xvf filename.tar ! unpack the file To transfer a zipped file, use the following steps from the ftp prompt: 1. ftp> binary ! set transfer mode to binary 2. ftp> get filename.zip ! transfer the file 3. ftp> bye ! leave ftp 4. $ unzip filename.zip ! uncompress your local copy ------------------------------------------------------------------- 9. REFERENCES Helmig, D., Bottenheim J., Galbally I. E., Lewis A., Milton M. J. T., Penkett S., Plass-Duelmer C., Reimann S., Tans P., Theil S. (2009) Volatile Organic Compounds in the Global Atmosphere. Eos Trans. AGU, 90(52), Feature. Pollmann J., Helmig D., Hueber J., Plass-Duelmer C., and Tans P. (2008) Sampling, storage, and analysis of C2-C7 non-methane hydrocarbons from the US National Oceanic and Atmospheric Administration Cooperative Air Sampling Network glass flasks. J. Chromatogr., 1188, 75-87. -------------------------------------------------------------------