This component applies conventional fixed length averaging to input data. Normally it also removes contaminated values from the averages. Note that some types of averaging require inputs beyond the values being averaged. For example, to average raw absorptions, transmittance and path length are required because the average is performed with a difference measurement.

Usage

da.avg [switches…​] [[station] variables times [archive]|[file]]

Switches

--contamination=STRING

This defines the contamination mode name in effect. The contamination mode determines which variables are removed when they are flagged as contaminated. This often corisponds with various profiles for other station specific configuration. For example the "aerosol" mode generally removes scattering, absorption, extinction and concentrations. The special mode "none" or "disable" turns off all contamination removal and values are averaged regardless of their contaminated state.

Default: aerosol

--continuous[=BOOLEAN]

If this option is set then the averages produced are the maximum continuous averages within the binning interval. This allows difference measurements to be repeatable but may cause splitting within the interval.

Default: Disabled

--cover=NUMBERS

This is the fraction of data required to exist for an average bin to be produced at all. For example if this is set to 0.95 then averages will only be produced when the fraction of missing data in a bin is greater than 95% of the total bin. So, for a one hour bin that would require at least 57 minutes of valid data. Setting this to undefined disables the requirement.

Default: Disabled

--gap=INTERVAL

If two values are seperated by this much time then the average is split. Even if there is no gap the values may still be split into separate bins if the interval demands it. An undefined gap allows for infinite separation.

Default: Infinite

Undefined intervals are accepted. The interval must be greater than or equal to zero length.

--interval=INTERVAL

This is width of the bin to average on. All data within each interval is averaged together to produce the output. An undefined interval averages all available data together.

Default: One hour

Undefined intervals are accepted. The interval must be greater than zero in length.

Arguments

If no bare word input specification is supplied then data are read from standard input.

station

This argument is used to specify the station used to look up variables that do not include a station as part of an archive read specification. The station is the three letter GAW station code of the location, such as BND. The argument accepts multiple stations specified as regular expressions and separated by : or ; or ,. For example BND,MLO and (BND|MLO) are two ways of selecting both the BND and MLO stations.

The special value allstations may also be specified to select all stations.

variables

This argument may be split into multiple actual program arguments. Each part consists of one or more variable specifications separated by , (commas). The station and archive used if none are explicitly given are defined in the other arguments or inferred from system defaults. For example simply BsB_S11 selects the blue scattering from the S11 instrument and returns all cut sizes and metadata. If instead ::BsB_S11:pm10 is used then only PM10 scattering is returned. That is, the selection specifies "any" station and archive (the defaults are set in the other arguments) then requires that it has the "PM10" flavor. A specification such as bnd:BsB_S11,thd:BsB_S11 allows for data selection from multiple stations.

This may also be used to select a "record" type alias such as "S11a". In this mode all the variables defined in the alias are returned.

Finally the special value everything can be used to select all available data for the given station(s) and archive(s).

times

This argument may be split into multiple actual program arguments. This full list of time arguments defines a time bounds list that sets the range of data queried. The final resulting data are clipped to the given time bounds but any values that intersect them are returned.

archive

This argument is used to specify the archive used to look up variables that do not include an archive as part of an archive read specification. The archive is the internal name, such as raw or clean_meta. The argument accepts multiple archives specified as regular expressions and separated by : or ; or ,. For example raw(_meta)? selects both the raw and raw_meta archives.

The special value allarchives may also be used to select all available archives.

file

This argument is used to specify the the file to read data from. If it is present and exists then data is read from the given file name instead of from standard input. Alternatively - (a single dash) may be used to explicitly specify standard input.

Examples

Defaults

This will average input data to one hour intervals and remove contamination from the standard set of variables involved in an aerosol system (generally scattering, absorption, extinction and concentrations).

da.avg bnd S11a 2015-05-01 2015-05-03

Or:

da.avg input_file.c3d

Or:

da.get bnd S11a 2015-05-01 2015-05-03 | da.avg
Interval and coverage requirement

This will average data to one day interval requiring at least 95% of data within each day to exist (1368 minutes). Contaminated data is not excluded from the averages.

da.avg --contamination=none --cover=0.95 --interval=1da bnd S11a 2015-05-01 2015-05-03
Infinite continuous averages

This will average all input data to the maximum extent possible but will break up those averages whenever there is a discontinuity in the input data. This also excludes contaminated values for all standard aerosol parameters.

da.avg --continuous --interval= bnd S11a 2015-05-01 2015-05-03