Waveform archive data availability collector
data extents - the absolute earliest and latest times data is available of a particular channel
data segments - continuous data segments sharing the same quality and sampling rate attributes
scardac is intended to be executed periodically, e.g., as a cronjob.
/fdsnws/station (extent information only, see
/fdsnws/ext/availability (extent and segment information provided in different formats)
Scan the SDS archive for new channel IDs and create new
Subsequently process the
threadsnumber of parallel threads. For each
Find all available daily data files
Sort the file list according date
For each data file
DataSegmentsthat do longer exists
update or create
DataSegmentsthat changed or are new
a segment is split if
jitter(difference between previous records end time and current records start time) is exceeded
the quality or sampling rate changed
merge segment information into
Extentssharing the same quality and sample rate information)
merge segment start and end time into overall
Get command line help or execute scardac with default parameters and informative debug output:
scardac -h scardac --debug
scardac -d mysql://sysop:sysop@localhost/seiscomp -a $SEISCOMP_ROOT/var/lib/archive --debug scxmldump -Yf -d mysql://sysop:sysop@localhost/seiscomp -o availability.xml
Update the availability of waveform data files existing in the standard SDS archive to the seiscomp database. Use fdsnws to fetch a flat file containing a list of periods of available data from stations of the CX network sharing the same quality and sampling rate attributes:
scardac -d mysql://sysop:sysop@localhost/seiscomp -a $SEISCOMP_ROOT/var/lib/archive wget -O availability.txt 'http://localhost:8080/fdsnws/ext/availability/1/query?network=CX'
The SeisComP module fdsnws must be running for executing this example.
scardac inherits global options.
Path to MiniSeed waveform archive where all data is stored. The SDS archive structure is defined as YEAR/NET/STA/CHA/NET.STA.LOC.CHA.YEAR.DATEOFYEAR, e.g. 2018/GE/APE/BHZ.D/GE.APE..BHZ.D.2018.125 Default is
Number of threads scanning the archive in parallel Default is
Batch size of database transactions used when updating data availability segments. Allowed range: [1,1000] Default is
Acceptable derivation of end time and start time of successive records in multiples of sample time. Default is
Maximum number of segments per stream. If the limit is reached no more segments are added to the database and the corresponding extent is flagged as to fragmented. Use a negative value to disable any limit. Default is
show help message.
show version information
Use alternative configuration file. When this option is used the loading of all stages is disabled. Only the given configuration file is parsed and used. To use another name for the configuration create a symbolic link of the application or copy it, e.g. scautopick -> scautopick2.
Load given plugins.
Run as daemon. This means the application will fork itself and doesn’t need to be started with &.
Verbosity level [0..4]. 0:quiet, 1:error, 2:warning, 3:info, 4:debug
Increase verbosity level (may be repeated, eg. -vv)
Quiet mode: no logging output
For each log entry print the component right after the log level. By default the component output is enabled for file output but disabled for console output.
Limits the logging to a certain component. This option can be given more than once.
Use syslog logging back end. The output usually goes to /var/lib/messages.
Path to lock file.
Send log output to stdout.
Debug mode: –verbosity=4 –console=1
Trace mode: –verbosity=4 –console=1 –print-component=1 –print-context=1
Use alternative log file.
Overrides configuration parameter
Do not scan the archive but generate test data for each stream in the inventory. Format: days,gaps,gapslen,overlaps,overlaplen. E.g. the following parameter list would generate test data for 100 days (starting from now()-100) which includes 150 gaps with a length of 2.5s followed by 50 overlaps with an overlap of 5s: –generate-test-data=100,150,2.5,50,5