likwid-agent(1) monitoring daemon for hardware performance counters

SYNOPSIS

likwid-agent <config_file>

DESCRIPTION

likwid-agent is a daemon application that uses likwid-perfctr(1) to measure hardware performance counters. The basic configuration is in a global configuration file. The configuration of the hardware event sets is done with extra files suitable for each architecture. Besides the hardware event configuration, the raw data can be transformed using formulas to interested metrics. In order to output to much data, the data can be further filtered or aggregated. likwid-agent provides multiple store backends like logfiles, RRD (Round Robin Database) or gmetric (Ganglia Monitoring System).

CONFIG FILE

The global configuration file has the following options:
GROUPPATH <path>
Path to the group files containing event set and output definitions. See section GROUP FILES for information.
EVENTSET <group1> <group2> ...
Space separated list of groups (without .txt) that should be monitored.
DURATION <time>
Measurement duration in seconds.
LOGPATH <path>
Specify a logfile.
GMETRIC <True/False>
Activates the output to gmetric.
GMETRICPATH <path>
Set path to the gmetric executable.
GMETRICCONFIG <path>
Set a custom configuration file is needed for gmetric.
RRD <True/False>
Activates the output to RRD files (Round Robin Database).
RRDPATH <path>
Output path for the RRD files. The files are named according to the group and each output metric is saved as DS with function GAUGE. The RRD is configured with RRA entries to store average, minimum and maximum of 10 minutes for one hour, of 60 min for one day and daily data for one month.
SYSLOG <True/False>
Activates the output to system log using logger.
SYSLOGPRIO <prio>
Set the priority string for logger, default is 'local0.notice'.

GROUP FILES

The group files are adapted performance group files as used by likwid-perfctr(1). This makes it easy to uses the predefined and often used performance groups as basis for the monitoring. The folder structure of for the groups is <GROUPPATH>/<SHORT_ARCH_NAME>/ with <SHORT_ARCH_NAME> similar to the ones for the performance groups, like 'sandybridge' or 'haswellEP'.
SHORT <string>
A short descriptive information about the group.
EVENTSET
<counter1> <event1>
<counter2>:<option> <event2>
Definition of the eventset similar to the performance groups.
METRICS
<metricname> <formula>
<filter> <metricname> <formula>
Definition of the output metrics. The syntax follows the METRICS definition of the performance groups as used by likwid-perfctr(1). If no function is set at the beginning of the line, <formula> is evaluated for every CPU and send to the output backends. The <metricname> gets the prefix "T<cpuid> ". To avoid writing to much data to the backends, the data can be reduced by <filter>. The possible filter options are MIN, MAX, AVG, SUM, ONCE. The ONCE filter sends only the data from the first CPU to the output backends commonly used for the measurement duration.

AUTHOR

Written by Thomas Roehl <[email protected]>.