telegraf/plugins/inputs/logparser/README.md

# logparser Input Plugin

The logparser plugin streams and parses the given logfiles. Currently it only
has the capability of parsing "grok" patterns from logfiles, which also supports
regex patterns.

### Configuration:

```toml
[[inputs.logparser]]
  ## Log files to parse.
  ## These accept standard unix glob matching rules, but with the addition of
  ## ** as a "super asterisk". ie:
  ##   /var/log/**.log     -> recursively find all .log files in /var/log
  ##   /var/log/*/*.log    -> find all .log files with a parent dir in /var/log
  ##   /var/log/apache.log -> only tail the apache log file
  files = ["/var/log/influxdb/influxdb.log"]
  ## Read file from beginning.
  from_beginning = false

  ## Parse logstash-style "grok" patterns:
  ##   Telegraf builtin parsing patterns: https://goo.gl/dkay10
  [inputs.logparser.grok]
    ## This is a list of patterns to check the given log file(s) for.
    ## Note that adding patterns here increases processing time. The most
    ## efficient configuration is to have one file & pattern per logparser.
    patterns = ["%{INFLUXDB_HTTPD_LOG}"]
    ## Full path(s) to custom pattern files.
    custom_pattern_files = []
    ## Custom patterns can also be defined here. Put one pattern per line.
    custom_patterns = '''
    '''
```

> **Note:** The InfluxDB log pattern in the default configuration only works for Influx versions 1.0.0-beta1 or higher.

## Grok Parser

The grok parser uses a slightly modified version of logstash "grok" patterns,
with the format `%{<capture_syntax>[:<semantic_name>][:<modifier>]}`


Telegraf has many of it's own
[built-in patterns](https://github.com/influxdata/telegraf/blob/master/plugins/inputs/logparser/grok/patterns/influx-patterns),
as well as supporting
[logstash's builtin patterns](https://github.com/logstash-plugins/logstash-patterns-core/blob/master/patterns/grok-patterns).


The best way to get acquainted with grok patterns is to read the logstash docs,
which are available here:
  https://www.elastic.co/guide/en/logstash/current/plugins-filters-grok.html


If you need help building patterns to match your logs,
you will find the http://grokdebug.herokuapp.com application quite useful!


By default all named captures are converted into string fields.
Modifiers can be used to convert captures to other types or tags.
Timestamp modifiers can be used to convert captures to the timestamp of the
 parsed metric.


- Available modifiers:
  - string   (default if nothing is specified)
  - int
  - float
  - duration (ie, 5.23ms gets converted to int nanoseconds)
  - tag      (converts the field into a tag)
  - drop     (drops the field completely)
- Timestamp modifiers:
  - ts-ansic         ("Mon Jan _2 15:04:05 2006")
  - ts-unix          ("Mon Jan _2 15:04:05 MST 2006")
  - ts-ruby          ("Mon Jan 02 15:04:05 -0700 2006")
  - ts-rfc822        ("02 Jan 06 15:04 MST")
  - ts-rfc822z       ("02 Jan 06 15:04 -0700")
  - ts-rfc850        ("Monday, 02-Jan-06 15:04:05 MST")
  - ts-rfc1123       ("Mon, 02 Jan 2006 15:04:05 MST")
  - ts-rfc1123z      ("Mon, 02 Jan 2006 15:04:05 -0700")
  - ts-rfc3339       ("2006-01-02T15:04:05Z07:00")
  - ts-rfc3339nano   ("2006-01-02T15:04:05.999999999Z07:00")
  - ts-httpd         ("02/Jan/2006:15:04:05 -0700")
  - ts-epoch         (seconds since unix epoch)
  - ts-epochnano     (nanoseconds since unix epoch)
  - ts-"CUSTOM"


CUSTOM time layouts must be within quotes and be the representation of the
"reference time", which is `Mon Jan 2 15:04:05 -0700 MST 2006`
See https://golang.org/pkg/time/#Parse for more details.
logparser input plugin closes #102 closes #328 2016-06-02 17:47:15 +00:00			`# logparser Input Plugin`

			`The logparser plugin streams and parses the given logfiles. Currently it only`
			`has the capability of parsing "grok" patterns from logfiles, which also supports`
			`regex patterns.`

			`### Configuration:`

			```toml
			`[[inputs.logparser]]`
			`## Log files to parse.`
			`## These accept standard unix glob matching rules, but with the addition of`
			`## ** as a "super asterisk". ie:`
			`## /var/log/**.log -> recursively find all .log files in /var/log`
			`## /var/log//.log -> find all .log files with a parent dir in /var/log`
			`## /var/log/apache.log -> only tail the apache log file`
			`files = ["/var/log/influxdb/influxdb.log"]`
			`## Read file from beginning.`
			`from_beginning = false`

			`## Parse logstash-style "grok" patterns:`
			`## Telegraf builtin parsing patterns: https://goo.gl/dkay10`
			`[inputs.logparser.grok]`
			`## This is a list of patterns to check the given log file(s) for.`
			`## Note that adding patterns here increases processing time. The most`
			`## efficient configuration is to have one file & pattern per logparser.`
			`patterns = ["%{INFLUXDB_HTTPD_LOG}"]`
			`## Full path(s) to custom pattern files.`
			`custom_pattern_files = []`
			`## Custom patterns can also be defined here. Put one pattern per line.`
			`custom_patterns = '''`
			`'''`
			```

Add note about influxdb compatability (#1465) 2016-07-10 10:11:43 +00:00			`> Note: The InfluxDB log pattern in the default configuration only works for Influx versions 1.0.0-beta1 or higher.`

logparser input plugin closes #102 closes #328 2016-06-02 17:47:15 +00:00			`## Grok Parser`

			`The grok parser uses a slightly modified version of logstash "grok" patterns,`
			with the format `%{<capture_syntax>[:<semantic_name>][:<modifier>]}`


			`Telegraf has many of it's own`
			`[built-in patterns](https://github.com/influxdata/telegraf/blob/master/plugins/inputs/logparser/grok/patterns/influx-patterns),`
			`as well as supporting`
			`[logstash's builtin patterns](https://github.com/logstash-plugins/logstash-patterns-core/blob/master/patterns/grok-patterns).`


			`The best way to get acquainted with grok patterns is to read the logstash docs,`
			`which are available here:`
			`https://www.elastic.co/guide/en/logstash/current/plugins-filters-grok.html`


			`If you need help building patterns to match your logs,`
			`you will find the http://grokdebug.herokuapp.com application quite useful!`


			`By default all named captures are converted into string fields.`
			`Modifiers can be used to convert captures to other types or tags.`
			`Timestamp modifiers can be used to convert captures to the timestamp of the`
			`parsed metric.`


			`- Available modifiers:`
			`- string (default if nothing is specified)`
			`- int`
			`- float`
			`- duration (ie, 5.23ms gets converted to int nanoseconds)`
			`- tag (converts the field into a tag)`
			`- drop (drops the field completely)`
			`- Timestamp modifiers:`
			`- ts-ansic ("Mon Jan _2 15:04:05 2006")`
			`- ts-unix ("Mon Jan _2 15:04:05 MST 2006")`
			`- ts-ruby ("Mon Jan 02 15:04:05 -0700 2006")`
			`- ts-rfc822 ("02 Jan 06 15:04 MST")`
			`- ts-rfc822z ("02 Jan 06 15:04 -0700")`
			`- ts-rfc850 ("Monday, 02-Jan-06 15:04:05 MST")`
			`- ts-rfc1123 ("Mon, 02 Jan 2006 15:04:05 MST")`
			`- ts-rfc1123z ("Mon, 02 Jan 2006 15:04:05 -0700")`
			`- ts-rfc3339 ("2006-01-02T15:04:05Z07:00")`
			`- ts-rfc3339nano ("2006-01-02T15:04:05.999999999Z07:00")`
			`- ts-httpd ("02/Jan/2006:15:04:05 -0700")`
			`- ts-epoch (seconds since unix epoch)`
			`- ts-epochnano (nanoseconds since unix epoch)`
			`- ts-"CUSTOM"`


			`CUSTOM time layouts must be within quotes and be the representation of the`
			"reference time", which is `Mon Jan 2 15:04:05 -0700 MST 2006`
			`See https://golang.org/pkg/time/#Parse for more details.`