Split parser/serializer docs (#4690)

2018-09-17 11:45:08 -07:00
parent 96f3d7def4
commit 41d528c8ce
25 changed files with 1412 additions and 1331 deletions
--- a/docs/DATA_FORMATS_INPUT.md
+++ b/docs/DATA_FORMATS_INPUT.md
--- a/docs/DATA_FORMATS_OUTPUT.md
+++ b/docs/DATA_FORMATS_OUTPUT.md
@@ -4,13 +4,14 @@ In addition to output specific data formats, Telegraf supports a set of
 standard data formats that may be selected from when configuring many output
 plugins.

-1. [InfluxDB Line Protocol](#influx)
-1. [JSON](#json)
-1. [Graphite](#graphite)
-1. [SplunkMetric](../plugins/serializers/splunkmetric/README.md)
+1. [InfluxDB Line Protocol](/plugins/serializers/influx)
+1. [JSON](/plugins/serializers/json)
+1. [Graphite](/plugins/serializers/graphite)
+1. [SplunkMetric](/plugins/serializers/splunkmetric)

 You will be able to identify the plugins with support by the presence of a
 `data_format` config option, for example, in the `file` output plugin:
+
 ```toml
 [[outputs.file]]
  ## Files to write to, "stdout" is a specially handled file.
@@ -22,191 +23,3 @@ You will be able to identify the plugins with support by the presence of a
  ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_OUTPUT.md
  data_format = "influx"
 ```
-
-## Influx
-
-The `influx` data format outputs metrics using
-[InfluxDB Line Protocol](https://docs.influxdata.com/influxdb/latest/write_protocols/line_protocol_tutorial/).
-This is the recommended format unless another format is required for
-interoperability.
-
-### Influx Configuration
-```toml
-[[outputs.file]]
-  ## Files to write to, "stdout" is a specially handled file.
-  files = ["stdout", "/tmp/metrics.out"]
-
-  ## Data format to output.
-  ## Each data format has its own unique set of configuration options, read
-  ## more about them here:
-  ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_OUTPUT.md
-  data_format = "influx"
-
-  ## Maximum line length in bytes.  Useful only for debugging.
-  # influx_max_line_bytes = 0
-
-  ## When true, fields will be output in ascending lexical order.  Enabling
-  ## this option will result in decreased performance and is only recommended
-  ## when you need predictable ordering while debugging.
-  # influx_sort_fields = false
-
-  ## When true, Telegraf will output unsigned integers as unsigned values,
-  ## i.e.: `42u`.  You will need a version of InfluxDB supporting unsigned
-  ## integer values.  Enabling this option will result in field type errors if
-  ## existing data has been written.
-  # influx_uint_support = false
-```
-
-## Graphite
-
-The Graphite data format is translated from Telegraf Metrics using either the
-template pattern or tag support method.  You can select between the two
-methods using the [`graphite_tag_support`](#graphite-tag-support) option.  When set, the tag support
-method is used, otherwise the [`template` pattern](#template-pattern) is used.
-
-#### Template Pattern
-
-The `template` option describes how Telegraf traslates metrics into _dot_
-buckets.  The default template is:
-
-```
-template = "host.tags.measurement.field"
-```
-
-In the above template, we have four parts:
-
-1. _host_ is a tag key. This can be any tag key that is in the Telegraf
-metric(s). If the key doesn't exist, it will be ignored. If it does exist, the
-tag value will be filled in.
-1. _tags_ is a special keyword that outputs all remaining tag values, separated
-by dots and in alphabetical order (by tag key). These will be filled after all
-tag keys are filled.
-1. _measurement_ is a special keyword that outputs the measurement name.
-1. _field_ is a special keyword that outputs the field name.
-
-**Example Conversion**:
-
-```
-cpu,cpu=cpu-total,dc=us-east-1,host=tars usage_idle=98.09,usage_user=0.89 1455320660004257758
-=>
-tars.cpu-total.us-east-1.cpu.usage_user 0.89 1455320690
-tars.cpu-total.us-east-1.cpu.usage_idle 98.09 1455320690
-```
-
-Fields with string values will be skipped.  Boolean fields will be converted
-to 1 (true) or 0 (false).
-
-#### Graphite Tag Support
-
-When the `graphite_tag_support` option is enabled, the template pattern is not
-used.  Instead, tags are encoded using
-[Graphite tag support](http://graphite.readthedocs.io/en/latest/tags.html)
-added in Graphite 1.1.  The `metric_path` is a combination of the optional
-`prefix` option, measurement name, and field name.
-
-The tag `name` is reserved by Graphite, any conflicting tags and will be encoded as `_name`.
-
-**Example Conversion**:
-```
-cpu,cpu=cpu-total,dc=us-east-1,host=tars usage_idle=98.09,usage_user=0.89 1455320660004257758
-=>
-cpu.usage_user;cpu=cpu-total;dc=us-east-1;host=tars 0.89 1455320690
-cpu.usage_idle;cpu=cpu-total;dc=us-east-1;host=tars 98.09 1455320690
-```
-
-### Graphite Configuration
-
-```toml
-[[outputs.file]]
-  ## Files to write to, "stdout" is a specially handled file.
-  files = ["stdout", "/tmp/metrics.out"]
-
-  ## Data format to output.
-  ## Each data format has its own unique set of configuration options, read
-  ## more about them here:
-  ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_OUTPUT.md
-  data_format = "graphite"
-
-  ## Prefix added to each graphite bucket
-  prefix = "telegraf"
-  ## Graphite template pattern
-  template = "host.tags.measurement.field"
-
-  ## Support Graphite tags, recommended to enable when using Graphite 1.1 or later.
-  # graphite_tag_support = false
-```
-
-## JSON
-
-The JSON output data format output for a single metric is in the
-form:
-```json
-{
-    "fields": {
-        "field_1": 30,
-        "field_2": 4,
-        "field_N": 59,
-        "n_images": 660
-    },
-    "name": "docker",
-    "tags": {
-        "host": "raynor"
-    },
-    "timestamp": 1458229140
-}
-```
-
-When an output plugin needs to emit multiple metrics at one time, it may use
-the batch format.  The use of batch format is determined by the plugin,
-reference the documentation for the specific plugin.
-```json
-{
-    "metrics": [
-        {
-            "fields": {
-                "field_1": 30,
-                "field_2": 4,
-                "field_N": 59,
-                "n_images": 660
-            },
-            "name": "docker",
-            "tags": {
-                "host": "raynor"
-            },
-            "timestamp": 1458229140
-        },
-        {
-            "fields": {
-                "field_1": 30,
-                "field_2": 4,
-                "field_N": 59,
-                "n_images": 660
-            },
-            "name": "docker",
-            "tags": {
-                "host": "raynor"
-            },
-            "timestamp": 1458229140
-        }
-    ]
-}
-```
-
-### JSON Configuration
-
-```toml
-[[outputs.file]]
-  ## Files to write to, "stdout" is a specially handled file.
-  files = ["stdout", "/tmp/metrics.out"]
-
-  ## Data format to output.
-  ## Each data format has its own unique set of configuration options, read
-  ## more about them here:
-  ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_OUTPUT.md
-  data_format = "json"
-
-  ## The resolution to use for the metric timestamp.  Must be a duration string
-  ## such as "1ns", "1us", "1ms", "10ms", "1s".  Durations are truncated to
-  ## the power of 10 less than the specified units.
-  json_timestamp_units = "1s"
-```
--- a/docs/METRICS.md
+++ b/docs/METRICS.md
@@ -0,0 +1,22 @@
+# Metrics
+
+Telegraf metrics are the internal representation used to model data during
+processing.  Metrics are closely based on InfluxDB's data model and contain
+four main components:
+
+- **Measurement Name**: Description and namespace for the metric.
+- **Tags**: Key/Value string pairs and usually used to identify the
+  metric.
+- **Fields**: Key/Value pairs that are typed and usually contain the
+  metric data.
+- **Timestamp**: Date and time associated with the fields.
+
+This metric type exists only in memory and must be converted to a concrete
+representation in order to be transmitted or viewed.  To acheive this we
+provide several [output data formats][] sometimes referred to as
+*serializers*.  Our default serializer converts to [InfluxDB Line
+Protocol][line protocol] which provides a high performance and one-to-one
+direct mapping from Telegraf metrics.
+
+[output data formats]: /docs/DATA_FORMATS_OUTPUT.md
+[line protocol]: /plugins/serializers/influx
--- a/docs/README.md
+++ b/docs/README.md
@@ -0,0 +1,21 @@
+# Telegraf
+
+- Concepts
+  - [Metrics][metrics]
+  - [Input Data Formats][parsers]
+  - [Output Data Formats][serializers]
+  - [Aggregators & Processors][aggproc]
+- Administration
+  - [Configuration][conf]
+  - [Profiling][profiling]
+  - [Windows Service][winsvc]
+  - [FAQ][faq]
+
+[conf]: /docs/CONFIGURATION.md
+[metrics]: /docs/METRICS.md
+[parsers]: /docs/DATA_FORMATS_INPUT.md
+[serializers]: /docs/DATA_FORMATS_OUTPUT.md
+[aggproc]: /docs/AGGREGATORS_AND_PROCESSORS.md
+[profiling]: /docs/PROFILING.md
+[winsvc]: /docs/WINDOWS_SERVICE.md
+[faq]: /docs/FAQ.md
--- a/docs/TEMPLATE_PATTERN.md
+++ b/docs/TEMPLATE_PATTERN.md
@@ -0,0 +1,135 @@
+# Template Patterns
+
+Template patterns are a mini language that describes how a dot delimited
+string should be mapped to and from [metrics][].
+
+A template has the form:
+```
+"host.mytag.mytag.measurement.measurement.field*"
+```
+
+Where the following keywords can be set:
+
+1. `measurement`: specifies that this section of the graphite bucket corresponds
+to the measurement name. This can be specified multiple times.
+2. `field`: specifies that this section of the graphite bucket corresponds
+to the field name. This can be specified multiple times.
+3. `measurement*`: specifies that all remaining elements of the graphite bucket
+correspond to the measurement name.
+4. `field*`: specifies that all remaining elements of the graphite bucket
+correspond to the field name.
+
+Any part of the template that is not a keyword is treated as a tag key. This
+can also be specified multiple times.
+
+**NOTE:** `field*` cannot be used in conjunction with `measurement*`.
+
+### Examples
+
+#### Measurement & Tag Templates
+
+The most basic template is to specify a single transformation to apply to all
+incoming metrics. So the following template:
+
+```toml
+templates = [
+    "region.region.measurement*"
+]
+```
+
+would result in the following Graphite -> Telegraf transformation.
+
+```
+us.west.cpu.load 100
+=> cpu.load,region=us.west value=100
+```
+
+Multiple templates can also be specified, but these should be differentiated
+using _filters_ (see below for more details)
+
+```toml
+templates = [
+    "*.*.* region.region.measurement", # <- all 3-part measurements will match this one.
+    "*.*.*.* region.region.host.measurement", # <- all 4-part measurements will match this one.
+]
+```
+
+#### Field Templates
+
+The field keyword tells Telegraf to give the metric that field name.
+So the following template:
+
+```toml
+separator = "_"
+templates = [
+    "measurement.measurement.field.field.region"
+]
+```
+
+would result in the following Graphite -> Telegraf transformation.
+
+```
+cpu.usage.idle.percent.eu-east 100
+=> cpu_usage,region=eu-east idle_percent=100
+```
+
+The field key can also be derived from all remaining elements of the graphite
+bucket by specifying `field*`:
+
+```toml
+separator = "_"
+templates = [
+    "measurement.measurement.region.field*"
+]
+```
+
+which would result in the following Graphite -> Telegraf transformation.
+
+```
+cpu.usage.eu-east.idle.percentage 100
+=> cpu_usage,region=eu-east idle_percentage=100
+```
+
+#### Filter Templates
+
+Users can also filter the template(s) to use based on the name of the bucket,
+using glob matching, like so:
+
+```toml
+templates = [
+    "cpu.* measurement.measurement.region",
+    "mem.* measurement.measurement.host"
+]
+```
+
+which would result in the following transformation:
+
+```
+cpu.load.eu-east 100
+=> cpu_load,region=eu-east value=100
+
+mem.cached.localhost 256
+=> mem_cached,host=localhost value=256
+```
+
+#### Adding Tags
+
+Additional tags can be added to a metric that don't exist on the received metric.
+You can add additional tags by specifying them after the pattern.
+Tags have the same format as the line protocol.
+Multiple tags are separated by commas.
+
+```toml
+templates = [
+    "measurement.measurement.field.region datacenter=1a"
+]
+```
+
+would result in the following Graphite -> Telegraf transformation.
+
+```
+cpu.usage.idle.eu-east 100
+=> cpu_usage,region=eu-east,datacenter=1a idle=100
+```
+
+[metrics]: /docs/METRICS.md