Commit c88354ce authored by Fabian Reinartz's avatar Fabian Reinartz

Add documentation for vector matching for binary operations.

parent cafa0441
...@@ -5,7 +5,13 @@ sort_rank: 2 ...@@ -5,7 +5,13 @@ sort_rank: 2
# Operators # Operators
## Arithmetic binary operators ## Binary operators
Prometheus's query language supports basic logical and arithmetic operators.
For operations between two instant vectors, the [matching behavior](#vector-matching)
can be modified.
### Arithmetic binary operators
The following binary arithmetic operators exist in Prometheus: The following binary arithmetic operators exist in Prometheus:
* `+` (addition) * `+` (addition)
...@@ -25,16 +31,18 @@ value of every data sample in the vector. E.g. if a time series instant vector ...@@ -25,16 +31,18 @@ value of every data sample in the vector. E.g. if a time series instant vector
is multiplied by 2, the result is another vector in which every sample value of is multiplied by 2, the result is another vector in which every sample value of
the original vector is multiplied by 2. the original vector is multiplied by 2.
**Between two instant vectors**, a binary arithmetic operator only applies to **Between two instant vectors**, a binary arithmetic operator is applied to
vector elements that have identical sets of labels between the two vectors. each entry in the left-hand-side vector and its [matching element](#vector-matching)
Vector elements that don't find an exact label match on the other side get in the right hand vector. The result is propagated into the result vector and the metric
dropped from the result. The metric name of the result vector is carried over name is dropped. Entries for which no matching entry in the right-hand vector can be
from the left hand side of the expression. found are not part of the result.
## Comparison / filter binary operators ### Comparison/filter binary operators
The following binary comparison/filter operators exist in Prometheus: The following binary comparison/filter operators exist in Prometheus:
* `==` (equal)
* `!=` (not-equal)
* `>` (greater-than) * `>` (greater-than)
* `<` (less-than) * `<` (less-than)
* `>=` (greater-or-equal) * `>=` (greater-or-equal)
...@@ -50,31 +58,102 @@ either `0` (`false`) or `1` (`true`), depending on the comparison result. ...@@ -50,31 +58,102 @@ either `0` (`false`) or `1` (`true`), depending on the comparison result.
value of every data sample in the vector, and vector elements between which the value of every data sample in the vector, and vector elements between which the
comparison result is `false` get dropped from the result vector. comparison result is `false` get dropped from the result vector.
**Between two instant vectors**, these operators behave as a filter: They apply **Between two instant vectors**, these operators behave as a filter, applied to
to vector elements that have identical sets of labels between the two vectors. matching entries. Vector elements for which the expression is not true or
Vector elements for which the expression evaluates to `false` or which don't which do not find a match on the other side of the expression get dropped from the
find an exact label match on the other side of the expression get dropped from result, while the others are propagated into a result vector with their original
the result, while the others get carried over into a result vector with their (left-hand-side) metric names and label values.
original (left-hand-side) metric names and data values.
# Logical/set binary operators ### Logical/set binary operators
These logical/set binary operators are only defined between instant vectors: These logical/set binary operators are only defined between instant vectors:
* `and` (intersection) * `and` (intersection)
* `or` (union) (**not yet implemented!**) * `or` (union)
`vector1 and vector2` results in a vector consisting of the elements of `vector1 and vector2` results in a vector consisting of the elements of
`vector1` for which there are elements in `vector2` with exactly matching `vector1` for which there are elements in `vector2` with exactly matching
labelsets. Other elements are dropped. The metric name and values are carried label sets. Other elements are dropped. The metric name and values are carried
over from the left-hand-side vector. over from the left-hand-side vector.
`vector1 or vector2` results in a vector that contains all original elements `vector1 or vector2` results in a vector that contains all original elements
(labelsets + values) of `vector1` and additionally all elements of `vector2`, (label sets + values) of `vector1` and additionally all elements of `vector2`
which don't have matching labelsets in `vector1`. The metric name is carried which do not have matching label sets in `vector1`.
over from the left-hand-side vector in both cases.
## Vector matching
Operations between vectors attempt to find a matching element in the right-hand-side
vector for each entry in the left-hand side. There are two basic types of
matching behavior:
**One-to-one** finds a unique pair of entries from each side of the operation.
In the default case, that is an operation following the format `vector1 <operator> vector2`.
Two entries match if they have the exact same set of labels and corresponding values.
The `on` keyword allows reducing the set of considered labels to a provided list:
<vector expr> <bin-op> on(<label list>) <vector expr>
Example input:
method:http_errors:rate5m{method="get", code="500"} 24
method:http_errors:rate5m{method="get", code="404"} 30
method:http_errors:rate5m{method="put", code="500"} 3
method:http_errors:rate5m{method="post", code="500"} 6
method:http_errors:rate5m{method="post", code="404"} 21
method:http_requests:rate5m{domain="example.com", method="get"} 600
method:http_requests:rate5m{domain="example.com", method="del"} 34
method:http_requests:rate5m{domain="example.com", method="post"} 120
Example query:
method:http_errors:rate5m{code="500"} / on(method) method:http_requests:rate5m
This returns a result vector containing the fraction of HTTP requests with status code
of 500 for each method, as measured over the last 5 minutes. Without `on(method)` there
would have been no match as the metrics do not share the same set of labels.
The entries with methods `put` and `del` have no match and will not show up in the result:
{method="get"} 0.04 // 24 / 600
{method="post"} 0.1 // 12 / 120
**Many-to-one** and **one-to-many** matchings refer to the case where each vector element on
the "one"-side can match with multiple elements on the "many"-side. This has to
be explicitly requested using the `group_left` or `group_right` modifier, where
left/right determines which vector has the higher cardinality.
<vector expr> <bin-op> on(<label list>) group_left(<label list>) <vector expr>
<vector expr> <bin-op> on(<label list>) group_right(<label list>) <vector expr>
The label list provided with the group modifier contains additional labels to be included in
the result metrics. A label can only appear in one of the lists. Every time
series of the result vector must be uniquely identifiable by the labels from both lists
combined.
_Grouping modifiers can only be used for [comparison/filtering](#comparison-/-filter-binary-operators)
and [arithmetic](#arithmetic-binary-operators) operations as `and` and `or` operations
match with all possible entries in the right vector by default._
Example query:
method:http_errors:rate5m / on(method) group_left(code,domain) method:http_requests:rate5m
In this case the left vector contains more than one entry per `method` label value. Thus,
we indicate this using `group_left` and provide `code` as the label that ensures that
the result vector entries are unique. The `domain` label name simply requests to include the
label into the result elements. The elements from the right side are now matched with multiple
elements with the same `method` label on the left:
{domain="example.com", method="get", code="500"} 0.04 // 24 / 600
{domain="example.com", method="get", code="404"} 0.05 // 30 / 600
{domain="example.com", method="post", code="500"} 0.1 // 12 / 120
{domain="example.com", method="post", code="404"} 0.175 // 21 / 120
_Many-to-one and one-to-many matching are advanced use cases that should be carefully considered.
Often a proper use of `on(<labels>)` provides the desired outcome._
# Aggregation operators ## Aggregation operators
Prometheus supports the following built-in aggregation operators that can be Prometheus supports the following built-in aggregation operators that can be
used to aggregate the elements of a single instant vector, resulting in a new used to aggregate the elements of a single instant vector, resulting in a new
...@@ -105,7 +184,7 @@ number of seen HTTP requests per application and group over all instances via: ...@@ -105,7 +184,7 @@ number of seen HTTP requests per application and group over all instances via:
sum(http_requests_total) by (application, group) sum(http_requests_total) by (application, group)
If we are just interested in the total of HTTP requests we've seen in **all** If we are just interested in the total of HTTP requests we have seen in **all**
applications, we could simply write: applications, we could simply write:
sum(http_requests_total) sum(http_requests_total)
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment