Welcome to the third and final post of our series about metrics. First, we deep-dived into the four types of Prometheus metrics; then, we examined how metrics work in OpenTelemetry; and finally, we are now putting the two together—explaining the differences, similarities, and integration between the metrics in both systems.
We decided to round off the series by comparing the metrics in both tools because we believe this is a choice you’ll need to make sooner rather than later. While Prometheus has been the current standard for monitoring your systems, OpenTelemetry is quickly gaining ground, especially in the cloud-native landscape, which was traditionally Prometheus’ stronghold. OpenTelemetry’s promise of creating a unified standard among traces, logs, and metrics with enough flexibility to model and interact with other approaches is tempting many developers.
From our partners:
After working with both frameworks, our goal with this blog post is to compare the two, show you how to transform one into the other, and share some thoughts on which may be more appropriate, depending on your use case.
Metrics in Prometheus vs. OpenTelemetry: Common Ground
Both systems allow you to collect and transform metrics (although Open Telemetry is much more flexible as a transformation pipeline). However, looking back at the previous articles, we need to remember an important distinction: Prometheus is an Observability tool (including collection, storage, and query) that uses a metric model designed to suit its own needs.
On the other hand, the metrics component of OpenTelemetry translates from many different data models into one single framework (providing collection with no storage or query). These core differences reflect the systems’ complexity—Prometheus is a straightforward model which is often exposed in text, while OpenTelemetry is a more intricate series of three models which uses a binary format for transmission.
Both systems also allow code instrumentation via an SDK, but OpenTelemetry also focuses on supporting automatic instrumentation, which does not add any code to applications (where possible).
So what’s the overlap between the metrics you can create? Essentially, OpenTelemetry allows the representation of all Prometheus metric types (counters, gauges, summaries, and histograms). Still, Prometheus can’t represent some configurations of OpenTelemetry metrics, including delta representations and exponential histograms (although these will be added to Prometheus soon), as well as integer values.
In other words, Prometheus metrics are a strict subset of OpenTelemetry metrics.
Main Differences
While there are some differences in how the internal models work (read on for more information), the practical differences between the two from a developer’s point of view are more to do with the ecosystem.
Prometheus provides metric collection, storage, and query. It generally gathers metrics via a simple scraping system that pulls data from hosts. The Prometheus database stores that data, which you can then query with the Prometheus query language, PromQL. The Prometheus database can handle a lot of data, but it’s not officially meant to be a long-term storage solution, so data is often sent to another storage solution—like Promscale— after some time but still read back via PromQL.
OpenTelemetry has a much smaller scope. It collects metrics (as well as traces and logs) using a consolidated API via push or pull, potentially transforms them, and sends them onward to other systems for storage or query. By only focusing on the parts of Observability which applications interact with, OpenTelemetry is decoupling the creation of signals from the operational concerns of storing them and querying them. Ironically, this means OpenTelemetry metrics often end up back in Prometheus or a Prometheus-compatible system.
When we are looking at actual metric types, there are several differences:
- OpenTelemetry can represent metrics as deltas rather than as cumulative, storing the difference between each data point rather than the cumulative sum. Prometheus does not allow this by design (although you can calculate the values at query time). This isn’t the default in OpenTelemetry and would mainly be used for metrics that would only ever be expressed as rates.
- OpenTelemetry also allows metric values to be integers rather than floating-point numbers, which Prometheus can not express.
- OpenTelemetry can attach some extra metadata to histograms, allowing you to track the maximum and minimum values.
- Finally, OpenTelemetry has an exponential histogram aggregation type (which uses a formula and a scale to calculate bucket sizings). Prometheus can not represent this today, but does have a fully compatible metric type in the works!
Choosing Between the Two
If you don’t already have an investment in one of the two technologies, the choice between Prometheus and OpenTelemetry might boil down to four questions:
- Are you planning on capturing traces, logs, and metrics? If so, OpenTelemetry will allow you to use the same libraries to instrument across all three signal types, which is a significant benefit. You can even send all three signals to the same backend and use a single language to query across them (for example, Promscale and SQL).
- Do you value stability and battle-tested systems? If so, Prometheus might be the correct answer for a few more years as OpenTelemetry gets production exposure.
- Would you like to use a multi-step routing and transformation pipeline? If so, perhaps OpenTelemetry might be worth a look.
- Do you want to be able to stay as flexible as possible? Then OpenTelemetry is for you, as it doesn’t implement any storage or query, giving you maximum flexibility.
Most organizations will likely mix both standards: Prometheus for infrastructure monitoring, making use of the much more mature ecosystem of integration to extract metrics from hundreds of components, and OpenTelemetry for services that have been developed. Many engineers will probably use Prometheus as a backend to store both Prometheus and OpenTelemetry metrics, and will need to ensure the OpenTelemetry metrics they produce are compatible with Prometheus.
In practice, this means opting for the cumulative aggregation temporality and only using OpenTelemetry metric types supported by Prometheus (leaving aside OpenTelemetry exponential histograms until Prometheus adds support for these).
Converting Between Prometheus and OpenTelemetry
If you want to mix and match the standards, then the good news is that OpenTelemetry provides the OpenTelemetry Collector, which can help with moving in both directions (even casting between types if needed in some cases).
The OpenTelemetry Collector is pluggable, allowing both receivers and exporter components to be enabled using a config file at runtime. We will be using the contrib
package that includes many receivers and exporters. You can download the appropriate binary from the GitHub Release pages. Alternatively, if you’re running the collector in production, you can also compile a version containing just the components you need using the OpenTelemetry Collector Builder.
For the examples in the following sections, we are running the collector with the following config
file which is saved as config.yaml
:
<span class="token key atrule">receivers</span><span class="token punctuation">:</span>
<span class="token key atrule">prometheus</span><span class="token punctuation">:</span>
<span class="token key atrule">config</span><span class="token punctuation">:</span>
<span class="token key atrule">scrape_configs</span><span class="token punctuation">:</span>
<span class="token punctuation">-</span> <span class="token key atrule">job_name</span><span class="token punctuation">:</span> demo
<span class="token key atrule">scrape_interval</span><span class="token punctuation">:</span> 15s
<span class="token key atrule">static_configs</span><span class="token punctuation">:</span>
<span class="token punctuation">-</span> <span class="token key atrule">targets</span><span class="token punctuation">:</span> <span class="token punctuation">[</span><span class="token string">'localhost:9090'</span><span class="token punctuation">]</span>
<span class="token key atrule">exporters</span><span class="token punctuation">:</span>
<span class="token key atrule">prometheus</span><span class="token punctuation">:</span>
<span class="token key atrule">endpoint</span><span class="token punctuation">:</span> <span class="token string">"0.0.0.0:1234"</span>
<span class="token key atrule">logging</span><span class="token punctuation">:</span>
<span class="token key atrule">loglevel</span><span class="token punctuation">:</span> debug
<span class="token key atrule">service</span><span class="token punctuation">:</span>
<span class="token key atrule">telemetry</span><span class="token punctuation">:</span>
<span class="token key atrule">logs</span><span class="token punctuation">:</span>
<span class="token key atrule">level</span><span class="token punctuation">:</span> debug
<span class="token key atrule">pipelines</span><span class="token punctuation">:</span>
<span class="token key atrule">metrics</span><span class="token punctuation">:</span>
<span class="token key atrule">receivers</span><span class="token punctuation">:</span> <span class="token punctuation">[</span>prometheus<span class="token punctuation">]</span>
<span class="token key atrule">processors</span><span class="token punctuation">:</span> <span class="token punctuation">[</span><span class="token punctuation">]</span>
<span class="token key atrule">exporters</span><span class="token punctuation">:</span> <span class="token punctuation">[</span>logging<span class="token punctuation">,</span>prometheus<span class="token punctuation">]</span>
Then we start the collector with:
otelcol –config config.yaml
The collector will use the Prometheus receiver to try to scrape a Prometheus service at http://localhost:9100. If you need something to test with, you could start a local node_exporter that uses this port. As data is scraped from this service, you will see it show up as log output from the collector, and it will also be available from the Prometheus exporter endpoint, which the collector will run on http://localhost:1234.
The Prometheus Remote Write Exporter is also an option, but it’s more limited in scope at this stage, only being able to handle cumulative counters and gauges.
If we look back to our previous post on Prometheus metrics, we covered the four main metric types: counters, gauges, histograms, and summaries. In the Prometheus context, a counter is monotonic (continuously increasing), whereas a gauge is not (it can go up and down).
If you’re really on point, you’ll also remember that the difference between a histogram and a summary is that a summary reports quantiles and doesn’t require as much client computation. A histogram, on the other hand, is more flexible, providing the raw bucket widths and counts.
OpenTelemetry, in contrast, has five metric types: sums, gauges, summaries, histograms, and exponential histograms. If you’ve been following along with the how metrics work in OpenTelemetry post, you will have a question at this stage—are these different types from what we have previously seen? The answer to this question lies in the three OpenTelemetry models.
If you are a developer creating OpenTelemetry metrics, you deal with the Event model, which is then translated (for transmission) to the OpenTelemetry Protocol (OTLP) Stream Model by the OpenTelemetry SDK. The types we are referencing here are part of this model, which the Prometheus receiver translates directly into.
Luckily, these new metric types are self-explanatory and map directly onto the Prometheus metric types (summary is implemented only for Prometheus compatibility, and you won’t see it used elsewhere). The exception is the exponential histogram, which can’t be converted to Prometheus today (but will be able to be converted in the future). The diagram below describes how the mappings work in each direction.
OpenTelemetry promises lossless conversions to and from Prometheus metrics, giving users the ability to convert as they need without worrying about data loss.
Converting From Prometheus to OpenTelemetry
Let’s explore how the Prometheus to OpenTelemetry conversions work by looking at examples of Prometheus scrapes and OpenTelemetry metrics. While OpenTelemetry doesn’t have a text representation like Prometheus, we can use the Logging Exporter to emit a text dump of the metrics captured.
OpenTelemetry will extract some information from the scrape itself and store this, producing the following output, which defines Resource labels that will be attached to all metrics.
<span class="token key atrule">Resource SchemaURL</span><span class="token punctuation">:</span>
<span class="token key atrule">Resource labels</span><span class="token punctuation">:</span>
<span class="token key atrule">-> service.name</span><span class="token punctuation">:</span> STRING(node<span class="token punctuation">-</span>exporter)
<span class="token key atrule">-> service.instance.id</span><span class="token punctuation">:</span> STRING(127.0.0.1<span class="token punctuation">:</span>9100)
<span class="token key atrule">-> net.host.port</span><span class="token punctuation">:</span> STRING(9100)
<span class="token key atrule">-> http.scheme</span><span class="token punctuation">:</span> STRING(http)
Counters
If we take the following Prometheus scrape data and point the collector at it:
<span class="token comment"># HELP http_requests_total Total HTTP requests served</span>
<span class="token comment"># TYPE http_requests_total counter</span>
http_requests_total<span class="token punctuation">{</span>method="post"<span class="token punctuation">,</span>code="200"<span class="token punctuation">}</span> 1028
http_requests_total<span class="token punctuation">{</span>method="post"<span class="token punctuation">,</span>code="400"<span class="token punctuation">}</span> 5
Then the collector would output the following metric from the Logging Exporter.
Metric <span class="token comment">#0</span>
<span class="token key atrule">Descriptor</span><span class="token punctuation">:</span>
<span class="token key atrule">-> Name</span><span class="token punctuation">:</span> http_requests_total
<span class="token key atrule">-> Description</span><span class="token punctuation">:</span> Total HTTP requests served
<span class="token key atrule">-> Unit</span><span class="token punctuation">:</span>
<span class="token key atrule">-> DataType</span><span class="token punctuation">:</span> Sum
<span class="token key atrule">-> IsMonotonic</span><span class="token punctuation">:</span> <span class="token boolean important">true</span>
<span class="token key atrule">-> AggregationTemporality</span><span class="token punctuation">:</span> AGGREGATION_TEMPORALITY_CUMULATIVE
NumberDataPoints <span class="token comment">#0</span>
<span class="token key atrule">Data point attributes</span><span class="token punctuation">:</span>
<span class="token key atrule">-> code</span><span class="token punctuation">:</span> STRING(200)
<span class="token key atrule">-> method</span><span class="token punctuation">:</span> STRING(post)
<span class="token key atrule">StartTimestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>16 11<span class="token punctuation">:</span>49<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Timestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>16 11<span class="token punctuation">:</span>49<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Value</span><span class="token punctuation">:</span> <span class="token number">1028.000000</span>
NumberDataPoints <span class="token comment">#1</span>
<span class="token key atrule">Data point attributes</span><span class="token punctuation">:</span>
<span class="token key atrule">-> code</span><span class="token punctuation">:</span> STRING(400)
<span class="token key atrule">-> method</span><span class="token punctuation">:</span> STRING(post)
<span class="token key atrule">StartTimestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>16 11<span class="token punctuation">:</span>49<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Timestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>16 11<span class="token punctuation">:</span>49<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Value</span><span class="token punctuation">:</span> <span class="token number">5.000000</span>
We can see that the metric type ( DataType
) is Sum
and the AggregationTemporality
is Cumulative
(the only aggregation that Prometheus supports). There are two timestamps per data point to track counter resets: Timestamp
is the time of the recording, and StartTimestamp
is either the time the first sample was received or the time of the last counter reset. There is no Unit
specified. This is because Prometheus specifies units by including them as part of the textual metric name, which can’t be accurately decoded by the OpenTelemetry Collector. Interestingly, using the compatible OpenMetrics format to add a unit does not work either.
Gauges
If we take a Prometheus gauge and scrape it:
<span class="token comment"># HELP node_filesystem_avail_bytes Available bytes in filesystems</span>
<span class="token comment"># TYPE node_filesystem_avail_bytes gauge</span>
node_filesystem_avail_bytes<span class="token punctuation">{</span>method="/data"<span class="token punctuation">,</span>fstype="ext4"<span class="token punctuation">}</span> 250294
We would see the following output from the collector.
Metric <span class="token comment">#0</span>
<span class="token key atrule">Descriptor</span><span class="token punctuation">:</span>
<span class="token key atrule">-> Name</span><span class="token punctuation">:</span> node_filesystem_avail_bytes
<span class="token key atrule">-> Description</span><span class="token punctuation">:</span> Available bytes in filesystems
<span class="token key atrule">-> Unit</span><span class="token punctuation">:</span>
<span class="token key atrule">-> DataType</span><span class="token punctuation">:</span> Gauge
NumberDataPoints <span class="token comment">#0</span>
<span class="token key atrule">Data point attributes</span><span class="token punctuation">:</span>
<span class="token key atrule">-> fstype</span><span class="token punctuation">:</span> STRING(ext4)
<span class="token key atrule">-> method</span><span class="token punctuation">:</span> STRING(/data)
<span class="token key atrule">StartTimestamp</span><span class="token punctuation">:</span> 1970<span class="token punctuation">-</span>01<span class="token punctuation">-</span>01 00<span class="token punctuation">:</span>00<span class="token punctuation">:</span>00 +0000 UTC
<span class="token key atrule">Timestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>23 07<span class="token punctuation">:</span>42<span class="token punctuation">:</span>07.117 +0000 UTC
<span class="token key atrule">Value</span><span class="token punctuation">:</span> <span class="token number">250294.000000</span>
Here, the DataType
is set to Gauge
. Gauge
is the default metric type OpenTelemetry will convert into, so the lack of a # TYPE line in the Prometheus scrape data will result in a gauge. Prometheus doesn’t actually use the type information itself (it doesn’t differentiate between counters and gauges internally), so some exporters will forgo the two comment lines to make the scrape more efficient. This would result in all OpenTelemetry metrics being gauges.
Histograms
A Prometheus histogram which was scraped as:
<span class="token comment"># HELP http_request_duration_seconds Histogram of latencies for HTTP requests.</span>
<span class="token comment"># TYPE http_request_duration_seconds histogram</span>
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="0.1"<span class="token punctuation">}</span> 25547
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="0.2"<span class="token punctuation">}</span> 26688
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="0.4"<span class="token punctuation">}</span> 27760
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="1"<span class="token punctuation">}</span> 28641
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="3"<span class="token punctuation">}</span> 28782
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="8"<span class="token punctuation">}</span> 28844
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="20"<span class="token punctuation">}</span> 28855
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="60"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="120"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>le="+Inf"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_sum<span class="token punctuation">{</span>handler="/"<span class="token punctuation">}</span> 1863.80491025699
http_request_duration_seconds_count<span class="token punctuation">{</span>handler="/"<span class="token punctuation">}</span> 28860
Would present in OpenTelemetry through the Logging Exporter as:
Metric <span class="token comment">#0</span>
<span class="token key atrule">Descriptor</span><span class="token punctuation">:</span>
<span class="token key atrule">-> Name</span><span class="token punctuation">:</span> prometheus_http_request_duration_seconds
<span class="token key atrule">-> Description</span><span class="token punctuation">:</span> Histogram of latencies for HTTP requests.
<span class="token key atrule">-> Unit</span><span class="token punctuation">:</span>
<span class="token key atrule">-> DataType</span><span class="token punctuation">:</span> Histogram
<span class="token key atrule">-> AggregationTemporality</span><span class="token punctuation">:</span> AGGREGATION_TEMPORALITY_CUMULATIVE
HistogramDataPoints <span class="token comment">#0</span>
<span class="token key atrule">Data point attributes</span><span class="token punctuation">:</span>
<span class="token key atrule">-> handler</span><span class="token punctuation">:</span> STRING(/)
<span class="token key atrule">StartTimestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>23 07<span class="token punctuation">:</span>54<span class="token punctuation">:</span>07.117 +0000 UTC
<span class="token key atrule">Timestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>23 07<span class="token punctuation">:</span>54<span class="token punctuation">:</span>07.117 +0000 UTC
<span class="token key atrule">Count</span><span class="token punctuation">:</span> <span class="token number">28860</span>
<span class="token key atrule">Sum</span><span class="token punctuation">:</span> <span class="token number">1863.804910</span>
ExplicitBounds <span class="token comment">#0: 0.100000</span>
ExplicitBounds <span class="token comment">#1: 0.200000</span>
ExplicitBounds <span class="token comment">#2: 0.400000</span>
ExplicitBounds <span class="token comment">#3: 1.000000</span>
ExplicitBounds <span class="token comment">#4: 3.000000</span>
ExplicitBounds <span class="token comment">#5: 8.000000</span>
ExplicitBounds <span class="token comment">#6: 20.000000</span>
ExplicitBounds <span class="token comment">#7: 60.000000</span>
ExplicitBounds <span class="token comment">#8: 120.000000</span>
Buckets <span class="token comment">#0, Count: 25547</span>
Buckets <span class="token comment">#1, Count: 1141</span>
Buckets <span class="token comment">#2, Count: 1072</span>
Buckets <span class="token comment">#3, Count: 881</span>
Buckets <span class="token comment">#4, Count: 141</span>
Buckets <span class="token comment">#5, Count: 62</span>
Buckets <span class="token comment">#6, Count: 11</span>
Buckets <span class="token comment">#7, Count: 5</span>
Buckets <span class="token comment">#8, Count: 0</span>
Buckets <span class="token comment">#9, Count: 0</span>
We can see a histogram is created in OpenTelemetry, but one thing we can’t do is include minimum and maximum values (which OpenTelemetry supports, but Prometheus doesn’t).
Summaries
A Prometheus summary which is scraped as the following:
<span class="token comment"># HELP prometheus_rule_evaluation_duration_seconds The duration for a rule to execute.</span>
<span class="token comment"># TYPE prometheus_rule_evaluation_duration_seconds summary</span>
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>quantile="0.5"<span class="token punctuation">}</span> 6.4853e<span class="token punctuation">-</span><span class="token number">05</span>
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>quantile="0.9"<span class="token punctuation">}</span> 0.00010102
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>quantile="0.99"<span class="token punctuation">}</span> 0.000177367
prometheus_rule_evaluation_duration_seconds_sum 1.623860968846092e+06
prometheus_rule_evaluation_duration_seconds_count 1.112293682e+09
Would result in an OpenTelemetry metric which outputs via the Logging Exporter as:
Metric <span class="token comment">#3</span>
<span class="token key atrule">Descriptor</span><span class="token punctuation">:</span>
<span class="token key atrule">-> Name</span><span class="token punctuation">:</span> prometheus_rule_evaluation_duration_seconds
<span class="token key atrule">-> Description</span><span class="token punctuation">:</span> The duration for a rule to execute.
<span class="token key atrule">-> Unit</span><span class="token punctuation">:</span>
<span class="token key atrule">-> DataType</span><span class="token punctuation">:</span> Summary
SummaryDataPoints <span class="token comment">#0</span>
<span class="token key atrule">StartTimestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>23 07<span class="token punctuation">:</span>50<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Timestamp</span><span class="token punctuation">:</span> 2022<span class="token punctuation">-</span>06<span class="token punctuation">-</span>23 07<span class="token punctuation">:</span>50<span class="token punctuation">:</span>22.117 +0000 UTC
<span class="token key atrule">Count</span><span class="token punctuation">:</span> <span class="token number">1112293682</span>
<span class="token key atrule">Sum</span><span class="token punctuation">:</span> <span class="token number">1623860.968846</span>
QuantileValue <span class="token comment">#0: Quantile 0.500000, Value 0.000065</span>
QuantileValue <span class="token comment">#1: Quantile 0.900000, Value 0.000101</span>
QuantileValue <span class="token comment">#2: Quantile 0.990000, Value 0.000177</span>
We can see that the OpenTelemetry Summary
metric type has been selected here—remember that this was explicitly created for Prometheus integration and should not be used anywhere else. It’s similar to the histogram output but lists quantiles rather than explicit buckets and bucket counts. In this case, it looks like we are losing some precision, but fear not. This is just the Logging Exporter pretty-printing, as we will see in the next section.
Converting From OpenTelemetry to Prometheus
Using either the Prometheus Exporter to allow scraping or the Prometheus Remote Write Exporter to push directly to another Prometheus instance, we can transmit metrics from OpenTelemetry to Prometheus. There aren’t any surprises on this side: if the conversion is supported, it happens without any loss of precision.
One thing to remember is that there are some configurations of OpenTelemetry metrics that we can’t translate directly into Prometheus metrics because Prometheus has a much more constrained model.
Any metrics with an AggregationTemporality
of DELTA
will be converted back into CUMULATIVE
by the Prometheus Exporter (and will be rejected by the Prometheus Remote Write Exporter). The Prometheus Remote Write Exporter will also reject summary and histogram metrics, but these are managed perfectly by the Prometheus Exporter. OpenTelemetry metrics with an integer value will be converted into float values.
For instance, a scrape from the Prometheus Exporter (when all the examples from the above sections have been ingested) would produce the following results. You will see that each value is exactly the same as the input Prometheus value from the previous sections, with no loss of fidelity.
<span class="token comment"># HELP http_request_duration_seconds Histogram of latencies for HTTP requests.</span>
<span class="token comment"># TYPE http_request_duration_seconds histogram</span>
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="0.1"<span class="token punctuation">}</span> 25547
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="0.2"<span class="token punctuation">}</span> 26688
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="0.4"<span class="token punctuation">}</span> 27760
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="1"<span class="token punctuation">}</span> 28641
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="3"<span class="token punctuation">}</span> 28782
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="8"<span class="token punctuation">}</span> 28844
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="20"<span class="token punctuation">}</span> 28855
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="60"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="120"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_bucket<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>le="+Inf"<span class="token punctuation">}</span> 28860
http_request_duration_seconds_sum<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 1863.80491025699
http_request_duration_seconds_count<span class="token punctuation">{</span>handler="/"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 28860
<span class="token comment"># HELP http_requests_total Total HTTP requests served</span>
<span class="token comment"># TYPE http_requests_total counter</span>
http_requests_total<span class="token punctuation">{</span>code="200"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>method="post"<span class="token punctuation">}</span> 907
http_requests_total<span class="token punctuation">{</span>code="400"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>method="post"<span class="token punctuation">}</span> 5
<span class="token comment"># HELP node_filesystem_avail_bytes Available bytes in filesystems</span>
<span class="token comment"># TYPE node_filesystem_avail_bytes gauge</span>
node_filesystem_avail_bytes<span class="token punctuation">{</span>fstype="ext4"<span class="token punctuation">,</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>method="/data"<span class="token punctuation">}</span> 250294
<span class="token comment"># HELP prometheus_rule_evaluation_duration_seconds The duration for a rule to execute.</span>
<span class="token comment"># TYPE prometheus_rule_evaluation_duration_seconds summary</span>
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>quantile="0.5"<span class="token punctuation">}</span> 6.4853e<span class="token punctuation">-</span><span class="token number">05</span>
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>quantile="0.9"<span class="token punctuation">}</span> 0.00010102
prometheus_rule_evaluation_duration_seconds<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">,</span>quantile="0.99"<span class="token punctuation">}</span> 0.000177367
prometheus_rule_evaluation_duration_seconds_sum<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 1.623860968846092e+06
prometheus_rule_evaluation_duration_seconds_count<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 1.112293682e+09
<span class="token comment"># HELP scrape_duration_seconds Duration of the scrape</span>
<span class="token comment"># TYPE scrape_duration_seconds gauge</span>
scrape_duration_seconds<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 0.003231334
<span class="token comment"># HELP scrape_samples_post_metric_relabeling The number of samples remaining after metric relabeling was applied</span>
<span class="token comment"># TYPE scrape_samples_post_metric_relabeling gauge</span>
scrape_samples_post_metric_relabeling<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 20
<span class="token comment"># HELP scrape_samples_scraped The number of samples the target exposed</span>
<span class="token comment"># TYPE scrape_samples_scraped gauge</span>
scrape_samples_scraped<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 20
<span class="token comment"># HELP scrape_series_added The approximate number of new series in this scrape</span>
<span class="token comment"># TYPE scrape_series_added gauge</span>
scrape_series_added<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 20
<span class="token comment"># HELP up The scraping was successful</span>
<span class="token comment"># TYPE up gauge</span>
up<span class="token punctuation">{</span>instance="127.0.0.1<span class="token punctuation">:</span>9100"<span class="token punctuation">,</span>job="node<span class="token punctuation">-</span>exporter"<span class="token punctuation">}</span> 1
When converting to Prometheus, keep in mind that we only have two options currently: one is using the Prometheus Exporter, which will mean all our data is exposed as a single scrape that won’t scale well if you have a large volume of series. The second option is using the Prometheus Remote Write Exporter, which we expect to scale better but is limited to counters and gauges and won’t perform DELTA
to CUMULATIVE
conversions (it will drop these metrics).
Decision Time
Summing up, Prometheus and OpenTelemetry provide metrics implementations with slightly different angles. While Prometheus is the de facto standard, covering metrics creation, storage, and query, OpenTelemetry is newer, covering only the generation of metrics. Still, it also supports traces and logs with the same SDK.
You can mainly convert between the two without any loss of precision—but it pays to know that some metric types will change slightly. For example, all OpenTelemetry DELTA
metrics will be converted to CUMULATIVE
before export as Prometheus metrics, and Prometheus cannot represent OpenTelemetry exponential histograms until they add support (which will hopefully be soon).
Most organizations will likely mix standards, but if you’re wondering which one to adopt, we recommend you weigh your priorities: do you value stability or flexibility? Are you planning to capture traces, logs, and metrics, or are you okay with metrics only?
Prometheus will give you a battle-tested system. OpenTelemetry, a more expansive and flexible standard. The final decision depends only on you, but we hope this blog post has given you some helpful clues.
And if you are looking for a long-term store for your Prometheus metrics, check out Promscale, the observability backend built on PostgreSQL and TimescaleDB. It seamlessly integrates with Prometheus, with 100% PromQL compliance, multitenancy, and OpenMetrics exemplars support.
- Promscale is an open-source project, and you can use it for free. Check out our docs for install instructions in Kubernetes, Docker, or your virtual machine.
Guest post originally published on Timescale’s blog by James Blackwood-Sewell
Source CNCF
For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!
Our humans need coffee too! Your support is highly appreciated, thank you!