The New Rules of Sampling

By: Rachel Perkins | May 9th, 2019

Metrics Sampling

2 Min. Read

One of the most common questions we get at Honeycomb is about how to control costs while still achieving the level of observability needed to debug, troubleshoot, and understand what is happening in production. Historically, the answer from most vendors has been to aggregate your data–to offer you calculated medians, means, and averages rather than the deep context you gain from having access to the actual events coming from your production environment.

This is exactly what it sounds like–a poor tradeoff for performance. With classic metrics and APM tools, you can never again get back to the raw event source of truth, which means you’ll regret that choice when debugging a complex, distributed system. When you’re working with metrics, the data must be numeric, and any other type of data must be stored as metadata either attached to the datapoints themselves or out-of-band in some way (“tags”, “dimensions”, etc), AKA: more limits on what you can store and retrieve.

Honeycomb’s answer is: Sample your data.

But, you say, sampling means I’m throwing away some (or a lot) of my data. How is that OK? I won’t know what I am not seeing, right?

What if you had more flexibility? What if sampling offered a greater breadth of options than just “send a percentage of my data”?

Find out what’s possible in The New Rules of Sampling.

Don’t forget to share!

Rachel Perkins

Yingrong Zhao | Dec 10, 2024

Refinery 2.9: A Love Letter to Refinery’s Operators

Refinery is a powerful tail-based sampler—but with great power comes great challenges. We heard your feedback and are excited to announce the release of Refinery 2.9, a rather large update that is packed with goodies to make your life easier when running Refinery in your network.

Sampling

Kent Quirk | Oct 01, 2024

Refinery and EMA Sampling

Refinery is Honeycomb's sampling proxy, which our largest customers use to improve the value they get from their telemetry. It has a variety of interesting samplers to choose from. One category of these is called dynamic sampling. It's basically a technique for adjusting sample rates to account for the volume of incoming data—but doing so in a way that rare events get more priority than common events.

Observability Sampling

Max Aguirre | Sep 03, 2024

Getting Started With Refinery: Rules File Template

Sampling is a necessity for applications at scale. We at Honeycomb sample our data through the use of our Refinery tool, and we recommend that you do too. But how do you get started? Do you simply a set rate for all data and a handful of drop and keep rules, or is there more to it? What do these rules even mean, and how do you implement them?

Sampling

All-in-one Observability

Why Honeycomb

Looking for something?

Our mission

The New Rules of Sampling

Rachel Perkins

Related posts

Refinery 2.9: A Love Letter to Refinery’s Operators

Refinery and EMA Sampling

Getting Started With Refinery: Rules File Template

Ready to get started?