mpm + observability   17

Co-evolving Tracing and Fault Injection with Box of Pain
Distributed systems are hard to reason about largely because of uncertainty about what may go wrong in a particular execution, and about whether the system will mitigate those faults. Tools that perturb executions can help test whether a system is robust to faults, while tools that observe executions can help better understand their system-wide effects. We present Box of Pain, a tracer and fault injector for unmodified distributed systems that addresses both concerns by interposing at the system call level and dynamically reconstructing the partial order of communication events based on causal relationships. Box of Pain’s lightweight approach to tracing and focus on simulating the effects of partial failures on communication rather than the failures themselves sets it apart from other tracing and fault injection systems. We present evidence of the promise of Box of Pain and its approach to lightweight observation and perturbation of distributed systems.
testing  observability 
4 weeks ago by mpm
Ten Platform Commandments
Your observability must center your users’ perspective, not your own. The health of the system doesn’t matter. The health of every request, and every high-cardinality grouping of requests — those are what matter
november 2018 by mpm
BPFtrace is a high-level tracing language for Linux enhanced Berkeley Packet Filter (eBPF) available in recent Linux kernels
ebpf  linux  observability 
october 2018 by mpm
Introducing ebpf_exporter
In this blog post we'll talk about how we measure low level metrics and share a tool that can help you to get similar understanding of your systems
linux  ebpf  observability 
october 2018 by mpm
Tracing a packet journey using Linux tracepoints, perf and eBPF
At the end of this post, we’ll have a simple and easy to use low level packet tracer
ebpf  observability  networking 
may 2018 by mpm
DTrace-style dynamic tracing tool for Linux
ebpf  linux  observability 
march 2018 by mpm
A single distribution of libraries that automatically collects traces and metrics from your app, displays them locally, and sends them to any analysis tool.
monitoring  observability 
january 2018 by mpm
libstapsdt is a library which allows creating and firing Systemtap's USDT probes at runtime. It's inspired on chrisa/libusdt. The goal of this library is to add USDT probes functionality to dynamic languages.
monitoring  observability 
november 2017 by mpm
Jaeger, inspired by Dapper and OpenZipkin, is a distributed tracing system released as open source by Uber Technologies. It can be used for monitoring microservice-based architectures
september 2017 by mpm
Linux Load Averages: Solving the Mystery
When Linux load averages increase, you know you have higher demand for resources (CPUs, disks, and some locks), but you aren't sure which. You can use other metrics for clarification
linux  metrics  observability 
september 2017 by mpm
Dynamic tracing in Linux user and kernel space
Did you forget to insert probe points in your code? No problem. Learn how to insert them dynamically with uprobe and kprobe
observability  linux  debugging  performance 
july 2017 by mpm
Wingtips is a distributed tracing solution for Java based on the Google Dapper paper.
java  managability  protocol  observability 
december 2016 by mpm
WireShark-Specific Dissector Language
protocol  observability  lua 
october 2016 by mpm
A vendor-neutral open standard for distributed tracing.
monitoring  observability 
october 2016 by mpm
intelsdi-x/snap: The open telemetry framework
Snap is an open telemetry framework designed to simplify the collection, processing and publishing of system data through a single API
monitoring  managability  observability 
may 2016 by mpm
ktap is a new script-based dynamic tracing tool for Linux, it uses a scripting language and lets users trace the Linux kernel dynamically. ktap is designed to give operational insights with interoperability that allows users to tune, troubleshoot and extend kernel and application. It's similar with Linux Systemtap and Solaris Dtrace
linux  performance  observability  lua 
november 2015 by mpm
LTTng Project
The LTTng project aims at providing highly efficient tracing tools for Linux. Its tracers help tracking down performance issues and debugging problems involving multiple concurrent processes and threads. Tracing across multiple systems is also possible.

Apart from LTTng's kernel tracer and userspace tracer, viewing and analysis tools are part of the project. The LTTV viewer permits to analyze and show traces, both in text format and graphically.
linux  managability  maintainability  observability  performance 
may 2013 by mpm

Copy this bookmark: