Lab / AgentOps
Dev Infrastructure Internal Alpha Updated January 20, 2026

AgentOps

The 'Black Box' recorder for autonomous coding agents.

Core Stack
Rust ClickHouse React Flow Claude 3.5 Sonnet eBPF
Project Links

Access is available on request for partners.

// Foundation

The Thesis

Market Context

With the rise of Agentic Engineering (Devin, Cursor), there is a vacuum in 'Agent APM' (Application Performance Monitoring). Traditional tools like Datadog cannot trace non-deterministic reasoning chains. AgentOps fills this gap.

Hypothesis

As we deploy autonomous engineers, debugging non-deterministic code becomes impossible without a replay engine. We needed a way to 'rewind' an agent's thought process step-by-step.

// Engineering

Technical Challenges

High-Throughput Log Ingestion

Agents generate massive verbose logs (internal monologue). Standard SQL databases were too slow. We migrated to ClickHouse to handle 1GB/s log ingestion streams from multiple concurrent agents.

Visualizing Non-Linear Logic

Agents often branch or loop. We had to build a custom DAG (Directed Acyclic Graph) visualizer using React Flow that supports real-time streaming updates via WebSockets.

// Architecture

System Design

  • 01. Ingestion: Rust High-Performance Sidecar
  • 02. Storage: ClickHouse (Time-Series Logs)
  • 03. Analysis: Background Workers (Python/Pandas)
  • 04. Frontend: Next.js + React Flow + WebSockets
// Results

Outcomes

Reduced agent regression testing time by 65%. Now monitors all Mavik Labs production workloads, processing 50M+ tokens/day.

Research Roadmap

Prototype Q1 2026

Single-agent tracing

Internal Beta Q2 2026

Multi-agent swarm support

SaaS Launch Q3 2026

Public API & SDK Release

Build This With Us

We license our research stacks to select partners.

Partner on AgentOps

Let's build
something real.

No more slide decks. No more "maybe next quarter".
Let's ship your MVP in weeks.

Start Building Now