Lab / AgentOps

Dev Infrastructure Internal Alpha Updated January 20, 2026

AgentOps

The 'Black Box' recorder for autonomous coding agents.

Core Stack

Rust ClickHouse React Flow Claude 3.5 Sonnet eBPF

Project Links

Access is available on request for partners.

// Foundation

The Thesis

Market Context

With the rise of Agentic Engineering (Devin, Cursor), there is a vacuum in 'Agent APM' (Application Performance Monitoring). Traditional tools like Datadog cannot trace non-deterministic reasoning chains. AgentOps fills this gap.

Hypothesis

As we deploy autonomous engineers, debugging non-deterministic code becomes impossible without a replay engine. We needed a way to 'rewind' an agent's thought process step-by-step.

// Engineering

Technical Challenges

High-Throughput Log Ingestion

Agents generate massive verbose logs (internal monologue). Standard SQL databases were too slow. We migrated to ClickHouse to handle 1GB/s log ingestion streams from multiple concurrent agents.

Visualizing Non-Linear Logic

Agents often branch or loop. We had to build a custom DAG (Directed Acyclic Graph) visualizer using React Flow that supports real-time streaming updates via WebSockets.

// Architecture

System Design

01. Ingestion: Rust High-Performance Sidecar
02. Storage: ClickHouse (Time-Series Logs)
03. Analysis: Background Workers (Python/Pandas)
04. Frontend: Next.js + React Flow + WebSockets

// Results

Outcomes

Reduced agent regression testing time by 65%. Now monitors all Mavik Labs production workloads, processing 50M+ tokens/day.

Research Roadmap

Prototype Q1 2026

Single-agent tracing

Internal Beta Q2 2026

Multi-agent swarm support

SaaS Launch Q3 2026

Public API & SDK Release

Build This With Us

We license our research stacks to select partners.

Partner on AgentOps

// More Research

Other Experiments

Consumer Agent Beta 2.0

PlanEat AI

A customized nutrition assistant that negotiates meal plans based on real-time fridge inventory and biometric data.

Explore

AI / Reasoning Research

SynthMind

An AI system that combines vision, language, and structured reasoning to extract insights from complex multi-page documents — contracts, research papers, and financial reports.