Research

Arguments about agent improvement that can be tested against traces, gates, and runtime behavior.

Traces Are The Training Data

Why self-improving agents need full trajectories, tool spans, analyst findings, provenance, and replay instead of final scores alone.

Each note should make a claim about how agent systems improve, fail, or prove progress.

A series map for self-improving agent systems, from optimization theory and prompt search to runtime topology, traces, memory, and governance.

Where GEPA, DSPy, MIPRO, AxLLM, and related prompt optimizers fit inside a larger self-improving agent stack.

How SkillOpt, Voyager-style skill libraries, and agent skills turn durable procedure into an optimization surface.

A compact map from hill climbing and Bayesian search to GEPA, SkillOpt, Frontier Tuning, agent runtimes, and noisy promotion gates.

How SFT, RLHF, process supervision, tool-use RL, and Microsoft Frontier Tuning differ from public prompt, skill, and harness loops.

How episodic memory, knowledge gates, retrieval evals, negative knowledge, and production trace mining fit into self-improving agent systems.

Why held-out promotion, judge reliability, failure taxonomies, cost ceilings, and confidence intervals decide whether self-improvement is real.

Why best-of-N, self-consistency, verifier reranking, and compute-matched controls are the baseline for agent topology claims.

Why meta-harness, AlphaEvolve-style code search, worktree isolation, and architecture frontiers matter after prompt and skill tuning plateau.

Why multi-agent self-improvement needs explicit runtime primitives for fanout, refine, select, parallelism, supervision, budgets, and replay.

How driver, worker, selector, reviewer, analyst, and coordinator roles become reliable multi-agent systems instead of roleplay.

Why prompt injection, sandbox boundaries, eval poisoning, provenance, compliance, and release gates are core to any real self-improving agent stack.