AgentSight is for the moments when an agent run becomes too expensive, slow, risky, or hard to review.
Start from a real run, not a synthetic dashboard. AgentSight gives you a system-level profile and skillset you can use for debugging, review, tool evaluation, and incident investigation.
Profile slow or expensive agent runs
Pain
The agent ran for 25 minutes and burned through tokens, but the transcript does not explain where the time went.
Profile outcome
Break the run down by model calls, shell time, repeated scans, long-running subprocesses, network waits, CPU, and memory.
LLM turns and token volumeRepeated file readsFailed commands and retriesCPU and memory samples
Review AI-generated PRs faster
Pain
A diff tells you what changed, not how the agent got there or whether it actually ran the checks it claimed.
Profile outcome
Attach a run profile to an AI-generated PR with commands, tests, failures, touched files, network calls, and run cost.
Commands and exit statusFiles written and deletedTest retriesRepo-external access
Trace closed-source agent CLIs
Pain
Claude Code, Codex, Gemini CLI, and other agents expose different logs and hooks, and some behavior happens in child processes.
Profile outcome
Trace the agent process tree from outside the application without SDKs, proxies, or vendor-specific integrations.
Process lineageTLS model trafficShell commandsNetwork destinations
Generate shareable insight pages
Pain
Raw traces are too detailed for reviewers, managers, or tool authors. They need findings, evidence, and next actions in one page.
Profile outcome
Use an AgentSight skill to drive the profiling workflow and guide the agent from raw evidence to a self-contained HTML artifact.