Selected work

Orion-OS

A self-correcting runtime for autonomous code generation — it writes, tests, and repairs its own output.

Problem

LLM code generation fails silently. Plausible-looking code, no tests, no recovery path, no way to know what happened.

Approach

A deterministic pipeline wrapped around the model — coherence gate, real pytest execution, a failure taxonomy, and a self-repair loop — with every run traced and queryable.

Result

average cost per generated-and-tested feature
~$0.001
self-repair retries to recover a failure, on a fixed budget
≤3
unattended on a VPS, with a nightly benchmark
24/7

Stack

  • Python
  • FastAPI
  • Postgres
  • DeepSeek