The AI That Ships Your Code
Gilfoyle orchestrates a network of 8 AI agents that investigate, plan, implement, test, and ship production code autonomously.
Meet the Agents
Orchestrator
Gilfoyle
Receives alerts, triages severity, delegates to the right agents, and ensures every fix ships to production.
[gilfoyle] Alert: SENTRY-4521 NullPointerException
[gilfoyle] → Assigning anton for investigation
[gilfoyle] → Investigation complete. Root cause identified.
[gilfoyle] → Assigning son-of-anton for architecture plan
[gilfoyle] → Plan approved. Spawning carla for implementation.Investigation
Anton
Deep-dives into Sentry traces, stack traces, and logs to identify the exact root cause before anyone writes a line of code.
$ sentry:investigate PROD-4521
Error: Cannot read property 'subscription' of null
File: billing/services.py:247
Trace: handle_webhook → process_payment → get_subscription
Root Cause: Race condition in webhook handler.
Customer subscriptions created async — webhook fires
before record commits. Fix: add retry with backoff.Architect
Son of Anton
Designs the implementation plan — which files to change, what to create, estimated scope — before a single line is written.
## Plan: Fix PROD-4521 (Race Condition)
1. Add retry decorator to webhook handler
→ billing/services.py:process_payment()
2. Implement exponential backoff (max 3 retries)
→ billing/utils/retry.py (new file)
3. Add integration test for concurrent webhook
→ tests/billing/test_webhook_race.py
Estimated: 45 lines changed, 2 files modified, 1 newCode
Carla
Writes production code — clean diffs, proper error handling, following existing patterns in the codebase.
def process_payment(self, webhook_data):
- subscription = self.get_subscription(customer_id)
- if not subscription:
- raise SubscriptionNotFound(customer_id)
+ subscription = self.get_subscription_with_retry(
+ customer_id,
+ max_retries=3,
+ backoff_base=0.5
+ )
return self.charge(subscription, webhook_data.amount)Code
Holden
Writes comprehensive test suites — unit tests, integration tests, edge cases — ensuring every fix is verified before merge.
async def test_concurrent_webhook_handling():
"""Verify payment processes even when
subscription creation is delayed."""
customer = await create_test_customer()
# Fire webhook before subscription commits
webhook = simulate_webhook(customer.id)
result = await process_payment(webhook)
assert result.status == "succeeded"
assert result.retries == 2Code
Devon
Operates in isolated git worktrees — parallel branches, zero conflicts, clean PRs ready for review.
$ claude --worktree fix-billing-race
Creating worktree: fix-billing-race
Branch: fix/PROD-4521-webhook-race
Modified: billing/services.py
Created: billing/utils/retry.py
Tests: 3 added, all passing
→ PR #847 created: "fix: webhook race condition"QA
Big Head
Runs the full test matrix — unit, integration, browser, regression, and load — before any code merges.
QA Report: PR #847
✓ Unit tests — 34/34 passing
✓ Integration — 12/12 passing
✓ Browser QA — checkout flow verified
✓ Regression — no side effects detected
✓ Load test — 500 concurrent webhooks OK
Verdict: SHIP ITOps
Jared
Handles the last mile — merges PRs, monitors deploys, verifies error rates drop to zero in production.
Deploy: fix/PROD-4521-webhook-race
[14:23] PR #847 merged to main
[14:24] CI pipeline: build → test → deploy
[14:26] Staging: healthy ✓
[14:27] Production: rolling deploy 0/3...
[14:28] Production: 3/3 instances healthy ✓
[14:28] Sentry: error rate → 0. Resolved.From Alert to Shipped PR
Capabilities
Sentry → PR
Automatically investigates Sentry alerts and ships the fix as a pull request.
Code Review
Reviews every PR for logic errors, security issues, and style.
Browser QA
Runs headless browser tests to verify UI flows end-to-end.
CI/CD Watch
Monitors pipelines and auto-fixes broken builds.
Customer Triage
Routes support issues to the right agent for investigation.
PR Pipeline
Manages the full lifecycle from branch creation to production deploy.
Sprint Ops
Picks up tickets and executes them autonomously from Linear.
Infra Guard
Watches infrastructure health and patches issues proactively.
PRs authored in 4 weeks
min fastest bug fix
total PRs merged
hr median PR to merge
Built on Claude
Claude Opus 4
Powers every agent
Claude Code
Universal codebase interface
Git Worktrees
Parallel isolation, zero conflicts