AgentOps

A proof-of-concept agent framework demonstrating durable AI agents with human-in-the-loop approval.

This repository showcases architectural patterns for building autonomous agent systems that can pause for human approval and resume after server restarts. It's intended as a reference implementation and learning resource, not a production deployment.

What This Demonstrates

This project illustrates key patterns for durable agent systems:

Human-in-the-Loop (HITL) - Dangerous operations pause for human approval before executing
Event Sourcing - All agent state derived from an append-only journal for crash recovery
Multi-agent Orchestration - Route tasks to specialized agents based on capabilities
Real-time Dashboard - Watch agent progress and approve actions via web UI

See AGENT_ARCHITECTURE.md for detailed architecture and patterns.

Core Technologies

Agent Framework:

Vercel AI SDK - LLM orchestration with tool calling
Hono - Lightweight HTTP server
TypeORM - Database ORM for journal persistence
Zod - Schema validation

Dashboard:

React - UI framework
Tailwind CSS - Styling
Server-Sent Events - Real-time event streaming

Infrastructure:

Docker Compose - Service orchestration
PostgreSQL - Journal persistence
Loki + Grafana - Log aggregation and visualization
Traefik - Reverse proxy

Sample Application

The repository includes a distributed bookstore system for testing agent capabilities:

Store API (orders, catalog, inventory)
Warehouse APIs (fulfillment, shipments)
React UI (customer + admin interfaces)
PostgreSQL databases
Background jobs (health checks, reconciliation)

This application provides realistic complexity for testing agents: debugging TypeScript services, analyzing distributed logs, and troubleshooting distributed system issues.

See BOOKSTORE_ARCHITECTURE.md for details.

Quick Start

Prerequisites

Node.js 20+ - Runtime for TypeScript
Docker & Docker Compose - Container orchestration
Anthropic API Key - Get from console.anthropic.com

Installation

# Clone repository
git clone https://github.com/yourusername/agentops.git
cd agentops

# Create environment files
cp .env.example .env
cp ops/packages/agent-server/.env.example ops/packages/agent-server/.env

# Edit .env files and set your Anthropic API key
nano ops/packages/agent-server/.env  # Set ANTHROPIC_API_KEY=sk-ant-...
nano .env                            # Review and adjust if needed

See comments in .env files for variable descriptions.

Start Services

Option 1: Automated (Recommended)

./scripts/start.sh --build

Option 2: Manual

# Install Docker Loki logging plugin
./scripts/setup-logging.sh

# Start all services
docker compose up --build

Access the Dashboard

Open http://localhost:3001 in your browser.

Enter a task (e.g., "The store-api is returning 500 errors. Fix it.")
Watch the agent investigate and propose solutions
Approve or reject dangerous operations (file writes, shell commands)

Access Points

Service	URL	Purpose
Agent Dashboard	http://localhost:3001	Agent UI with approval flow
Bookstore UI	http://localhost	Customer/admin interface
Store API	http://api.localhost/store	Orders and catalog
Warehouse Alpha	http://api.localhost/warehouses/alpha	Fulfillment
Warehouse Beta	http://api.localhost/warehouses/beta	Fulfillment
Grafana (Logs)	http://grafana.localhost	Log visualization
Loki	http://loki.localhost	Log aggregation
Traefik Dashboard	http://localhost:8080	Routing and services
Agent Server API	http://api.localhost/agents	Agent HTTP API

Test Credentials:

Customer: alice@customer.com:alice123
Admin: admin@bookstore.com:admin123
Warehouse Staff: staff@warehouse-alpha.com:staff123

Example Tasks

Orchestration Agent (recommended):

"The store-api is returning 500 errors. Fix it."
"Debug the inventory sync job and check recent logs"
"Fix the bug in auth service and verify no errors in logs"

Coding Agent:

"Fix all TypeScript errors in ops/test-cases/"
"Debug why the authentication module is failing"

Log Analyzer Agent:

"Why is warehouse-alpha returning 500 errors?"
"Show me all failed order processing attempts"

Project Structure

agentops/
├── ops/                           # Agent framework (monorepo)
│   ├── packages/
│   │   ├── agent-server/          # Hono HTTP server + agents
│   │   ├── dashboard/             # React dashboard with approval UI
│   │   └── shared/                # Common types and utilities
│   └── test-cases/                # Sample code for testing agents
├── services/                      # Bookstore microservices
│   ├── store-api/                 # Store backend (TypeScript + Koa)
│   ├── warehouse-api/             # Warehouse backend (TypeScript + Koa)
│   └── bookstore-ui/              # React frontend
├── infra/                         # Infrastructure configs
│   ├── traefik/                   # Reverse proxy
│   ├── loki/                      # Log aggregation
│   └── grafana/                   # Log visualization
├── AGENT_ARCHITECTURE.md          # Agent framework details
├── BOOKSTORE_ARCHITECTURE.md      # Sample app architecture
└── docker-compose.yaml            # Service orchestration

Documentation

Agent Architecture - Framework design, patterns, and extensibility
Bookstore Architecture - Sample application details
User Guide - Using the bookstore application
Logging Guide - Log aggregation with Loki and Grafana

Development

Agent Development

cd ops
npm install              # Install dependencies
npm run build            # Build all packages
npm run dev:server       # Run agent server (port 3200)
npm run dev:dashboard    # Run dashboard (port 3001)

Bookstore Development

# Rebuild specific service
docker compose up --build store-api

# Access database
docker compose exec store-db psql -U storeuser -d store_db

# View logs
docker compose logs -f store-api

Proof of Concept Limitations

This is a reference implementation for learning and experimentation. It is not intended for production use and lacks:

Security hardening - No HTTPS, uses basic auth, shared secrets in env vars
Error recovery - Limited retry logic and failure handling
Scalability - Single-instance services, no horizontal scaling
Monitoring - Basic logging only, no metrics or alerting
Testing - Minimal test coverage
Authentication - Simple auth suitable for demos only

If adapting this for real-world use, implement proper security, observability, and reliability patterns.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
infra		infra
ops		ops
plans		plans
scripts		scripts
services		services
shared		shared
.env.example		.env.example
.gitignore		.gitignore
AGENT_ARCHITECTURE.md		AGENT_ARCHITECTURE.md
BLOG_POST.md		BLOG_POST.md
BOOKSTORE_ARCHITECTURE.md		BOOKSTORE_ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
PLAN.md		PLAN.md
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AgentOps

What This Demonstrates

Core Technologies

Sample Application

Quick Start

Prerequisites

Installation

Start Services

Access the Dashboard

Access Points

Example Tasks

Project Structure

Documentation

Development

Agent Development

Bookstore Development

Proof of Concept Limitations

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

tonyfosterdev/agent-ops

Folders and files

Latest commit

History

Repository files navigation

AgentOps

What This Demonstrates

Core Technologies

Sample Application

Quick Start

Prerequisites

Installation

Start Services

Access the Dashboard

Access Points

Example Tasks

Project Structure

Documentation

Development

Agent Development

Bookstore Development

Proof of Concept Limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages