llm-task-v3: Zero-Code LLM Task Gateway

A zero-code, configuration-driven LLM task gateway. Simply submit a YAML configuration file to automatically generate a production-grade API endpoint with input validation, structured output, automatic retry, and multi-model routing capabilities.

Features

Zero-Code Configuration: Define tasks in YAML, no coding required
Dynamic Type Generation: JSON Schema → Pydantic models for validation
Structured Output: Enforced output schema using Instructor
Multi-Model Support: OpenAI, Anthropic, vLLM, Ollama, and more
Built-in Security: API key authentication, rate limiting, concurrency control
Observability: Prometheus metrics integration
Startup Validation: Config validation before service starts

Quick Start

Installation

# Clone the repository
git clone <repository-url>
cd llm-task-v3

# Install dependencies using uv
uv sync

# Copy environment variables
cp .env.example .env

# Edit .env with your API keys
nano .env

Configuration

Configure Models (config/models.yaml):

model_list:
  - model_name: gpt-4o-mini
    provider: openai
    model: openai/gpt-4o-mini
    api_key: os.environ/OPENAI_API_KEY

Define a Task (tasks/sentiment_analysis.yaml):

meta:
  id: "sentiment_analysis"
  version: "1.0.0"
  description: "Analyze sentiment of text"

input_schema:
  type: object
  properties:
    content:
      type: string
      minLength: 10
  required: ["content"]

output_schema:
  type: object
  properties:
    sentiment:
      type: string
      enum: ["POSITIVE", "NEGATIVE", "NEUTRAL"]
    confidence:
      type: number
      minimum: 0.0
      maximum: 1.0
  required: ["sentiment", "confidence"]

strategy:
  primary_model: "gpt-4o-mini"
  max_retries: 2
  timeout: 15

prompt:
  system: "You are a sentiment analyst."
  user: "Analyze: {{ content }}"

Run the Server

uv run uvicorn src.main:app --reload

The API will be available at http://localhost:8000

API Usage

Run a Task

curl -X POST "http://localhost:8000/api/v1/run" \
  -H "X-API-Key: your-secret-api-key-here" \
  -H "Content-Type: application/json" \
  -d '{
    "task_id": "sentiment_analysis",
    "payload": {
      "content": "This product is amazing!"
    }
  }'

Response

{
  "code": 200,
  "status": "success",
  "data": {
    "sentiment": "POSITIVE",
    "confidence": 0.95
  },
  "meta": {
    "task_id": "sentiment_analysis",
    "model_used": "gpt-4o-mini",
    "latency_ms": 850
  }
}

API Endpoints

Endpoint	Method	Description
`/health`	GET	Health check
`/api/v1/tasks`	GET	List all tasks
`/api/v1/tasks/{task_id}`	GET	Get task details
`/api/v1/run`	POST	Execute a task
`/metrics`	GET	Prometheus metrics

Development

Running Tests

# Run all tests
uv run pytest

# Run unit tests only
uv run pytest tests/unit/ -v

# Run with coverage
uv run pytest --cov=src --cov-report=html

Code Quality

# Linting
uv run ruff check .

# Format code
uv run ruff format .

Project Structure

llm-task-v3/
├── config/
│   └── models.yaml         # Model configurations
├── tasks/
│   └── *.yaml              # Task definitions
├── src/
│   ├── main.py             # FastAPI application
│   ├── config.py           # Configuration loading
│   ├── models.py           # Dynamic model builder
│   ├── executor.py         # Task executor
│   ├── gateway.py          # Auth, rate limiting, concurrency
│   ├── metrics.py          # Prometheus metrics
│   └── llm/
│       └── provider_registry.py  # LLM client registry
├── tests/
│   ├── unit/               # Unit tests
│   ├── integration/        # Integration tests
│   └── e2e/                # End-to-end tests
└── doc/
    ├── prd.md              # Product requirements
    └── research.md         # Dependency research

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
doc		doc
logs		logs
scripts		scripts
src		src
static/swagger-ui		static/swagger-ui
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
start_server.py		start_server.py
test_batch_endpoint.py		test_batch_endpoint.py
test_e2e.py		test_e2e.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-task-v3: Zero-Code LLM Task Gateway

Features

Quick Start

Installation

Configuration

Run the Server

API Usage

Run a Task

Response

API Endpoints

Development

Running Tests

Code Quality

Project Structure

License

About

Uh oh!

Releases

Packages

Languages

camelfire/llm-task

Folders and files

Latest commit

History

Repository files navigation

llm-task-v3: Zero-Code LLM Task Gateway

Features

Quick Start

Installation

Configuration

Run the Server

API Usage

Run a Task

Response

API Endpoints

Development

Running Tests

Code Quality

Project Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages