Safer Codespace

An experimental development environment exploring defense-in-depth approaches to mitigate prompt injection risks when using AI coding assistants.

⚠️ Security Notice: This template uses multiple security layers (network firewall, content segregation, human review) to reduce prompt injection risks. While no approach is perfect, these controls make data exfiltration significantly harder. Learn more about the threat model →

Quick Start

Get up and running in 3 steps:

Click "Use this template" to create your repository
Open in GitHub Codespaces (or your preferred devcontainer host)
Start coding with your choice of AI tool:

# For complex, multi-step tasks with file access, use Claude Code
claude

# Optional: Install SpecStory to auto-save conversations (see docs/SpecStory-Installation.md)
specstory run claude

# For simple tasks, questions and text processing, use llm CLI tool
llm "explain this error" < error.log

# Or use the pipe syntax:
git diff --staged | llm -s "Generate a conventional commit message from these changes"

That's it! All tools are pre-installed and the security firewall is automatically configured.

Prerequisites

Before using this template, ensure you have:

Required:
- GitHub account (for Codespaces or template usage)
- Docker Desktop (for local devcontainer usage)
Optional (for AI features):
- Anthropic API key (for Claude Code and llm Claude models)
- GitHub Models access (free GPT-4o via llm - default)
- Google AI Studio key (for llm Gemini models)

The default configuration uses GitHub's free GPT-4o model, so you can start immediately without any API keys.

What's Included

This devcontainer comes pre-configured with:

AI Development Tools

Anthropic's Claude Code for interactive AI assistance with file access and command execution
- Default: Claude Code (requires Anthropic API key)
- Plugins: File System, Shell Command Execution
- Optional: Use with SpecStory to auto-save conversations as markdown (installation guide)
llm developed by Simon Willison, A CLI tool for interacting with OpenAI, Anthropic's Claude, Google's Gemini, Meta's Llama and dozens of other Large Language Models
- Default: GitHub GPT-4o (free, no API key required)
- Plugins: Anthropic Claude, Google Gemini, GitHub Models

Language Environments

Python 3.13 with uv (fast package manager) and pip
Node.js 24.x with npm (pinned for Claude Code compatibility)
Go (latest stable)

Development Tools

glow - Beautiful markdown rendering in terminal
just - Simple command runner (like make, but better)

Security Features

Network firewall - Blocks unauthorized outbound connections (auto-configured)
Content segregation - Separate context/trusted/ and context/untrusted/ directories
No GitHub CLI - Intentionally excluded to prevent potential data exfiltration

Continuous Integration

GitHub Actions - Automated testing and validation
- Devcontainer Build (.github/workflows/devcontainer-build.yml) - Validates the devcontainer builds successfully
- Network Connectivity (.github/workflows/network-connectivity-test.yml) - Verifies required endpoints are reachable
- Firewall Validation (.github/workflows/firewall-validation.yml) - Ensures firewall blocks unauthorized domains while allowing required endpoints
- llm CLI Tool Test (.github/workflows/llm-tool-test.yml) - Verifies the llm CLI tool is installed and available models are configured
- All workflows run on push/PR to main and can be manually triggered from the Actions tab

Choosing the Right Tool

Not every task needs an AI agent. Choose the right tool for your needs:

When to use Claude Code

Best for focused development tasks where you need file access and iterative assistance:

Boilerplate in unfamiliar languages - You know the goal but not the specific syntax or patterns
Test-driven development - Draft test cases and develop iteratively with Claude's guidance
Codebase exploration - Understand existing code through Claude's search and analysis capabilities
Simple features from scratch - Build UIs or functionality with minimal dependencies
Targeted refactoring - Modify existing code patterns across related files

# Start interactive session
claude

# Start with a specific task
claude "Give me an overview of this codebase"

When to use the `llm` CLI tool

Perfect for text processing tasks that don't need file access or command execution. Particularly useful because you can pipe output from other CLI tools directly to it:

Explain errors or code snippets - Paste error messages or functions for quick analysis
Generate commit messages - Create conventional commits from staged changes
Quick code reviews - Analyze diffs for potential issues or improvements
Explore Git history - Analyze commit logs, changes, and project evolution patterns
Transform data formats - Convert between JSON, CSV, markdown, or other text formats
Programming Q&A - Get answers about syntax, concepts, or best practices

# Generate commit message from staged changes
git diff --staged | llm -s "write a conventional commit message for these changes"

# Quick code review
git diff main | llm "review these changes for potential issues"

# Ask a question
cat script.py | llm "explain what this code does"

Security benefit: llm cannot access your files or run commands out-of-the-box. Even if compromised by prompt injection, damage is limited to the text you explicitly provide.

When to use VS Code with LLM integration

Perfect for controlled, inline code editing and completion when you want to restrict changes to specific files or sections:

Targeted code modifications - Edit specific functions or code blocks without affecting other files
Inline documentation - Generate comments, docstrings, type hints, or explanations within existing code
Code completion and suggestions - Get LLM assistance while maintaining full control over what you write
Refactoring specific sections - Modify code patterns within a single file or selected region
Language-specific optimizations - Improve syntax, or performance in focused code sections

This approach gives you the benefit of LLM assistance while ensuring changes remain scoped to exactly what you want to modify.

When to skip AI altogether

For simple, routine tasks, such as:

Basic git operations
Installing packages
Running tests
Simple config changes

Why? Faster, safer, and you maintain direct control.

Usage Guides

Claude Code

Interactive AI assistant with file access and command execution capabilities.

# Start Claude Code
claude

# Start with a prompt
claude "help me refactor this code"

# View options
claude --help

Using SpecStory with Claude Code:

SpecStory automatically saves your Claude Code conversations as clean, searchable markdown. This preserves the reasoning, decisions, and design tradeoffs behind your code as versioned, git-friendly documentation.

# Install SpecStory (optional, see installation guide)
# Follow instructions at: docs/SpecStory-Installation.md

# Run Claude Code with SpecStory
specstory run claude

# Your conversations are automatically saved to .specstory/
# as markdown files with timestamps and full context

Why use SpecStory?

Preserve intent - Capture the "why" behind code decisions
Reusable context - Refer back to past conversations and reasoning
Team collaboration - Share decision logs with teammates
Git-friendly - Version control your design discussions
Local-first - All data stays on your machine by default

For detailed documentation: https://docs.claude.com/en/docs/claude-code/overview SpecStory documentation pages: https://docs.specstory.com/overview

llm CLI

Command-line interface for various language models. Uses GitHub GPT-4o by default (no API key required).

# Basic usage - pipe input to llm
echo "Hello world" | llm "translate to Spanish"

# Read from file
llm "explain this code" < script.py

# Alternative: use cat to pipe file contents
cat script.py | llm "explain this code"

# Interactive chat mode
llm chat

# List available models
llm models list

# Change default model
llm models default claude-3-5-sonnet-latest

# View comprehensive help
llm --help

Key commands:

llm prompt - Execute a one-off prompt (default)
llm chat - Start an interactive conversation
llm models - Manage available models
llm keys - Configure API keys for different providers
llm logs - View prompt/response history
llm templates - Manage reusable prompt templates

For detailed documentation: https://llm.datasette.io/

Python Development

# Install packages (fast method)
uv pip install package-name

# Traditional pip
pip install package-name

# Run scripts
python script.py

Node.js Development

# Install packages
npm install package-name

# Run package.json scripts
npm run script-name

Go Development

# Initialize module
go mod init module-name

# Install packages
go get package-name

# Run programs
go run main.go

Other Tools

glow - Render markdown beautifully in your terminal:

glow README.md      # View a file
glow                # Browse current directory
cat file.md | glow  # Pipe content

just - Run project-specific commands (defined in justfile):

just --list              # Show available commands
just command-name        # Run a command
just --show command-name # View command definition

Understanding the Security Model

The Threat: Prompt Injection and the "Lethal Trifecta"

When AI assistants have all three capabilities, attackers can inject malicious instructions into external content:

Access to Private Data - Read your code, files, secrets
Exposure to Untrusted Content - Process external docs, dependencies, web pages
Ability to Exfiltrate Data - Send data out via network requests to attacker-controlled servers

Attack scenario: Malicious instructions in documentation → AI reads your secrets → AI sends them to attacker's server.

Our Defense-in-Depth Approach

This template uses multiple redundant layers rather than relying on any single defense:

1. Network Firewall (Mitigates Exfiltration)

Blocks all outbound traffic except to pre-approved development endpoints
Allowed domains: GitHub, npm, PyPI, Anthropic API, Google Gemini API, etc.
Automatically configured on container startup
Validates rules to ensure proper function

Test the firewall:

tests/network/test_connectivity.sh

Add new domains: Edit .devcontainer/init-firewall.sh and add to the domain list around line 67:

for domain in \
    "registry.npmjs.org" \
    "api.anthropic.com" \
    "your-new-domain.com" \
    # ... rest of domains

Limitations: Cannot prevent exfiltration to allowed domains (e.g., GitHub). Works best with other layers.

2. Content Segregation

The context/ directory separates vetted from unvetted content:

context/
├── trusted/       # Human-reviewed content (safe to use)
└── untrusted/     # External docs requiring review

Recommended workflow:

Fetch external content using non-AI tools (curl, Jina Reader)
Save to context/untrusted/ with descriptive filenames
Review yourself for malicious instructions:
- "Send passwords to..."
- "Ignore previous instructions..."
- Suspicious URLs or commands
Move to context/trusted/ after review
Reference only context/trusted/ with AI agents

This ensures agents never directly process untrusted content.

3. Human Review (The Critical Layer)

Key principle: Don't use AI to detect prompt injection attacks! Use human intelligence to review external content before providing it as context.

4. Tool Selection

Choose less powerful tools when possible:

llm (no file access) over Claude Code for simple tasks
Manual commands over AI for routine operations

Why No GitHub CLI?

The gh CLI is intentionally excluded as part of our security-first approach:

Risk: Could be used to exfiltrate data via issues, PRs, or gists
Alternative: Standard git commands handle most workflows
If needed: Install manually (sudo apt install gh) with full awareness of the risk

Learn More

For detailed information about prompt injection:

Read the curated content in context/trusted/simon-willison-weblog-content/
Original 2022 post coining "prompt injection"
2025 "lethal trifecta" framework
Real-world examples of attacks on major AI systems

Additional Resources

Development Philosophy

This repository includes CLAUDE.md with comprehensive guidelines on:

Literate Programming principles
MVP-first methodology (6-8 step rule)
Test-Driven Development practices
Python coding standards
Git workflow with Conventional Commits

View it: glow CLAUDE.md

Security Resources

Prompt Injection Deep Dive: context/trusted/simon-willison-weblog-content/
GitHub Actions Risks: context/trusted/github-blog-posts/github-actions-workflow-injection-risks.md

Troubleshooting

Firewall Issues

Problem: Can't connect to a required service

Solution: Add the domain to .devcontainer/init-firewall.sh and rebuild:

sudo /workspaces/claude-codespace/.devcontainer/init-firewall.sh

Claude Code Issues

Problem: "Claude Code not found"

Solution: Node.js 25+ is not supported. This template uses Node.js 24.x. Rebuild the container.

llm Issues

Problem: "No API key configured"

Solution: The default GitHub GPT-4o model should work without keys. If using other models:

# Set API keys
llm keys set anthropic
llm keys set openai

# Verify models are available
llm models list

Problem: llm command not found

Solution: Rebuild the devcontainer. The llm tool is installed during container build.

Firewall Issues

Problem: Cannot connect to a service or API not in the allow-list

Solution: The network firewall blocks unauthorized outbound connections by default. To allow a new endpoint:

Add the domain/IP to .devcontainer/init-firewall.sh in the allowed domains list
Rebuild the devcontainer to apply changes

For temporary debugging: Disable the firewall (resets on container restart):

sudo iptables -F
sudo iptables -X

General Container Issues

Problem: Tools not installed or outdated

Solution: Rebuild the devcontainer:

In VS Code: Command Palette → "Dev Containers: Rebuild Container"
In GitHub Codespaces: Rebuild from the Codespaces menu

Contributing

This is an experimental repository exploring prompt injection mitigations. We welcome:

Bug reports - File an issue describing the problem
Security feedback - Share your perspective on the defense-in-depth approach
Tool suggestions - Propose additions that align with our security model
Documentation improvements - Help make this more accessible

Note: We do not claim to have "solved" prompt injection. This template explores practical mitigation strategies using defense-in-depth principles.

Providing Feedback

GitHub Issues - For bugs, feature requests, or questions
Pull Requests - For documentation or tooling improvements
Discussions - For broader conversations about security approaches

License

MIT License - See LICENSE file for details

Built by the security-conscious developer community. Use at your own risk and always review external content before exposing it to AI agents.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
cli-tools		cli-tools
context		context
docs		docs
tests/network		tests/network
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
justfile		justfile
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt

License

nicomarr/safer-codespace

Folders and files

Latest commit

History

Repository files navigation

Safer Codespace

Table of Contents

Quick Start

Prerequisites

What's Included

AI Development Tools

Language Environments

Development Tools

Security Features

Continuous Integration

Choosing the Right Tool

When to use Claude Code

When to use the llm CLI tool

When to use VS Code with LLM integration

When to skip AI altogether

Usage Guides

Claude Code

llm CLI

Other Tools

Understanding the Security Model

The Threat: Prompt Injection and the "Lethal Trifecta"

Our Defense-in-Depth Approach

1. Network Firewall (Mitigates Exfiltration)

2. Content Segregation

3. Human Review (The Critical Layer)

4. Tool Selection

Why No GitHub CLI?

Learn More

Additional Resources

Development Philosophy

Security Resources

Troubleshooting

Firewall Issues

Claude Code Issues

llm Issues

Firewall Issues

General Container Issues

Contributing

Providing Feedback

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

When to use the `llm` CLI tool

Packages