PromptRepository

A framework for managing, testing, and evaluating Large Language Model (LLM) prompts. Part of the StrongAI Gym Assistant suite. Built to help you create more reliable and robust AI applications through systematic prompt engineering, validation, and testing.

The repo also demonstrates a technique I think will become much more common - bundling a framework with prompts to generate code to use the framework.

Inspired by two main things:

Anthropic's guidance on writing applications - which is in essence, 'keep it simple'. Use the basic API for as long as it works, and don't leap straight to heavy frameworks like LangChain, Rivet etc. "We suggest that developers start by using LLM APIs directly: many patterns can be implemented in a few lines of code.". The frameworks are great, but can divert you into learning the framework rather than learning more directly about how to productively use the LLM.
Chris Benson/Practical AI's work on the equivalent of Unit Tests for Prompts. This gives you a solid base to build from as you build apps which contain many embeded prompts & target responses for which you need to assure quality.

Externalized Prompts - stored in JSON

The motivation is to easily allow your prompts to be separated from your code, so you can define a distinct lifecycle e.g. testing a prompt variant when you change the model is not the same as re-testing your code. It is a subset. (You do need to retest the whole thing at the end though...)
Simple JSON schema for prompts, initially for TypeScript but allowing portability for a Python client in future.
Prompts can be loaded from JSON files at runtime or included as TypeScript variables at build time.

The Prompt format includes:

an ID and version for tracking changes
user and system prompt values
required and optional parameters
default values for optional parameters
automatic validation of required parameters during template expansion

The format is compatible with Model Context Protocol (MCP), which I envisage using as soon as Cursor adds support for Prompts.

Packaged prompts for generating Evals for your prompts

Standard prompts for generating unit tests (mainly presence of required parameters and permutations of optional ones) and unit evals for your prompts.
Unit testing is pretty standard, but Evals are quite new thinking. For a given prompt in your application, the 'generateEvals' prompt creates three evals:
- A base function eval. The simplest thing you can ask your model, and what you expect must be in the response.
- Amended input to the base function that provides the same output.
- Amended input to the base function that should provide different output.

These are all very basic tests - you expect them to work 100% of the time, like unit tests, so you run them as frequently as you run unit tests whenever you change your prompts.

This approach has been described by Chris Benson, an AI Strategist for Lockheed Martin, in the Practical AI Podcast. It's about 40 minutes in.

Example:

PROMPT = "Given some text, classify it as POSITIVE or NEGATIVE"

# Base Test (simplest possible test, must pass)
TEST basic_sentiment:
    INPUT = "I am happy"
    RESULT = evaluate(INPUT)
    EXPECT RESULT = POSITIVE

# Same Result Test (different input, same expected output)
TEST similar_sentiment:
    INPUT = "I am joyful"
    RESULT = evaluate(INPUT)
    EXPECT RESULT = POSITIVE

# Different Result Test (different input, different output)
TEST opposite_sentiment:
    INPUT = "I am sad"
    RESULT = evaluate(INPUT)
    EXPECT RESULT = NEGATIVE

The Eval prompt:

Given the following prompt: <prompt>{prompt}</prompt>, generate a set of evaluations for the prompt in
{language} using the {framework} framework. Use the PromptInMemoryRespository API to load the prompt and
expand varables. Use the getModelResponse API to call the model. You should include three cases:
1) Very simple input and output containing known content.
2) A small change to the input that should return the same output.
3) A small change to the input that should produce different output.
Use the domain of motorsports to generate the evaluation cases.

The Test prompt:

Given the following prompt: <prompt>{prompt}</prompt>, generate a full set of unit tests for the prompt in
{language} using the {framework} framework. Cover all permmutations of missing required parameters, and
variant values for both required and optional parameters.

This approach (bundling a framework with prompts to generate code to use the framework) is still work in progress. I am using the prompts to generate test cases for another LLM-based application I am building, and will continue to evolve the prompts.

Tool Calling Support

PromptRepository now supports tool calling functionality following the OpenAI Responses API pattern. This allows the model to call functions in a single interaction, enabling complex workflows and comprehensive responses.

Key Features

Tool Calls: The model can call one or more functions in one response
Automatic Tool Execution: Functions are executed automatically and results fed back to the model
Streaming Support: Tool calls work with both regular and streaming responses
Error Handling: Robust error handling for function validation and execution
Loop Prevention: Intelligent detection and prevention of infinite tool call loops

Example: Tool Calling

import { ChatDriverFactory, EModelProvider, EModel, EVerbosity } from '@jonverrier/prompt-repository';
import { IFunction, EDataType } from '@jonverrier/prompt-repository';

// Create functions following OpenAI pattern
const weatherFunction: IFunction = {
  name: 'get_weather',
  description: 'Get current weather for a city',
  inputSchema: {
    type: EDataType.kObject,
    properties: {
      city: { type: EDataType.kString, description: 'City name' }
    },
    required: ['city']
  },
  outputSchema: {
    type: EDataType.kObject,
    properties: {
      temperature: { type: EDataType.kNumber, description: 'Temperature in Celsius' },
      condition: { type: EDataType.kString, description: 'Weather condition' }
    },
    required: ['temperature', 'condition']
  },
  validateArgs: (args) => {
    if (!args.city) throw new Error('City is required');
    return args;
  },
  execute: async (args) => {
    // Your weather API logic here
    return { temperature: 22, condition: 'sunny' };
  }
};

const horoscopeFunction: IFunction = {
  name: 'get_horoscope',
  description: "Get today's horoscope for an astrological sign",
  inputSchema: {
    type: EDataType.kObject,
    properties: {
      sign: { type: EDataType.kString, description: 'Astrological sign like Aquarius' }
    },
    required: ['sign']
  },
  outputSchema: {
    type: EDataType.kObject,
    properties: {
      horoscope: { type: EDataType.kString, description: 'Horoscope text' }
    },
    required: ['horoscope']
  },
  validateArgs: (args) => {
    if (!args.sign) throw new Error('Sign is required');
    return args;
  },
  execute: async (args) => {
    return { horoscope: `${args.sign} - Next Tuesday you will befriend a baby otter.` };
  }
};

// Use tools in one interaction
const chatDriver = new ChatDriverFactory().create(EModel.kLarge, EModelProvider.kOpenAI);

const response = await chatDriver.getModelResponse(
  'You are a helpful assistant with access to weather and horoscope data.',
  'I am an Aquarius planning a trip to London. Can you give me the weather and my horoscope?',
  EVerbosity.kMedium,
  [], // message history
  [weatherFunction, horoscopeFunction] // Available tools
);

// The model will automatically:
// 1. Call get_weather with city: "London"
// 2. Call get_horoscope with sign: "Aquarius"
// 3. Combine results into a comprehensive response
console.log(response);

Streaming with Tools

// Streaming also supports tool calls
const iterator = chatDriver.getStreamedModelResponse(
  'You are a travel assistant.',
  'What\'s the weather in London, Paris, and Tokyo?',
  EVerbosity.kMedium,
  [],
  [weatherFunction]
);

// The model will call get_weather three times (once for each city)
// and stream the final response
for await (const chunk of iterator) {
  process.stdout.write(chunk);
}

Forced Tool Usage

// Force the model to use at least one tool
const response = await chatDriver.getModelResponseWithForcedTools(
  'You must use available tools to answer questions.',
  'Tell me about the weather somewhere.',
  EVerbosity.kMedium,
  [],
  [weatherFunction]
);
// Model will be required to call get_weather before responding

Implementation Details

The tool calling implementation follows the official OpenAI Responses API pattern:

Input List Pattern: Messages are converted to input_list format
Function Call Detection: Response output is scanned for function_call items
Parallel Execution: Multiple function calls are executed in parallel
Result Integration: Function results are added as function_call_output items
Continuation: Process repeats until no more function calls are needed

This ensures compatibility with OpenAI's latest API patterns while providing a clean, TypeScript-native interface.

File Attachments and Table JSON Support

PromptRepository supports file attachments and structured table JSON data for enhanced document processing. This is particularly useful when working with PDFs and other documents that contain tabular data.

Key Features

File Attachments: Upload PDF files directly to OpenAI/Azure OpenAI for document analysis
Table JSON: Provide structured table data extracted from documents (e.g., via LlamaParse) for better fidelity than PDF extraction
Automatic Cleanup: Files are automatically deleted after use (configurable)
Provider Support: Works with both OpenAI and Azure OpenAI

File Attachments

The attachment API allows you to upload PDF files that the model can analyze. Note that the Responses API only accepts PDF files for attachments.

import { ChatWithAttachmentDriverFactory, EModelProvider, EModel, EVerbosity, IChatAttachmentContent } from '@jonverrier/prompt-repository';
import * as fs from 'fs';

// Create a driver with attachment support
const factory = new ChatWithAttachmentDriverFactory();
const driver = factory.create(EModel.kLarge, EModelProvider.kOpenAI);

// Read a PDF file
const pdfBuffer = fs.readFileSync('report.pdf');

// Create an attachment
const attachment: IChatAttachmentContent = {
  filename: 'report.pdf',
  mimeType: 'application/pdf',
  data: pdfBuffer,
  deleteAfterUse: true // Automatically delete after the request (default: true)
};

// Use the attachment in a request
const response = await driver.getModelResponse(
  'You are a financial analyst.',
  'Summarize the key findings from this report.',
  EVerbosity.kMedium,
  attachment
);

console.log(response);

Reusing Uploaded Files

If you've already uploaded a file, you can reference it by ID to avoid re-uploading:

import { IChatAttachmentReference } from '@jonverrier/prompt-repository';

// Reference an already uploaded file
const attachmentRef: IChatAttachmentReference = {
  id: 'file-abc123',
  deleteAfterUse: false // Keep the file for future use
};

const response = await driver.getModelResponse(
  'You are a document analyst.',
  'What are the main topics in this document?',
  EVerbosity.kMedium,
  attachmentRef
);

Table JSON Support

For better fidelity with tabular data, you can extract tables from documents using tools like LlamaParse and provide them as structured JSON. This approach provides better accuracy than PDF extraction for tables.

import { IChatTableJson } from '@jonverrier/prompt-repository';

// Table JSON extracted from a document (e.g., via LlamaParse)
const tableJson: IChatTableJson = {
  name: 'Financial Report Tables',
  description: 'Tables extracted from Q1 2024 financial report',
  data: [
    {
      table: 'Revenue Summary',
      page: 1,
      rows: [
        { quarter: 'Q1 2024', revenue: 1250000, growth: '12%' },
        { quarter: 'Q4 2023', revenue: 1115000, growth: '8%' }
      ]
    },
    {
      table: 'Expense Breakdown',
      page: 2,
      rows: [
        { category: 'Salaries', amount: 450000, percentage: '36%' },
        { category: 'Marketing', amount: 200000, percentage: '16%' }
      ]
    }
  ]
};

// Use table JSON in a request
const response = await driver.getModelResponse(
  'You are a financial analyst. Analyze the provided table data.',
  'What was the revenue for Q1 2024? Calculate the total expenses.',
  EVerbosity.kMedium,
  undefined, // no file attachment
  tableJson  // table JSON as separate input
);

console.log(response);

Combining Attachments and Table JSON

You can use both file attachments and table JSON together for comprehensive document analysis:

// Upload the full PDF
const attachment: IChatAttachmentContent = {
  filename: 'financial-report.pdf',
  mimeType: 'application/pdf',
  data: fs.readFileSync('financial-report.pdf')
};

// Provide extracted tables as JSON for better accuracy
const tableJson: IChatTableJson = {
  name: 'Report Tables',
  description: 'Tables extracted via LlamaParse',
  data: [
    // ... table data from LlamaParse
  ]
};

// Use both together
const response = await driver.getModelResponse(
  'You are a financial analyst.',
  'Analyze the document and the extracted tables. Provide a comprehensive summary.',
  EVerbosity.kMedium,
  attachment,  // Full document
  tableJson    // Structured table data
);

When to Use Table JSON vs Attachments

Use File Attachments when:
- You need the model to analyze the full document context
- The document contains mostly text with some tables
- You want the model to understand document structure and formatting
Use Table JSON when:
- You need high-fidelity table data extraction
- Tables are the primary focus of your analysis
- You've already extracted tables using tools like LlamaParse
- You need precise numerical calculations from tabular data
Use Both Together when:
- You want comprehensive document analysis with accurate table data
- The document contains important context beyond just tables
- You need both document understanding and precise table analysis

Manual File Management

You can also manage file uploads and deletions manually:

// Upload a file and get a reference
const attachment: IChatAttachmentContent = {
  filename: 'document.pdf',
  mimeType: 'application/pdf',
  data: pdfBuffer
};

const reference = await driver.uploadAttachment(attachment);
console.log(`File uploaded with ID: ${reference.id}`);

// Use the reference in multiple requests
const response1 = await driver.getModelResponse(
  'Analyze this document.',
  'What is the main topic?',
  EVerbosity.kMedium,
  reference
);

const response2 = await driver.getModelResponse(
  'Analyze this document.',
  'What are the key conclusions?',
  EVerbosity.kMedium,
  reference
);

// Delete when done
await driver.deleteAttachment(reference.id);

Implementation Details

File Format: Only PDF files are supported for attachments (Responses API limitation)
Automatic Cleanup: Files uploaded inline are deleted by default after use
Table JSON Format: Any valid JSON structure is supported (arrays, objects, nested structures)
Provider Support: Both OpenAI and Azure OpenAI implementations support attachments and table JSON

Installation

Prerequisites

Before installing, ensure you have the required dependencies:

For OpenAI/Azure OpenAI: The openai package (peer dependency)
For Google Gemini: The @google/generative-ai package (peer dependency)

Install peer dependencies as needed:

# For OpenAI/Azure OpenAI support
npm install openai

# For Google Gemini support
npm install @google/generative-ai

Package Installation

Install the package from GitHub Packages:

First, configure npm to use GitHub Packages for the @jonverrier scope by creating or updating your .npmrc file:

echo "@jonverrier:registry=https://npm.pkg.github.com" >> ~/.npmrc

Then install the package:

npm install @jonverrier/prompt-repository

Note: You'll need a GitHub personal access token with read:packages permission to install packages from GitHub Packages. Set it as an environment variable:

export NODE_AUTH_TOKEN=your_github_token_here

Or add it to your .npmrc file:

//npm.pkg.github.com/:_authToken=your_github_token_here

Ensure you have set your API keys as environment variables:

OPENAI_API_KEY - Required for OpenAI provider
AZURE_OPENAI_API_KEY and AZURE_OPENAI_ENDPOINT - Required for Azure OpenAI provider
GOOGLE_GEMINI_API_KEY - Required for Google Gemini provider

Note: You only need to set the API keys for the providers you plan to use. The package will work with any combination of providers.

Create a JSON file containing your prompts

[{
  "id": "template-prompt-001",
  "version": "1.0.0",
  "name": "Template Prompt",
  "systemPrompt": "You are a helpful assistant that {ROLE_DESCRIPTION}. Your task is to {TASK_DESCRIPTION}.",
  "systemPromptParameters": [
    {
      "name": "ROLE_DESCRIPTION",
      "description": "Description of the assistant's role",
      "type": "kString",
      "required": true
    },
    {
      "name": "TASK_DESCRIPTION",
      "description": "Description of the main task or objective",
      "type": "kString",
      "required": true
    }
  ],
  "userPrompt": "Please help me with {USER_REQUEST}. Consider the following context: {CONTEXT}",
  "userPromptsParameters": [
    {
      "name": "USER_REQUEST",
      "description": "The specific request or question from the user",
      "type": "kString",
      "required": true
    },
    {
      "name": "CONTEXT",
      "description": "Additional context or background information",
      "type": "kString",
      "required": false
    }
  ]
}]

Use the prompts in your TypeScript project:

import { PromptFileRepository} from "@jonverrier/prompt-repository";

const fileName = "template-prompt.json";
const filePath = path.join(__dirname, fileName);

// Initialize repository with test file
const repo = new PromptFileRepository(filePath);

const promptId1 = "template-prompt-001";

// Test loading a specific prompt
const prompt : IPrompt | undefined = await repo.getPrompt(promptId1);
expect(prompt).toBeDefined();

let result = repo.expandSystemPrompt(prompt, {ROLE_DESCRIPTION: "Teaches French", TASK_DESCRIPTION: "Teach the user how to say 'Hello' in French"});
expect(result).toContain("Teaches French");
expect(result).toContain("Teach the user how to say 'Hello' in French");

let result2 = repo.expandUserPrompt(prompt, {USER_REQUEST: "How do I say 'Hello' in French?", CONTEXT: "The user is a beginner to French"});
expect(result2).toContain("How do I say 'Hello' in French?");
expect(result2).toContain("The user is a beginner to French");

Usage - Eval Prompts

for Cursor users:

Select your prompt and add it to the context.
Add your codebase to the context.
Cut and Paste the prompt. When Cursor supports MCP prompts I envisage moving the prompt to a tool.

You should see something like this:

Run the prompt & Cursor should generate a decent set of Unit-test level Evals for you.

Testing Strategy

This project uses a tiered testing approach to balance comprehensive testing with CI/CD efficiency:

Local Development (Full Test Suite)

npm test                    # Run all tests (requires OpenAI API key)
npm run test:integration    # Run only integration tests (requires API key)

CI/CD (Basic Unit Tests)

npm run test:ci            # Run unit tests only (no API key required)

Test Categories:

Unit Tests (test:ci): Core functionality without external dependencies
- Prompt repository operations
- Parameter validation and templating
- String sanitization
- ID generation utilities
Integration Tests (test:integration): Full LLM interactions
- Text responses with OpenAI/Azure OpenAI
- Embedding generation
- Function calling
- Streaming responses with tool support
- Complex multi-tool scenarios

The GitHub Actions workflow runs only the unit tests to avoid requiring API keys in CI/CD, while developers run the full test suite locally before pushing to main.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Run the full test suite locally (npm test) to ensure integration tests pass
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request against the development branch in the main repo.

Note: The CI/CD pipeline runs basic unit tests only. Ensure your local integration tests pass before submitting PRs.

Publishing Releases

This package is automatically published to GitHub Packages when you create a release:

Update version: npm version patch|minor|major (updates package.json and creates git tag)
Push changes: git push && git push --tags
Create GitHub Release: Go to GitHub → Releases → Create new release using the version tag
Automatic Publishing: The GitHub Actions workflow will automatically build, test, and publish the package

The package is only published on releases, not on every commit, to maintain clean version management.

Name		Name	Last commit message	Last commit date
Latest commit History 199 Commits
.github/workflows		.github/workflows
examples		examples
scripts		scripts
src		src
test		test
.cursorrules		.cursorrules
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
2		2
C4Container.C4Diagrammer.md		C4Container.C4Diagrammer.md
C4Context.C4Diagrammer.md		C4Context.C4Diagrammer.md
CodeReview.md		CodeReview.md
Cursor-Prompt-Example.PNG		Cursor-Prompt-Example.PNG
LICENSE		LICENSE
README.md		README.md
mocha		mocha
package-lock.json		package-lock.json
package.json		package.json
prompt-repository@0.2.0		prompt-repository@0.2.0
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PromptRepository

Externalized Prompts - stored in JSON

Packaged prompts for generating Evals for your prompts

Tool Calling Support

Key Features

Example: Tool Calling

Streaming with Tools

Forced Tool Usage

Implementation Details

File Attachments and Table JSON Support

Key Features

File Attachments

Reusing Uploaded Files

Table JSON Support

Combining Attachments and Table JSON

When to Use Table JSON vs Attachments

Manual File Management

Implementation Details

Installation

Prerequisites

Package Installation

Usage - Eval Prompts

Testing Strategy

Local Development (Full Test Suite)

CI/CD (Basic Unit Tests)

Contributing

Publishing Releases

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors 2

Languages

License

jonverrier/PromptRepository

Folders and files

Latest commit

History

Repository files navigation

PromptRepository

Externalized Prompts - stored in JSON

Packaged prompts for generating Evals for your prompts

Tool Calling Support

Key Features

Example: Tool Calling

Streaming with Tools

Forced Tool Usage

Implementation Details

File Attachments and Table JSON Support

Key Features

File Attachments

Reusing Uploaded Files

Table JSON Support

Combining Attachments and Table JSON

When to Use Table JSON vs Attachments

Manual File Management

Implementation Details

Installation

Prerequisites

Package Installation

Usage - Eval Prompts

Testing Strategy

Local Development (Full Test Suite)

CI/CD (Basic Unit Tests)

Contributing

Publishing Releases

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors 2

Languages

Packages