[WIP] go-queue-embeddings

[WIP] go-queue-embeddings

A minimal Go service that queues long-running embedding tasks with self hosted inference.

⚠️ Queue implementation is TODO

Live on: go-queue-embeddings.onrender.com

⚠️ This demo runs on Render.com's free tier. It may show a 502 error during initialization after periods of inactivity. If this happens, please wait a few seconds and refresh the page.

Goals:

Showcase concurrency patterns in Go using worker queues
Provide a working pipeline for document processing and embedding
Use modular interfaces for future extensibility

Local Development

Using Docker

Build the container:

docker build -t go-queue-embeddings .

Run the container (maps port 8080):

docker run -p 8080:8080 go-queue-embeddings

Prerequisites: Ensure Docker is installed on your system

Key Decisions

Gin because is popular and easy to use
Using freely an hexagonal architecture approach to ensure extensibility, specially decoupling the logic from the embedding provider and the storage
Started using Ollama because it has a huge community and is optimized for different hardware out of the box
Saving as the process JSON in a temp folder for this POC but the code is expandable to save in a database or other storage in the future
Plan and Progress are tracked in plan.md for clarity and future reference.
Added Supervisord to run this in a Hugging Face Space, but managing two ports inside Hugging Face caused issues, so we switched to Render.com instead. Later we can revisit this issue, for example using tfgo instead of ollama.
Choosed HTMX to mantain a lean view implementation, the focus is the go service. But we can implement server side react or next.js later.

[EXPECTED] API Flow

PDF Upload
- Route: /upload
- Receives PDF via POST request
- Returns UUID process ID for tracking
- Request fields:
  - pdf: PDF file (multipart/form-data)
  - chunk_strategy: Strategy for PDF text chunking

Response Format

{
  "id": "uuid-process-id",
  "status": "processing",
  "progress": 25
}

Processing Pipeline
- PDF is divided into chunks based on strategy
- Each chunk is sent to embedding service
- Results saved as JSON file (<process_id>.json)
Output route
- Route: /process/<process_id>
- Returns the JSON file with the status and, if completed, the results

Data Model

Process JSON

{
  "id": "uuid-process-id",
  "status": "processing|completed|failed",
  "progress": 75,
  "data": [
    {
      "id": "uuid-chunk-id",
      "text": "chunk text content",
      "embedding": [0.1, 0.2, ...],
      "metadata": {
        "chunk_size": 512,
        "embedding_model": "model-name"
      }
    }
  ],
  "metadata": {
    "chunk_size": 512,
    "embedding_model": "model-name"
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
cmd		cmd
internal		internal
static		static
templates		templates
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
plan.md		plan.md
supervisord.conf		supervisord.conf
supervisord_start.sh		supervisord_start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[WIP] go-queue-embeddings

Goals:

Local Development

Using Docker

Key Decisions

[EXPECTED] API Flow

Data Model

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

License

emiliacb/go-queue-embeddings

Folders and files

Latest commit

History

Repository files navigation

[WIP] go-queue-embeddings

Goals:

Local Development

Using Docker

Key Decisions

[EXPECTED] API Flow

Data Model

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages