AXS — A High-Performance, Highly Available, and Scalable Event-Driven Balance Engine

AXS (Account eXecution Service) is a high-throughput balance engine designed for systems where correctness and performance cannot be compromised—such as cryptocurrency exchanges, trading platforms, payment systems, and real-time financial applications.

It processes balance mutations in-memory, persists them via Kafka-backed event logs, and writes them to storage using batched atomic commits. AXS guarantees idempotency, ordering, and durability while providing gRPC APIs for safe ingestion and Kafka topics for downstream integration.

Architecture

flowchart TD
    classDef client fill:#ffebee,stroke:#c62828,stroke-width:2px;
    classDef grpc fill:#e3f2fd,stroke:#1565c0,stroke-width:2px;
    classDef consumer fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px;
    classDef persistent fill:#f0f4c3,stroke:#827717,stroke-width:2px;
    classDef cron fill:#fff3e0,stroke:#ef6c00,stroke-width:2px;
    classDef db fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px;
    
    Client((External Microservices/Clients)):::client

    subgraph Grpc_Layer [API Gateway/gRPC Layer]
      WriteAPI("Write API (Balance Command)"):::grpc
      ReadAPI("Read API (Balance Query)"):::grpc
      AckAPI("ACK API (Saga Confirmation)"):::grpc
    end

    subgraph Infra_Layer [Infrastructure Layer]
        MQ{{"Kafka Command/Event Log"}}:::persistent
        DB[("PostgreSQL Shards")]:::persistent
        Redis[("Redis Cluster/Read Cache")]:::persistent
    end

    Client -- saga: Consume Result Event --> MQ
    Client -- saga: Process Result, then Call ACK --> AckAPI
    AckAPI -- saga: Update Saga Status/ACK State --> DB

    Client --> ReadAPI
    ReadAPI -- read: Query Cache Data --> Redis

    Client --> WriteAPI
    WriteAPI -- write: Insert Ledger & Check Idempotency --> DB
    WriteAPI -- write: Produce Command Log Asynchronously --> MQ

    subgraph Consumer [Balance Consumer Group]
        subgraph Processor [Synchronous Processing Unit]
            BatchConsumer{{"Shard Consumer"}}:::consumer
            BalanceCache{{"In-Memory Balance State"}}:::consumer
        end

        subgraph AsyncProcessor [Asynchronous Persistence & Cache Update]
            FlushWorker{{"Flush Worker (DB Batcher)"}}:::consumer
            RedisWorkerPool{{"Redis Write Pool"}}:::consumer
        end
        LeaderElect{{"Leader Election (HA/ZDT)"}}:::consumer
    end
    
    LeaderElect -- 1. Acquire Lock (Get-Then-Set) --> Redis
    LeaderElect -- 2. Init Consumer (Read Offset, Assign Partition) --> BatchConsumer
    
    BatchConsumer -- 3. Read Command Messages --> MQ
    BatchConsumer -- 4. Apply Change (Single Thread) --> BalanceCache
    BatchConsumer -- 5. Produce Result Event --> MQ
    BatchConsumer -- 6. Send Changes to Flush Queue --> FlushWorker
    BatchConsumer -- 7. Send Snapshot to Redis Queue --> RedisWorkerPool
    
    FlushWorker -- Aggregate Results, Batch Write DB, Commit Offset --> DB
    RedisWorkerPool -- LWW with Lua Script --> Redis
    
    subgraph Saga_Monitoring [Saga Monitoring & Compensation]
        Cron{{"Cron Job (Audit/Retry)"}}:::cron
    end
    
    
    Cron -- Check Unacknowledged and Unproduced Status (Compensation) --> DB
    Cron -- outbox: Produce Failed Produced Message --> MQ
    Cron -- saga: Initiate Compensation/Retry Call --> Client

    DB ~~~ MQ
    MQ ~~~ BatchConsumer
    LeaderElect ~~~ BatchConsumer

The architecture is explained in detail in this article.

Key Implementations

Batch Consumer (Zero-Goroutine Aggregation)

AXS provides a deterministic batch consumer built on Kafka’s poll timeout. When idle, it uses a long timeout to avoid excessive polling; once a message arrives, it switches to a short batching delay to quickly accumulate follow-up messages and prevent backlog.

Batches flush when either batchSize is reached or batchingDelay expires, ensuring low latency and in-order processing without extra goroutines. This design reduces concurrency overhead by eliminating extra goroutines and avoiding locks on shared buffers.

High Performance, High Availability, and Scalability (Zero-GC Cache, Leader Election, Liveness Probe)

AXS updates balances in an in-memory cache first and flushes them to the database asynchronously. It uses a zero-GC cache (BigCache-style byte-array store) that can hold millions of entries without causing garbage-collection overhead, enabling ultra-low-latency balance updates.

A lightweight leader-election mechanism allows multiple consumers to run for the same partition while ensuring only one actively processes events. This supports zero-downtime rolling upgrades and automatic failover. If a leader detects that another node has taken ownership, it immediately reports unavailable through a liveness probe, triggering a controlled restart.

Memory Model (Lazy Loading & Long-Lived Balance Residency)

AXS loads a user’s balance from the database on first use and keeps it resident in memory for subsequent updates. This eliminates repeated DB lookups, enabling microsecond-level update performance even under extreme load.

Atomic Batch Database Flush (One SQL for Many Updates)

AXS applies balance updates using periodic batched SQL writes. A dedicated flush worker aggregates update requests (e.g., every 500ms) and commits them in a single atomic SQL statement. This reduces database contention, improves throughput, and guarantees correctness under load.

Data Durability and Consistency (Idempotency, Offset Management in DB, LWW Lua Script)

AXS follows a Kafka-based event sourcing / WAL design:

Producer idempotency: implemented via the outbox pattern and DB-level unique keys
Consumer idempotency: each event carries an event-status record; the flush worker updates this status using optimistic locking
Offset consistency: Kafka offsets are committed inside the same database transaction as the batch flush, ensuring exactly-once semantics at the application level

For reads, balances are served from Redis. Redis updates use Lua scripts with Last-Write-Wins (LWW) timestamps to guarantee eventual consistency under concurrent updates across worker threads.

Elegant Sharding Strategy (Shard ID for DB Partitioning, Kafka Partitioning, and Redis Hash-Tagging)

AXS uses an explicit shard_id column to drive all sharding dimensions:

Database range partitions
Kafka partition keys
Redis Cluster hash tags ({shard_id})

This unified sharding model provides clear operability, flexible re-sharding, and predictable data locality across storage layers.

Repository layout

.
├── cmd/                 # Cobra commands: grpc, consumer (placeholders for http/cron)
├── config/              # TOML configs (copy config.example.toml to your env)
├── deployment/          # Includes Docker Compose (local) and Helm charts (Kubernetes) for deployment
├── pb/                  # Protobuf definitions + generated Go code (api + event schemas)
├── pkg/                 # Application code (app wiring, handlers, repositories, services, utils)
│   ├── app/             # Fx bootstrap helpers for grpc + consumer apps
│   ├── handler/         # gRPC handler and Kafka batch consumer abstraction
│   ├── infra/           # Config loading, DB/Redis/Kafka clients
│   ├── repository/      # dbdao (Postgres), redisdao, cachedao (BigCache)
│   ├── service/         # Event processor, leader election, result publisher
│   └── utils/           # Worker pool, retry helpers, etc.
├── setup/               # goose migrations, Kafka topic specs, seed tool + Dockerfile
└── stresstest/          # k6 based stress test harness (TypeScript + webpack)

Getting started

Prerequisites

Go 1.24.x (per go.mod)
Docker & Docker Compose (for local infra)
protoc, protoc-gen-go, protoc-gen-go-grpc (only if you need to regenerate protobufs)
Optional for stress tests: Node 20+, npm, go install go.k6.io/xk6@latest

How to use

Copy config/config.example.toml to config/config.toml and update the values as needed.
Use the provided Makefile to build, run, and manage the service.

Stress Test

Stress testing is performed using k6. See the stress test documentation for details.

To-Do

Implement robust failure-handling mechanisms (e.g., introduce a Dead Letter Queue and add safeguards for resolving data inconsistencies between the database and in-memory cache when two consumers accidentally process requests for the same user).
Add comprehensive unit tests across all layers (SQL, Redis, cache, event processing logic, etc.).
Add request signature verification to prevent unauthorized internal requests and ensure message integrity.
Add a cron job to resend failed produced messages (for cases where Kafka or network issues prevent successful publishing).
Introduce an acknowledgment (ACK) mechanism to eliminate the need for separate cron jobs in individual microservices. AXS would centrally manage callback delivery and retry logic when ACKs are lost.
Provide Helm charts for Kubernetes deployment.
Add MySQL repository implementation (alternative persistence layer to PostgreSQL).
Include more comments to enhance clarity and readability.

Disclaimer

This project is a beta-stage prototype, designed primarily to demonstrate the performance characteristics of an event-driven architecture. Before using it in any production environment, please review the entire codebase thoroughly and conduct extensive testing to ensure it meets your system’s reliability, safety, and compliance requirements.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AXS — A High-Performance, Highly Available, and Scalable Event-Driven Balance Engine

Architecture

Key Implementations

Batch Consumer (Zero-Goroutine Aggregation)

High Performance, High Availability, and Scalability (Zero-GC Cache, Leader Election, Liveness Probe)

Memory Model (Lazy Loading & Long-Lived Balance Residency)

Atomic Batch Database Flush (One SQL for Many Updates)

Data Durability and Consistency (Idempotency, Offset Management in DB, LWW Lua Script)

Elegant Sharding Strategy (Shard ID for DB Partitioning, Kafka Partitioning, and Redis Hash-Tagging)

Repository layout

Getting started

Prerequisites

How to use

Stress Test

To-Do

Disclaimer

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cmd		cmd
config		config
deployment		deployment
pb		pb
pkg		pkg
setup		setup
stresstest		stresstest
.gitignore		.gitignore
.mockery.yml		.mockery.yml
Dockerfile		Dockerfile
Dockerfile.setup		Dockerfile.setup
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

License

vx416/axs

Folders and files

Latest commit

History

Repository files navigation

AXS — A High-Performance, Highly Available, and Scalable Event-Driven Balance Engine

Architecture

Key Implementations

Batch Consumer (Zero-Goroutine Aggregation)

High Performance, High Availability, and Scalability (Zero-GC Cache, Leader Election, Liveness Probe)

Memory Model (Lazy Loading & Long-Lived Balance Residency)

Atomic Batch Database Flush (One SQL for Many Updates)

Data Durability and Consistency (Idempotency, Offset Management in DB, LWW Lua Script)

Elegant Sharding Strategy (Shard ID for DB Partitioning, Kafka Partitioning, and Redis Hash-Tagging)

Repository layout

Getting started

Prerequisites

How to use

Stress Test

To-Do

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages