Stars
Java-App for creating and validating Factur-X / ZuGFeRD / X-Rechnung invoices conforming with EU-Norm EN 16931.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
The Ultimate Express. Fastest http server with full Express compatibility, based on µWebSockets.
A smart source code globber that collects relevant code and puts it into your agent.md: npx update-agents-md
A modern shell with functional programming synatx.
In fluid dynamics, an eddy is the swirling of a fluid and the reverse current created when the fluid is in a turbulent flow regime.
Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automa…
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
CrisprWhisper for Mac - MLX - inference code and 8b quants
The open source Zapier alternative. Build workflow automation without spending time and money.
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
A nearly-live implementation of OpenAI's Whisper.
Simultaneous speech-to-text model
Build your personal knowledge base with Trilium Notes
OCR model that handles complex tables, forms, handwriting with full layout.
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Virtual whiteboard for sketching hand-drawn like diagrams





