Speaky

Voice-to-text for macOS, powered by on-device AI
Press a hotkey, speak, and the transcription is pasted at your cursor.

Download Latest Release

Download

Build	Architecture	Default Engine
Speaky-Apple-Silicon.dmg	Apple Silicon (M1/M2/M3/M4)	Parakeet V3
Speaky-Intel.dmg	Intel (x86_64)	Whisper Medium Q5

Installation

Download the DMG for your Mac.
Open the DMG and drag Speaky to the Applications folder.
Before first launch, open Terminal and run:
```
xattr -cr /Applications/Speaky.app
```
Open Speaky from Applications. On first launch, you may need to right-click > Open.

Why is this needed? Speaky is open-source and not notarized with Apple ($99/year requirement). The xattr command tells macOS you trust this app. You only need to do this once.

Features

Customizable Hotkey — Set any keyboard shortcut you want to start/stop recording from anywhere
Push-to-Talk & Hands-Free — Hold to record, or tap to toggle
Local Transcription — The app ships with default models (Parakeet V3 for Apple Silicon, Whisper Medium Q5 for Intel), but you can download any model from the built-in list or import your own custom Whisper model
Cloud Transcription — Optionally use Groq Whisper API for fast cloud-based transcription with your own API key
Auto-Paste — Transcribed text is pasted at your cursor automatically
Smart Text Cleanup — Removes filler words, fixes capitalization and spacing
Sound Effects — Audio cues when recording starts and transcription completes (can be disabled in Settings)
Speaky Mascot — Animated character shows recording/transcribing state in the notch and main window
Dynamic Notch Overlay — Live waveform, timer, and Speaky animation in the macOS notch
System Audio Muting — Optionally mutes system audio while recording
Multi-Model Support — Download and switch between 10+ transcription models
Custom Model Import — Import your own Whisper .bin models

Supported Models

The app comes with a default model pre-downloaded during onboarding, but you can switch to any of these at any time:

Model	Type	Size	Speed	Accuracy	Platform
Parakeet V3	Local (CoreML)	~494 MB	5/5	5/5	Apple Silicon
Whisper Medium Q5	Local (whisper.cpp)	~539 MB	3/5	4/5	Both
Groq Whisper	Cloud (API)	—	5/5	5/5	Both
Whisper Medium	Local (whisper.cpp)	~1.5 GB	2/5	4/5	Both
Whisper Small	Local (whisper.cpp)	~466 MB	4/5	3/5	Both
Whisper Small Q5	Local (whisper.cpp)	~190 MB	4/5	3/5	Both
Whisper Base	Local (whisper.cpp)	~142 MB	5/5	2/5	Both
Whisper Base Q5	Local (whisper.cpp)	~60 MB	5/5	2/5	Both
Whisper Tiny	Local (whisper.cpp)	~75 MB	5/5	1/5	Both
Whisper Large v1	Local (whisper.cpp)	~2.9 GB	1/5	4/5	Both
Whisper Large v2	Local (whisper.cpp)	~2.9 GB	1/5	5/5	Both

You can also import any custom Whisper .bin model via Settings > Advanced > Import Custom Whisper Model.

Build from Source

Speaky uses XcodeGen to generate the Xcode project.

# Install xcodegen
brew install xcodegen

# Generate project and build
xcodegen generate
xcodebuild -project Speaky.xcodeproj -scheme Speaky -configuration Release build

# Or use the build script for release builds
./build.sh              # Universal binary
./build.sh silicon      # Apple Silicon only
./build.sh intel        # Intel only
./build.sh separate     # Both architectures + DMGs

Architecture

Speaky/
├── SpeakyApp.swift                 # App entry point
├── AppState.swift                  # Central state (@Observable, @MainActor)
├── AppDelegate.swift               # Menu bar setup
├── Models/
│   ├── Settings.swift              # App preferences (UserDefaults)
│   ├── Transcription.swift         # SwiftData model
│   └── TranscriptionModel.swift    # Model metadata + availability
├── Services/
│   ├── AudioRecorder.swift         # AVAudioEngine → 16kHz mono WAV
│   ├── AudioControlService.swift   # System volume mute/unmute
│   ├── SoundEffectService.swift    # Start/end recording sound effects
│   ├── PasteService.swift          # CGEvent-based Cmd+V paste
│   ├── HotkeyManager.swift         # Global keyboard shortcuts
│   ├── ModelManager.swift          # Model download + cache
│   ├── TextCleanupService.swift    # Filler word removal
│   ├── CleanupService.swift        # Auto-delete old transcriptions
│   ├── DeviceGuard.swift           # Audio device disconnect protection
│   └── Transcription/
│       ├── TranscriptionEngine.swift   # Protocol
│       ├── WhisperEngine.swift         # Local (whisper.cpp)
│       ├── ParakeetEngine.swift        # Local (CoreML, Apple Silicon)
│       └── GroqEngine.swift            # Cloud (Groq API)
├── Utilities/
│   ├── Constants.swift             # App-wide constants
│   ├── Theme.swift                 # Colors, gradients, button styles
│   ├── KeychainHelper.swift        # Secure API key storage
│   ├── AudioFileLoader.swift       # Audio file loading
│   ├── AudioLevelMonitor.swift     # Real-time audio levels
│   └── WAVWriter.swift             # WAV file encoding
├── Views/
│   ├── MainWindow/
│   │   ├── MainWindowView.swift    # Main app window with Speaky animation
│   │   └── SettingsView.swift      # Settings + model management
│   ├── MenuBar/
│   │   └── MenuBarView.swift       # Menu bar dropdown
│   ├── Onboarding/                 # First-launch setup flow
│   ├── Overlay/
│   │   └── NotchOverlayView.swift  # Dynamic notch recording UI
│   └── Shared/
│       ├── SpeakyAnimation.swift   # Animation state enum
│       ├── SpeakyAnimationView.swift # APNG animation player
│       └── LanguagePicker.swift    # Language selection
└── Resources/
    ├── Speaky/                     # APNG mascot animations
    ├── Sounds/                     # Start/end sound effects
    └── Assets.xcassets/            # App icon + colors

Dependencies

Package	Purpose
SwiftWhisper	whisper.cpp Swift wrapper
KeyboardShortcuts	Global hotkey recording
DynamicNotchKit	Notch overlay UI
FluidAudio	Parakeet engine (Apple Silicon)

Requirements

macOS 15.0+ (Sequoia)
Microphone permission
Accessibility permission (for auto-paste)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github		.github
Speaky		Speaky
SpeakyTests		SpeakyTests
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app-icon.png		app-icon.png
build.sh		build.sh
project.yml		project.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaky

Download

Installation

Features

Supported Models

Build from Source

Architecture

Dependencies

Requirements

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speaky

Download

Installation

Features

Supported Models

Build from Source

Architecture

Dependencies

Requirements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages