Whisper Speech-to-Text Tool

簡單好用的語音轉文字工具，使用 OpenAI 的 Whisper 模型，搭配 React 前端和 FastAPI 後端。

Built with AI Vibe Coding

Python 3.8+
Node.js 18+
ffmpeg: 用於音訊處理
- macOS: brew install ffmpeg
- Ubuntu: sudo apt update && sudo apt install ffmpeg
- Windows: 使用 FFmpeg 官網或 choco install ffmpeg

快速開始

後端設定

使用 uv（推薦）:

# 進入後端目錄
cd backend

# 建立虛擬環境
uv venv

# 啟用虛擬環境
source .venv/bin/activate  # Windows 使用: .venv\Scripts\activate

# 安裝依賴
uv pip install -r requirements.txt

# 啟動後端服務器
python main.py  # 或使用 uvicorn: uvicorn main:app --reload

使用標準 Python 工具:

# 進入後端目錄
cd backend

# 建立虛擬環境並安裝依賴
python -m venv .venv
source .venv/bin/activate  # Windows 使用: .venv\Scripts\activate
pip install -r requirements.txt

# 啟動後端服務器
python main.py

前端設定

# 進入前端目錄
cd frontend

# 安裝依賴
npm install

# 啟動開發伺服器
PORT=4001 npm start

前端將在 http://localhost:4001 運行。

使用方法

確保後端和前端都已啟動運行
打開瀏覽器，訪問 http://localhost:4001
使用界面上傳音訊檔案或直接錄音
錄音完成後，系統會自動進行轉錄
轉錄結果將顯示在頁面上

功能特點

支援上傳音訊檔案或直接錄音
即時音訊波形視覺化顯示
使用 Whisper "turbo" 模型提供快速準確的語音識別
現代化 UI 設計 (Tailwind CSS + shadcn/ui)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
start.bat		start.bat
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Whisper Speech-to-Text Tool

快速開始

後端設定

前端設定

使用方法

功能特點

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

Lanznx/Whisper-Speech-to-Text-Tool

Folders and files

Latest commit

History

Repository files navigation

Whisper Speech-to-Text Tool

快速開始

後端設定

前端設定

使用方法

功能特點

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages