
aTrain
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
The Lens
aTrain turns speech recordings into text on your own machine, with no cloud upload and no subscription. It runs OpenAI's Whisper model locally for transcription in 99 languages and adds speaker diarization (working out who said what) through pyannote. It is a real desktop app with installers on the Microsoft Store and Flathub, not a script you have to babysit. AGPL-3.0, fully free.
Because everything runs on your device, nothing you record leaves your computer, which is the whole point for anyone handling interview, medical, or legal audio. On a plain CPU it is slow; with an NVIDIA GPU and the CUDA toolkit installed, the best model runs at roughly three times the audio length. It exports straight into MAXQDA, ATLAS.ti, and NVivo, so qualitative researchers are clearly the target audience.
Weigh this against Otter.ai, Rev, and Trint, which are faster and need no setup but send your audio to their servers and bill you monthly. If privacy matters, or you transcribe enough hours that subscriptions add up, aTrain wins outright. Solo researchers and journalists can install it and stop paying per minute. Teams with sensitive recordings may find local processing is the only option compliance allows.
The catch: local means your hardware is the bottleneck. Without a decent GPU, long recordings take real time, and accuracy still depends on audio quality the way every transcription tool does. It trades a monthly bill for your own patience and a CUDA install.
Free vs Self-Hosted vs Paid
fully freeFree tier: Everything. AGPL-3.0, no subscription, no per-minute fees.
Self-hosted / local: Runs entirely on your device. GUI installers on the Microsoft Store and Flathub, or pip for headless use. NVIDIA GPU plus CUDA toolkit optional but needed for real speed.
Paid: None. You trade a monthly transcription bill for your own hardware and setup time.
Completely free and open source (AGPL-3.0). Your only cost is hardware, a GPU if you want speed.
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
Similar Tools

the subtitle editor :)

本地优先的一站式桌面字幕工具,内置 6 种 ASR 引擎与全平台 GPU 加速及 17+ 翻译服务商,覆盖音视频转写、翻译、校对、字幕烧录封装全流程,跨 Windows/macOS/Linux 运行

Robust Speech Recognition via Large-Scale Weak Supervision

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Transcribe on your own!
License: GNU Affero General Public License v3.0
Must share source even for SaaS/network use. Strongest copyleft.
Commercial use: ✓ Yes
About
- Owner
- aTrain Development Team (Organization)
- Stars
- 1,166
- Forks
- 87
Explore Further
More tools in the directory
openclaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
381.2k ★everything-claude-code
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
224.8k ★hermes-agent
The agent that grows with you
207.4k ★