Question 1

Does Echosy work offline?

Accepted Answer

Yes, Echosy works 100% offline. All audio transcription happens on your Mac using local AI models. No internet connection is required for recording, transcribing, or dictation.

Question 2

Is my audio data private?

Accepted Answer

Absolutely. Your audio never leaves your Mac. Echosy uses on-device AI models (Whisper, Qwen3-ASR) that run locally on your GPU. No audio data is sent to any external server.

Question 3

What languages does Echosy support?

Accepted Answer

Echosy supports 24+ transcription languages including English, Chinese (Mandarin, Cantonese, Hokkien), Japanese, Korean, Spanish, French, German, and more. The app interface is available in English, Simplified Chinese, Traditional Chinese, Japanese, and Korean.

Question 4

Can Echosy record system audio on macOS?

Accepted Answer

Yes, Echosy captures system audio from any application using macOS ScreenCaptureKit, plus your microphone simultaneously. Perfect for recording meetings, lectures, podcasts, and more.

Question 5

Is Echosy free?

Accepted Answer

Yes, Echosy has a free tier that includes up to 15-minute recordings, real-time transcription, dictation, and 3 AI summaries per day. Pro is a one-time $24.90 purchase — no subscription — and unlocks 4-hour recordings, all 10 ASR models, unlimited summaries, file transcription, and advanced export formats (SRT, VTT, DOCX, PDF).

Question 6

Can I use Echosy to transcribe meetings?

Accepted Answer

Yes. Echosy captures system audio from any meeting app (Zoom, Teams, Google Meet, etc.) alongside your microphone. Transcripts appear in real time with timestamps, and you can generate an AI summary when the meeting ends — all processed locally on your Mac.

Question 7

Does Echosy work on Intel Macs?

Accepted Answer

Yes. Echosy ships as a universal binary supporting both Apple Silicon (M1/M2/M3/M4) and Intel Macs. On Apple Silicon, GPU-accelerated models (Qwen3-ASR, MLX Whisper) are available for faster transcription. On Intel, Whisper models run on CPU.

Question 8

How is Echosy different from MacWhisper or Whisper Transcription?

Accepted Answer

Echosy goes beyond file transcription: it captures live system audio and microphone simultaneously, provides system-wide dictation that pastes text at your cursor in any app, streams real-time captions during recording, and includes an AI chat feature grounded in your transcript. It also supports 10 different ASR models across three backends.

Question 9

Can Echosy transcribe audio and video files?

Accepted Answer

Yes (Pro feature). Drag and drop any audio or video file — WAV, MP3, M4A, MP4, MOV, and 20+ other formats — and Echosy transcribes it locally using the same on-device AI models. No upload, no waiting for a cloud service.

Model	Size	Notes
Gemma 4 E4B (4-bit)	~3 GB	Default — fast, solid quality
Qwen 3.5 4B (4-bit)	~3 GB	Strong multilingual, especially Chinese / Japanese
Gemma 4 E12B (4-bit)	~8 GB	Higher quality, slower; needs 16 GB RAM

Provider	Notes
OpenAI	GPT-4o, GPT-4o-mini, etc.
Gemini	Google's Gemini models
Claude	Anthropic's Claude models
Groq	Fast inference for open-source models
DeepSeek	DeepSeek models
OpenRouter	Multi-provider gateway, access many models
Ollama	Free, local LLM runner — no API key needed
Custom	Any OpenAI-compatible endpoint

AI / LLM Settings

On-Device LLM (MLX)

Cloud Provider

API Key

API Endpoint

Model

Test Connection

Custom Summary Prompt PRO

Ready to get started?