Voice Notes
Never miss a detail again. Instantly capture, transcribe, label speakers, and summarize meetings, lectures, or brilliant ideas with word-level precision—all securely processed on your device.
Recording Audio & Import
Navigate to the AI Voice Note tab and tap the record button to start capturing audio. The app uses either WhisperKit or Apple STT for on-device speech recognition.
- Real-time transcription: Text appears as you speak
- Background recording: On iOS, recording continues when the app is in the background
- Import audio (PRO): Turn your past recordings into searchable, summarizable text with clear, step-by-step progress tracking so you're never left guessing.
All audio processing happens entirely on your device using WhisperKit and Apple Speech frameworks. No audio data is sent to any server.
Transcription
Transcription runs locally with Whisper models or Apple STT, depending on your selected model and workflow. Features include:
- Word-level timestamps: Each word is timestamped for precise navigation
- Multiple languages: Support for many languages including English, Chinese, Japanese, Spanish, French, and more
- Automatic punctuation: The model adds punctuation and sentence structure
Speaker Diarization PRO
Automatically label who said what in your voice notes and imported recordings.
- Model download: Download speaker models once directly from the Voice Note UI.
- Label Speakers toggle: Turn on diarization to instantly split transcripts by speaker.
- Precision matching: Advanced tuning controls guarantee perfectly accurate speaker labels, even when people talk over each other.
- Display Speakers toggle: Switch cleanly between speaker labels and timestamp-only views.
- Persisted labels: Speaker labels are securely saved and instantly restored when you reopen a recording.
Re-Transcription
Use Re-Transcript to regenerate transcript text from existing audio with Whisper or Apple STT.
- Model switching: Re-run with a different transcription model for better results
- Optional speaker labeling: Apply diarization automatically after re-transcription
- Safe overwrite: New transcript and speaker labels replace older results for the same audio
AI Processing
After transcription, you can process the text with AI for:
- Summarization: Get concise summaries of meetings or lectures
- Translation: Translate the transcription to another language
- Key points: Extract action items and key takeaways
- Speaker-aware analysis: Your AI assistant knows exactly who said what. Ask it to "Summarize Sarah's points" or "List action items for John" for deeply personalized insights.
- Custom processing: Use any prompt to analyze your transcript however you need.
Word-Level Navigation
Tap any word in the transcription to jump to that exact moment in the recording. This makes it easy to:
- Verify specific quotes or statements
- Re-listen to important sections
- Navigate long recordings efficiently
Organization & Renaming
Keep your brilliant ideas and crucial meetings perfectly organized.
- Custom naming: Easily rename any recording so you can identify important lectures, interviews, and brainstorms at a single glance.
Language Support
Whisper and Apple STT both support transcription in many languages. You can let the model detect spoken language automatically or set it manually for better accuracy with specific languages.
For best transcription quality, use a quiet environment and speak clearly. External microphones also improve accuracy significantly.