Transcription
AI-powered speech-to-text with two processing modes for flexibility and accuracy
Overview
Notably offers two transcription modes to fit different workflows. Real-time transcription shows text as you speak during recording, while post-recording transcription uses more powerful models for higher accuracy after recording ends.
All transcription processing happens locally on your device using Whisper AI models. No audio data is ever sent to the cloud.
Transcription Modes
Real-time Transcription
Live transcription during recording using lightweight Whisper models. Text appears as you speak, providing immediate feedback.
- Fast processing: Uses optimized lightweight models (Tiny, Base, Small)
- Live feedback: See transcription text as recording progresses
- Lower latency: Minimal delay between speech and text display
- Good accuracy: Suitable for most general meetings and notes
Post-recording Transcription
High-accuracy transcription after recording completes using more powerful models. Can be automatic or triggered manually.
- Higher accuracy: Uses larger models (Medium, Large) for better results
- Automatic or manual: Configure to run automatically or generate on demand
- Better punctuation: More accurate capitalization and sentence structure
- Technical terms: Improved recognition of specialized vocabulary
Language Support
Notably supports automatic language detection and 13 specific languages for transcription:
Use auto-detect to let Notably identify the language automatically, or select a specific language for improved accuracy.
Transcript Viewer
View and interact with transcriptions using an advanced segment-based display:
- ✓ Segment display: Transcriptions organized into logical segments with timestamps
- ✓ Word highlighting: Individual words highlight as they're spoken during playback
- ✓ Jump to segment: Click any segment to jump to that point in the recording
- ✓ Sync to playback: Transcript scrolls automatically as recording plays
Re-transcribe Recordings
Generate new transcriptions anytime with different models or settings:
- Switch to a larger model for improved accuracy
- Try a different language setting if auto-detect was incorrect
- Re-process after downloading a new model version
- Keep multiple transcription versions for comparison
Original recordings remain unchanged - re-transcription creates a new transcript without affecting existing ones.
Privacy & Local Processing
All transcription processing happens entirely on your device:
- ✓ Whisper models downloaded and stored locally
- ✓ Audio never leaves your device
- ✓ Transcription text stored in local database
- ✓ No internet connection required after model download
- ✓ Complete data ownership and control
Model Selection Guide
Tiny / Base
Best for real-time transcription. Fast, lightweight, good for general meetings.
Small
Balanced option. Better accuracy than Tiny/Base while still fast enough for real-time use.
Medium
High accuracy for post-recording. Good for important meetings, technical discussions.
Large
Highest accuracy. Best for critical content, difficult audio, or specialized vocabulary.