1. MuseHub
  2. Plugins
  3. Speaker Diarization Pro
Speaker Diarization Pro
Speaker Diarization Pro
Pr.Germux
PLN 399.99

Transform any mono or stereo recording into isolated speaker stems, subtitles, and timeline files for podcasts, interviews, post-production, and research workflows. With a single plug-in instance, Speaker Diarization Pro uses embedded diarization model assets to detect speaker boundaries and export per-voice outputs, saving hours of manual editing.

Previews

What’s New?

1.0.0

Initial release on MuseHub

Description

Automatically split mixed-speaker audio into separate tracks, right inside your DAW. Transform any mono or stereo recording into isolated speaker stems, subtitles, and timeline files for podcasts, interviews, post-production, and research workflows. With a single plug-in instance, Speaker Diarization Pro uses embedded diarization model assets to detect speaker boundaries and export per-voice outputs, saving hours of manual editing. Key Features: • Advanced Speaker Segmentation (1 to 20): Choose the number of speakers from 1 to 20, or enable Auto mode for speaker-count detection. • Expanded Pro Input Formats: Pro supports WAV, MP3, AIFF/AIF, FLAC, and OGG. Basic supports WAV only. • Higher Speaker-Identity Accuracy vs first Basic (192-dim): Pro uses full 512-dimensional speaker embeddings. That is +167% richer embedding representation (512 vs 192) and removes the earlier 63% embedding truncation. In practice, diarization quality is more stable on difficult multi-speaker recordings. • Pro Controls for Cleaner Turns: Adjust sensitivity, minimum segment length, and merge gap for better speaker boundary behavior. • Hardware Modes:Run Auto hardware mode (GPU when available with CPU fallback) or force CPU-only mode. • Multi-Export Workflow: Export WAV stems, SRT subtitles, and CSV diarization timeline in one run. • Fully Local Processing: Runs inside your DAW with no cloud upload and no external app round-trip. • Pro vs Basic (Quick Contrast): Capabilities: Basic: Input formats: WAV only Max speakers: Up to 10 Exports: WAV stems Pro: Input formats: WAV, MP3, AIFF/AIF, FLAC, OGG Max speakers: Up to 20 + Auto mode Exports: WAV stems, SRT, CSV How It Works: 1. Open the Speaker Diarization Pro plugin in your DAW program. 2. Browse your recording file (WAV, MP3, AIFF/AIF, FLAC, or OGG) and set speaker count. 3. Adjust sensitivity, minimum segment length, merge gap, and hardware mode as needed. 4. Run processing and export selected outputs automatically. What’s Included: • Speaker Diarization Pro.vst3 (x86, x64, arm64) • ONNX models (.onnx) pre-optimized for real-time • Runtime components required by the plug-in • Lifetime license with free minor updates Licensing & Support: • Perpetual License: purchase once, use forever Take your podcast, interview, and post-production workflow to the next level. Use Speaker Diarization Pro and stop manual chopping — let AI do the hard work.

Advanced Speaker Segmentation
Expanded Pro Input Formats
Higher Speaker-Identity Accuracy
Pro Controls for Cleaner Turns
GPU Support
Multi-Export Workflow

Reviews

Speaker Diarization Pro hasn’t received any reviews yet.
Formats
VST3
Rating
No Ratings
Version
1.0.0
Released
2026

You May Also Like