🎙️ Multilingual Audio Processor

Upload an audio file and select whether to transcribe, get word timestamps, or identify speakers (Powered by faster-whisper).

🤖 Select Whisper Model

Larger models are more accurate but slower

Upload Audio

✅ Transcription will always be performed.

Extract timing for each word

🧭 Word-level Timestamps

Identify different speakers (requires HF token)

🗣️ Diarize Speakers

📊 Status

📄 Transcript

🧭 Word-level Timestamps

🗣️ Speaker Diarization