Adding realtime diarization to collabora/WhisperLive #178
Unanswered
yehiaabdelm
asked this question in
Q&A
Replies: 1 comment
-
Hi @yehiaabdelm, apart from To combine diart with Whisper you can check this article that I wrote on Medium some time ago. It will give you a head start but I'm sure many improvements can be made. You can also check out this gist for the diart+whisper code. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to add diarization to this repo https://github.com/collabora/WhisperLive, which has transcription and also runs a VAD model before passing audio data to the transcriber. I had it working with pyannote-audio, however, the VAD model and the diarization model both run on the CPU so they slow down each other. I was also passing the whole audio file every time to the model so this is obviously not optimal. I was wondering how I can use diart instead of pyannote. Most of the examples I see are directly from microphone. Can anyone please share an example of how I can use it diart with the data being a float 32 numpy array of mono audio instead of a stream from the microphone? Any help is appreciated.
Beta Was this translation helpful? Give feedback.
All reactions