User:Ralsettem: Difference between revisions

Revision as of 06:01, 1 November 2022

Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to.

I’m using a local of installation of Whisper by OpenAi on Github to using the large model to transcribe podcast episodes. It’s theoretically possible to have diarization of speakers using Pyannote on Github, however I don’t know python and there is a lack of video tutorials that show the process. Hopefully there will be some kind developers that will create a webUI that is able to transcribe with diarization.

@@ Line 1: / Line 1: @@
 Self-proclaimed nerd venturing out into the world of transcription supporting a podcast I love to listen to.
+I’m using a local of installation of [https://openai.com/blog/whisper/ Whisper] by OpenAi on [https://github.com/openai/whisper Github] to using the large model to transcribe podcast episodes.
+It’s theoretically possible to have [https://github.com/lablab-ai/Whisper-transcription_and_diarization-speaker-identification- diarization] of speakers using [https://github.com/pyannote/pyannote-audio Pyannote on Github], however I don’t know python and there is a lack of video tutorials that show the process.
+Hopefully there will be some kind developers that will create a webUI that is able to transcribe with diarization.

User:Ralsettem: Difference between revisions

Revision as of 06:01, 1 November 2022

Navigation menu

Search