Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Lexicap: Lex Fridman Podcast Whisper Captions by Andrej Karpathy (karpathy.ai)
42 points by ashvardanian on Sept 27, 2022 | hide | past | favorite | 4 comments


Curious and a little off topic of the post (but related), is there a way to detect speakers with Whisper or with a combination of models, similar to Descript?


No, speaker diarization is not part of Whisper. There are open source projects - such as Kaldi [1], but it's hard to get them running if you are not an area expert.

[1] https://kaldi-asr.org/


Highly recommend the five hour John Carmack episode from a few weeks ago.


Yeah, that was great. Here are John's other appearances on other podcasts: https://jkstream.com/#/personality-john-carmack-Q92605




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: