Lexicap: Lex Fridman Podcast Whisper Captions by Andrej Karpathy

throwaway743 · on Sept 27, 2022

Curious and a little off topic of the post (but related), is there a way to detect speakers with Whisper or with a combination of models, similar to Descript?

abhinavkulkarni · on Sept 28, 2022

No, speaker diarization is not part of Whisper. There are open source projects - such as Kaldi [1], but it's hard to get them running if you are not an area expert.

[1] https://kaldi-asr.org/

bitlax · on Sept 28, 2022

Highly recommend the five hour John Carmack episode from a few weeks ago.

abhinavkulkarni · on Sept 28, 2022

Yeah, that was great. Here are John's other appearances on other podcasts: https://jkstream.com/#/personality-john-carmack-Q92605