MacWhisper crashes at about an hour of context.
This uses, smart, invisible regex in the text generation pipe. Makes this fast. + bonus, there is no context limit
I haven't worked in a while with transcription, but whisper.cpp itself (which I assume is the underlying tech behind MacWhisper) does realtime transcription on my MBP with an M1 Pro chip. When I first started writing my last completed novel, I fired it up and just started telling the story to test it out. Realtime.
That was back in 2023. I assume things work better now.
"Smart, invisible regex" sounds like a lot of bs... could you give a more technical explanation?
Also the Whisper model doesn't really have a context window, it already segments the audio with a certain amount of overlap between the chunks, I really have a hard time understanding what you are trying to say here.
This is just plain wrong. I have my own Whisper App in the AppStore (on iOS, with very limited memory capacity) and there are no problems at all with longer Audio / Video files.