Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's been 3 years since Adobe first showed off their voice impression and editing feature (unreleased): https://www.youtube.com/watch?v=I3l4XLZ59iw

Though this is very impressive, it seems to take longer and longer to make those tiny improvements that make all the difference wrt believability.

N.B. The most convincing TTS I've ever heard (predating Lyre by quite a bit) generated things like this: http://web.archive.org/web/20190803012012/https://instaud.io...



I have real time voice to voice that runs on low end CPUs. You can impersonate celebrities and cartoon characters.

https://drive.google.com/file/d/1zRvJEGJjTpKvvzel-J0agh3fKBn...

I'm integrating it into a "Snapchat filter" type app with lightweight social features just as a means to bring it to market and hopefully attract Facebook or Snapchat or Tencent into buying it. I'm building it to sell, essentially.

I need capital so I can fund my real ambitious start-up of end to end computational filmmaking. Graph-based story language, light field camera optics, tracking and localization in prerendered environments, content-aware shaders, real time storyboard population and automated editing, posture estimation and mistake correction...

With patent protection, I think it could unseat Disney and make more money than they do with Marvel and Star Wars.

I need a lot of capital to build my lab. Optics (good sensors and glass), a modest studio with rigging and tracking set up for experiments, and a handful of engineers.


Hmm. I'd like to test that out, I did film audio for a decade so I feel like I could provide you with useful feedback. NDA is OK, same name at gmail if you want to get in touch.



Oh wow, I didn't do due dilligence.

I checked the Android app store and all the apps used text to speech before vocoding. I had no idea this existed on iPhone.

Thanks.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: