Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Despite the large video memory capacity, its video memory bandwidth is very low. I guess the model's decode speed will be very slow. Of course, this design is very well suited for the inference needs of MoE models.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: