Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Show HN: Fine-tune Llama3-8B on 8GB GPU without quantization (github.com/mega4alik)
3 points by anuarsh 5 months ago | past
Show HN: Run Qwen3-Next-80B on 8GB GPU at 1tok/2s throughput (github.com/mega4alik)
123 points by anuarsh 7 months ago | past | 17 comments
Show HN: Run gpt-oss-20b on 8GB GPUs (github.com/mega4alik)
6 points by anuarsh 7 months ago | past
Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs (github.com/mega4alik)
3 points by anuarsh 7 months ago | past | 7 comments

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: