Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
coder543
52 days ago
|
parent
|
context
|
favorite
| on:
GLM-4.7: Advancing the Coding Capability
The Spark has more compute, so it should be faster for prefill (prompt processing).
The M4 Max has double the memory bandwidth, so it should be faster for decode (token generation).
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
The M4 Max has double the memory bandwidth, so it should be faster for decode (token generation).