The Spark has more compute, so it should be faster for prefill (prompt processin...

		coder543 52 days ago \| parent \| context \| favorite \| on: GLM-4.7: Advancing the Coding Capability The Spark has more compute, so it should be faster for prefill (prompt processing). The M4 Max has double the memory bandwidth, so it should be faster for decode (token generation).