VLLM: The High-Throughput and Memory-Efficient Serving Engine for LLMs

		VLLM: The High-Throughput and Memory-Efficient Serving Engine for LLMs (vllm.ai)
		1 point by sorrow17 49 days ago \| hide \| past \| favorite