DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 blogs.nvidia.com 9 points by moondistance 4 hours ago
billconan 3 hours ago https://news.ycombinator.com/item?id=42879864this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.
https://news.ycombinator.com/item?id=42879864
this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.