Comment by throwawaymaths
25 days ago
According to someone I talked to at groq event I was invited to (I did not sign an nda), They are putting ~8 racks of hardware per llm. Of course coordinating those racks to have exact timings between them to pull tokens through is definitely "part of the hard part".
No comments yet
Contribute on Hacker News ↗