
J Studios
Cerebras, an artificial intelligence startup based in Sunnyvale, Calif., launched Cerebras Inference today, which it said is the fastest AI inference solution in the world.
"Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second