Cerebras launches Inference, which runs 20 times faster than Nvidia GPUs

Aug. 27, 2024 1:16 PM ET, , , , By: Brandon Evans, SA News Editor71 Comments
(3min)
AI logo place on abstract blocks

J Studios

Cerebras, an artificial intelligence startup based in Sunnyvale, Calif., launched Cerebras Inference today, which it said is the fastest AI inference solution in the world.

"Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second

Recommended For You