Chinese lab DeepSeek released DeepSeek-V2 as a 236B mixture-of-experts model that activated only 21B parameters per token, achieving GPT-4-class performance at a fraction of the inference cost. Its open release shocked the industry and caused API price wars, with some providers cutting prices by 80% to compete.

Comments on "DeepSeek-V2"
Create a free account or sign in to join the discussion.
Sign in to join the conversation