🤗 HuggingFaceSignificantDeepSeek
DeepSeek-V4 Preview Launches: 1M Context MoE Models with Open Weights
DeepSeek released DeepSeek-V4 Preview with two model variants: DeepSeek-V4-Pro (1.6T total parameters, 49B active params with Mixture of Experts architecture) and DeepSeek-V4-Flash (284B total, 13B active params). Both s…