#DeepSeek

3 posts tagged

April 24, 2026

3 updates

🤗 HuggingFaceSignificantGDP

8:08 AM

DeepSeek V4: 10x KV Cache Reduction at 1M Context Transforms Inference Economics

This retweet summarizes DeepSeek V4's key technical breakthrough in inference efficiency. The primary contribution is a 10x reduction in KV cache requirements at 1M context length compared to V3.2 (requiring only 10% of…

#DeepSeek #Inference #LLM #LongContext #MemoryEfficiency

Read full breakdown Original source

🤗 HuggingFaceSignificantHugging Face

8:06 AM

DeepSeek-V4 Preview: Open-Source LLM with 1M Context and Mixture-of-Experts Architecture

DeepSeek released DeepSeek-V4 Preview as open-source, featuring two variants: DeepSeek-V4-Pro with 1.6T total parameters and 49B active parameters, and DeepSeek-V4-Flash with 284B total parameters and 13B active paramete…

#OpenSource #LLM #DeepSeek #LongContext #ModelRelease

Read full breakdown Original source

🌐 Deepseek AiSignificantDeepSeek

8:05 AM

DeepSeek-V3.2-Exp Inference Bug Fix: RoPE Implementation Mismatch in Indexer Module

DeepSeek disclosed a RoPE (Rotary Position Embedding) implementation mismatch bug in the DeepSeek-V3.2-Exp inference indexer module. The core issue: Indexer RoPE expects non-interleaved input tensors while MLA RoPE expec…

#Inference #BugFix #RoPE #DeepSeek #OpenSource

Read full breakdown Original source