#llama.cpp

1 post tagged

Trending

#OpenSource61 #LLM36 #HuggingFace18 #Inference12 #Benchmarks9 #Benchmark8 #AgenticAI7 #DeepSeek7 #FineTuning7 #MoE7 #Performance6 #Agents5 #AI5 #AIAgents5 #Robotics5

May 13, 2026

1 update

🤗 HuggingFaceSignificantLewis Tunstall

2:05 PM

Qwen3-35B-A3B: Running a 35B Parameter MoE Model on Local Hardware

Qwen3-35B-A3B is a new model from Alibaba's Qwen3 series. This tweet announces local inference capability using llama.cpp (the popular gguf-format inference engine) combined with Unsloth's 4-bit quantization, enabling a…

#Qwen3 #llama.cpp #Quantization #LocalLlm #MoE

Read full breakdown Original source