Qwen3.5 Small: 0.8B, 2B, 4B, 9B Released

(huggingface.co)

8 points | by syx 8 hours ago ago

5 comments

karmakaze 2 hours ago
Chart of how these compare[0] to the Qwen3 235B-A22B, Next-80B-A3B-Thinking, 30B-A3B-Thinking, 4B, 1.7B models.
These new ones are very much punching above their weights.
[0] https://www.reddit.com/r/LocalLLaMA/comments/1rivckt/visuali...
throwaway2027 8 hours ago
So 27B at Q3 or 9B at Q8?
powera 8 hours ago
This looks like somebody re-releasing QWEN models to promote their own company. https://news.ycombinator.com/item?id=47217305 is the link to QWEN's repo.
[-]
- cpburns2009 8 hours ago
  If you want to have a chance at running a large model, it needs to be quantized. The unsloth user on Huggingface manages popular quantizations for many models, Qwen included, and I think he developed dynamic GGUF quantization.
  Take Qwen/Qwen3.5-35B-A3B for example. It's 72 GB. While unsloth/Qwen3.5-35B-A3B-GGUF has quantizations from 9-38 GB.
- karmakaze 2 hours ago
  Unsloth is one of, if not the most well-known provider of model quantizations. The release post of course should reference the source, but most probably use unsloth or bartowski quantized models being my go-tos so relevant/convenient.