

QwQ-32B, from Alibaba Qwen team, is a new open-source 32B LLM achieving DeepSeek-R1 level reasoning via scaled Reinforcement Learning. It features a "thinking mode" for complex tasks and is part of the Qwen series, focusing on reasoning capabilities. Compared to instruction-tuned models, QwQ excels in downstream tasks, especially hard problems. It's built upon Qwen2.5 and requires the latest Hugging Face transformers library.
25 Mar 2025
Readmore