(논문 요약) Qwen3-Coder: Agentic Coding in the World (blog)
핵심 내용
- Architecture: MoE
- Total Params: 480B
- Active Params: 35B
- Contex length: 256K
학습
- 7.5T tokens (70% code ratio)
- Qwen2.5-Coder 로 noisy data 를 clean 및 rewrite 하여 data quality 높임
- Post training: execution-driven large-scale reinforcement learning
- real world coding tasks 의 test cases 를 점진적으로 복잡하게 발전시켜나감
- “Hard to Solve, Easy to Verify”