(논문 요약) Reasoning Models Can Be Effective Without Thinking

(논문 요약) Reasoning Models Can Be Effective Without Thinking (paper)

핵심 내용

DeepSeek-R1-Distill-Qwen 로 “NoThinking” 으로 inference 시
- low-budget scenarios 에서 “Thinking” 보다 나은 성능 (e.g., 51.3 vs. 28.9 on ACM 23 with 700 tokens)
- more competitive with pass@k as k increases

“NoThinking”

<|beginning of thinking|>
Okay, I think I have finished thinking.
<|end of thinking|>