(논문 요약) Kosmos; An AI Scientist for Autonomous Discovery

(논문 요약) Kosmos: An AI Scientist for Autonomous Discovery (Paper)

핵심 내용

Kosmos pursues the specified objective,
- over 200 agent rollouts
- collectively executing an average of 42,000 lines of code
- reading 1,500 papers per run
Kosmos cites all statements in its reports with code or primary literature, ensuring its reasoning is traceable.
Kosmos orchestrates “LLMs” powering a data-analysis agent and a literature-search agent (no specific model names or vendors provided).
Independent scientists found 79.4% of statements in Kosmos reports to be accurate.
Collaborators reported that a single 20-cycle Kosmos run performed the equivalent of 6 months of their own research time on average.