(논문 요약) Kosmos: An AI Scientist for Autonomous Discovery (Paper)

핵심 내용

  • Kosmos pursues the specified objective,
    • over 200 agent rollouts
    • collectively executing an average of 42,000 lines of code
    • reading 1,500 papers per run
  • Kosmos cites all statements in its reports with code or primary literature, ensuring its reasoning is traceable.

  • Kosmos orchestrates “LLMs” powering a data-analysis agent and a literature-search agent (no specific model names or vendors provided).

  • Independent scientists found 79.4% of statements in Kosmos reports to be accurate.

  • Collaborators reported that a single 20-cycle Kosmos run performed the equivalent of 6 months of their own research time on average.