(잡생각) statistical models

(잡생각) statistical models

Big World Hypothesis
- For many learning problems, the world is multiple orders of magnitude larger than the agent.
- The agent neither fully perceives the state of the world nor can it learn the correct value or optimal action for each state.
- It has to rely on approximate solutions to achieve its goals.
statistical models 의 input 공간, output 공간을 ‘state of the world’ 라고 생각할 수 있음.
- 이 경우, image, text 등, 데이터가 존재하는 공간이 real world 가 됨.
- Big World Hypothesis 에서 state of the world 를 근사한 것 처럼, state of images, state of texts 를 근사하게 됨.

(1) state 들을 근사.
- embedding 을 통해 구현 (word embedding, image embedding).
(2) 근사된 state 들의 변환.
- 근사된 state 들이 network 의 forward 과정에서 다른 공간으로 mapping 되며, 최종적으로 output space 로 mapping 됨.

사람의 표정을 classify 하도록 학습된 모델은 개 vs 고양이 구분 불가능.
vision foundation model, language foundation model 들은 모든 task 에서 잘하려는 것이 목표
- target object 와, task 의 범위를 좁히는 경우보다 문제가 어려움 (수학만 잘하기 vs. 전과목 다 잘하기).
“이 정도 task 에는 이 정도 모델 학습이면 충분함” 같은 방법이 있으면 유용할 듯.
- binary search 방식으로 upper layer 들 일부만 finetuning 하는 방식?

원하는 대로 동작하고 있는지 체크할수 있는 metric 필요.
metric 이 악화되는 경우, 이를 개선할수 있도록 align 필요 (추가 학습의 필요성).
여러 명이 같은 모델을 사용하면서, 동시에 align 해나가면 개인당 시간/노력을 줄일수 있을 것임 (e.g. community efforts).