AI-ML·중요도 6·2017. 08. 03.·OpenAI Blog

Gathering human feedback

── KO ──────────────────

RL-Teacher는 인간 피드백을 통해 AI를 훈련하는 오픈소스 구현이다.

RL-Teacher는 인간 피드백을 통해 AI를 훈련할 수 있는 오픈소스 인터페이스이다. 이 접근 방식은 안전한 AI 시스템을 위한 한 단계로 개발되었으며, 보상 명세가 어려운 강화 학습 문제에도 적용될 수 있다. 손으로 작성된 보상 함수 대신 더 유연한 방식으로 AI를 훈련할 수 있는 가능성을 제공한다.

── EN ──────────────────

RL-Teacher is an open-source implementation for training AIs with human feedback.

RL-Teacher is an open-source interface that trains AIs using occasional human feedback instead of manually crafted reward functions. This method was developed as a step towards safer AI systems but applies to reinforcement learning problems where rewards are challenging to define. It offers the potential for more flexible AI training methods.

원문 보기 →목록으로