AI-ML·중요도 6·2026. 06. 26.·Dev.to

Trace-to-Training: how agent runs become learning data

── KO ──────────────────

WasmAgent는 에이전트 실행 데이터를 교육 데이터로 변환하는 과정을 설명합니다.

WasmAgent는 에이전트 실행을 데이터 포인트로 변환하고 이를 준수 엔진을 통해 평가하여 SFT 또는 DPO 교육용 데이터로 활용합니다. 기존의 많은 프레임워크는 이 데이터를 버리지만, WasmAgent는 이를 'ComplianceEvalRecord'로 내보내어 활용합니다. 이 과정에서는 인간의 라벨링이 필요 없으며, 'full_pcl', 'prompt_retry', 'direct'와 같은 다양한 수정 모드가 포함됩니다.

── EN ──────────────────

WasmAgent transforms agent runs into learning data for training.

WasmAgent preserves agent runs as data points, evaluated by a compliance engine, and exports them as 'ComplianceEvalRecord' for SFT or DPO training. Most frameworks discard this data, but WasmAgent utilizes it without human labeling. The system includes various repair modes such as 'full_pcl', 'prompt_retry', and 'direct'.

원문 보기 →목록으로