SECURITY·중요도 8·2025. 12. 22.·OpenAI Blog

Continuously hardening ChatGPT Atlas against prompt injection

── KO ──────────────────

OpenAI가 ChatGPT Atlas의 프롬프트 주입 방어를 강화하고 있습니다.

OpenAI는 ChatGPT Atlas가 프롬프트 주입 공격에 대응하기 위해 자동화된 레드 팀을 활용하여 방어를 강화하고 있습니다. 이는 강화 학습을 통해 훈련된 방식으로, 새로운 공격을 조기에 발견하고 패치하는 능력을 키워줍니다. AI의 자율성이 증가함에 따라 이러한 프로액티브한 접근은 중요합니다.

── EN ──────────────────

OpenAI is enhancing ChatGPT Atlas's defenses against prompt injection attacks.

OpenAI is strengthening ChatGPT Atlas against prompt injection attacks using automated red teaming backed by reinforcement learning. This proactive discover-and-patch loop helps identify novel exploits early. As AI becomes more agentic, enhancing defenses is critical.

원문 보기 →목록으로