agent defense Topic Archive

agent defense Topic Archive agent-defense.html 关键词 agent defense 的长期追踪 RSS，汇总历史命中文献。 zh-CN Sun, 28 Jun 2026 05:24:06 +0000 SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning ../papers/arxiv-e85c8c6f3e5d.html https://arxiv.org/abs/2606.01991v1#2026-06-02#agent-defense Tue, 02 Jun 2026 13:56:35 +0800 As Large Language Model (LLM) agents increasingly leverage the Model Context Protocol (MCP) to operate in complex environments, the expansion of their action spaces offers agents unsafe capabilities and underscores the risk of power-seeking. While broad action space and greater environment influence are essential for task fulfillment, they create a fragile risk surface where minor errors or hallucinations are magnified into catastrophic failures. In response, we propose SafeMCP, a {server-side}…