<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<channel>
<title>agent defense Topic Archive</title>
<link>agent-defense.html</link>
<description>关键词 agent defense 的长期追踪 RSS，汇总历史命中文献。</description>
<language>zh-CN</language>
<lastBuildDate>Sun, 28 Jun 2026 05:24:06 +0000</lastBuildDate>
<item>
<title>SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning</title>
<link>../papers/arxiv-e85c8c6f3e5d.html</link>
<guid>https://arxiv.org/abs/2606.01991v1#2026-06-02#agent-defense</guid>
<pubDate>Tue, 02 Jun 2026 13:56:35 +0800</pubDate>
<description>As Large Language Model (LLM) agents increasingly leverage the Model Context Protocol (MCP) to operate in complex environments, the expansion of their action spaces offers agents unsafe capabilities and underscores the risk of power-seeking. While broad action space and greater environment influence are essential for task fulfillment, they create a fragile risk surface where minor errors or hallucinations are magnified into catastrophic failures. In response, we propose SafeMCP, a {server-side}…</description>
</item>
</channel>
</rss>
