Subscription View

趋势与订阅总览

从归档里提炼长期信号:每个 feed 的近 7 天 / 30 天命中情况,配置关键词的长期命中轨迹,以及跨多天反复出现的持续升温论文。

Feed 订阅

7

个固定入口

82 个 digest 日

关键词追踪

65

个关键词页

4572 次历史命中

最近 30 天

378

篇 feed 命中

1848 次关键词命中

持续升温论文

2

篇候选

2 个累计活跃日期

阅读清单

2

篇已标记论文

1 篇标星

Feed 趋势订阅

固定链接适合长期追踪某个研究方向的命中量和最近变化。

关键词长期追踪

这些页面来自你配置里的 feed 关键词,适合观察某个主题是否持续冒头。

Topic

LLM

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

64 / 7d286 / 30d671 / all

Topic

language model

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

56 / 7d240 / 30d619 / all

Topic

benchmark

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

50 / 7d217 / 30d574 / all

Topic

large language model

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

44 / 7d209 / 30d538 / all

Topic

agent

最近一次命中来自 LM:Joint Learning of Experiential Rules and Policies for Large Language Model Agents

42 / 7d197 / 30d448 / all

Topic

evaluation

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

32 / 7d149 / 30d426 / all

Topic

reasoning

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

34 / 7d136 / 30d399 / all

Topic

RAG

最近一次命中来自 LM:NuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models

38 / 7d146 / 30d373 / all

Topic

alignment

最近一次命中来自 LM:In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

12 / 7d59 / 30d175 / all

Topic

coding agent

最近一次命中来自 Terminal and SWE Agents:A Deterministic Control Plane for LLM Coding Agents

7 / 7d33 / 30d57 / all

Topic

jailbreak

最近一次命中来自 Agent Runtime Security:Jailbreaking for the Average Jane: Choosing Optimal Jailbreaks via Bandit Algorithms for Automatically Enhanced Queries

7 / 7d26 / 30d42 / all

Topic

guardrail

最近一次命中来自 Agent Runtime Security:Do Safety Guardrails Need to Reason? LeanGuard: A Fast and Light Approach for Robust Moderation

8 / 7d24 / 30d40 / all

Topic

prompt injection

最近一次命中来自 LM:Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings

5 / 7d21 / 30d31 / all

Topic

SWE-bench

最近一次命中来自 Terminal and SWE Agents:To Run or Not to Run: Analyzing the Cost-Effectiveness of Code Execution in LLM-Based Program Repair

3 / 7d16 / 30d29 / all

Topic

computer-use agent

最近一次命中来自 LM:Uncertainty Quantification for Computer-Use Agents: A Benchmark across Vision-Language Models and GUI Grounding Datasets

3 / 7d9 / 30d18 / all

Topic

Terminal-Bench

最近一次命中来自 Terminal and SWE Agents:LemonHarness Technical Report

2 / 7d7 / 30d12 / all

Topic

code agent

最近一次命中来自 Terminal and SWE Agents:How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

1 / 7d10 / 30d11 / all

Topic

instruction tuning

最近一次命中来自 LM:SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment

2 / 7d8 / 30d11 / all

Topic

repository-level

最近一次命中来自 Terminal and SWE Agents:Evaluating LLMs on Real-World Software Performance Optimization

2 / 7d7 / 30d10 / all

Topic

in-context learning

最近一次命中来自 LM:MedGuards: Multi-Agent System for Reliable Medical Error Detection and Correction

2 / 7d7 / 30d9 / all

Topic

agent runtime

最近一次命中来自 Agent Runtime Security:Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

0 / 7d3 / 30d8 / all

Topic

indirect prompt injection

最近一次命中来自 Agent Runtime Security:CodeSentinel: A Three-Layer Defense Against Indirect Prompt Injection in Code Contexts

0 / 7d3 / 30d6 / all

Topic

policy enforcement

最近一次命中来自 LM:A Technical Taxonomy of LLM Agent Communication Protocols

0 / 7d2 / 30d5 / all

Topic

code editing

最近一次命中来自 Agent Runtime Security:WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

0 / 7d1 / 30d4 / all

Topic

code generation benchmark

最近一次命中来自 Terminal and SWE Agents:VoidPadding: Let [VOID] Handle Padding in Masked Diffusion Language Models so that [EOS] Can Focus on Semantic Termination

0 / 7d3 / 30d4 / all

Topic

issue resolution

最近一次命中来自 Terminal and SWE Agents:Unlocking Model Potentials Through Adaptive Multi-Agent Scaffolding for Efficient Issue Resolution

1 / 7d2 / 30d4 / all

Topic

program repair

最近一次命中来自 Terminal and SWE Agents:Smaller Models, Unexpected Costs: Trade-offs in LLM Quantization for Automated Program Repair

2 / 7d3 / 30d4 / all

Topic

sandboxing

最近一次命中来自 Agent Runtime Security:Burnyard: Future of Malware Analysis

1 / 7d3 / 30d4 / all

Topic

agent security

最近一次命中来自 Agent Runtime Security:Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

0 / 7d3 / 30d3 / all

Topic

automated program repair

最近一次命中来自 Terminal and SWE Agents:Smaller Models, Unexpected Costs: Trade-offs in LLM Quantization for Automated Program Repair

1 / 7d2 / 30d3 / all

Topic

data exfiltration

最近一次命中来自 Agent Runtime Security:Securing LLM-Agent Long-Term Memory Against Poisoning: Non-Malleable, Origin-Bound Authority with Machine-Checked Guarantees

1 / 7d1 / 30d3 / all

Topic

secure agent

最近一次命中来自 Agent Runtime Security:Provably Secure Agent Guardrail

0 / 7d0 / 30d3 / all

Topic

SWE bench

最近一次命中来自 Terminal and SWE Agents:Exploration Structure in LLM Agents for Multi-File Change Localization

0 / 7d2 / 30d3 / all

Topic

terminal agent

最近一次命中来自 Terminal and SWE Agents:Tmax: A simple recipe for terminal agents

1 / 7d2 / 30d3 / all

Topic

test generation

最近一次命中来自 Terminal and SWE Agents:Knowledge Matters: Injecting Project and Testing Knowledge into LLM-based Unit Test Generation

0 / 7d2 / 30d3 / all

Topic

privilege escalation

最近一次命中来自 Agent Runtime Security:Seeing Is Not Screening: Multimodal Hidden Instruction Attacks on Agent Skill Scanners

0 / 7d1 / 30d2 / all

Topic

retrieval augmented generation

最近一次命中来自 LM:Probabilistic Agents in Deterministic Audits: Evaluating Multi-Agent Systems for Automated Audits Based on the German IT-Grundschutz

1 / 7d2 / 30d2 / all

Topic

agent attack

最近一次命中来自 Agent Runtime Security:Secure UAV Swarms in Low-Altitude Wireless Networks: Challenges and Solutions

0 / 7d0 / 30d1 / all

Topic

agent defense

最近一次命中来自 Agent Runtime Security:SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning

0 / 7d1 / 30d1 / all

Topic

agent isolation

最近一次命中来自 LLM:RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents

0 / 7d0 / 30d1 / all

Topic

agent sandbox

最近一次命中来自 Agent Runtime Security:DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

0 / 7d0 / 30d1 / all

Topic

bug fixing

最近一次命中来自 Terminal and SWE Agents:DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

0 / 7d1 / 30d1 / all

Topic

code repair

最近一次命中来自 Terminal and SWE Agents:SHERLOC: Structured Diagnostic Localization for Code Repair Agents

1 / 7d1 / 30d1 / all

Topic

LLM agent security

最近一次命中来自 Agent Runtime Security:Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

0 / 7d1 / 30d1 / all

Topic

malicious tool

最近一次命中来自 Agent Runtime Security:From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework

0 / 7d1 / 30d1 / all

Topic

multimodal language model

最近一次命中来自 LLM:Don't Show Pixels, Show Cues: Unlocking Visual Tool Reasoning in Language Models via Perception Programs

0 / 7d0 / 30d1 / all

Topic

patch generation

最近一次命中来自 Agent Runtime Security:EviACT: An Evidence-to-Action Framework for Agentic Program Repair

0 / 7d0 / 30d1 / all

Topic

repository level

最近一次命中来自 Terminal and SWE Agents:Dependency-Guided Repository-Level C-to-Rust Translation with Reinforcement Alignment

0 / 7d1 / 30d1 / all

Topic

runtime security

最近一次命中来自 LLM:ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection

0 / 7d0 / 30d1 / all

Topic

shell agent

最近一次命中来自 Agent Runtime Security:How Agentic AI Coding Assistants Become the Attacker's Shell

0 / 7d0 / 30d1 / all

Topic

software engineering agent

最近一次命中来自 Terminal and SWE Agents:Same Signal, Different Semantics: A Cross-Framework Behavioral Analysis of Software Engineering Agents

0 / 7d0 / 30d1 / all

Topic

terminal bench

最近一次命中来自 LM:What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design

0 / 7d0 / 30d1 / all

Topic

AI agent security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

autonomous agent security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

browser agent security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

code agent security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

command line agent

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

function calling security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

MCP security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

model context protocol security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

software engineering benchmark

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

terminal benchmark

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

tool calling security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

tool-use security

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

Topic

untrusted tool

暂未命中,页面会持续追踪后续归档。

0 / 7d0 / 30d0 / all

持续升温论文

这些论文在多个日期或多个 feed 中反复出现,更适合放进长期观察列表。