Keyword Tracking

关键词追踪：code agent

这个页面会长期追踪你配置里关心的关键词，并把命中的论文按日期沉淀下来。

返回归档首页查看趋势总览最新 JSON 订阅 RSS

近期走势

最近一次命中来自 Terminal and SWE Agents：How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

2026-06-15

2026-06-16

2026-06-17

2026-06-18

2026-06-19

2026-06-20

2026-06-21

2026-06-22

2026-06-23

2026-06-24

2026-06-25

2026-06-26

2026-06-27

2026-06-28

命中明细

按日期回看匹配到这个关键词的论文标题，并保留来源 feed 信息。

2026-06-26

2026-06-26 13:16:53 (Asia/Shanghai)

Terminal and SWE Agents

How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

查看原始来源

LLM-based code agents navigate repositories through keyword search but miss the structural relationships, such as call graphs, inheritance hierarchies, and configuration dependenc…

2026-06-11

2026-06-11 13:59:12 (Asia/Shanghai)

Terminal and SWE Agents

Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production

查看原始来源

Custom AI agents areagents that live inside their own application, talk to their own data and tools, enforce their own security boundaries, and carry their own brand and audit tra…

2026-06-10

2026-06-10 13:25:04 (Asia/Shanghai)

Terminal and SWE Agents

AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies

查看原始来源

Numerical solvers for partial differential equations (PDEs) are core computational tools in science and engineering. Building reliable PDE solvers requires not only executable cod…

Terminal and SWE Agents

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

查看原始来源

As the capabilities of LLM-based code agents continue to advance, their expected role is expanding beyond localized bug fixing in existing codebases toward architecting and implem…

2026-06-05

2026-06-05 13:25:00 (Asia/Shanghai)

Terminal and SWE Agents

Asuka-Bench: Benchmarking Code Agents on Underspecified User Intent and Multi-Round Refinement

查看原始来源

Existing code-generation benchmarks score a single mapping from a complete prompt to a one-shot output. However, real web development is different. Users seldom write a full spec…

Terminal and SWE Agents

SmellBench: Towards Fine-Grained Evaluation of Code Agents on Refactoring Tasks

查看原始来源

Code Agents have achieved remarkable advances in recent years, exhibiting strong capabilities across a wide range of software engineering tasks. However, their misuse often produc…

Terminal and SWE Agents

RAT: RunAnyThing via Fully Automated Environment Configuration

查看原始来源

Automating repository-level software engineering tasks is a foundational challenge for autonomous code agents, largely due to the difficulty of configuring executable environments…

2026-06-04

2026-06-04 14:02:06 (Asia/Shanghai)

Terminal and SWE Agents

The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development?

查看原始来源

Current AI benchmarks evaluate agents on task execution within human-designed workflows. These evaluations fundamentally fail to measure a critical next-level capability: whether…

2026-06-03

2026-06-03 14:09:56 (Asia/Shanghai)

Terminal and SWE Agents

What Makes Interaction Trajectories Effective for Training Terminal Agents?

查看原始来源

Stronger code agents are commonly assumed to be superior teachers for post-training, yet this assumption remains poorly disentangled from task difficulty, harness design, and stud…

Terminal and SWE Agents

Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing

查看原始来源

AI-assisted coding agents are bottlenecked by input-token cost. Two pathologies of raw human input drive much of this overhead: tokenization inefficiency for non-English text and…

2026-05-15

2026-05-15 14:57:29 (Asia/Shanghai)

Terminal and SWE Agents

CRANE: Constrained Reasoning Injection for Code Agents via Nullspace Editing

查看原始来源

Code agents must both reason over long-horizon repository state and obey strict tool-use protocols. In paired Instruct/Thinking checkpoints, these capabilities are complementary b…