TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

论文概览

While Large Language Models (LLMs) have empowered AI research agents to perform isolated scientific tasks, automating complex, real-world workflows, such as LLM training, remains a significant challenge. In this paper,…

规范主键

arxiv:2604.14116

合并来源

arXiv

作者

Zerun Ma，Guoqiang Wang，Xinchen Xie，Yicheng Chen，He Du，Bowen Li，Yanan Sun，Wenran Liu，Kai Chen，Yining Li

分类

cs.AI, cs.CL

标签

评测 / 应用 / 方法

主题词

Benchmark / Language Model

首次出现

2026-04-16 11:43:00 (UTC+08:00)

个人反馈

把你为什么标记这篇论文、接下来准备怎么处理，直接挂在规范化详情页上。

当前还没有个人反馈，可以先用本地 feedback CLI 补上。

反馈操作

复制规范主键或本地 CLI 命令，把这篇论文快速加入个人反馈状态文件。

行动提醒状态

这里记录这篇论文最近已经触发过哪些 action reason，便于解释为什么今天没有再次提醒。

当前还没有记录过 action 提醒。

来源与外链

优先展示这篇论文在各来源上的规范化入口，再补当前摘要页和 PDF。

arXiv PDF

历史命中

按归档时间回看它在哪些 feed 中出现过，并保留当日 digest 产物入口。

LLM

2026-04-16

2026-04-16 11:43:00 (Asia/Shanghai)

While Large Language Models (LLMs) have empowered AI research agents to perform isolated scientific tasks, automating complex, real-world workflows, such as LLM training, remains…

Score 107 · title matched "agent"；summary matched "benchmark"；summary matched "evaluation"

Markdown JSON 对应 Feed 页

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

论文概览

个人反馈

反馈操作

行动提醒状态

来源与外链

历史命中

2026-04-16

相关推荐

Transforming oncology clinical trial matching through neuro-symbolic, multi-agent AI and an oncology-specific knowledge graph: a prospective evaluation in 3804 patients.

DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

Beyond Function Calling: Benchmarking Tool-Using Agents under Tool-Environment Unreliability

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows

Cooperative Profiles Predict Multi-Agent LLM Team Performance in AI for Science Workflows