ClinicRealm: Re-evaluating large language models with conventional machine learning for non-generative clinical prediction tasks.

论文概览

Large Language Models (LLMs) are increasingly deployed in medicine. However, their utility for non-generative clinical prediction is under-evaluated, and they are often assumed to be inferior to specialized models, crea…

规范主键

pubmed:41951858

合并来源

PubMed

作者

Yinghao Zhu，Junyi Gao，Zixiang Wang，Weibin Liao，Xiaochen Zheng，Lifang Liang，Miguel O Bernabeu，Yasha Wang，Lequan Yu，Chengwei Pan，Ewen M Harrison，Liantao Ma

分类

Journal Article

标签

评测 / 方法

主题词

Language Model / Benchmark

首次出现

2026-04-09 14:51:56 (UTC+08:00)

个人反馈

把你为什么标记这篇论文、接下来准备怎么处理，直接挂在规范化详情页上。

备注内容

Recheck whether ClinicRealm still beats classical clinical baselines under the same task framing.

下一步

recheck benchmark framing against classical baselines

最晚处理

2026-04-20

搁置到

未设置

复查周期

每 14 天

反馈操作

复制规范主键或本地 CLI 命令，把这篇论文快速加入个人反馈状态文件。

行动提醒状态

这里记录这篇论文最近已经触发过哪些 action reason，便于解释为什么今天没有再次提醒。

当前还没有记录过 action 提醒。

来源与外链

优先展示这篇论文在各来源上的规范化入口，再补当前摘要页和 PDF。

PubMed

历史命中

按归档时间回看它在哪些 feed 中出现过，并保留当日 digest 产物入口。

PubMed AI

2026-04-09

2026-04-09 14:51:56 (Asia/Shanghai)

Large Language Models (LLMs) are increasingly deployed in medicine. However, their utility for non-generative clinical prediction is under-evaluated, and they are often assumed to…

Score 0 · 无额外命中原因

Markdown JSON 对应 Feed 页

ClinicRealm: Re-evaluating large language models with conventional machine learning for non-generative clinical prediction tasks.

论文概览

个人反馈

反馈操作

行动提醒状态

来源与外链

历史命中

2026-04-09

相关推荐

V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

Transforming oncology clinical trial matching through neuro-symbolic, multi-agent AI and an oncology-specific knowledge graph: a prospective evaluation in 3804 patients.

The Measurement Gap in the Automation of EU Law: Benchmarking Doctrinal Legal Reasoning under the EU AI Act

CineCap: Structured Reasoning with Spatio-Temporal Anchors for Cinematographic Video Captioning

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards