Canonical Paper

ClinicRealm: Re-evaluating large language models with conventional machine learning for non-generative clinical prediction tasks.

这是一篇规范化归档后的论文详情页,聚合了多来源命中、历史出现记录和相关推荐。

相关性

0

当前分数

1 个合并来源

历史跨度

1

个活跃日期

1 个 feed

归档记录

1

次归档出现

1 个合并来源

主题与标签

2

个主题词

2 个标签

论文概览

Large Language Models (LLMs) are increasingly deployed in medicine. However, their utility for non-generative clinical prediction is under-evaluated, and they are often assumed to be inferior to specialized models, crea…

规范主键

pubmed:41951858

合并来源

PubMed

作者

Yinghao Zhu,Junyi Gao,Zixiang Wang,Weibin Liao,Xiaochen Zheng,Lifang Liang,Miguel O Bernabeu,Yasha Wang,Lequan Yu,Chengwei Pan,Ewen M Harrison,Liantao Ma

分类

Journal Article

标签

评测 / 方法

主题词

Language Model / Benchmark

首次出现

2026-04-09 14:51:56 (UTC+08:00)

最近出现

2026-04-09 14:51:56 (UTC+08:00)

覆盖跨度

1 个活跃日期 / 1 个 feed / 1 次归档出现

反馈状态

待跟进

下一步

recheck benchmark framing against classical baselines

最晚处理

2026-04-20

搁置到

未设置

复查周期

每 14 天

个人备注

Recheck whether ClinicRealm still beats classical clinical baselines under the same task framing.

命中原因

未记录

最近行动提醒

未记录

个人反馈

把你为什么标记这篇论文、接下来准备怎么处理,直接挂在规范化详情页上。

备注内容

Recheck whether ClinicRealm still beats classical clinical baselines under the same task framing.

下一步

recheck benchmark framing against classical baselines

最晚处理

2026-04-20

搁置到

未设置

复查周期

每 14 天

反馈操作

复制规范主键或本地 CLI 命令,把这篇论文快速加入个人反馈状态文件。

行动提醒状态

这里记录这篇论文最近已经触发过哪些 action reason,便于解释为什么今天没有再次提醒。

当前还没有记录过 action 提醒。

来源与外链

优先展示这篇论文在各来源上的规范化入口,再补当前摘要页和 PDF。

历史命中

按归档时间回看它在哪些 feed 中出现过,并保留当日 digest 产物入口。

PubMed AI

2026-04-09

2026-04-09 14:51:56 (Asia/Shanghai)

Large Language Models (LLMs) are increasingly deployed in medicine. However, their utility for non-generative clinical prediction is under-evaluated, and they are often assumed to…

Score 0 · 无额外命中原因

相关推荐

基于共享主题、标签和配置关键词做的轻量规则推荐。

Related

Transforming oncology clinical trial matching through neuro-symbolic, multi-agent AI and an oncology-specific knowledge graph: a prospective evaluation in 3804 patients.

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:reasoning / benchmark / evaluation

Score 101PubMed

Related

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:reasoning / benchmark / evaluation

Score 108arXiv

Related

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:reasoning / benchmark / language model

Score 145arXiv

Related

Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic Views

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:reasoning / benchmark / language model

Score 130arXiv

Related

Classifying American Society of Anesthesiologists Physical Status With a Low-Rank-Adapted Large Language Model: Development and Validation Study.

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:benchmark / language model / clinical

Score 111PubMed

Related

VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model.

共享主题:Benchmark / Language Model;共享标签:评测 / 方法;共享关键词:benchmark / evaluation / language model

Score 111PubMed