Keyword Tracking

关键词追踪：alignment

这个页面会长期追踪你配置里关心的关键词，并把命中的论文按日期沉淀下来。

返回归档首页查看趋势总览最新 JSON 订阅 RSS

近期走势

最近一次命中来自 LM：In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

2026-06-15

2026-06-16

2026-06-17

2026-06-18

2026-06-19

2026-06-20

2026-06-21

2026-06-22

2026-06-23

2026-06-24

2026-06-25

2026-06-26

2026-06-27

2026-06-28

命中明细

按日期回看匹配到这个关键词的论文标题，并保留来源 feed 信息。

2026-06-26

2026-06-26 13:16:53 (Asia/Shanghai)

In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

查看原始来源

Synthesizing human motion from textual descriptions is essential for immersive digital applications, yet existing methods face a persistent trade-off between semantic fidelity and…

Just how sure are you? Improving Verbalized Uncertainty Calibration in Medical VQA

查看原始来源

Multimodal large language models (MLLMs) applied to Medical Visual Question Answering (VQA) tend to produce overconfident outputs regardless of actual correctness, and existing ve…

2026-06-25

2026-06-25 13:11:21 (Asia/Shanghai)

RAS: Measuring LLM Safety Through Refusal Alignment

查看原始来源

Safety evaluation of large language models (LLMs) is commonly performed by querying models with unsafe or jailbreak prompts and judging whether their outputs violate a safety poli…

SpeechEQ: Benchmarking Emotional Intelligence Quotient in Socially Aware Voice Conversational Models

查看原始来源

As multimodal conversational systems increasingly engage in spoken interaction, their ability to navigate paralinguistic social cues has become a critical bottleneck for natural h…

SARA: Unlocking Multilingual Knowledge in Mixture-of-Experts via Semantically Anchored Routing Alignment

查看原始来源

Sparse Mixture-of-Experts (MoE) architectures have emerged as an increasingly influential paradigm as they offer a strategic balance between parameter scalability and computationa…

Agent Runtime Security

The Unfireable Safety Kernel: Execution-Time AI Alignment for AI Agents and Other Escapable AI Systems

查看原始来源

AI agents are granted access to tools, APIs, and other infrastructure, making them active principals in those systems. The dominant approach places controls inside the agent's own…

2026-06-24

2026-06-24 13:06:49 (Asia/Shanghai)

EG-VQA: Benchmarking Verifiable Video Question Answering with Grounded Temporal Evidence

查看原始来源

Recent advances in Video Large Language Models (Video-LLMs) have yielded promising performance on video question answering (VideoQA). Nevertheless, existing benchmarks are predomi…

Agent Runtime Security

PHANTOM: A Large-Scale Dataset of Multimodal Adversarial Attacks for Vision-Language Models

查看原始来源

We introduce a large-scale, open-source dataset of pre-generated adversarial attacks for vision-language models (VLMs). The dataset is designed to be diverse, representative, and…

2026-06-23

2026-06-23 13:10:02 (Asia/Shanghai)

Distribution-Aware Diffusion-LLM for Robust Ultra-Long-Term Time Series Forecasting

查看原始来源

Time series forecasting is a fundamental machine learning task. Recent work has explored Large Language Models (LLMs) for this purpose due to their strong generalization, pattern…

On the Limits of Prompt-Conditioned Language Models as General-Purpose Learners

查看原始来源

Large Language Models (LLMs) are frequently portrayed as general-purpose solvers capable of solving arbitrary tasks. We argue that this view overlooks a fundamental constraint: la…

Measuring & Mitigating Over-Alignment for LLMs in Multilingual Criminal Law Courts

查看原始来源

While the wider applicability of LLMs in the legal field is currently debated due to their reliability and the gravity of any errors, narrow uses with well-understood and mitigate…

Agent Runtime Security

Capable but Careless: Do Computer-Use Agents Follow Contextual Integrity?

查看原始来源

Computer-use agents (CUAs) now act on a user's behalf across personal applications such as email, calendars, and to-do lists. This cross-application access is useful, but it also…

2026-06-19

2026-06-19 14:26:15 (Asia/Shanghai)

Your Mouse and Eyes Secretly Leak Your Preference: LLM Alignment using Implicit Feedback from Users

查看原始来源

To align a Large Language Model (LLM), most existing methods collect explicit human feedback and train a reward model to predict the human preference based on the response text. T…

2026-06-18

2026-06-18 14:03:08 (Asia/Shanghai)

Quantifying and Auditing LLM Evaluation via Positive--Unlabeled Learning

查看原始来源

Large Language Models (LLMs) are increasingly used as judges for scalable evaluation, yet such LLM--as--a--Judge systems exhibit systematic biases that are decoupled from semantic…

Beyond Tokenization: Direct Timestep Embedding and Contrastive Alignment for Time-Series Question Answering

查看原始来源

Recent advances in large language models (LLMs) have given rise to time-series question answering (TSQA), which formulates time-series analysis as natural-language question answer…

G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

查看原始来源

Idioms are difficult to transfer across languages due to their non-compositionality and weak surface-form grounding, making literal mappings unreliable. We present G-IdiomAlign, a…

Beyond Safe Data: Pretraining-Stage Alignment with Regular Safety Reflection

查看原始来源

To achieve deeper safety alignment for large language models (LLMs), recent efforts have studied how to push safety interventions earlier into the pretraining stage, primarily by…

2026-06-17

2026-06-17 14:22:19 (Asia/Shanghai)

LLM Consumer Behavior Theory: Foundations of a Novel Research Field

查看原始来源

Large language models (LLMs) are increasingly deployed as autonomous agents that make consumption decisions on behalf of users. This shift raises fundamental questions for consume…

RubricsTree: Scalable and Evolving Open-Ended Evaluation of Personal Health Agents across Health Memory and Medical Skills

查看原始来源

The LLM-empowered personal health agents with user health (sensor) metrics have offered a promising pathway to alleviate global disparities in healthcare access. However, large-sc…

ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents

查看原始来源

Tool-using LLM agents increasingly use the Model Context Protocol (MCP) to answer from heterogeneous evidence sources, including search, APIs, databases, clinical records, and for…

2026-06-16

2026-06-16 14:38:43 (Asia/Shanghai)

LabOSBench: Benchmarking Computer Use Agents for Scientific Instrument Control

查看原始来源

Current computer-use benchmarks primarily focus on software operation tasks in virtualized systems, whereas scientific instrumentation scenarios require coordinated control over c…

Contrastive-Difference CKA Reveals Concept-Specific Structural Alignment Across Language Model Architectures

查看原始来源

Do different LLM architectures encode high-level concepts in structurally compatible ways? We systematically characterize a geometric-functional universality dissociation: across…

Agent Runtime Security

DoubtProbe: Black-Box Jailbreak Defense via Structural Verification and Semantic Auditing

查看原始来源

As large language models (LLMs) are increasingly deployed in user-facing systems, black-box jailbreak defense has become an important practical problem. Existing defenses often re…

Agent Runtime Security

Adaptive and Explicit safe: Triggering Latent Safety Awareness in Large Reasoning Models

查看原始来源

While Large Reasoning Models (LRMs) excel at complex tasks, they remain highly vulnerable to sophisticated jailbreaks and direct harmful queries. To address this vulnerability, pr…

2026-06-12

2026-06-12 13:55:02 (Asia/Shanghai)

Leveraging Audio-LLMs to Filter Speech-to-Speech Training Data

查看原始来源

Large-scale mined corpora provide abundant training data for end-to-end speech-to-speech translation (S2ST) but may contain noise, misalignment, and semantic errors. Filtering noi…

2026-06-11

2026-06-11 13:59:12 (Asia/Shanghai)

OpenMedReason: Scientific Reasoning Supervision for Medical Vision-Language Models

查看原始来源

High-stakes clinical use of large vision-language models (LVLMs) requires reasoning that is grounded in visual evidence and clinical knowledge, not just correct final answers. We…

ALIGNBEAM : Inference-Time Alignment Transfer via Cross-Vocabulary Logit Mixing

查看原始来源

Domain fine-tuning degrades the safety of large language models: fine-tuned specialists readily comply with harmful prompts framed in domain language. Existing inference-time defe…

Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation

查看原始来源

Spoken dialogue models typically start from text LLM backbones, yet reasoning often degrades when conditioning on speech instead of text. We attribute part of this modality gap to…

Agent Runtime Security

Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

查看原始来源

Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-Constrained Decodin…

2026-06-10

2026-06-10 13:25:04 (Asia/Shanghai)

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

查看原始来源

Although Large Language Model (LLM) agents have demonstrated strong performance on complex tasks, their learning is often limited by inefficient interaction feedback and static tr…

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

查看原始来源

This study investigates cross-lingual distributional skew (the Shibboleth Effect) in frontier large language models (LLMs) subjected to sustained adversarial conditions. We develo…

Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models

查看原始来源

Instruction-tuned LLMs are increasingly converted into reasoning models through post-training to improve multi-step task performance. This conversion is usually optimized for reas…

Null-Space Constrained Low-Rank Adaptation for Response-Specified Large Language Model Unlearning

查看原始来源

Large language model unlearning aims to suppress designated undesirable knowledge while preserving benign capabilities. Many unlearning objectives focus on suppressing undesired a…

Agent Runtime Security

It Takes One to Bias Them All: Breaking Bad with One-Shot GRPO

查看原始来源

Warning: This paper contains several toxic and offensive statements. Modern large language models (LLMs) are typically aligned through large-scale post-training to ensure fair and…

Agent Runtime Security

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

查看原始来源

Failures in multi-turn reasoning models are largely invisible to terminal-score evaluation. A model can lock onto an unsafe stance early in a long dialogue, yet its final-turn ref…

2026-06-09

2026-06-09 13:12:49 (Asia/Shanghai)

Gradient-Guided Reward Optimization for Inference-time Alignment

查看原始来源

Ensuring the reliability of Large Language Models (LLMs) under distribution drift requires inference-time adaptation. While inference-time alignment methods such as Best-of-$N$ an…

IS-CoT: Breaking the Long-form Generation Collapse via Interleaved Structural Thinking

查看原始来源

Generating coherent and controllable long-form content remains a persistent challenge for Large Language Models (LLMs). While reasoning-enhanced models have demonstrated success i…

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

查看原始来源

The ambition behind alignment training is to make large language models safe and useful. The primary mechanism, reinforcement learning from human feedback (RLHF), shapes the behav…

UXBench: Benchmarking User Experience in AI Assistants

查看原始来源

As AI assistants serve millions of users daily, evaluating user experience (UX) beyond general model capability has become increasingly important. We present UXBench, the first us…

2026-06-05

2026-06-05 13:25:00 (Asia/Shanghai)

CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

查看原始来源

Multi-agent systems (MAS) built on large language models have shown growing promise, with their effectiveness resting on agents' ability to coordinate through text-based channels…

Beyond tokens: a unified framework for latent communication in LLM-based multi-agent systems

查看原始来源

Multi-agent systems built on large language models (LLMs) have become a prevailing paradigm for tackling complex reasoning, planning, and tool-use tasks. The dominant communicatio…

A Komi-Yazva--Russian Parallel Corpus and Evaluation Protocol for Zero- and Few-Shot LLM Translation

查看原始来源

We present the first Komi-Yazva--Russian parallel corpus together with an explicit evaluation protocol for studying LLM translation in an endangered, extremely low-resource settin…

Agent Runtime Security

Safety Paradox: How Enhanced Safety Awareness Leaves LLMs Vulnerable to Posterior Attack

查看原始来源

Large language models (LLMs) are rigorously aligned to refuse harmful requests, a process that inherently cultivates a latent capacity to evaluate and recognize unsafe content. In…

Agent Runtime Security

The Granularity Gap: A Multi-Dimensional Longitudinal Audit of Sycophancy in Gemini Models

查看原始来源

Large language models are increasingly deployed as high-stakes advisors, yet standard alignment benchmarks treat sycophancy as a binary failure mode. We introduce the Granularity…

2026-06-04

2026-06-04 14:02:06 (Asia/Shanghai)

Large Language Models in K-12 Education: Alignment with State Curriculum Standards and Student Personas

查看原始来源

As Large Language Models (LLMs) become increasingly popular in educational settings, they raise important questions about the ethical implications of their use. Publicly available…

GRAIL: Gradient-Reweighted Advantages for Reinforcement Learning with Verifiable Rewards

查看原始来源

Reinforcement learning with verifiable rewards (e.g. GRPO) is now a common way to improve mathematical reasoning in Large Language Models (LLMs). However, current methods usually…

Terminal and SWE Agents

The Meta-Agent Challenge: Are Current Agents Capable of Autonomous Agent Development?

查看原始来源

Current AI benchmarks evaluate agents on task execution within human-designed workflows. These evaluations fundamentally fail to measure a critical next-level capability: whether…

Terminal and SWE Agents

Trustworthy AI Software Engineers

查看原始来源

With the rapid rise of AI coding agents, the fundamental premise of what it means to be a software engineer is in question. In this vision paper, we examine what it means for an A…

2026-06-03

2026-06-03 14:09:56 (Asia/Shanghai)

Exploring Adversarial Robustness and Safety Alignment in Multilingual Multi-Modal Large Language Models

查看原始来源

Multimodal Large Language Models integrate visual perception into language reasoning, introducing a continuous attack surface susceptible to adversarial attacks. Prior work on MLL…

Can Factual Opinions Be Edited (Manipulated) in Large Language Models?

查看原始来源

Large Language Models (LLMs) are increasingly integrated into various domains, making knowledge editing techniques crucial yet potentially hazardous. Current editing methods prima…

Fully Automated Identification of Lexical Alignment and Preference-Stage Shifts in Large Language Models

查看原始来源

The language used by digital chat assistants such as ChatGPT can diverge from human expectations (misalignment). Research, mostly on Scientific English, has described both what di…

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

查看原始来源

While large language models (LLMs) are increasingly used for clinical applications, many existing pipelines require sending raw sensitive health information to remote servers for…

Hallucinations as Orthogonal Noise: Inference-Time Manifold Alignment via Dynamic Contextual Orthogonalization

查看原始来源

Hallucination in Large Language Models (LLMs), characterized by the generation of content inconsistent with contextual facts or logical constraints -- remains a persistent challen…

Agent Runtime Security

Which Defense Closes Which Threat? Attributing OWASP-LLM-Top-10 Coverage and Its Brittleness Under Paraphrasing

查看原始来源

Production LLM applications stack several defense families -- refusal-phrase filters, token-budget controls, model allowlists, rate limits, tool-registry authentication -- yet exi…

Terminal and SWE Agents

Dependency-Guided Repository-Level C-to-Rust Translation with Reinforcement Alignment

查看原始来源

Automating C-to-Rust migration is critical for improving software security without sacrificing performance. Traditional rule-based methods struggle with diverse C idioms, often pr…

Terminal and SWE Agents

Automated Repair of Requirements for Cyber-Physical Systems in Simulink Requirements Tables

查看原始来源

The development of complex software systems, e.g., cyber-physical systems (CPSs), involves continuous evolution of both system implementations and their requirements. These two ar…

2026-06-02

2026-06-02 13:56:35 (Asia/Shanghai)

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

查看原始来源

Recent multimodal large language models have demonstrated strong reasoning ability, yet their reliability as automated evaluators remains limited by a critical weakness: when visu…

SafeSteer: Localized On-Policy Distillation for Efficient Safety Alignment

查看原始来源

Aligning Large Language Models (LLMs) with human values often degrades their general capabilities, termed the alignment tax. Existing methods mitigate this by balancing dual objec…

Agent Runtime Security

Jailbreaking Multimodal Large Language Models using Multi-Clip Video

查看原始来源

As multimodal large language models (MLLMs) have advanced to process video inputs, concerns have emerged about their potential for malicious misuse. Prior jailbreak studies have s…

2026-05-29

2026-05-29 13:18:32 (Asia/Shanghai)

CIRF: Tokenizing Chain-of-Thoughts into Reusable Functional Units for Efficient Latent Reasoning in Large Language Models

查看原始来源

Implicit Chain-of-Thought (CoT) reduces the inference cost of large language models by internalizing the explicit rationales. However, existing approaches typically lack alignment…

Argument Quality Assessment with Large Language Models: A Pairwise Bradley-Terry Approach

查看原始来源

Large Language Models (LLMs) have demonstrated remarkable capabilities in tasks related to reasoning and judgment. However, assessing the quality of arguments requires a rigorous…

Modeling Community Attitude through Reaction Tone: A Human-AI Collaborative Framework for Evaluating LLM Alignment with Linguistic Behaviors in Online Communities

查看原始来源

Large language models (LLMs) are increasingly utilized as proxies for computational social analysis; yet, their ability to faithfully represent the "thick descriptions" (Geertz, 1…

Framing Matters: Addressing Framing Sensitivity in Decision-Making through Behaviorally-Grounded Value Alignment

查看原始来源

Large Language Models (LLMs) are increasingly deployed in high-stakes decision-making settings such as legal reasoning, where consistency under factually equivalent inputs is crit…

HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment

查看原始来源

Entity Alignment (EA) is essential for knowledge graph (KG) fusion, but existing benchmarks often allow models to exploit name overlap rather than relational structure. This makes…

Agent Runtime Security

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

查看原始来源

Modern open-world agents such as OpenClaw exhibit powerful cross-environment execution capabilities yet introduce broad new safety risk sources. Meanwhile, advanced frontier AI mo…

2026-05-28

2026-05-28 13:15:52 (Asia/Shanghai)

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

查看原始来源

Memory is essential for enabling large language models to support long-horizon reasoning, yet existing memory systems remain unreliable and difficult to debug. Tracing memory's dy…

MUSE: Benchmarking Manufacturable, Functional, and Assemblable Text-to-CAD Generation

查看原始来源

Large language models (LLMs) have recently advanced text-driven 3D generation, yet Text-to-CAD remains far from supporting industrial product design. Existing benchmarks focus pri…

VLMs May Not Globally Enhance Human Alignment over LLMs During Natural Reading

查看原始来源

Large language models (LLMs) have become increasingly useful computational models of human language processing, but it remains unclear whether vision-language learning makes text…

Evaluating the Realism of LLM-powered Social Agents: A Case Study of Reactions to Spanish Online News

查看原始来源

LLM-powered social agents are increasingly used to simulate online social behavior, yet their realism remains difficult to validate. Existing work has largely relied on general-pu…

2026-05-27

2026-05-27 13:23:19 (Asia/Shanghai)

MATCHA: Matching Text via Contrastive Semantic Alignment

查看原始来源

Reliable evaluation is essential for understanding large language model (LLM) performance, yet today's go-to metrics, namely token-overlap scores (e.g., ROUGE) and embedding-based…

ENPMR-Bench: Benchmarking Proactive Memory Retrieval for Emotional Support Agents

查看原始来源

Memory-augmented language agents are increasingly deployed in affective applications such as emotional support, where understanding and responding to users' latent emotional needs…

It's Not Always Sycophancy: Measuring LLM Conformity as a Function of Epistemic Uncertainty

查看原始来源

Large language models (LLMs) are known to abandon their initial stance to conform to user pushback. While prior research largely attributes this behavior to sycophancy learned dur…

2026-05-26

2026-05-26 13:09:24 (Asia/Shanghai)

Causal methods for LLM development and evaluation

查看原始来源

Large language model (LLM) development is currently driven by large-scale empirical iteration over data mixtures, reward models, routing strategies, and evaluation pipelines. Here…

PolyGnosis 2.0: Enhancing LLM Reasoning via Agentic Harness Engineering for Polymarket and OSINT Insight Extraction

查看原始来源

This paper introduces PolyGnosis 2.0, a pioneering multi-agent architecture designed to extract predictive intelligence by synthesizing Polymarket anomaly signals with global Open…

MAGIC: Multimodal Alignment & Grounding-aware Instruction Coreset for Vision-Language Models

查看原始来源

Instruction tuning of large vision-language models (LVLMs) increasingly depends on massive multimodal corpora, yet these datasets contain samples with substantial redundancy, low…

2026-05-22

2026-05-22 13:08:19 (Asia/Shanghai)

Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

查看原始来源

Agentic systems are becoming more capable: agents define strategies, take actions, and interact with different environments. This autonomy poses serious challenges for overseeing…

ChronoMedKG: A Temporally-Grounded Biomedical Knowledge Graph and Benchmark for Clinical Reasoning

查看原始来源

Biomedical knowledge graphs (KGs) treat disease associations as static facts, but temporal information is crucial for clinical reasoning, e.g., a symptom diagnostic of one disease…

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

查看原始来源

Reinforcement learning has proven effective for enhancing multi-step reasoning in large language models (LLMs), yet its benefits have not fully translated to multilingual contexts…

From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment

查看原始来源

Adapting Large Language Models (LLMs) to specialized domains typically incurs high data and computational overhead. While prior efficiency efforts have largely treated data select…

Planning in the LLM Era: Building for Reliability and Efficiency

查看原始来源

Growing attention to intelligent agents has put a spotlight on one of their central capabilities: planning. Early attempts to leverage large language models (LLMs) for planning re…

Cross-Lingual Consensus: Aligning Multilingual Cultural Knowledge via Multilingual Self-Consistency

查看原始来源

Although Large Language Models (LLMs) demonstrate strong capabilities across various tasks, they exhibit significant performance discrepancies across languages. While prompting LL…

Polite on the Surface, Wrong in Practice: A Curated Dataset for Fixing Honorific Failures in Multilingual Bangla Generation

查看原始来源

Recent advances in Multilingual Large Language Models (MLLMs) have significantly enhanced cross-lingual conversational capabilities, yet modeling culturally nuanced and context-de…

2026-05-21

2026-05-21 13:14:24 (Asia/Shanghai)

Federated LoRA Fine-Tuning for LLMs via Collaborative Alignment

查看原始来源

Low-rank adaptation (LoRA) has emerged as a powerful tool for parameter-efficient fine-tuning of large language models (LLMs). This paper studies LoRA under a federated learning s…

2026-05-20

2026-05-20 13:10:58 (Asia/Shanghai)

SciCustom: A Framework for Custom Evaluation of Scientific Capabilities in Large Language Models

查看原始来源

Large language models (LLMs) are increasingly applied to scientific research, yet existing evaluations often fail to reflect the fine-grained capabilities required in practice. Mo…

LambdaPO: A Lambda Style Policy Optimization for Reasoning Language Models

查看原始来源

Group Relative Policy Optimization(GRPO) has become a cornerstone of modern reinforcement learning alignment, prized for its efficacy in foregoing an explicit value-critic by leve…

Agent Runtime Security

Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains

查看原始来源

Foundation models are increasingly deployed in socially sensitive domains such as education, mental health, and caregiving, where failures are often cumulative and context-depende…

Agent Runtime Security

SimGym: A Framework for A/B Test Simulation in E-Commerce with Traffic-Grounded VLM Agents

查看原始来源

A/B testing remains the gold standard for evaluating modifications to e-commerce storefronts, yet it diverts traffic, requires weeks to reach statistical significance, and risks d…

2026-05-19

2026-05-19 13:08:04 (Asia/Shanghai)

CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark

查看原始来源

Spatial intelligence requires multimodal large language models (MLLMs) to move beyond single-view perception and reason consistently about objects, visibility, geometry, and inter…

Estimating Item Difficulty with Large Language Models as Experts

查看原始来源

Accurate estimates of item difficulty are essential for valid assessment and effective adaptive learning. However, for newly created tasks, response data are typically unavailable…

Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

查看原始来源

Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this ga…

AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment

查看原始来源

The alignment of Large Language Models (LLMs) for complex reasoning heavily relies on Reinforcement Learning with Verifiable Rewards (RLVR). However, standard algorithms like GRPO…

Query-Conditioned Knowledge Alignment for Reliable Cross-System Medical Reasoning

查看原始来源

Cross-domain knowledge alignment is essential for integrating heterogeneous medical systems, yet existing approaches typically treat entity alignment as a static matching problem,…

Agent Runtime Security

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

查看原始来源

Coding agents now run autonomously with shell, file, and network privileges. When a user issues a benign request, the agent sometimes does more than asked: it deletes unrelated fi…

Agent Runtime Security

Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models

查看原始来源

The integration of audio modality into Large Audio Language Models (LALMs) significantly expands their attack surface. Existing jailbreak paradigms predominantly treat audio as a…

2026-05-15

2026-05-15 14:57:29 (Asia/Shanghai)

Talk is (Not) Cheap: A Taxonomy and Benchmark Coverage Audit for LLM Attacks

查看原始来源

We introduce a reusable framework for auditing whether LLM attack benchmarks collectively cover the threat surface: a 4$\times$6 Target $\times$ Technique matrix grounded in STRID…

Terminal and SWE Agents

Comparing Developer and LLM Biases in Code Evaluation

查看原始来源

As LLMs are increasingly used as judges in code applications, they should be evaluated in realistic interactive settings that capture partial context and ambiguous intent. We pres…

2026-05-13

2026-05-13 12:54:34 (Asia/Shanghai)

ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models

查看原始来源

Large language models (LLMs) often produce answers with high certainty even when they are incorrect, making reliable confidence estimation essential for deployment in real-world s…

Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models

查看原始来源

Visual latent reasoning lets a multimodal large language model (MLLM) create intermediate visual evidence as continuous tokens, avoiding external tools or image generators. Howeve…

Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring

查看原始来源

Estimating question difficulty is a critical component in evaluating and improving large language models (LLMs) for question answering (QA). Existing approaches often rely on read…

Pretraining Exposure Explains Popularity Judgments in Large Language Models

查看原始来源

Large language models (LLMs) exhibit systematic preferences for well-known entities, a phenomenon often attributed to popularity bias. However, the extent to which these preferenc…

2026-05-12

2026-05-12 12:42:08 (Asia/Shanghai)

Conformity Generates Collective Misalignment in AI Agents Societies

查看原始来源

Artificial intelligence safety research focuses on aligning individual language models with human values, yet deployed AI systems increasingly operate as interacting populations w…

DGPO: Beyond Pairwise Preferences with Directional Consistent Groupwise Optimization

查看原始来源

Although Large Language Models (LLMs) have made remarkable progress, current preference optimization methods still struggle to align directional consistency while preserving reaso…

Agent Runtime Security

Intrinsic Guardrails: How Semantic Geometry of Personality Interacts with Emergent Misalignment in LLMs

查看原始来源

Fine-tuning Large Language Models (LLMs) on benign narrow data can sometimes induce broad harmful behaviors, a vulnerability termed emergent misalignment (EM). While prior work li…

2026-05-08

2026-05-08 14:15:32 (Asia/Shanghai)

Measuring Evaluation-Context Divergence in Open-Weight LLMs: A Paired-Prompt Protocol with Pilot Evidence of Alignment-Pipeline-Specific Heterogeneity

查看原始来源

Safety benchmarks are routinely treated as evidence about how a language model will behave once deployed, but this inference is fragile if behavior depends on whether a prompt loo…

Evaluation Awareness in Language Models Has Limited Effect on Behaviour

查看原始来源

Large reasoning models (LRMs) sometimes note in their chain of thought (CoT) that they may be under evaluation. Researchers worry that this verbalised evaluation awareness (VEA) c…

2026-05-07

2026-05-07 12:38:06 (Asia/Shanghai)

Misaligned by Reward: Socially Undesirable Preferences in LLMs

查看原始来源

Reward models are a key component of large language model alignment, serving as proxies for human preferences during training. However, existing evaluations focus primarily on bro…

Why Expert Alignment Is Hard: Evidence from Subjective Evaluation

查看原始来源

Aligning large language models with expert judgment is especially difficult in subjective evaluation tasks, where experts may disagree, rely on tacit criteria, and change their ju…

2026-05-06

2026-05-06 12:37:23 (Asia/Shanghai)

MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents

查看原始来源

Coding agents often pass per-prompt safety review yet ship exploitable code when their tasks are decomposed into routine engineering tickets. The challenge is structural: existing…

Nora: Normalized Orthogonal Row Alignment for Scalable Matrix Optimizer

查看原始来源

Matrix-based optimizers have demonstrated immense potential in training Large Language Models (LLMs), however, designing an ideal optimizer remains a formidable challenge. A super…

2026-05-05

2026-05-05 12:20:54 (Asia/Shanghai)

MTA: Multi-Granular Trajectory Alignment for Large Language Model Distillation

查看原始来源

Knowledge distillation is a key technique for compressing large language models (LLMs), but most existing methods align representations at fixed layers or token-level outputs, ign…

SRA: Span Representation Alignment for Large Language Model Distillation

查看原始来源

Cross-Tokenizer Knowledge Distillation (CTKD) enables knowledge transfer between a large language model and a smaller student, even when they employ different tokenizers. While ex…

2026-05-01

2026-05-01 12:53:56 (Asia/Shanghai)

Exploration Hacking: Can LLMs Learn to Resist RL Training?

查看原始来源

Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Successful RL relies on su…

Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems

查看原始来源

Text-to-SQL (T2SQL) evaluation in production environments poses fundamental challenges that existing benchmarks do not address. Current evaluation methodologies whether rule-based…

Design Structure Matrix Modularization with Large Language Models

查看原始来源

Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Tradi…

2026-04-29

2026-04-29 12:26:28 (Asia/Shanghai)

LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation

查看原始来源

Reliable evaluation of large language model (LLM)-generated summaries remains an open challenge, particularly across heterogeneous domains and document lengths. We conduct a compr…

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

查看原始来源

Real-world data visualization (DV) requires native environmental grounding, cross-platform evolution, and proactive intent alignment. Yet, existing benchmarks often suffer from co…

Think Before You Act -- A Neurocognitive Governance Model for Autonomous AI Agents

查看原始来源

The rapid deployment of autonomous AI agents across enterprise, healthcare, and safety-critical environments has created a fundamental governance gap. Existing approaches, runtime…

Progressing beyond Art Masterpieces or Touristic Clichés: how to assess your LLMs for cultural alignment?

查看原始来源

Although the cultural (mis)alignment of Large Language Models (LLMs) has attracted increasing attention -- often framed in terms of cultural bias -- until recently there has been…

Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggers

查看原始来源

Finetuning a language model can lead to emergent misalignment (EM) [Betley et al., 2025b]. Models trained on a narrow distribution of misaligned behavior generalize to more egregi…

2026-04-24

2026-04-24 11:46:20 (Asia/Shanghai)

LLM

Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models

查看原始来源

Large language models (LLMs) are increasingly integrated into sensitive workflows, raising the stakes for adversarial robustness and safety. This paper introduces Transient Turn I…

LLM

Inferring High-Level Events from Timestamped Data: Complexity and Medical Applications

查看原始来源

In this paper, we develop a novel logic-based approach to detecting high-level temporally extended events from timestamped data and background knowledge. Our framework employs log…

Vision

KD-CVG: A Knowledge-Driven Approach for Creative Video Generation

查看原始来源

Creative Generation (CG) leverages generative models to automatically produce advertising content that highlights product features, and it has been a significant focus of recent r…

PubMed AI

GATE: Graph and Text Exchange for Zero-Shot ECG Classification with LLM Prompts.

查看原始来源

Electrocardiography (ECG) is a fundamental tool for diagnosing cardiovascular diseases, yet the scarcity of large-scale annotated data limits the applicability of supervised learn…

2026-04-23

2026-04-23 11:42:13 (Asia/Shanghai)

LLM

V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

查看原始来源

We introduce V-tableR1, a process-supervised reinforcement learning framework that elicits rigorous, verifiable reasoning from multimodal large language models (MLLMs). Current ML…

LLM

ONOTE: Benchmarking Omnimodal Notation Processing for Expert-level Music Intelligence

查看原始来源

Omnimodal Notation Processing (ONP) represents a unique frontier for omnimodal AI due to the rigorous, multi-dimensional alignment required across auditory, visual, and symbolic d…

LLM

Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation

查看原始来源

Situated conversational recommendation (SCR), which utilizes visual scenes grounded in specific environments and natural language dialogue to deliver contextually appropriate reco…

LLM

Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem

查看原始来源

The value alignment problem for artificial intelligence (AI) is often framed as a purely technical or normative challenge, sometimes focused on hypothetical future systems. I argu…

LLM

Can "AI" Be a Doctor? A Study of Empathy, Readability, and Alignment in Clinical LLMs

查看原始来源

Large Language Models (LLMs) are increasingly deployed in healthcare, yet their communicative alignment with clinical standards remains insufficiently quantified. We conduct a mul…

Vision

Physics-Informed Conditional Diffusion for Motion-Robust Retinal Temporal Laser Speckle Contrast Imaging

查看原始来源

Retinal laser speckle contrast imaging (LSCI) is a noninvasive optical modality for monitoring retinal blood flow dynamics. However, conventional temporal LSCI (tLSCI) reconstruct…

2026-04-22

2026-04-22 11:37:03 (Asia/Shanghai)

LLM

Four-Axis Decision Alignment for Long-Horizon Enterprise AI Agents

查看原始来源

Long-horizon enterprise agents make high-stakes decisions (loan underwriting, claims adjudication, clinical review, prior authorization) under lossy memory, multi-step reasoning,…

LLM

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

查看原始来源

Large Language Model agents have rapidly evolved from static text generators into dynamic systems capable of executing complex autonomous workflows. To enhance reliability, multi-…

LLM

Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic Views

查看原始来源

Large Language Models (LLMs) still struggle with multi-step logical reasoning. Existing approaches either purely refine the reasoning chain in natural language form or attach a sy…

LLM

Beyond Rating: A Comprehensive Evaluation and Benchmark for AI Reviews

查看原始来源

The rapid adoption of Large Language Models (LLMs) has spurred interest in automated peer review; however, progress is currently stifled by benchmarks that treat reviewing primari…

LLM

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

查看原始来源

Multimodal Large Language Models are increasingly adopted as autonomous agents in interactive environments, yet their ability to proactively address safety hazards remains insuffi…

LLM

Lost in Translation: Do LVLM Judges Generalize Across Languages?

查看原始来源

Automatic evaluators such as reward models play a central role in the alignment and evaluation of large vision-language models (LVLMs). Despite their growing importance, these eva…

Vision

Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval

查看原始来源

This paper presents the first exploration of text-to-image diffusion models for zero-shot sketch-based 3D shape retrieval (ZS-SBSR). Existing sketch-based 3D shape retrieval metho…

Vision

MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation

查看原始来源

Recent advances in Diffusion Transformers (DiTs) have enabled high-quality joint audio-video generation, producing videos with synchronized audio within a single model. However, e…

2026-04-21

2026-04-21 11:40:46 (Asia/Shanghai)

LLM

StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning

查看原始来源

General agents have given rise to phenomenal applications such as OpenClaw and Claude Code. As these agent systems (a.k.a. Harnesses) strive for bolder goals, they demand increasi…

LLM

IceBreaker for Conversational Agents: Breaking the First-Message Barrier with Personalized Starters

查看原始来源

Conversational agents, such as ChatGPT and Doubao, have become essential daily assistants for billions of users. To further enhance engagement, these systems are evolving from pas…

Vision

Weakly-Supervised Referring Video Object Segmentation through Text Supervision

查看原始来源

Referring video object segmentation (RVOS) aims to segment the target instance in a video, referred by a text expression. Conventional approaches are mostly supervised learning, r…

Vision

AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation

查看原始来源

Reasoning segmentation requires models to ground complex, implicit textual queries into precise pixel-level masks. Existing approaches rely on a single segmentation token $\texttt…

Vision

Revisiting Change VQA in Remote Sensing with Structured and Native Multimodal Qwen Models

查看原始来源

Change visual question answering (Change VQA) addresses the problem of answering natural-language questions about semantic changes between bi-temporal remote sensing (RS) images.…

Vision

OmniHuman: A Large-scale Dataset and Benchmark for Human-Centric Video Generation

查看原始来源

Recent advancements in audio-video joint generation models have demonstrated impressive capabilities in content creation. However, generating high-fidelity human-centric videos in…

Vision

Denoise and Align: Diffusion-Driven Foreground Knowledge Prompting for Open-Vocabulary Temporal Action Detection

查看原始来源

Open-Vocabulary Temporal Action Detection (OV-TAD) aims to localize and classify action segments of unseen categories in untrimmed videos, where effective alignment between action…

2026-04-20

2026-04-20 11:48:52 (Asia/Shanghai)

PubMed AI

Medic Training at Military-Civilian Partnerships-A Narrative Review.

查看原始来源

INTRODUCTION: Military-Civilian Partnerships (MCP) were developed to mitigate degradation of combat medical readiness during peacetime. Although these programs have historically f…

2026-04-18

2026-04-18 11:26:55 (Asia/Shanghai)

PubMed AI

MILU: a consensus ensemble benchmark for multimodal medical imaging lecture understanding.

查看原始来源

PURPOSE: Vision-language models (VLMs) are increasingly used to interpret multimodal educational materials, yet their reliability on diagram-, equation-, and text-dense scientific…

PubMed AI

An explainable multi-head attention network for healthcare IoT threat detection based on the MedDefender-MHAN framework.

查看原始来源

The rapid proliferation of Internet of Medical Things (IoMT) devices in healthcare environments has created critical cybersecurity vulnerabilities that demand both accurate and in…

2026-04-17

2026-04-17 11:39:21 (Asia/Shanghai)

LLM

QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies

查看原始来源

Large language models have demonstrated strong performance on general-purpose programming tasks, yet their ability to generate executable algorithmic trading strategies remains un…

LLM

AI-Assisted Requirements Engineering: An Empirical Evaluation Relative to Expert Judgment

查看原始来源

Artificial Intelligence is increasingly introduced into systems engineering activities, particularly within requirements engineering, where quality assessment and validation remai…

LLM

Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation

查看原始来源

Simulating group-level user behavior enables scalable counterfactual evaluation of merchant strategies without costly online experiments. However, building a trustworthy simulator…

Vision

RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

查看原始来源

Tool learning with foundation models aims to endow AI systems with the ability to invoke external resources -- such as APIs, computational utilities, and specialized models -- to…

Vision

From Boundaries to Semantics: Prompt-Guided Multi-Task Learning for Petrographic Thin-section Segmentation

查看原始来源

Grain-edge segmentation (GES) and lithology semantic segmentation (LSS) are two pivotal tasks for quantifying rock fabric and composition. However, these two tasks are often treat…

2026-04-16

2026-04-16 11:43:00 (Asia/Shanghai)

LLM

GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis

查看原始来源

The integration of Large Language Models (LLMs) into Geographic Information Systems (GIS) marks a paradigm shift toward autonomous spatial analysis. However, evaluating these LLM-…

LLM

Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning

查看原始来源

The rapid evolution of multimodal large models has revolutionized the simulation of diverse characters in speech dialogue systems, enabling a novel interactive paradigm. Character…

LLM

MUSE: Multi-Domain Chinese User Simulation via Self-Evolving Profiles and Rubric-Guided Alignment

查看原始来源

User simulators are essential for the scalable training and evaluation of interactive AI systems. However, existing approaches often rely on shallow user profiling, struggle to ma…

LLM

MAny: Merge Anything for Multimodal Continual Instruction Tuning

查看原始来源

Multimodal Continual Instruction Tuning (MCIT) is essential for sequential task adaptation of Multimodal Large Language Models (MLLMs) but is severely restricted by catastrophic f…

2026-04-15

2026-04-15 11:35:50 (Asia/Shanghai)

LLM

EvoSpark: Endogenous Interactive Agent Societies for Unified Long-Horizon Narrative Evolution

查看原始来源

Realizing endogenous narrative evolution in LLM-based multi-agent systems is hindered by the inherent stochasticity of generative emergence. In particular, long-horizon simulation…

Vision

Challenging Vision-Language Models with Physically Deployable Multimodal Semantic Lighting Attacks

查看原始来源

Vision-Language Models (VLMs) have shown remarkable performance, yet their security remains insufficiently understood. Existing adversarial studies focus almost exclusively on the…

PubMed AI

Multimodal large language models in brain tumor imaging: clinical applications and future perspectives.

查看原始来源

The use of multimodal data is essential for the precise diagnosis and treatment of brain tumors. In this context, multimodal data encompass multisequence magnetic resonance imagin…

PubMed AI

Bridging the Modality Gap in Medical Vision-Language Models: A Hybrid Contrastive-Optimal Transport Framework for Enhanced Cross-Modal Alignment.

查看原始来源

Vision-language models in healthcare face a critical limitation, i.e., the modality gap, where image and text embeddings occupy distantly separated regions in shared representatio…

2026-04-14

2026-04-14 11:37:06 (Asia/Shanghai)

LLM

RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents

查看原始来源

The rapid adoption of Large Language Models (LLMs) in interactive systems has enabled the creation of dynamic, open-ended Role-Playing Agents (RPAs). However, evaluating these age…

LLM

Detecting Safety Violations Across Many Agent Traces

查看原始来源

To identify safety violations, auditors often search over large sets of agent traces. This search is difficult because failures are often rare, complex, and sometimes even adversa…

LLM

ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection

查看原始来源

Tool-augmented Large Language Model (LLM) agents have demonstrated impressive capabilities in automating complex, multi-step real-world tasks, yet remain vulnerable to indirect pr…

Vision

Anthropogenic Regional Adaptation in Multimodal Vision-Language Model

查看原始来源

While the field of vision-language (VL) has achieved remarkable success in integrating visual and textual information across multiple languages and domains, there is still no dedi…

Vision

Budget-Aware Uncertainty for Radiotherapy Segmentation QA Using nnU-Net

查看原始来源

Accurate delineation of the Clinical Target Volume (CTV) is essential for radiotherapy planning, yet remains time-consuming and difficult to assess, especially for complex treatme…

Vision

HDR Video Generation via Latent Alignment with Logarithmic Encoding

查看原始来源

High dynamic range (HDR) imagery offers a rich and faithful representation of scene radiance, but remains challenging for generative models due to its mismatch with the bounded, p…

Vision

Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation

查看原始来源

Perturbation-based explainability methods such as KernelSHAP provide model-agnostic attributions but are typically impractical for patch-based 3D medical image segmentation due to…

2026-04-08

2026-04-08 17:10:24 (Asia/Shanghai)

LLM

Topological Characterization of Churn Flow and Unsupervised Correction to the Wu Flow-Regime Map in Small-Diameter Vertical Pipes

查看原始来源

Churn flow-the chaotic, oscillatory regime in vertical two-phase flow-has lacked a quantitative mathematical definition for over $40$ years. We introduce the first topology-based…

LLM

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

查看原始来源

MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilities remain underutilized. Directly incorporating chain-of-thought reaso…

LLM

Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement

查看原始来源

Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conventional Next-Token Prediction (NTP) focuses on one-step-ahead supervi…

LLM

Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Systems Operating Across Enterprise and Geopolitical Boundaries

查看原始来源

The governance of artificial intelligence has a blind spot: the machine identities that AI systems use to act. AI agents, service accounts, API tokens, and automated workflows now…

LLM

Lightweight Multimodal Adaptation of Vision Language Models for Species Recognition and Habitat Context Interpretation in Drone Thermal Imagery

查看原始来源

This study proposes a lightweight multimodal adaptation framework to bridge the representation gap between RGB-pretrained VLMs and thermal infrared imagery, and demonstrates its p…

Vision

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

查看原始来源

MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilities remain underutilized. Directly incorporating chain-of-thought reaso…

Vision

Lightweight Multimodal Adaptation of Vision Language Models for Species Recognition and Habitat Context Interpretation in Drone Thermal Imagery

查看原始来源

This study proposes a lightweight multimodal adaptation framework to bridge the representation gap between RGB-pretrained VLMs and thermal infrared imagery, and demonstrates its p…

Vision

Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning

查看原始来源

Graphics Program Synthesis is pivotal for interpreting and editing visual data, effectively facilitating the reverse-engineering of static visuals into editable TikZ code. While T…