Feed Subscription

Vision 固定订阅页

适合长期跟踪单个研究方向。页面会汇总这个 feed 的最近 7 天 / 30 天表现，并保留每天命中的原始条目和 digest 链接。

返回归档首页查看趋势总览最新 Markdown 订阅 RSS

近期走势

Vision 今日没有新的命中文献。

2026-06-15

2026-06-16

2026-06-17

2026-06-18

2026-06-19

2026-06-20

2026-06-21

2026-06-22

2026-06-23

2026-06-24

2026-06-25

2026-06-26

2026-06-27

2026-06-28

历史命中

按天回看这个 feed 的命中文献，并保留当日 digest 的 Markdown / JSON 原始产物。

2026-04-24

命中 10 篇生成于 2026-04-24 11:46:20 (Asia/Shanghai)

Markdown JSON

Vision10 篇

《Pre-process for segmentation task with nonlinear diffusion filters》〔方法〕：This paper deals with the case of using nonlinear diffusion filters to obtain piecewise constant images as a previous process for segmentation tec…

Pre-process for segmentation task with nonlinear diffusion filters · Score 102
title matched "diffusion"；title matched "segmentation"；has PDF
原始来源
KD-CVG: A Knowledge-Driven Approach for Creative Video Generation · Score 79
title matched "video generation"；summary matched "multimodal"；has PDF
原始来源
Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation · Score 77
title matched "video generation"；summary matched "diffusion"；has PDF
原始来源
Seeing Fast and Slow: Learning the Flow of Time in Videos · Score 68
summary matched "video generation"；summary matched "multimodal"；has PDF
原始来源
DCMorph: Face Morphing via Dual-Stream Cross-Attention Diffusion · Score 66
title matched "diffusion"；has PDF；has rich summary
原始来源

2026-04-23

命中 9 篇生成于 2026-04-23 11:42:13 (Asia/Shanghai)

Markdown JSON

Vision9 篇

《LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model》〔方法〕：We present LLaDA2.0-Uni, a unified discrete diffusion large language model (dLLM) that supports multimodal underst…

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model · Score 111
title matched "diffusion"；title matched "multimodal"；has PDF
原始来源
Hallucination Early Detection in Diffusion Models · Score 75
title matched "diffusion"；has DOI；has PDF
原始来源
ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control · Score 72
title matched "diffusion"；has PDF；has rich summary
原始来源
Amodal SAM: A Unified Amodal Segmentation Framework with Generalization · Score 70
title matched "segmentation"；has PDF；has rich summary
原始来源
GeoRelight: Learning Joint Geometrical Relighting and Reconstruction with Flexible Multi-Modal Diffusion Transformers · Score 70
title matched "diffusion"；has PDF；has rich summary
原始来源

2026-04-22

命中 10 篇生成于 2026-04-22 11:37:03 (Asia/Shanghai)

Markdown JSON

Vision10 篇

《PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving》〔评测 / 方法〕：This paper presents the first study on Unsupervised Domain Adaptation (UDA) for multimodal 3D panoptic segme…

PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving · Score 106
title matched "multimodal"；title matched "segmentation"；has PDF
原始来源
Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval · Score 100
title matched "diffusion"；title matched "multimodal"；has PDF
原始来源
ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis · Score 90
title matched "video generation"；summary matched "diffusion"；has PDF
原始来源
MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation · Score 89
title matched "video generation"；summary matched "diffusion"；has PDF
原始来源
MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention · Score 89
title matched "segmentation"；summary matched "diffusion"；has PDF
原始来源

2026-04-21

命中 10 篇生成于 2026-04-21 11:40:46 (Asia/Shanghai)

Markdown JSON

Vision10 篇

《AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation》〔应用 / 方法〕：Video diffusion transformers (DiTs) suffer from prohibitive inference latency due to quadratic attention complexity. Existing…

AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation · Score 87
title matched "video generation"；summary matched "diffusion"；has PDF
原始来源
DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery · Score 85
title matched "diffusion"；summary matched "segmentation"；has PDF
原始来源
Weakly-Supervised Referring Video Object Segmentation through Text Supervision · Score 76
title matched "segmentation"；summary matched "multimodal"；has PDF
原始来源
AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation · Score 72
title matched "segmentation"；has PDF；has rich summary
原始来源
UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models · Score 71
title matched "diffusion"；has PDF；has rich summary
原始来源

2026-04-17

命中 9 篇生成于 2026-04-17 11:39:21 (Asia/Shanghai)

Markdown JSON

Vision9 篇

《SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation》〔应用 / 方法〕：Reliable uncertainty estimation is critical for medical image segmentation, where automated contours…

SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation · Score 72
title matched "segmentation"；has PDF；has rich summary
原始来源
Unsupervised Skeleton-Based Action Segmentation via Hierarchical Spatiotemporal Vector Quantization · Score 70
title matched "segmentation"；has PDF；has rich summary
原始来源
Boundary-Centric Active Learning for Temporal Action Segmentation · Score 70
title matched "segmentation"；has PDF；has rich summary
原始来源
An Analysis of Regularization and Fokker-Planck Residuals in Diffusion Models for Image Generation · Score 70
title matched "diffusion"；has PDF；has rich summary
原始来源
RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework · Score 68
summary matched "diffusion"；summary matched "multimodal"；has PDF
原始来源

2026-04-16

命中 10 篇生成于 2026-04-16 11:43:00 (Asia/Shanghai)

Markdown JSON

Vision10 篇

《ROSE: Retrieval-Oriented Segmentation Enhancement》〔评测 / 方法〕：Existing segmentation models based on multimodal large language models (MLLMs), such as LISA, often struggle with novel or emerging entities due to their inab…

ROSE: Retrieval-Oriented Segmentation Enhancement · Score 90
title matched "segmentation"；summary matched "multimodal"；has PDF
原始来源
Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models · Score 88
title matched "multimodal"；summary matched "segmentation"；has PDF
原始来源
Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding · Score 78
title matched "multimodal"；summary matched "diffusion"；has PDF
原始来源
DiT as Real-Time Rerenderer: Streaming Video Stylization with Autoregressive Diffusion Transformer · Score 78
title matched "diffusion"；summary matched "video generation"；has PDF
原始来源
Seedance 2.0: Advancing Video Generation for World Complexity · Score 72
title matched "video generation"；has PDF；has rich summary
原始来源

2026-04-15

命中 9 篇生成于 2026-04-15 11:35:50 (Asia/Shanghai)

Markdown JSON

Vision9 篇

《RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation》〔评测 / 方法〕：Multimodal semantic segmentation has emerged as a powerful paradigm for enhancing scene understanding by leveragin…

RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation · Score 100
title matched "multimodal"；title matched "segmentation"；has PDF
原始来源
All in One: A Unified Synthetic Data Pipeline for Multimodal Video Understanding · Score 78
title matched "multimodal"；summary matched "segmentation"；has PDF
原始来源
Probabilistic Feature Imputation and Uncertainty-Aware Multimodal Federated Aggregation · Score 71
title matched "multimodal"；has PDF；has rich summary
原始来源
AbdomenGen: Sequential Volume-Conditioned Diffusion Framework for Abdominal Anatomy Generation · Score 71
title matched "diffusion"；has PDF；has rich summary
原始来源
Radar-Camera BEV Multi-Task Learning with Cross-Task Attention Bridge for Joint 3D Detection and Segmentation · Score 70
title matched "segmentation"；has PDF；has rich summary
原始来源

2026-04-14

命中 10 篇生成于 2026-04-14 11:37:06 (Asia/Shanghai)

Markdown JSON

Vision10 篇

《OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation》〔评测 / 数据 / 应用 / 方法〕：In this work, we study Human-Object Interaction Video Generation (HOIVG), which aims to synthesize high-quality…

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation · Score 112
title matched "video generation"；title matched "multimodal"；has PDF
原始来源
LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation · Score 90
title matched "segmentation"；summary matched "multimodal"；has PDF
原始来源
GeomPrompt: Geometric Prompt Learning for RGB-D Semantic Segmentation Under Missing and Degraded Depth · Score 87
title matched "segmentation"；summary matched "multimodal"；has PDF
原始来源
Anthropogenic Regional Adaptation in Multimodal Vision-Language Model · Score 86
title matched "multimodal"；summary matched "diffusion"；has PDF
原始来源
GazeVaLM: A Multi-Observer Eye-Tracking Benchmark for Evaluating Clinical Realism in AI-Generated X-Rays · Score 78
summary matched "diffusion"；summary matched "multimodal"；has DOI
原始来源

2026-04-08

命中 10 篇生成于 2026-04-08 17:10:24 (Asia/Shanghai)

Markdown JSON

Vision10 篇

收录 10 篇，重点包括《Action Images: End-to-End Policy Learning via Multiview Video Generation》、《DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models》。

Vision 固定订阅页

近期走势

相关关键词页

历史命中