Feed Subscription

Vision 固定订阅页

适合长期跟踪单个研究方向。页面会汇总这个 feed 的最近 7 天 / 30 天表现,并保留每天命中的原始条目和 digest 链接。

最近 7 天

39

篇论文

4 个活跃 digest

最近 30 天

68

篇论文

7 个活跃 digest

全部历史

68

篇论文

7 个活跃 digest

近期走势

《PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving》〔评测 / 方法〕:This paper presents the first study on Unsupervised Domain Adaptation (UDA) for multimodal 3D panoptic segme…

2026-04-09
0
2026-04-10
0
2026-04-11
0
2026-04-12
0
2026-04-13
0
2026-04-14
10
2026-04-15
9
2026-04-16
10
2026-04-17
9
2026-04-18
0
2026-04-19
0
2026-04-20
0
2026-04-21
10
2026-04-22
10

相关关键词页

如果这个 feed 同时命中了你配置里的关键词,这里会给出长期追踪入口。

历史命中

按天回看这个 feed 的命中文献,并保留当日 digest 的 Markdown / JSON 原始产物。

2026-04-22

命中 10 篇生成于 2026-04-22 11:37:03 (Asia/Shanghai)
Vision10 篇

《PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving》〔评测 / 方法〕:This paper presents the first study on Unsupervised Domain Adaptation (UDA) for multimodal 3D panoptic segme…

  1. PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving · Score 106
    title matched "multimodal";title matched "segmentation";has PDF
  2. Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval · Score 100
    title matched "diffusion";title matched "multimodal";has PDF
  3. ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis · Score 90
    title matched "video generation";summary matched "diffusion";has PDF
  4. MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation · Score 89
    title matched "video generation";summary matched "diffusion";has PDF
  5. MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention · Score 89
    title matched "segmentation";summary matched "diffusion";has PDF

2026-04-21

命中 10 篇生成于 2026-04-21 11:40:46 (Asia/Shanghai)
Vision10 篇

《AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation》〔应用 / 方法〕:Video diffusion transformers (DiTs) suffer from prohibitive inference latency due to quadratic attention complexity. Existing…

  1. AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation · Score 87
    title matched "video generation";summary matched "diffusion";has PDF
  2. DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery · Score 85
    title matched "diffusion";summary matched "segmentation";has PDF
  3. Weakly-Supervised Referring Video Object Segmentation through Text Supervision · Score 76
    title matched "segmentation";summary matched "multimodal";has PDF
  4. AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation · Score 72
    title matched "segmentation";has PDF;has rich summary
  5. UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models · Score 71
    title matched "diffusion";has PDF;has rich summary

2026-04-17

命中 9 篇生成于 2026-04-17 11:39:21 (Asia/Shanghai)
Vision9 篇

《SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation》〔应用 / 方法〕:Reliable uncertainty estimation is critical for medical image segmentation, where automated contours…

  1. SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation · Score 72
    title matched "segmentation";has PDF;has rich summary
  2. Unsupervised Skeleton-Based Action Segmentation via Hierarchical Spatiotemporal Vector Quantization · Score 70
    title matched "segmentation";has PDF;has rich summary
  3. Boundary-Centric Active Learning for Temporal Action Segmentation · Score 70
    title matched "segmentation";has PDF;has rich summary
  4. An Analysis of Regularization and Fokker-Planck Residuals in Diffusion Models for Image Generation · Score 70
    title matched "diffusion";has PDF;has rich summary
  5. RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework · Score 68
    summary matched "diffusion";summary matched "multimodal";has PDF

2026-04-16

命中 10 篇生成于 2026-04-16 11:43:00 (Asia/Shanghai)
Vision10 篇

《ROSE: Retrieval-Oriented Segmentation Enhancement》〔评测 / 方法〕:Existing segmentation models based on multimodal large language models (MLLMs), such as LISA, often struggle with novel or emerging entities due to their inab…

  1. ROSE: Retrieval-Oriented Segmentation Enhancement · Score 90
    title matched "segmentation";summary matched "multimodal";has PDF
  2. Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models · Score 88
    title matched "multimodal";summary matched "segmentation";has PDF
  3. Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding · Score 78
    title matched "multimodal";summary matched "diffusion";has PDF
  4. DiT as Real-Time Rerenderer: Streaming Video Stylization with Autoregressive Diffusion Transformer · Score 78
    title matched "diffusion";summary matched "video generation";has PDF
  5. Seedance 2.0: Advancing Video Generation for World Complexity · Score 72
    title matched "video generation";has PDF;has rich summary

2026-04-15

命中 9 篇生成于 2026-04-15 11:35:50 (Asia/Shanghai)
Vision9 篇

《RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation》〔评测 / 方法〕:Multimodal semantic segmentation has emerged as a powerful paradigm for enhancing scene understanding by leveragin…

  1. RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation · Score 100
    title matched "multimodal";title matched "segmentation";has PDF
  2. All in One: A Unified Synthetic Data Pipeline for Multimodal Video Understanding · Score 78
    title matched "multimodal";summary matched "segmentation";has PDF
  3. Probabilistic Feature Imputation and Uncertainty-Aware Multimodal Federated Aggregation · Score 71
    title matched "multimodal";has PDF;has rich summary
  4. AbdomenGen: Sequential Volume-Conditioned Diffusion Framework for Abdominal Anatomy Generation · Score 71
    title matched "diffusion";has PDF;has rich summary
  5. Radar-Camera BEV Multi-Task Learning with Cross-Task Attention Bridge for Joint 3D Detection and Segmentation · Score 70
    title matched "segmentation";has PDF;has rich summary

2026-04-14

命中 10 篇生成于 2026-04-14 11:37:06 (Asia/Shanghai)
Vision10 篇

《OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation》〔评测 / 数据 / 应用 / 方法〕:In this work, we study Human-Object Interaction Video Generation (HOIVG), which aims to synthesize high-quality…

  1. OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation · Score 112
    title matched "video generation";title matched "multimodal";has PDF
  2. LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation · Score 90
    title matched "segmentation";summary matched "multimodal";has PDF
  3. GeomPrompt: Geometric Prompt Learning for RGB-D Semantic Segmentation Under Missing and Degraded Depth · Score 87
    title matched "segmentation";summary matched "multimodal";has PDF
  4. Anthropogenic Regional Adaptation in Multimodal Vision-Language Model · Score 86
    title matched "multimodal";summary matched "diffusion";has PDF
  5. GazeVaLM: A Multi-Observer Eye-Tracking Benchmark for Evaluating Clinical Realism in AI-Generated X-Rays · Score 78
    summary matched "diffusion";summary matched "multimodal";has DOI

2026-04-08

命中 10 篇生成于 2026-04-08 17:10:24 (Asia/Shanghai)