Keyword Tracking

关键词追踪：segmentation

这个页面会长期追踪你配置里关心的关键词，并把命中的论文按日期沉淀下来。

返回归档首页查看趋势总览最新 JSON 订阅 RSS

近期走势

最近一次命中来自 Vision：PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving

2026-04-09

2026-04-10

2026-04-11

2026-04-12

2026-04-13

2026-04-14

2026-04-15

2026-04-16

2026-04-17

2026-04-18

2026-04-19

2026-04-20

2026-04-21

2026-04-22

命中明细

按日期回看匹配到这个关键词的论文标题，并保留来源 feed 信息。

2026-04-22

2026-04-22 11:37:03 (Asia/Shanghai)

Vision

PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving

查看原始来源

This paper presents the first study on Unsupervised Domain Adaptation (UDA) for multimodal 3D panoptic segmentation (mm-3DPS), aiming to improve generalization under domain shifts…

Vision

MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention

查看原始来源

Flow matching has recently emerged as a principled framework for learning continuous-time transport maps, enabling efficient deterministic generation without relying on stochastic…

Vision

RF-HiT: Rectified Flow Hierarchical Transformer for General Medical Image Segmentation

查看原始来源

Accurate medical image segmentation requires both long-range contextual reasoning and precise boundary delineation, a task where existing transformer- and diffusion-based paradigm…

2026-04-21

2026-04-21 11:40:46 (Asia/Shanghai)

Vision

DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery

查看原始来源

Diffusion models have emerged as powerful tools for a wide range of vision tasks, including text-guided image generation and editing. In this work, we explore their potential for…

Vision

Weakly-Supervised Referring Video Object Segmentation through Text Supervision

查看原始来源

Referring video object segmentation (RVOS) aims to segment the target instance in a video, referred by a text expression. Conventional approaches are mostly supervised learning, r…

Vision

AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation

查看原始来源

Reasoning segmentation requires models to ground complex, implicit textual queries into precise pixel-level masks. Existing approaches rely on a single segmentation token $\texttt…

Vision

DSA-CycleGAN: A Domain Shift Aware CycleGAN for Robust Multi-Stain Glomeruli Segmentation

查看原始来源

A key challenge in segmentation in digital histopathology is inter- and intra-stain variations as it reduces model performance. Labelling each stain is expensive and time-consumin…

2026-04-17

2026-04-17 11:39:21 (Asia/Shanghai)

Vision

SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation

查看原始来源

Reliable uncertainty estimation is critical for medical image segmentation, where automated contours feed downstream quantification and clinical decision support. Many strong unce…

Vision

Unsupervised Skeleton-Based Action Segmentation via Hierarchical Spatiotemporal Vector Quantization

查看原始来源

We propose a novel hierarchical spatiotemporal vector quantization framework for unsupervised skeleton-based temporal action segmentation. We first introduce a hierarchical approa…

Vision

Boundary-Centric Active Learning for Temporal Action Segmentation

查看原始来源

Temporal action segmentation (TAS) demands dense temporal supervision, yet most of the annotation cost in untrimmed videos is spent identifying and refining action transitions, wh…

Vision

Efficient Search of Implantable Adaptive Cells for Medical Image Segmentation

查看原始来源

Purpose: Adaptive skip modules can improve medical image segmentation, but searching for them is computationally costly. Implantable Adaptive Cells (IACs) are compact NAS modules…

Vision

From Boundaries to Semantics: Prompt-Guided Multi-Task Learning for Petrographic Thin-section Segmentation

查看原始来源

Grain-edge segmentation (GES) and lithology semantic segmentation (LSS) are two pivotal tasks for quantifying rock fabric and composition. However, these two tasks are often treat…

PubMed AI

From Image to Pixels: towards Fine-Grained Medical Vision-Language Models.

查看原始来源

Multimodal large language models (MLLMs) offer immense potential for biomedical AI, yet current applications remain limited to coarse-grained image understanding and basic textual…

2026-04-16

2026-04-16 11:43:00 (Asia/Shanghai)

Vision

ROSE: Retrieval-Oriented Segmentation Enhancement

查看原始来源

Existing segmentation models based on multimodal large language models (MLLMs), such as LISA, often struggle with novel or emerging entities due to their inability to incorporate…

Vision

Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models

查看原始来源

While Multimodal Large Language Models (MLLMs) excel in general vision-language tasks, their application to remote sensing change understanding is hindered by a fundamental "tempo…

Vision

PBE-UNet: A light weight Progressive Boundary-Enhanced U-Net with Scale-Aware Aggregation for Ultrasound Image Segmentation

查看原始来源

Accurate lesion segmentation in ultrasound images is essential for preventive screening and clinical diagnosis, yet remains challenging due to low contrast, blurry boundaries, and…

Vision

Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation

查看原始来源

Sparse mixture-of-experts (MoE) layers have been shown to substantially increase model capacity without a proportional increase in computational cost and are widely used in transf…

2026-04-15

2026-04-15 11:35:50 (Asia/Shanghai)

Vision

RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation

查看原始来源

Multimodal semantic segmentation has emerged as a powerful paradigm for enhancing scene understanding by leveraging complementary information from multiple sensing modalities (e.g…

Vision

All in One: A Unified Synthetic Data Pipeline for Multimodal Video Understanding

查看原始来源

Training multimodal large language models (MLLMs) for video understanding requires large-scale annotated data spanning diverse tasks such as object counting, question answering, a…

Vision

Radar-Camera BEV Multi-Task Learning with Cross-Task Attention Bridge for Joint 3D Detection and Segmentation

查看原始来源

Bird's-eye-view (BEV) representations are the dominant paradigm for 3D perception in autonomous driving, providing a unified spatial canvas where detection and segmentation featur…

Vision

Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models

查看原始来源

Deep learning-based medical image segmentation typically relies on ground truth (GT) labels obtained through manual annotation, but these can be prone to random errors or systemat…

2026-04-14

2026-04-14 11:37:06 (Asia/Shanghai)

Vision

LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation

查看原始来源

Large Multimodal Models (LMMs) have achieved remarkable progress in general-purpose vision--language understanding, yet they remain limited in tasks requiring precise object-level…

Vision

GeomPrompt: Geometric Prompt Learning for RGB-D Semantic Segmentation Under Missing and Degraded Depth

查看原始来源

Multimodal perception systems for robotics and embodied AI often assume reliable RGB-D sensing, but in practice, depth is frequently missing, noisy, or corrupted. We thus present…

Vision

Budget-Aware Uncertainty for Radiotherapy Segmentation QA Using nnU-Net

查看原始来源

Accurate delineation of the Clinical Target Volume (CTV) is essential for radiotherapy planning, yet remains time-consuming and difficult to assess, especially for complex treatme…

Vision

Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation

查看原始来源

Perturbation-based explainability methods such as KernelSHAP provide model-agnostic attributions but are typically impractical for patch-based 3D medical image segmentation due to…

Vision

Seeing Through the Tool: A Controlled Benchmark for Occlusion Robustness in Foundation Segmentation Models

查看原始来源

Occlusion, where target structures are partially hidden by surgical instruments or overlapping tissues, remains a critical yet underexplored challenge for foundation segmentation…

PubMed AI

Text4Seg++: Advancing Image Segmentation via Generative Language Modeling.

查看原始来源

Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks. However, effectively integrating image segmentation into these models remain…

2026-04-08

2026-04-08 17:10:24 (Asia/Shanghai)

Vision

Multi-Modal Landslide Detection from Sentinel-1 SAR and Sentinel-2 Optical Imagery Using Multi-Encoder Vision Transformers and Ensemble Learning

查看原始来源

Landslides represent a major geohazard with severe impacts on human life, infrastructure, and ecosystems, underscoring the need for accurate and timely detection approaches to sup…