麻豆精品无码av,欧美1区2区,久久中文字幕乱码人妻,亚洲欧美另类少妇精品,在线看黄射,69pao高清,九九九久久久国产精品,子操大逼1234区,九九爱99热精品

One paper has been accepted by IEEE TNNLS.

One paper entitled “Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation” has been accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS).

 

Title: Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation

Author: Jin Wang, Bingfeng Zhang*, Jian Pang, Weifeng Liu*, Baodi Liu, and Honglong Chen

Few-shot segmentation has garnered significant attention. Many recent approaches attempt to introduce the Segment Anything Model (SAM) to handle this task. With the strong generalization ability and rich object-specific extraction ability of the SAM model, such a solution shows great potential in few-shot segmentation. However, the decoding process of SAM highly relies on accurate and explicit Prompts, making previous approaches mainly focus on extracting Prompts from the support set, which is insufficient to activate the generalization ability of SAM, and this design is easy to result in a biased decoding process when adapting to the unknown classes. In this work, we propose an Unbiased Semantic Decoding (USD) strategy integrated with SAM, which extracts target information from both the support and query set simultaneously to perform consistent predictions guided by the semantics of the Contrastive Language-Image Pre-training (CLIP) model. Specifically, to enhance the unbiased semantic discrimination of SAM, we design two feature enhancement strategies that leverage the semantic alignment capability of CLIP to enrich the original SAM features, mainly including a global supplement at the image level to provide a generalize category indicate with support image and a local guidance at the pixel level to provide a useful target location with query image. Besides, to generate target-focused Prompt embeddings, a learnable visual-text target Prompt generator is proposed by interacting target text embeddings and clip visual features. Without requiring re-training of the vision foundation models, the features with semantic discrimination draw attention to the target region through the guidance of Prompt with rich target information. Experiments on both the PASCAL-5i and COCO-20i show that our proposed method outperforms existing approaches by a clear margin and achieves new state-of-the-art performances. The code is available on https://github.com/vangjin/USD.

 


登錄用戶可以查看和發(fā)表評論, 請前往  登錄 或  注冊。
SCHOLAT.com 學(xué)者網(wǎng)
免責(zé)聲明 | 關(guān)于我們 | 用戶反饋
聯(lián)系我們:
奉新县| 瑞安市| 昌邑市| 肥西县| 山东| 马尔康县| 菏泽市| 广灵县| 平塘县| 锦州市| 噶尔县| 桦川县| 盐源县| 清徐县| 上犹县| 堆龙德庆县| 敖汉旗| 郑州市| 布拖县| 阿勒泰市| 那坡县| 万全县| 通城县| 乌什县| 九台市| 崇阳县| 理塘县| 泰宁县| 昌邑市| 阳春市| 昭通市| 宁强县| 靖边县| 宣武区| 巢湖市| 花垣县| 玉龙| 阿城市| 越西县| 安塞县| 拜城县|