发布文献求助

TypeFormer: Multiscale Transformer With Type Controller for Remote Sensing Image Caption

隐藏字幕计算机科学变压器判决人工智能计算机视觉图像（数学）工程类电气工程电压

作者

Zihang Chen,Junjue Wang,Ailong Ma,Yanfei Zhong

出处

期刊：IEEE Geoscience and Remote Sensing Letters [Institute of Electrical and Electronics Engineers]
日期：2022-01-01 卷期号：19: 1-5 被引量：24

标识

DOI：10.1109/lgrs.2022.3192062

摘要

Image captioning in remote sensing can help us understandthe inner attributes of the objects and the outer relations between different objects. However, the existing image captioning algorithms lack the ability of global representation, and cannot obtain object relations over long distances. In addition, these algorithmics generate captions randomly without consideration of the specific demands. To this end, we propose a pure transformer architecture with caption type controller for remote sensing image captioning. Specifically, a multi-scale vision transformer is adopted for the image representation, where the global and detailed content can be captured with multi-head self-attention layers. A transformer decoder is then introduced to successively translate the image features into comprehensive sentences. The optional block called the caption type controller is designed to consider the types of captions through caption type matrix sets according to the demands, embedding the learnable sentence feature with the required type. The comparison and ablation experiments conducted on the Remote Sensing Image Captioning Dataset (RSICD) dataset demonstrate that the proposed framework outperforms the current state-of-the-art image captioning methods. The experiments conducted on the FloodNet caption dataset further illustrate that the proposed methods can effectively generate specific types of captions.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 2052669099上传了应助文件

刚刚; 3408关闭了3408的文献求助

1秒前; 王兵完成签到，获得积分20

1秒前; 坦率灵槐发布了新的文献求助10

2秒前; 寒冷的断秋发布了新的文献求助10

2秒前; NexusExplorer的应助被买桃子去采纳，获得10

2秒前; 小马甲的应助被fx采纳，获得10

2秒前; 肥仔关闭了肥仔的文献求助

2秒前; 田様上传了应助文件

3秒前; 慕青上传了应助文件

3秒前; 王兵发布了新的文献求助10

4秒前; he完成签到，获得积分10

5秒前; 打打的应助被Xoosi采纳，获得30

6秒前; 万能图书馆上传了应助文件

6秒前; 聪慧钻石发布了新的文献求助10

7秒前; 科研通AI6.1的应助被白色的明镜采纳，获得10

7秒前; haitianluna发布了新的文献求助10

8秒前; 小新qqq完成签到，获得积分10

10秒前; CodeCraft上传了应助文件

10秒前; 3408关闭了3408的文献求助

11秒前; 在水一方的应助被学习猴采纳，获得10

11秒前; choke发布了新的文献求助10

11秒前; Demons完成签到，获得积分10

12秒前; 贪玩的秋柔上传了应助文件

13秒前; 认真的冰淇淋的应助被小鱼采纳，获得10

13秒前; 刘奇发布了新的文献求助10

14秒前; kunnao完成签到，获得积分10

14秒前; 在水一方上传了应助文件

15秒前; abbbb发布了新的文献求助10

15秒前; NexusExplorer的应助被小鱼干采纳，获得10

16秒前; FashionBoy上传了应助文件

16秒前; 天高路远关注了科研通微信公众号

17秒前; 科研通AI6.3上传了应助文件

17秒前; 打打上传了应助文件

18秒前; doo完成签到，获得积分10

18秒前; 认真的冰淇淋上传了应助文件

20秒前; 斯文的莆完成签到，获得积分10

20秒前; 852的应助被王兵采纳，获得10

20秒前; 文艺的天德发布了新的文献求助30

20秒前; 冯博雅发布了新的文献求助10

21秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; 卤化钙钛矿人工突触的研究 1000; Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000; Wolffs Headache and Other Head Pain 9th Edition 1000; Continuing Syntax 1000; Signals, Systems, and Signal Processing 510; Effect of Betaine on Growth Performance, Nutrients Digestibility, Blood Cells, Meat Quality and Organ Weights in Broiler Chicks 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 6234736; 求助须知：如何正确求助？哪些是违规求助？ 8058467; 关于积分的说明 16812817; 捐赠科研通 5314907; 什么是DOI，文献DOI怎么找？ 2830769; 邀请新用户注册赠送积分活动 1808295; 关于科研通互助平台的介绍 1665759

今日热心研友

大力的灵雁

科研小能手

想喝三碗粥

贪玩的秋柔

每天都有一堆疑惑

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通