PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

计算机科学
作者
Chenhongyi Yang,Zehui Chen,Miguel Espinosa,Linus Ericsson,Zhenyu Wang,Jiaming Liu,Elliot J. Crowley
出处
期刊:Cornell University - arXiv
标识
DOI:10.48550/arxiv.2403.17695
摘要

We present PlainMamba: a simple non-hierarchical state space model (SSM) designed for general visual recognition. The recent Mamba model has shown how SSMs can be highly competitive with other architectures on sequential data and initial attempts have been made to apply it to images. In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability to learn features from two-dimensional images by (i) a continuous 2D scanning process that improves spatial continuity by ensuring adjacency of tokens in the scanning sequence, and (ii) direction-aware updating which enables the model to discern the spatial relations of tokens by encoding directional information. Our architecture is designed to be easy to use and easy to scale, formed by stacking identical PlainMamba blocks, resulting in a model with constant width throughout all layers. The architecture is further simplified by removing the need for special tokens. We evaluate PlainMamba on a variety of visual recognition tasks including image classification, semantic segmentation, object detection, and instance segmentation. Our method achieves performance gains over previous non-hierarchical models and is competitive with hierarchical alternatives. For tasks requiring high-resolution inputs, in particular, PlainMamba requires much less computing while maintaining high performance. Code and models are available at https://github.com/ChenhongyiYang/PlainMamba

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
1秒前
jane完成签到,获得积分10
2秒前
2秒前
ava425完成签到,获得积分10
2秒前
量子星尘发布了新的文献求助10
2秒前
负责吃饭完成签到,获得积分10
2秒前
weiminghao完成签到,获得积分10
3秒前
牧长一完成签到 ,获得积分0
3秒前
殊量完成签到,获得积分10
4秒前
今后应助tangpc采纳,获得10
4秒前
4秒前
pcr163应助Cindy采纳,获得50
4秒前
大大大反派完成签到 ,获得积分10
6秒前
静一发布了新的文献求助10
6秒前
TKTK完成签到,获得积分20
6秒前
打打应助张涛采纳,获得10
6秒前
7秒前
打打应助ava425采纳,获得10
7秒前
葵小葵完成签到,获得积分10
7秒前
执着的日记本完成签到 ,获得积分10
7秒前
饭饭完成签到,获得积分10
8秒前
wyx完成签到 ,获得积分10
9秒前
我是老大应助小绵羊采纳,获得10
9秒前
量子星尘发布了新的文献求助10
9秒前
9秒前
英俊的铭应助大白采纳,获得10
10秒前
Neil完成签到,获得积分10
10秒前
慕青应助subtle5114采纳,获得10
10秒前
丘比特应助kaka采纳,获得10
11秒前
马里奥好难完成签到 ,获得积分10
11秒前
TKTK发布了新的文献求助20
11秒前
科研通AI5应助薄饼哥丶采纳,获得10
11秒前
xiaocaiya完成签到,获得积分20
12秒前
12秒前
风中沛珊完成签到 ,获得积分10
12秒前
科研人完成签到,获得积分10
12秒前
乐桉蓝完成签到,获得积分10
13秒前
顾矜应助qiyian采纳,获得30
13秒前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2700
Neuromuscular and Electrodiagnostic Medicine Board Review 1000
Statistical Methods for the Social Sciences, Global Edition, 6th edition 600
こんなに痛いのにどうして「なんでもない」と医者にいわれてしまうのでしょうか 510
The Insulin Resistance Epidemic: Uncovering the Root Cause of Chronic Disease  500
Walter Gilbert: Selected Works 500
An Annotated Checklist of Dinosaur Species by Continent 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3662898
求助须知:如何正确求助?哪些是违规求助? 3223698
关于积分的说明 9752620
捐赠科研通 2933587
什么是DOI,文献DOI怎么找? 1606194
邀请新用户注册赠送积分活动 758307
科研通“疑难数据库(出版商)”最低求助积分说明 734775