计算机科学
杠杆(统计)
单变量
机器学习
人工智能
深度学习
可扩展性
水准点(测量)
特征工程
感知器
时间序列
建筑
系列(地层学)
数据挖掘
人工神经网络
多元统计
艺术
古生物学
大地测量学
数据库
视觉艺术
生物
地理
作者
Sian Chen,Chun‐Liang Li,Nathanael C. Yoder,Sercan Ö. Arık,Tomas Pfister
出处
期刊:Cornell University - arXiv
日期:2023-01-01
被引量:29
标识
DOI:10.48550/arxiv.2303.06053
摘要
Real-world time-series datasets are often multivariate with complex dynamics. To capture this complexity, high capacity architectures like recurrent- or attention-based sequential deep learning models have become popular. However, recent work demonstrates that simple univariate linear models can outperform such deep learning models on several commonly used academic benchmarks. Extending them, in this paper, we investigate the capabilities of linear models for time-series forecasting and present Time-Series Mixer (TSMixer), a novel architecture designed by stacking multi-layer perceptrons (MLPs). TSMixer is based on mixing operations along both the time and feature dimensions to extract information efficiently. On popular academic benchmarks, the simple-to-implement TSMixer is comparable to specialized state-of-the-art models that leverage the inductive biases of specific benchmarks. On the challenging and large scale M5 benchmark, a real-world retail dataset, TSMixer demonstrates superior performance compared to the state-of-the-art alternatives. Our results underline the importance of efficiently utilizing cross-variate and auxiliary information for improving the performance of time series forecasting. We present various analyses to shed light into the capabilities of TSMixer. The design paradigms utilized in TSMixer are expected to open new horizons for deep learning-based time series forecasting. The implementation is available at https://github.com/google-research/google-research/tree/master/tsmixer
科研通智能强力驱动
Strongly Powered by AbleSci AI