自回归模型
星型
噪音(视频)
计算机科学
非线性自回归外生模型
计量经济学
数学
统计
自回归积分移动平均
时间序列
人工智能
图像(数学)
作者
Marco Pasini,Javier Nistal,Stefan Lattner,George Fazekas
出处
期刊:Cornell University - arXiv
日期:2024-11-27
标识
DOI:10.48550/arxiv.2411.18447
摘要
Autoregressive models are typically applied to sequences of discrete tokens, but recent research indicates that generating sequences of continuous embeddings in an autoregressive manner is also feasible. However, such Continuous Autoregressive Models (CAMs) can suffer from a decline in generation quality over extended sequences due to error accumulation during inference. We introduce a novel method to address this issue by injecting random noise into the input embeddings during training. This procedure makes the model robust against varying error levels at inference. We further reduce error accumulation through an inference procedure that introduces low-level noise. Experiments on musical audio generation show that CAM substantially outperforms existing autoregressive and non-autoregressive approaches while preserving audio quality over extended sequences. This work paves the way for generating continuous embeddings in a purely autoregressive setting, opening new possibilities for real-time and interactive generative applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI