无损压缩
数据压缩
计算机科学
有损压缩
算法
压缩比
系列(地层学)
压缩(物理)
点(几何)
数据压缩比
时间序列
数据挖掘
图像压缩
人工智能
数学
机器学习
材料科学
汽车工程
内燃机
几何学
复合材料
古生物学
工程类
图像(数学)
生物
图像处理
作者
Haoyuan Chen,Liang Liu,Jingwen Meng,Wanying Lu
标识
DOI:10.1016/j.ins.2023.119847
摘要
The time series database is a specialized type of database specifically designed for storing and analyzing time series data. Compression of time series data is crucial for its performance. However, efficiently compressing time series data, particularly floating-point data, remains challenging. Existing compression algorithms are efficient for only a limited range of data patterns, indicating a lack of self-adaptation. In this paper, we propose an effective and Adaptive lossless Floating-point Compression algorithm AFC for time series databases. We devise four unique compression strategies, and based on the data patterns, AFC dynamically selects the appropriate strategy. These strategies handle data compression for diverse data patterns, enhancing the compression ratio and efficiency. The most suitable strategy is employed to achieve an optimal compression ratio. We compared our AFC algorithm with four state-of-the-art compression algorithms, namely Gorilla, FPC, TSXor, and Chimp, as well as various general-purpose compression algorithms such as LZ4 and Snappy. Experimental results demonstrate that our algorithm achieves an improvement of at least 20% in compression ratio and even up to 100% on certain datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI