Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

Götz, Leon; Kollovieh, Marcel; Günnemann, Stephan; Schwinn, Leo

Computer Science > Machine Learning

arXiv:2405.17951 (cs)

[Submitted on 28 May 2024]

Title:Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

Authors:Leon Götz, Marcel Kollovieh, Stephan Günnemann, Leo Schwinn

View PDF HTML (experimental)

Abstract:Transformer architectures have shown promising results in time series processing. However, despite recent advances in subquadratic attention mechanisms or state-space models, processing very long sequences still imposes significant computational requirements. Token merging, which involves replacing multiple tokens with a single one calculated as their linear combination, has shown to considerably improve the throughput of vision transformer architectures while maintaining accuracy. In this work, we go beyond computer vision and perform the first investigations of token merging in time series analysis on both time series transformers and state-space models. To effectively scale token merging to long sequences, we introduce local merging, a domain-specific token merging algorithm that selectively combines tokens within a local neighborhood, adjusting the computational complexity from linear to quadratic based on the neighborhood size. Our comprehensive empirical evaluation demonstrates that token merging offers substantial computational benefits with minimal impact on accuracy across various models and datasets. On the recently proposed Chronos foundation model, we achieve accelerations up to 5400% with only minor accuracy degradations.

Comments:	19 pages in total, 14 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2405.17951 [cs.LG]
	(or arXiv:2405.17951v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17951

Submission history

From: Leon Götz [view email]
[v1] Tue, 28 May 2024 08:28:18 UTC (196 KB)

Computer Science > Machine Learning

Title:Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators