On Sequence-to-Sequence Models for Automated Log Parsing
A new research paper on arXiv evaluates four sequence modeling architectures for automated log parsing, finding that Transformers achieve the best performance with a mean relative edit distance of 0.111. Mamba models offer a competitive alternative with significantly lower computational costs. The study also indicates that character-level tokenization generally improves accuracy, while sequence length has minimal impact on Transformer performance. AI
IMPACT Provides practical guidance on selecting sequence modeling architectures for automated log parsing, highlighting Transformer and Mamba's effectiveness.