Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

M 2 S 2 L: Mamba-based multi-scale spatial-temporal learning for video anomaly detection

Liu, Yang, Chen, Boan, Zhu, Xiaoguang, Liu, Jing, Sun, Peng and Zhou, Wei 2025. M 2 S 2 L: Mamba-based multi-scale spatial-temporal learning for video anomaly detection. Presented at: 2025 International Conference on Visual Communications and Image Processing (VCIP), Klagenfurt, Austria, 1-4 December 2025. 2025 International Conference on Visual Communications and Image Processing (VCIP). IEEE, 10.1109/vcip67698.2025.11396919

Full text not available from this repository.

Abstract

Video anomaly detection (VAD) is an essential task in the image processing community with prospects in video surveillance, which faces fundamental challenges in balancing detection accuracy with computational efficiency. As video content becomes increasingly complex with diverse behavioral patterns and contextual scenarios, traditional VAD approaches struggle to provide robust assessment for modern surveillance systems. Existing methods either lack comprehensive spatial-temporal modeling or require excessive computational resources for real-time applications. In this regard, we present a Mamba-based multi-scale spatial-temporal learning (M2S2L) framework in this paper. The proposed method employs hierarchical spatial encoders operating at multiple granularities and multi-temporal encoders capturing motion dynamics across different time scales. We also introduce a feature decomposition mechanism to enable task-specific optimization for appearance and motion reconstruction, facilitating more nuanced behavioral modeling and quality-aware anomaly assessment. Experiments on three benchmark datasets demonstrate that M2S2L framework achieves 98.5%, 92.1%, and 77.9% frame-level AUCs on UCSD Ped2, CUHK Avenue, and ShanghaiTech respectively, while maintaining efficiency with 20.1G FLOPs and 45 FPS inference speed, making it suitable for practical surveillance deployment.

Item Type: Conference or Workshop Item - published (Paper)
Date Type: Publication
Status: Published
Schools: Schools > Computer Science & Informatics
Publisher: IEEE
ISBN: 979-8-3315-6868-9
ISSN: 2642-9357
Last Modified: 13 Mar 2026 11:00
URI: https://orca.cardiff.ac.uk/id/eprint/185725

Actions (repository staff only)

Edit Item Edit Item