Amirpour, Hadi, Zhu, Jingwen, Zhou, Wei, Callet, Patrick Le and Timmerer, Christian
2025.
VQM4HAS: A real-time quality metric for HEVC videos in HTTP Adaptive Streaming.
IEEE Transactions on Multimedia
, pp. 1-12.
10.1109/tmm.2025.3613110
![]() |
![]() |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (8MB) |
Abstract
In HTTP Adaptive Streaming (HAS), a video is encoded at various bitrate-resolution pairs, collectively known as the bitrate ladder, allowing users to select the most suitable representation based on their network conditions. Optimizing this set of pairs to enhance the Quality of Experience (QoE) requires accurately measuring the quality of these representations. VMAF and ITU-T's P.1204.3 are highly reliable metrics for assessing the quality of representations in HAS. However, in practice, using these metrics for optimization is often impractical for live streaming applications due to their high computational costs and the large number of bitrate-resolution pairs in the bitrate ladder that need to be evaluated. To address their high complexity, our paper introduces a new method called VQM4HAS, which extracts low-complexity features, including (i) video complexity features, (ii) frame-level encoding statistics logged during the encoding process, and (iii) lightweight video quality metrics. These extracted features are then fed into a regression model to predict VMAF or P.1204.3. The VQM4HAS model is designed to operate on a per bitrate-resolution pair, per-resolution, and cross-representation basis, optimizing quality predictions across different scenarios. Our experimental results demonstrate that VQM4HAS achieves a high correlation with VMAF and P.1204.3, with Pearson correlation coefficients (PCC) ranging from 0.95 to 0.96 for VMAF and 0.97 to 0.99 for P.1204.3, depending on the resolution. Despite achieving a high correlation with VMAF and P.1204.3, VQM4HAS exhibits significantly less complexity than both metrics, with 98% and 99% less complexity for VMAF and P.1204.3, respectively, making it suitable for live streaming scenarios. We also conduct a feature importance analysis to further reduce the complexity of the proposed method. Furthermore, we evaluate the effectiveness of our method by using it to predict subjective quality scores. The results show that VQM4HAS achieves a higher correlation with subjective scores at various resolutions despite its minimal complexity. The source code is available at https://github.com/cd-athena/VQM4HAS.
Item Type: | Article |
---|---|
Date Type: | Published Online |
Status: | In Press |
Schools: | Schools > Computer Science & Informatics |
Additional Information: | License information from Publisher: LICENSE 1: URL: https://creativecommons.org/licenses/by/4.0/legalcode, Start Date: 2025-01-01 |
Publisher: | Institute of Electrical and Electronics Engineers |
ISSN: | 1520-9210 |
Date of First Compliant Deposit: | 8 October 2025 |
Last Modified: | 08 Oct 2025 09:15 |
URI: | https://orca.cardiff.ac.uk/id/eprint/181549 |
Actions (repository staff only)
![]() |
Edit Item |