Gholami, Afshin, Baldoni, Sara, Battisti, Federica, Zhou, Wei, Timmerer, Christian and Amirpour, Hadi
2025.
Perceptual quality assessment of spatial videos on Apple Vision Pro.
Presented at: MM '25:The 33rd ACM International Conference on Multimedia,
Dublin, Ireland,
31 October 2025.
IXR '25: Proceedings of the 3rd International Workshop on Interactive eXtended Reality.
Proceedings of the 3rd International Workshop on Interactive eXtended Reality.
Dublin:
ACM,
pp. 20-28.
10.1145/3746269.3760422
|
|
PDF
- Published Version
Available under License Creative Commons Attribution. Download (34MB) |
Abstract
Immersive stereoscopic (3D) video experiences have entered a new era with the advent of smartphones capable of capturing stereoscopic videos, advanced video codecs optimized for multiview content, and Head Mounted Displays (HMDs) that natively support stereoscopic video playback. In particular, Apple's recent introduction of spatial video capture on the recent iPhone Pro series and immersive playback on the Apple Vision Pro (AVP) has accelerated the mainstream adoption of stereoscopic content. In this work, we evaluate the quality of spatial videos encoded using optimized x265 software implementations of Multiview HEVC (MV-HEVC) on the AVP and compare them with their corresponding 2D versions through a subjective test. To support this study, we introduce SV-QoE, a novel dataset comprising video clips rendered with a twin-camera setup that replicates the human inter-pupillary distance. Our analysis reveals that spatial videos consistently deliver a superior Quality of Experience (QoE) when encoded at similar bitrates, with the benefits becoming more pronounced at higher bitrates. Additionally, renderings at closer distances exhibit significantly enhanced video quality and depth perception, highlighting the impact of spatial proximity on immersive viewing experiences. We further analyze the impact of disparity on depth perception and examine the correlation between Mean Opinion Score (MOS) and established objective quality metrics such as PSNR, SSIM, MS-SSIM, VMAF, and AVQT. Additionally, we explore how video quality and depth perception together influence overall quality judgments. The complete dataset, including videos and subjective scores, is publicly available at https://github.com/cd-athena/SV-QoE.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Date Type: | Publication |
| Status: | Published |
| Schools: | Schools > Computer Science & Informatics |
| Publisher: | ACM |
| ISBN: | 979-8-4007-2051-2 |
| Date of First Compliant Deposit: | 6 November 2025 |
| Last Modified: | 06 Nov 2025 10:30 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/182175 |
Actions (repository staff only)
![]() |
Edit Item |




Dimensions
Dimensions