Dynamic Volumetric Video Coding with Tensor Decomposition

Ju Yeon Shin, Yeoneui Kim, Je Won Kang, Gun Bang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently, volumetric video coding based on neural radiance fields has gained significant attention for storing and transmitting three-dimensional (3D) scenes captured from multi-view video. Because the neural networks are trained to produce novel view synthesis of surrounding 3D scenes, compressing the model and then rendering the colors and geometry through the decompressed model can be utilized as a 3D video coding system. However, although this approach provides superior performance compared to conventional 3D video coding standards using depth video, challenges remain in reducing overall model sizes to improve coding efficiency. In this paper, we propose a novel dynamic volumetric video coding technique that employs a Group of Volume (GoV) to divide multi-view video sequences into smaller chunks, addressing complex temporal dynamics. Our method uses volumetric video features represented with 3D spatial and temporal tensor matrices and vectors and encodes them with the GoVs. The tensors are compressed by existing 2D video codec, allowing for fast rendering and easing deployment. Experimental results validate that our method not only reduces memory footprint but also maintains high-quality rendering as compared to state-of-the-art studies.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331529543
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024 - Tokyo, Japan
Duration: 8 Dec 202411 Dec 2024

Publication series

Name2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024

Conference

Conference2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
Country/TerritoryJapan
CityTokyo
Period8/12/2411/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • and 3D Video Coding Standards
  • Neural Radiance Fields
  • Volumetric Video Coding
  • Voxel-grid Representation

Fingerprint

Dive into the research topics of 'Dynamic Volumetric Video Coding with Tensor Decomposition'. Together they form a unique fingerprint.

Cite this