Abstract
In this paper, we present a reinforcement learning (RL)-based coding method to recursively divide video frames into several groups displaying similar temporal characteristics and improve rate-distortion (R-D) performance. Although the previous works have attempted to challenge the problem with analytical models, it was difficult to address complicated de-pendencies of video frames. In the proposed method, we cast the recursive problem as a sequence of a state-action for an agent to conduct an RL, by partitioning the current group to the half. The optimal solution is obtained by maximizing a reward function of the RL policy. Experimental results demonstrate that the proposed method can adapt to a video sequence whereas a fixed coding scheme cannot efficiently achieve optimal coding performance in dynamic video sequences.
Original language | English |
---|---|
Title of host publication | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1598-1601 |
Number of pages | 4 |
ISBN (Electronic) | 9789881476890 |
State | Published - 2021 |
Event | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Tokyo, Japan Duration: 14 Dec 2021 → 17 Dec 2021 |
Publication series
Name | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings |
---|
Conference
Conference | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 |
---|---|
Country/Territory | Japan |
City | Tokyo |
Period | 14/12/21 → 17/12/21 |
Bibliographical note
Publisher Copyright:© 2021 APSIPA.