Abstract
In this paper, we present a reinforcement learning (RL)-based coding method to recursively divide video frames into several groups displaying similar temporal characteristics and improve rate-distortion (R-D) performance. Although the previous works have attempted to challenge the problem with analytical models, it was difficult to address complicated de-pendencies of video frames. In the proposed method, we cast the recursive problem as a sequence of a state-action for an agent to conduct an RL, by partitioning the current group to the half. The optimal solution is obtained by maximizing a reward function of the RL policy. Experimental results demonstrate that the proposed method can adapt to a video sequence whereas a fixed coding scheme cannot efficiently achieve optimal coding performance in dynamic video sequences.
| Original language | English |
|---|---|
| Title of host publication | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1598-1601 |
| Number of pages | 4 |
| ISBN (Electronic) | 9789881476890 |
| State | Published - 2021 |
| Event | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Tokyo, Japan Duration: 14 Dec 2021 → 17 Dec 2021 |
Publication series
| Name | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings |
|---|
Conference
| Conference | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 |
|---|---|
| Country/Territory | Japan |
| City | Tokyo |
| Period | 14/12/21 → 17/12/21 |
Bibliographical note
Publisher Copyright:© 2021 APSIPA.