Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding

Je Won Kang, Moncef Gabbouj, C. C. Jay Kuo

Research output: Contribution to journalArticlepeer-review

29 Scopus citations


In this paper, we propose a cascaded sparse/DCT (S/DCT) two-layer representation of prediction residuals, and implement this idea on top of the state-of-the-art high efficiency video coding (HEVC) standard. First, a dictionary is adaptively trained to contain featured patterns of residual signals so that a high portion of energy in a structured residual can be efficiently coded via sparse coding. It is observed that the sparse representation alone is less effective in the R-D performance due to the side information overhead at higher bit rates. To overcome this problem, the DCT representation is cascaded at the second stage. It is applied to the remaining signal to improve coding efficiency. The two representations successfully complement each other. It is demonstrated by experimental results that the proposed algorithm outperforms the HEVC reference codec HM5.0 in the Common Test Condition.

Original languageEnglish
Article number6494295
Pages (from-to)2711-2722
Number of pages12
JournalIEEE Transactions on Image Processing
Issue number7
StatePublished - 2013


  • discrete cosine transform (DCT)
  • high efficiency video coding (HEVC)
  • multilayered coding
  • overcomplete dictionary based video coding
  • residual coding
  • sparse representation
  • ρ domain rate model


Dive into the research topics of 'Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding'. Together they form a unique fingerprint.

Cite this