Algorithm performance at different thresholds.

Source publication

Number of all possible combinations of CU partitions in a frame for...

Fast CU Partition Decision Algorithm for VVC Intra Coding Using an MET-CNN

Article

Full-text available

Sep 2022

The newest video coding standard, the versatile video coding standard (VVC/H.266), came into effect in November 2020. Different from the previous generation standard—high-efficiency video coding (HEVC/H.265)—VVC adopts a more flexible block division structure, the quad-tree with nested multi-type tree (QTMT) structure, which improves its coding per...

Context 1

... order to select the most suitable threshold, we used the overall algorithm with different threshold formulas to perform coding tests on the VVC standard test sequence ParkRunning3 under the coding environments of QP = 22, 27, 32, and 37. The algorithm performance is listed under different thresholds in Table 2. The performance of the algorithm was measured using ∆T and the Bjøntegaard Delta Bit Rate (BDBR). ...

View in full-text

Context 2

... T base (QP i ) and T prop (QP i ) indicate the encoding time spent by the original algorithm and the algorithm proposed in this paper under QP = 22, 27, 32, and 37, respectively. According to the results shown in Table 2, we finally set (a, b) to be (0.8, 0.1), so that the proposed algorithm can achieve the best performance. The final threshold formula is: ...

View in full-text

Subjective Assessment of the Quality of Video Sequences by the Young Viewers

Conference Paper

Full-text available

Oct 2022

The paper presents the results of subjective tests of the general evaluation of the quality of video signals subjected to the encoding-H.264 / AVC (Audio Video Coding), H.265 / HEVC-(High-Efficiency Video Coding) and developed by Google VP09. The assessment was made at home, that is, when young people potentially watch movies on their laptops. The...

Fast CU Decision Algorithm Based on CNN and Decision Trees for VVC

Article

Full-text available

Jul 2023

Compared with the previous generation of High Efficiency Video Coding (HEVC), Versatile Video Coding (VVC) introduces a quadtree and multi-type tree (QTMT) partition structure with nested multi-class trees so that the coding unit (CU) partition can better match the video texture features. This partition structure makes the compression efficiency of VVC significantly improved, but the computational complexity is also significantly increased, resulting in an increase in encoding time. Therefore, we propose a fast CU partition decision algorithm based on DenseNet network and decision tree (DT) classifier to reduce the coding complexity of VVC and save more coding time. We extract spatial feature vectors based on the DenseNet network model. Spatial feature vectors are constructed by predicting the boundary probabilities of 4 × 4 blocks in 64 × 64 coding units. Then, using the spatial features as the input of the DT classifier, through the classification function of the DT classifier model, the top N division modes with higher prediction probability are selected, and other division modes are skipped to reduce the computational complexity. Finally, the optimal partition mode is selected by comparing the RD cost. Our proposed algorithm achieves 47.6% encoding time savings on VTM10.0, while BDBR only increases by 0.91%.

Fast Mode Decision Method of Multiple Weighted Bi-Predictions Using Lightweight Multilayer Perceptron in Versatile Video Coding

Article

Full-text available

Jun 2023

Versatile Video Coding (VVC), the state-of-the-art video coding standard, was developed by the Joint Video Experts Team (JVET) of ISO/IEC Moving Picture Experts Group (MPEG) and ITU-T Video Coding Experts Group (VCEG) in 2020. Although VVC can provide powerful coding performance, it requires tremendous computational complexity to determine the optimal mode decision during the encoding process. In particular, VVC adopted the bi-prediction with CU-level weight (BCW) as one of the new tools, which enhanced the coding efficiency of conventional bi-prediction by assigning different weights to the two prediction blocks in the process of inter prediction. In this study, we investigate the statistical characteristics of input features that exhibit a correlation with the BCW and define four useful types of categories to facilitate the inter prediction of VVC. With the investigated input features, a lightweight neural network with multilayer perceptron (MLP) architecture is designed to provide high accuracy and low complexity. We propose a fast BCW mode decision method with a lightweight MLP to reduce the computational complexity of the weighted multiple bi-prediction in the VVC encoder. The experimental results show that the proposed method significantly reduced the BCW encoding complexity by up to 33% with unnoticeable coding loss, compared to the VVC test model (VTM) under the random-access (RA) configuration.

Visual Perception Based Intra Coding Algorithm for H.266/VVC

Article

Full-text available

May 2023

The latest international video coding standard, H.266/Versatile Video Coding (VVC), supports high-definition videos, with resolutions from 4 K to 8 K or even larger. It offers a higher compression ratio than its predecessor, H.265/High Efficiency Video Coding (HEVC). In addition to the quadtree partition structure of H.265/HEVC, the nested multi-type tree (MTT) structure of H.266/VVC provides more diverse splits through binary and ternary trees. It also includes many new coding tools, which tremendously increases the encoding complexity. This paper proposes a fast intra coding algorithm for H.266/VVC based on visual perception analysis. The algorithm applies the factor of average background luminance for just-noticeable-distortion to identify the visually distinguishable (VD) pixels within a coding unit (CU). We propose calculating the variances of the numbers of VD pixels in various MTT splits of a CU. Intra sub-partitions and matrix weighted intra prediction are turned off conditionally based on the variance of the four variances for MTT splits and a thresholding criterion. The fast horizontal/vertical splitting decisions for binary and ternary trees are proposed by utilizing random forest classifiers of machine learning techniques, which use the information of VD pixels and the quantization parameter. Experimental results show that the proposed algorithm achieves around 47.26% encoding time reduction with a Bjøntegaard Delta Bitrate (BDBR) of 1.535% on average under the All Intra configuration. Overall, this algorithm can significantly speed up H.266/VVC intra coding and outperform previous studies.

A Fast VVC Intra Prediction Based on Gradient Analysis and Multi-Feature Fusion CNN

Article

Full-text available

Apr 2023

The Joint Video Exploration Team (JVET) has created the Versatile Video Coding Standard (VVC/H.266), the most up-to-date video coding standard, offering a broad selection of coding tools. The maturity of commercial VVC codecs can significantly reduce costs and improve coding efficiency. However, the latest video coding standards have introduced binomial and trinomial tree partitioning methods, which cause the coding units (CUs) to have various shapes, increasing the complexity of coding. This article proposes a technique to simplify VVC intra prediction through the use of gradient analysis and a multi-feature fusion CNN. The gradient of CUs is computed by employing the Sobel operator, the calculation results are used for predecision-making. Further decisions can be made by CNN for coding units that cannot be judged whether they should be segmented or not. We calculate the standard deviation (SD) and the initial depth as the input features of the CNN. To implement this method, the initial depth can be determined by constructing a segmented depth prediction dictionary. For the initial segmentation depth of the coding unit, regardless of its shape, it can also be determined by consulting the dictionary. The algorithm can determine whether to split CUs of varying sizes, decreasing the complexity of the CU division process and making VVC more practical. Experimental results demonstrate that the proposed algorithm can reduce encoding time by 36.56% with a minimal increase of 1.06% Bjøntegaard delta bit rate (BD-BR) compared to the original algorithm.

A Genetic Approach-Based Intra Coding Algorithm for H.266/VVCОснованный на генетическом подходе алгоритм внутрикодирования для H.266/VVC

Article

Full-text available

May 2024

This paper presents a genetic approach for optimizing intra coding in H.266/VVC. The proposed algorithm efficiently selects coding tools and Multi-Type Tree (MTT) partitions to achieve a balance between encoding time and video quality. The fitness evaluation function, which combines perceptual metrics and coding efficiency metrics, is used to assess the quality of each candidate solution. The results demonstrate a significant reduction in encoding time without compromising video quality. The proposed algorithm selects coding tools from a set of available tools in H.266/VVC. These tools include intra prediction modes, transform units, quantization parameters, and entropy coding modes. The MTT partitioning scheme includes four types of partitions: quadtree, binary tree, ternary tree, and quad-binary tree. Perceptual metrics are used to evaluate the visual quality of the encoded video. Coding efficiency metrics are used to evaluate the coding efficiency of the encoded video. The fitness evaluation function combines perceptual metrics and coding efficiency metrics to assess the quality of each candidate solution.

Accelerating QTMT-based CU partition and intra mode decision for versatile video coding

Article

Apr 2023
J VIS COMMUN IMAGE R

Algorithm performance at different thresholds.

Contexts in source publication

Similar publications

Citations