視訊點雲的壓縮(Video-based Point Cloud Compression)現有標準利用分割方式GRS (Grid Refine Segmentation),將三維空間的點雲資訊投影到不同的二維平面,再透過現有二維影像壓縮技術標準HEVC(High Efficiency Video Coding)進行二維影像壓縮。本論文根據適用於三維點雲場景語意分割之深度學習網路為基礎,改善V-PCC將點雲資訊投影到多個不同二維平面的投影分類,並在幾何BD-PSNR平均提升0.02dB。另外,在GRS處理過程至少有90%以上的執行程序並未改變投影面結果,因此本論文針對這點進行演算法的加速優化,預測該點後續投影平面是否有變化的可能,減少不必要的計算步驟。BD-Rate增加幅度不大的情況下,在GRS階段可降低15%的執行時間。
Video-based Point Cloud Compression (Video-based Point Cloud Compression) is the existing standard that uses the segmentation method GRS (Grid Refine Segmentation) to project the 3D point cloud information into different 2D planes, and then use the existing mature 2D image compression technology standard HEVC (High Efficiency Video Coding) performing 2D image compression. Based on a deep learning network suitable for semantic segmentation of 3D point cloud scenes, this paper improves the projection classification of V-PCC onto multiple different 2D planes, and improves Geomotry BD-PSNR by an average value of 0.02dB. Also, in the GRS process, at least 90% of the cases did not change the projection surface results during the refinement. Based on this point, we aim to accelerate the optimization of the algorithm, and predict whether the subsequent projection plane may change, and reduce unnecessary computations. The results show that the increase in BD-Rate is not large, and the execution time can be further reduced by 15%.