Content area

Abstract

The exponential growth of visual data, including 2D images and 3D modalities such as videos and multi-view images, has placed a premium on efficient representation and processing techniques that extend beyond traditional data compression. This thesis explores compression-driven approaches as innovative solutions for 2D and 3D vision, leveraging compression not merely for data reduction but as a transformative tool to enhance efficiency, versatility, and quality across various vision tasks. By leveraging the principles of compression, this thesis addresses a range of challenges, including reversible image transformations, learned lossy compression, joint task optimization with compression, as well as efficient 3D editing with compact yet expressive representations. To this end, we first develop frameworks that utilize invertible neural networks to encode and recover visual information with high fidelity, enabling tasks such as reversible image conversion and lossy image compression. By exploring the inherent invertibility of neural networks, we demonstrate that compression can serve as a reversible conduit for hiding and reconstructing multiple images within a single embedding, as well as improving the quality and efficiency of learned image compression codecs. We then extend image compression methods beyond data reduction by investigating their synergy with other tasks. Specifically, we propose a joint learning framework for image compression and denoising, leveraging the innate denoising capabilities of compression models. This approach achieves superior results in both domains while revealing the latent connections between data reduction and noise suppression. Finally, we expand the scope of compression to 3D vision, introducing a novel editing-friendly framework that encapsulates the appearance of 3D scenes into compact 2D canonical images. By treating the canonical image as a compressive representation of the 3D scene, this approach enables efficient 3D editing through standard 2D tools, eliminating the need for costly re-optimization while maintaining fidelity to the original scene.

Details

1010268
Business indexing term
Classification
Title
Advancing Compression-Driven Approaches in 2D and 3D Vision
Number of pages
116
Publication year
2025
Degree date
2025
School code
1223
Source
DAI-B 87/1(E), Dissertation Abstracts International
ISBN
9798288810046
Advisor
University/institution
Hong Kong University of Science and Technology (Hong Kong)
University location
Hong Kong
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32190692
ProQuest document ID
3232655002
Document URL
https://www.proquest.com/dissertations-theses/advancing-compression-driven-approaches-2d-3d/docview/3232655002/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic