HoSZp: An Efficient Homomorphic Error-bounded Lossy Compressor for Scientific Data

T Agarwal, S Di, J Huang, Y Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
arXiv preprint arXiv:2408.11971, 2024arxiv.org
Error-bounded lossy compression has been a critical technique to significantly reduce the
sheer amounts of simulation datasets for high-performance computing (HPC) scientific
applications while effectively controlling the data distortion based on user-specified error
bound. In many real-world use cases, users must perform computational operations on the
compressed data (aka homomorphic compression). However, none of the existing error-
bounded lossy compressors support the homomorphism, inevitably resulting in undesired …
Error-bounded lossy compression has been a critical technique to significantly reduce the sheer amounts of simulation datasets for high-performance computing (HPC) scientific applications while effectively controlling the data distortion based on user-specified error bound. In many real-world use cases, users must perform computational operations on the compressed data (a.k.a. homomorphic compression). However, none of the existing error-bounded lossy compressors support the homomorphism, inevitably resulting in undesired decompression costs. In this paper, we propose a novel homomorphic error-bounded lossy compressor (called HoSZp), which supports not only error-bounding features but efficient computations (including negation, addition, multiplication, mean, variance, etc.) on the compressed data without the complete decompression step, which is the first attempt to the best of our knowledge. We develop several optimization strategies to maximize the overall compression ratio and execution performance. We evaluate HoSZp compared to other state-of-the-art lossy compressors based on multiple real-world scientific application datasets.
arxiv.org
Showing the best result for this search. See all results