Implement blob chunking algorithm negotiation

author: Sascha Roloff <sascha.roloff@huawei.com> 2024-02-23 15:46:48 +0100
committer: Sascha Roloff <sascha.roloff@huawei.com> 2024-02-26 17:16:21 +0100
commit: 25ef9672988f008e61193228756dcfed069bda57 (patch)
tree: 946dfd5472d4228832c019204304d004a097d936 /src/buildtool/execution_api/execution_service/cas_server.hpp
parent: 1debca0855d2e4ae8cf08498148831124b65bd9e (diff)
download: justbuild-25ef9672988f008e61193228756dcfed069bda57.tar.gz
1 files changed, 36 insertions, 9 deletions
diff --git a/src/buildtool/execution_api/execution_service/cas_server.hpp b/src/buildtool/execution_api/execution_service/cas_server.hpp
index 306c2833..fd77a03e 100644
--- a/src/buildtool/execution_api/execution_service/cas_server.hpp
+++ b/src/buildtool/execution_api/execution_service/cas_server.hpp
@@ -120,17 +120,44 @@ class CASServiceImpl final
         -> ::grpc::Status override;
     // Split a blob into chunks.
     //
+    // This splitting API aims to reduce download traffic between client and
+    // server, e.g., if a client needs to fetch a large blob that just has been
+    // modified slightly since the last built. In this case, there is no need to
+    // fetch the entire blob data, but just the binary differences between the
+    // two blob versions, which are typically determined by deduplication
+    // techniques such as content-defined chunking.
+    //
     // Clients can use this API before downloading a blob to determine which
     // parts of the blob are already present locally and do not need to be
-    // downloaded again.
-    //
-    // The blob is split into chunks which are individually stored in the CAS. A
-    // list of the chunk digests is returned in the order in which the chunks
-    // have to be concatenated to assemble the requested blob.
-    //
-    // Using this API is optional but it allows clients to download only the
-    // missing parts of a blob instead of the entire blob data, which in turn
-    // can considerably reduce network traffic.
+    // downloaded again. The server splits the blob into chunks according to a
+    // specified content-defined chunking algorithm and returns a list of the
+    // chunk digests in the order in which the chunks have to be concatenated to
+    // assemble the requested blob.
+    //
+    // A client can expect the following guarantees from the server if a split
+    // request is answered successfully:
+    //  1. The blob chunks are stored in CAS.
+    //  2. Concatenating the blob chunks in the order of the digest list
+    //     returned by the server results in the original blob.
+    //
+    // The usage of this API is optional for clients but it allows them to
+    // download only the missing parts of a large blob instead of the entire
+    // blob data, which in turn can considerably reduce download network
+    // traffic.
+    //
+    // Since the generated chunks are stored as blobs, they underlie the same
+    // lifetimes as other blobs. However, their lifetime is extended if they are
+    // part of the result of a split blob request.
+    //
+    // For the client, it is recommended to verify whether the digest of the
+    // blob assembled by the fetched chunks results in the requested blob
+    // digest.
+    //
+    // If several clients use blob splitting, it is recommended that they
+    // request the same splitting algorithm to benefit from each others chunking
+    // data. In combination with blob splicing, an agreement about the chunking
+    // algorithm is recommended since both client as well as server side can
+    // benefit from each others chunking data.
     //
     // Errors:
     //
author	Sascha Roloff <sascha.roloff@huawei.com>	2024-02-23 15:46:48 +0100
committer	Sascha Roloff <sascha.roloff@huawei.com>	2024-02-26 17:16:21 +0100
commit	25ef9672988f008e61193228756dcfed069bda57 (patch)
tree	946dfd5472d4228832c019204304d004a097d936 /src/buildtool/execution_api/execution_service/cas_server.hpp
parent	1debca0855d2e4ae8cf08498148831124b65bd9e (diff)
download	justbuild-25ef9672988f008e61193228756dcfed069bda57.tar.gz