1 files changed, 204 insertions, 0 deletions
diff --git a/doc/tutorial/rebuild.org b/doc/tutorial/rebuild.org
new file mode 100644
index 00000000..3fa7cece
--- /dev/null
+++ b/doc/tutorial/rebuild.org
@@ -0,0 +1,204 @@
+* Ensuring reproducibility of the build
+
+Software builds should be [[https://reproducible-builds.org/][reproducible]].
+The ~just~ tool, supports this goal in local builds by isolating
+individual actions, setting permissions and file time stamps to
+canonical values, etc; most remote execution systems take even further
+measures to ensure the environment always looks the same to every
+action. Nevertheless, it is always possible to break reproducibility
+by bad actions, both coming from rules not carefully written, as
+well as from ad-hoc actions added by the ~generic~ target.
+
+#+BEGIN_SRC js
+...
+, "version.h":
+  { "type": "generic"
+  , "cmds":
+    ["echo '#define VERSION \"0.0.0.'`date +%Y%m%d%H%M%S`'\"' > version.h"]
+  , "outs": ["version.h"]
+  }
+...
+#+END_SRC
+
+Besides time stamps there are many other sources of nondeterminism,
+like properties of the build machine (name, number of CPUs available,
+etc), but also subtle ones like ~readdir~ order. Often, those
+non-reproducible parts get buried deeply in a final artifact (like
+the version string embedded in a binary contained in a compressed
+installation archive); and, as long as the non-reproducible action
+stays in cache, it does not even result in bad incrementality.
+Still, others won't be able to reproduce the exact artifact.
+
+There are tools like [[https://diffoscope.org/][diffoscope]] to deeply
+compare archives and other container formats. Nevertheless, it is
+desirable to find the root causes, i.e., the first (in topological
+order) actions that yield a different output.
+
+** Rebuilding
+
+To search for the root cause of non-reproducibility, ~just~ has
+a subcommand ~rebuild~. It builds the specified target again, requesting
+that every action be executed again (but target-level cache is still
+active); then the result of every action is compared to the one in the
+action cache, if present with the same inputs. So, you typically would
+first ~build~ and then ~rebuild~. Note that a repeated ~build~ simply
+takes the action result from cache.
+
+#+BEGIN_SRC sh
+$ cat TARGETS
+{ "":
+  { "type": "install"
+  , "files":
+    { "bin/hello": "hello"
+    , "share/hello/version.txt": "version.txt"
+    , "share/hello/OUT.txt": "OUT.txt"
+    }
+  }
+, "hello":
+  { "type": ["@", "rules", "CC", "binary"]
+  , "name": ["hello"]
+  , "srcs": ["hello.cpp"]
+  , "private-hdrs": ["version.h"]
+  }
+, "version.h":
+  { "type": "generic"
+  , "cmds":
+    ["echo '#define VERSION \"0.0.0.'`date +%Y%m%d%H%M%S`'\"' > version.h"]
+  , "outs": ["version.h"]
+  }
+, "version.txt":
+  { "type": "generic"
+  , "outs": ["version.txt"]
+  , "cmds": ["./hello -v > version.txt"]
+  , "deps": ["hello"]
+  }
+, "out.txt":
+  { "type": "generic"
+  , "outs": ["out.txt"]
+  , "cmds": ["./hello > out.txt"]
+  , "deps": ["hello"]
+  }
+, "OUT.txt":
+  { "type": "generic"
+  , "outs": ["OUT.txt"]
+  , "cmds": ["tr a-z A-Z > OUT.txt < out.txt"]
+  , "deps": ["out.txt"]
+  }
+}
+$ just build -C $CONF
+INFO: Requested target is [["@","tutorial","",""],{}]
+INFO: Analysed target [["@","tutorial","",""],{}]
+INFO: Export targets found: 0 cached, 0 uncached, 0 not eligible for caching
+INFO: Discovered 6 actions, 1 trees, 0 blobs
+INFO: Building [["@","tutorial","",""],{}].
+INFO: Processed 6 actions, 0 cache hits.
+INFO: Artifacts built, logical paths are:
+        bin/hello [59f7af154b3b7beac4a6cab40499cb3b388220c4:16608:x]
+        share/hello/OUT.txt [428b97b82b6c59cad7488b24e6b618ebbcd819bc:13:f]
+        share/hello/version.txt [088ae5a8a57f62016392bdf124a9b8dfc0288763:39:f]
+$ just build -C $CONF
+INFO: Requested target is [["@","tutorial","",""],{}]
+INFO: Analysed target [["@","tutorial","",""],{}]
+INFO: Export targets found: 0 cached, 0 uncached, 0 not eligible for caching
+INFO: Discovered 6 actions, 1 trees, 0 blobs
+INFO: Building [["@","tutorial","",""],{}].
+INFO: Processed 6 actions, 6 cache hits.
+INFO: Artifacts built, logical paths are:
+        bin/hello [59f7af154b3b7beac4a6cab40499cb3b388220c4:16608:x]
+        share/hello/OUT.txt [428b97b82b6c59cad7488b24e6b618ebbcd819bc:13:f]
+        share/hello/version.txt [088ae5a8a57f62016392bdf124a9b8dfc0288763:39:f]
+$ just rebuild -C $CONF
+INFO: Requested target is [["@","tutorial","",""],{}]
+INFO: Analysed target [["@","tutorial","",""],{}]
+INFO: Export targets found: 0 cached, 0 uncached, 0 not eligible for caching
+INFO: Discovered 6 actions, 1 trees, 0 blobs
+INFO: Rebuilding [["@","tutorial","",""],{}].
+WARN: Found flaky action:
+       - id: c854a382ea26628e1a5b8d4af00d6d0cef433436
+       - cmd: ["sh","-c","echo '#define VERSION \"0.0.0.'`date +%Y%m%d%H%M%S`'\"' > version.h\n"]
+       - output 'version.h' differs:
+         - [6aac3477e22cd57e8c98ded78562d3c017e5d611:39:f] (rebuilt)
+         - [789a29f39b6aa966f91776bfe092e247614e6acd:39:f] (cached)
+INFO: 2 actions compared with cache, 1 flaky actions found (0 of which tainted), no cache entry found for 4 actions.
+INFO: Artifacts built, logical paths are:
+        bin/hello [73994ff43ec1161aba96708f277e8c88feab0386:16608:x]
+        share/hello/OUT.txt [428b97b82b6c59cad7488b24e6b618ebbcd819bc:13:f]
+        share/hello/version.txt [8dd65747395c0feab30891eab9e11d4a9dd0c715:39:f]
+$
+#+END_SRC
+
+In the example, the second action compared to cache is the upper
+casing of the output. Even though the generation of ~out.txt~ depends
+on the non-reproducible ~hello~, the file itself is reproducible.
+Therefore, the follow-up actions are checked as well.
+
+For this simple example, reading the console output is enough to understand
+what's going on. However, checking for reproducibility usually is part
+of a larger, quality-assurance process. To support the automation of such
+processes, the findings can also be reported in machine-readable form.
+
+#+BEGIN_SRC sh
+$ just rebuild -C $CONF --dump-flaky flakes.json --dump-graph actions.json
+[...]
+$ cat flakes.json
+{
+  "cache misses": [
+    "ca45feffd2bab3bdbf752db7c89c451ce99d4803",
+    "eff0117276a0ad65a331eb29a2393a9b6e106e4b",
+    "1eef99333e887f2aa78d2eaee0a7c172db66009c",
+    "72a337628fa6187513818d5d8d00d36fbdb6923e"
+  ],
+  "flaky actions": {
+    "c854a382ea26628e1a5b8d4af00d6d0cef433436": {
+      "version.h": {
+        "cached": {
+          "file_type": "f",
+          "id": "789a29f39b6aa966f91776bfe092e247614e6acd",
+          "size": 39
+        },
+        "rebuilt": {
+          "file_type": "f",
+          "id": "4b5e64f324745d628c893f9d7e29ce7febdfdd0c",
+          "size": 39
+        }
+      }
+    }
+  }
+}$
+#+END_SRC
+
+The file reports the flaky actions together with the non-reproducible
+artifacts they generated, reporting both, the cached and the newly
+generated output. The files themselves can be obtained via ~just
+install-cas~ as usual, allowing deeper comparison of the outputs.
+The full definitions of the actions can be found in the action graph,
+in the example dumped as well as ~actions.json~; this definition
+also includes the origins for each action, i.e., the configured
+targets that requested the respective action.
+
+
+** Comparing build environments
+
+Simply rebuilding on the same machine is good way to detect embedded
+time stamps of sufficiently small granularity; for other sources of
+non-reproducibility, however, more modifications of the environment
+are necessary.
+
+A simple, but effective, way for modifying the build environment
+is the option ~-L~ to the set the local launcher, a list of
+strings the argument vector is prefixed with before the action is
+executed. The default ~["env", "--"]~ simply resolves the program
+to be executed in the current value of ~PATH~, but a different
+value for the launcher can obviously be used to set environment
+variables like ~LD_PRELOAD~. Relevant libraries and tools
+include [[https://github.com/wolfcw/libfaketime][libfaketime]],
+[[https://github.com/dtcooper/fakehostname][fakehostname]],
+and [[https://salsa.debian.org/reproducible-builds/disorderfs][disorderfs]].
+
+More variation can be achieved by comparing remote execution builds,
+either for two different remote-execution end points or comparing
+one remote-execution end point to the local build. The latter is
+also a good way to find out where a build that "works on my machine"
+differs. The endpoint on which the rebuild is executed can be set,
+in the same way as for build with the ~-r~ option; the cache end
+point to compare against can be set via the ~--vs~ option.