From 000b8dd95459118d3a9f0eab6c0e2e25cd1a9465 Mon Sep 17 00:00:00 2001
From: Simon Sapin <simon.sapin@exyr.org>
Date: Fri, 21 Sep 2018 18:43:55 +0200
Subject: [PATCH] Taskcluster README

---
 etc/ci/taskcluster/README.md | 191 +++++++++++++++++++++++++++++++++++
 1 file changed, 191 insertions(+)
 create mode 100644 etc/ci/taskcluster/README.md

diff --git a/etc/ci/taskcluster/README.md b/etc/ci/taskcluster/README.md
new file mode 100644
index 00000000000..6ed651fbf1f
--- /dev/null
+++ b/etc/ci/taskcluster/README.md
@@ -0,0 +1,191 @@
+# Testing Servo on Taskcluster
+
+## Homu
+
+When a pull request is reviewed and the appropriate command is given,
+[Homu] creates a merge commit of `master` and the PR’s branch, and pushes it to the `auto` branch.
+One or more CI system (through their own means) get notified of this push by GitHub,
+start testing the merge commit, and use the [GitHub Status API] to report results.
+
+Through a [Webhook], Homu gets notified of changes to these statues.
+If all of the required statuses are reported successful,
+Homu pushes its merge commit to the `master` branch
+and goes on to testing the next pull request in its queue.
+
+[Homu]: https://github.com/servo/servo/wiki/Homu
+[GitHub Status API]: https://developer.github.com/v3/repos/statuses/
+[Webhook]: https://developer.github.com/webhooks/
+
+
+## Taskcluster − GitHub integration
+
+Taskcluster is very flexible and not necessarily tied to GitHub,
+but it does have an optional [GitHub integration service] that you can enable
+on a repository [as a GitHub App].
+When enabled, this service gets notified for every push, pull request, or GitHub release.
+It then schedules some tasks based on reading [`.taskcluster.yml`] in the corresponding commit.
+
+This file contains templates for creating one or more tasks,
+but the logic it can support is fairly limited.
+So a common pattern is to have it only run a single initial task called a *decision task*
+that can have complex logic based on code and data in the repository
+to build an arbitrary [task graph].
+
+[GitHub integration service]: https://docs.taskcluster.net/docs/manual/using/github
+[as a GitHub App]: https://github.com/apps/taskcluster
+[`.taskcluster.yml`]: https://docs.taskcluster.net/docs/reference/integrations/taskcluster-github/docs/taskcluster-yml-v1
+[task graph]: https://docs.taskcluster.net/docs/manual/using/task-graph
+
+
+## Servo’s decision task
+
+This repository’s [`.taskcluster.yml`][tc.yml] schedules a single task
+that runs the Python 3 script [`etc/ci/taskcluster/decision-task.py`](decision-task.py).
+It is called a *decision task* as it is responsible for deciding what other tasks to schedule.
+
+The Docker image that runs the decision task
+is hosted on Docker Hub at [`servobrowser/taskcluster-bootstrap`][hub].
+It is built by [Docker Hub automated builds] based on a `Dockerfile`
+in the [`taskcluster-bootstrap-docker-images`] GitHub repository.
+Hopefully, this image does not need to be modified often
+as it only needs to clone the repository and run Python.
+
+[tc.yml]: ../../../.taskcluster.yml
+[hub]: https://hub.docker.com/r/servobrowser/taskcluster-bootstrap/
+[Docker Hub automated builds]: https://docs.docker.com/docker-hub/builds/
+[`taskcluster-bootstrap-docker-images`]: https://github.com/servo/taskcluster-bootstrap-docker-images/
+
+
+## In-tree Docker images
+
+[Similar to Firefox][firefox], Servo’s decision task supports running other tasks
+in Docker images built on-demand, based on `Dockerfile`s in the main repository.
+Modifying a `Dockerfile` and relying on those new changes
+can be done in the same pull request or commit.
+
+To avoid rebuilding images on every pull request,
+they are cached based on a hash of the source `Dockerfile`.
+For now, to support this hashing, we make `Dockerfile`s be self-contained (with one exception).
+Images are built without a [context],
+so instructions like [`COPY`] cannot be used because there is nothing to copy from.
+The exception is that the decision task adds support for a non-standard include directive:
+when a `Dockerfile` first line is `% include` followed by a filename,
+that line is replaced with the content of that file.
+
+For example,
+[`etc/ci/taskcluster/docker/build.dockerfile`](docker/build.dockerfile) starts like so:
+
+```Dockerfile
+% include base.dockerfile
+
+RUN \
+    apt-get install -qy --no-install-recommends \
+# […]
+```
+
+[firefox]: https://firefox-source-docs.mozilla.org/taskcluster/taskcluster/docker-images.html
+[context]: https://docs.docker.com/engine/reference/commandline/build/#extended-description
+[`COPY`]: https://docs.docker.com/engine/reference/builder/#copy
+
+
+## Build artifacts
+
+[web-platform-tests] (WPT) is large enough that running all of a it takes a long time.
+So it supports *chunking*,
+such as multiple chunks of the test suite can be run in parallel on different machines.
+As of this writing,
+Servo’s current Buildbot setup for this has each machine start by compiling its own copy of Servo.
+On Taskcluster with a decision task,
+we can have a single build task save its resulting binary executable as an [artifact],
+together with multiple testing tasks that each depend on the build task
+(wait until it successfully finishes before they can start)
+and start by downloading the artifact that was saved earlier.
+
+The logic for all this is in [`decision-task.py`](decision-task.py)
+and can be modified in any pull request.
+
+[web-platform-tests]: https://github.com/web-platform-tests/wpt
+[artifact]: https://docs.taskcluster.net/docs/manual/using/artifacts
+
+
+## Log artifacts
+
+Taskcluster automatically save the `stdio` output of a task as an artifact,
+and as special support for seeing and streaming that output while the task is still running.
+
+Servo’s decision task additionally looks for `*.log` arguments to its tasks’s commands,
+assumes they instruct a program to create a log file with that name,
+and saves those log files as individual artifacts.
+
+For example, WPT tasks have a `filtered-wpt-errorsummary.log` artifact
+that is typically the most relevant output when such a task fails.
+
+
+## Scopes and roles
+
+[Scopes] are what Taskcluster calls permissions.
+They control access to everything.
+
+Anyone logged in in the [web UI] has (access to) a set of scopes,
+which is visible on the [credentials] page
+(reachable from clicking on one’s own name on the top-right of any page).
+
+A running task has a set of scopes allowing it access to various functionality and APIs.
+It can grant those scopes (and at most only thoses) to sub-tasks that it schedules
+(if it has the scope allowing it to schedule new tasks in the first place).
+
+[Roles] represent each a set of scopes.
+They can be granted to… things,
+and then configured separately to modify what scopes they expand to.
+
+For example, when Taskcluster-GitHub schedules tasks based on the `.taskcluster.yml` file
+in a push to the `auto` branch of this repository,
+those tasks are granted the scope `assume:repo:github.com/servo/servo:branch:auto`.
+Scopes that start with `assume:` are special,
+they expand to the scopes defined in the matching roles.
+In this case, the [`repo:github.com/servo/servo:branch:*`][branches] role matches.
+
+Servo admins have scope `auth:update-role:repo:github.com/servo/*` which allows them
+to edit that role in the web UI and grant more scopes to these tasks
+(if that person has the new scope themselves).
+
+[Scopes]: https://docs.taskcluster.net/docs/manual/design/apis/hawk/scopes
+[web UI]: https://tools.taskcluster.net/
+[credentials]: https://tools.taskcluster.net/credentials
+[Roles]: https://docs.taskcluster.net/docs/manual/design/apis/hawk/roles
+[branches]: https://tools.taskcluster.net/auth/roles/repo%3Agithub.com%2Fservo%2Fservo%3Abranch%3A*
+
+
+## AWS EC2 workers
+
+As of this writing, Servo on Taskcluster can only use the `servo-docker-worker` worker type.
+Tasks scheduled with this worker type run in a Linux environment,
+in a Docker container, on an AWS EC2 virtual machine.
+
+These machines are short-lived “spot instances”.
+They are started automatically as needed by the [AWS provisioner]
+when the existing capacity is insufficient to execute queued tasks.
+They terminate themselves after being idle without work for a while,
+or unconditionally after a few days.
+Because these workers are short-lived,
+we don’t need to worry about evicting old entries from Cargo’s or rustup’s download cache,
+for example.
+
+Servo admins can view and edit the [worker type definition] which configures the provisioner,
+in particular with the types of EC2 instances to be used.
+
+[AWS provisioner]: https://docs.taskcluster.net/docs/reference/integrations/aws-provisioner/references/api
+[worker type definition]: https://tools.taskcluster.net/aws-provisioner/servo-docker-worker/view
+
+
+## Self-service, Bugzilla, and IRC
+
+Taskcluster is designed to be “self-service” as much as possible,
+with features like in-tree `.taskcluster.yml`
+or the web UI for modifying the worker type definitions.
+However some changes like adding a new worker type still require Taskcluster admin access.
+For those, file requests on Bugzilla under [Taskcluster :: Service Request][req].
+
+For asking for help less formally, try the `#servo` or `#taskcluster` channels on Mozilla IRC.
+
+[req]: https://bugzilla.mozilla.org/enter_bug.cgi?product=Taskcluster&component=Service%20Request