1. 19 Feb, 2019 3 commits
  2. 12 Feb, 2019 3 commits
  3. 11 Feb, 2019 3 commits
    • CVE-2019-5736 (runC): rexec callers as memfd · f45ab793
      Christian Brauner authored
      Adam Iwaniuk and Borys Popławski discovered that an attacker can compromise the
      runC host binary from inside a privileged runC container. As a result, this
      could be exploited to gain root access on the host. runC is used as the default
      runtime for containers with Docker, containerd, Podman, and CRI-O.
      
      The attack can be made when attaching to a running container or when starting a
      container running a specially crafted image.  For example, when runC attaches
      to a container the attacker can trick it into executing itself. This could be
      done by replacing the target binary inside the container with a custom binary
      pointing back at the runC binary itself. As an example, if the target binary
      was /bin/bash, this could be replaced with an executable script specifying the
      interpreter path #!/proc/self/exe (/proc/self/exec is a symbolic link created
      by the kernel for every process which points to the binary that was executed
      for that process). As such when /bin/bash is executed inside the container,
      instead the target of /proc/self/exe will be executed - which will point to the
      runc binary on the host. The attacker can then proceed to write to the target
      of /proc/self/exe to try and overwrite the runC binary on the host. However in
      general, this will not succeed as the kernel will not permit it to be
      overwritten whilst runC is executing. To overcome this, the attacker can
      instead open a file descriptor to /proc/self/exe using the O_PATH flag and then
      proceed to reopen the binary as O_WRONLY through /proc/self/fd/<nr> and try to
      write to it in a busy loop from a separate process. Ultimately it will succeed
      when the runC binary exits. After this the runC binary is compromised and can
      be used to attack other containers or the host itself.
      
      This attack is only possible with privileged containers since it requires root
      privilege on the host to overwrite the runC binary. Unprivileged containers
      with a non-identity ID mapping do not have the permission to write to the host
      binary and therefore are unaffected by this attack.
      
      LXC is also impacted in a similar manner by this vulnerability, however as the
      LXC project considers privileged containers to be unsafe no CVE has been
      assigned for this issue for LXC. Quoting from the
      https://linuxcontainers.org/lxc/security/ project's Security information page:
      
      "As privileged containers are considered unsafe, we typically will not consider
      new container escape exploits to be security issues worthy of a CVE and quick
      fix. We will however try to mitigate those issues so that accidental damage to
      the host is prevented."
      
      To prevent this attack, LXC has been patched to create a temporary copy of the
      calling binary itself when it starts or attaches to containers. To do this LXC
      creates an anonymous, in-memory file using the memfd_create() system call and
      copies itself into the temporary in-memory file, which is then sealed to
      prevent further modifications. LXC then executes this sealed, in-memory file
      instead of the original on-disk binary. Any compromising write operations from
      a privileged container to the host LXC binary will then write to the temporary
      in-memory binary and not to the host binary on-disk, preserving the integrity
      of the host LXC binary. Also as the temporary, in-memory LXC binary is sealed,
      writes to this will also fail.
      
      Note: memfd_create() was added to the Linux kernel in the 3.17 release.
      Signed-off-by: 's avatarChristian Brauner <christian.brauner@ubuntu.com>
      Co-Developed-by: 's avatarAleksa Sarai <asarai@suse.de>
      Acked-by: 's avatarSerge Hallyn <serge@hallyn.com>
  4. 09 Feb, 2019 1 commit
  5. 08 Feb, 2019 1 commit
  6. 31 Jan, 2019 2 commits
  7. 11 Dec, 2018 2 commits
  8. 10 Dec, 2018 25 commits