Page Navigation:

Table of Contents

Confidence Level TBD This article has not been reviewed for accuracy, timeliness, or completeness. Check that this information is valid before acting on it.

Jira ticket: (see https://hysds-core.atlassian.net/browse/HC-137, https://hysds-core.atlassian.net/browse/HC-138, https://hysds-core.atlassian.net/browse/HC-139)

Introduction

This guide will teach you how to enable and visualize docker execution stats for HySDS jobs.

By default when a verdi worker executes a job (i.e. executes the docker command for the job/PGE), it tracks the wall clock run time of the docker command (taking a timestamp before and after to calculate the duration). In addition to this metric, verdi also tracks the number of and size of inputs files that were localized and the number and size of output files that were published. Aside from these, no other resource utilization metrics are recorded.

However there are cases when end users are interested in tracking the resource utilization of a job (docker container execution). In particular, they are interested in the docker container's total CPU utilization and maximum memory usage. The docker stats command is feature of docker that will allow end users to track the live resource utilization of any and all containers: https://docs.docker.com/engine/reference/commandline/stats/. For example:

Code Block

$ docker stats --all --format "table {{.Container}}\t{{.CPUPerc}}\t{{.MemUsage}}" fervent_panini 5acfcb1b4fd1 drunk_visvesvaraya big_heisenberg

CONTAINER                CPU %               MEM USAGE / LIMIT
fervent_panini           0.00%               56KiB / 15.57GiB
5acfcb1b4fd1             0.07%               32.86MiB / 15.57GiB
drunk_visvesvaraya       0.00%               0B / 0B
big_heisenberg           0.00%               0B / 0B

The problem with utilizing docker stats is that statistics for a docker container is destroyed upon completion of the docker command. To track the docker stats, the verdi worker would need to:

Create a separate thread to run docker stats (or similar functionality using the ephemeral cgroups files) concurrent to the execution of the docker command.
Record the stream of metrics being returned by #1.
Aggregate and dump the metrics after the docker container execution is completed and destroyed.

Per https://www.datadoghq.com/blog/how-to-collect-docker-metrics/, an alternative to extracting the docker statistics of a container's execution is to retrieve them from the pseudo-files available under /sys/fs/cgroup within the container. Utilizing this method, the docker container can itself dump out the metrics collected from the pseudo-files prior to exiting. To enable this, a shim must be installed in the docker container and utilized by the entrypoint to execute the docker command.

As of HySDS framework v3.0.0-rc.6, the shim and its enabling docker entrypoint script are available within the hysds/pge-base and the hysds/cuda-pge-base docker images. The following example shows how to use the shim and an example of the _docker_stats.json file that is dumped and contains the docker stats:

Code Block

$ mkdir /tmp/test
$ cd /tmp/test
$ docker pull hysds/pge-base:latest
latest: Pulling from hysds/pge-base
Digest: sha256:b972b059185d1f2517754eaa87ecc70f6307ca3dec288c5625806d2b8953e87c
Status: Image is up to date for hysds/pge-base:latest
$ ls -al
total 4
drwxrwxr-x   2 hysdsops hysdsops    6 Dec  9 23:50 .
drwxrwxrwt. 12 root     root     4096 Dec  9 23:50 ..
$ docker run --rm -ti -u $ID:$(id -g) -v $(pwd):/home/ops/test -w /home/ops/test hysds/pge-base:latest sleep 5
$ ls -al
total 4
drwxrwxr-x   2 hysdsops hysdsops    6 Dec  9 23:50 .
drwxrwxrwt. 12 root     root     4096 Dec  9 23:50 ..
$ docker run --rm -ti -u $ID:$(id -g) -v $(pwd):/home/ops/test \
    -w /home/ops/test --entrypoint "/entrypoint-pge-with-stats.sh" \
    hysds/pge-base:latest sleep 5
$ ls -al
total 8
drwxrwxr-x   2 hysdsops hysdsops   32 Dec  9 23:50 .
drwxrwxrwt. 12 root     root     4096 Dec  9 23:50 ..
-rw-r--r--   1 root     hysdsops 3006 Dec  9 23:50 _docker_stats.json
$ cat _docker_stats.json 
{
  "wall_time": 5001177284,
  "user_cpu_time": 846000,
  "sys_cpu_time": 0,
  "cgroups": {
    "cpu_stats": {
      "cpu_usage": {
        "total_usage": 162841171,
        "percpu_usage": [
          61628201,
          31858986,
          29914333,
          39453546
        ],
        "usage_in_kernelmode": 70000000,
        "usage_in_usermode": 60000000
      },
      "throttling_data": {}
    },
    "memory_stats": {
      "cache": 335872,
      "usage": {
        "usage": 1433600,
        "max_usage": 5177344,
        "failcnt": 0,
        "limit": 9223372036854771712
      },
      "swap_usage": {
        "usage": 1433600,
        "max_usage": 5177344,
        "failcnt": 0,
        "limit": 9223372036854771712
      },
      "kernel_usage": {
        "failcnt": 0,
        "limit": 9223372036854771712
      },
      "kernel_tcp_usage": {
        "failcnt": 0,
        "limit": 9223372036854771712
      },
      "stats": {
        "active_anon": 1019904,
        "active_file": 28672,
        "cache": 335872,
        "hierarchical_memory_limit": 9223372036854771712,
        "hierarchical_memsw_limit": 9223372036854771712,
        "inactive_anon": 0,
        "inactive_file": 307200,
        "mapped_file": 0,
        "pgfault": 19885,
        "pgmajfault": 0,
        "pgpgin": 5280,
        "pgpgout": 4944,
        "rss": 1040384,
        "rss_huge": 0,
        "swap": 0,
        "total_active_anon": 1019904,
        "total_active_file": 28672,
        "total_cache": 335872,
        "total_inactive_anon": 0,
        "total_inactive_file": 307200,
        "total_mapped_file": 0,
        "total_pgfault": 19885,
        "total_pgmajfault": 0,
        "total_pgpgin": 5280,
        "total_pgpgout": 4944,
        "total_rss": 1040384,
        "total_rss_huge": 0,
        "total_swap": 0,
        "total_unevictable": 0,
        "unevictable": 0
      }
    },
    "pids_stats": {
      "current": 7
    },
    "blkio_stats": {
      "io_service_bytes_recursive": [
        {
          "major": 202,
          "op": "Read"
        },
        {
          "major": 202,
          "op": "Write",
          "value": 461312
        },
        {
          "major": 202,
          "op": "Sync",
          "value": 461312
        },
        {
          "major": 202,
          "op": "Async"
        },
        {
          "major": 202,
          "op": "Total",
          "value": 461312
        }
      ],
      "io_serviced_recursive": [
        {
          "major": 202,
          "op": "Read"
        },
        {
          "major": 202,
          "op": "Write",
          "value": 41
        },
        {
          "major": 202,
          "op": "Sync",
          "value": 41
        },
        {
          "major": 202,
          "op": "Async"
        },
        {
          "major": 202,
          "op": "Total",
          "value": 41
        }
      ]
    },
    "hugetlb_stats": {
      "1GB": {
        "failcnt": 0
      },
      "2MB": {
        "failcnt": 0
      }
    }
  }
}

Warning
NOTE: the output of the `_docker_stats.json` file specifies CPU time metrics in terms of nanoseconds and memory usage in terms of bytes. Refer to the "Pseudo-files" section at https://www.datadoghq.com/blog/how-to-collect-docker-metrics/ (reproduced below under the How to enable docker execution stats for a job type#Reference section) for more information.

Requirements

Configured HySDS Cluster (HySDS framework v3.0.0-rc.6 or greater)

Setup

To enable the dumping of the _docker_stats.json file for a set of HySDS jobs under a repo, edit the docker/Dockerfile so that
1. the FROM image (base image) is either hysds/pge-base:latest or hysds/cuda-pge-base:latest
2. the ENTRYPOINT is set to `/entrypoint-pge-with-stats.sh`

The following example shows the docker/Dockerfile for the HySDS core repo lightweight-jobs:

Code Block

FROM hysds/pge-base:latest

MAINTAINER malarout "Namrata.Malarout@jpl.nasa.gov"
LABEL description="Lightweight System Jobs"

# provision lightweight-jobs PGE
USER ops
COPY . /home/ops/verdi/ops/lightweight-jobs

# set entrypoint
ENTRYPOINT ["/entrypoint-pge-with-stats.sh"]

WORKDIR /home/ops
CMD ["/bin/bash", "--login"]

Rebuild and redeploy the docker container using your CI instance.
You're done. Because you're HySDS cluster was installed with using v3.0.0-rc.6 or greater:
1. the verdi job worker will detect the existence of _docker_stats.json files in the job work directory and publish the stats with the job status on mozart and the job metric info on metrics
2. your Kibana job metrics dashboard will show the visualizations for total CPU usage and max memory usage for each job type, e.g.:

Anchor
Reference
Reference
Reference

Source: https://www.datadoghq.com/blog/how-to-collect-docker-metrics/

Docker exposes metrics via three mechanisms: pseudo-files in sysfs, the stats command, and API. Metrics coverage across these three mechanisms is uneven, as seen below:

Pseudo-files

Docker metrics reported via pseudo-files in sysfs by default do not require privileged (root) access. They are also the fastest and most lightweight way to read metrics; if you are monitoring many containers per host, speed may become a requirement. However, you cannot collect all metrics from pseudo-files. As seen in the table above, there may be limitations on I/O and network metrics.

Pseudo-file location

This article assumes your metrics pseudo-files are located in /sys/fs/cgroup in the host OS. In some systems, they may be in /cgroup instead.

Your pseudo-file access path includes the long id of your container. For illustration purposes this article assumes that your have set an env variable CONTAINER_ID to the long ID of the container you are monitoring. If you’d like to copy-paste run commands in this article, you can set CONTAINER_ID like this: CONTAINER_ID=$(docker run [OPTIONS] IMAGE [COMMAND] [ARG...] ) or you can save it after launching: docker ps --no-trunc and then copy-paste and save the long ID as an env variable like CONTAINER_ID=<long ID>

CPU pseudo-files

CPU metrics are reported in cpu and cpuacct (CPU accumulated).

OS-specific metric paths

In the commands below, we use the metric directory for standard Linux systems (/sys/fs/cgroup/cpuacct/docker/$CONTAINER_ID/).

Usage

Code Block
$ cat /sys/fs/cgroup/cpuacct/docker/$CONTAINER_ID/cpuacct.stat > user 2451 # time spent running processes since boot > system 966 # time spent executing system calls since boot

If you’re using an x86 system, the times above are expressed in 10-millisecond increments, so the recently-booted container above has spent 24.51s running user processes, and 9.66s on system calls. (Technically the times are expressed in user jiffies. Deep jiffy info here.)

CPU Usage per core

Per-CPU usage can help you identify core imbalances, which can be caused by bad configuration.

Code Block
$ cat /sys/fs/cgroup/cpuacct/docker/$CONTAINER_ID/cpuacct.usage_percpu > 45094018900 # nanoseconds CPU has been in use since boot (45.09s)

If your container is using multiple CPU cores and you want a convenient total usage number, you can run:

Code Block
$ cat /sys/fs/cgroup/cpuacct/docker/$CONTAINER_ID/cpuacct.usage > 45094018900 # total nanoseconds CPUs have been in use (45.09s)

Throttled CPU

If you set a limit on the CPU time available to a container with CPU quota constraint, your container will be throttled when it attempts to exceed the limit.

Code Block

$ cat /sys/fs/cgroup/cpu/docker/$CONTAINER_ID/cpu.stat
> nr_periods 565 # Number of enforcement intervals that have elapsed
> nr_throttled 559 # Number of times the group has been throttled
> throttled_time 12119585961 # Total time that members of the group were throttled, in nanoseconds (12.12 seconds)

Memory pseudo-files

The following command will print a lot of information of memory usage, probably more than you need. Note that the first half of the measures have no standard prefix; these measures exclude sub-cgroups. The second half all are prefixed with “total_”; these measures include sub-cgroups.

Code Block

$ cat /sys/fs/cgroup/memory/docker/$CONTAINER_ID/memory.stat
  
    cache 532480
    rss 10649600
    mapped_file 1576960
    writeback 0
    swap 0
    pgpgin 302242
    pgpgout 296556
    pgfault 1142200
    pgmajfault 125
    inactive_anon 16384
    active_anon 577536
    inactive_file 11386880
    active_file 11309056
    unevictable 0
    hierarchical_memory_limit 18446744073709551615
    hierarchical_memsw_limit 18446744073709551615
    total_cache 22798336
    total_rss 491520
    total_rss_huge 0
    total_mapped_file 1576960
    total_writeback 0
    total_swap 0
    total_pgpgin 302242
    total_pgpgout 296556
    total_pgfault 1142200
    total_pgmajfault 125
    total_inactive_anon 16384
    total_active_anon 577536
    total_inactive_file 11386880
    total_active_file 11309056
    total_unevictable 0

You can get most interesting memory metrics directly by calling a specific command in the /sys/fs/cgroup/memory/docker/$CONTAINER_ID/ directory:

Code Block

# Total memory used: cached + rss 
$ cat /sys/fs/cgroup/memory/docker/$CONTAINER_ID/memory.usage_in_bytes
  
# Total memory used + swap in use
$ cat /sys/fs/cgroup/memory/docker/$CONTAINER_ID/memory.memsw.usage_in_bytes
    
# Number of times memory usage hit limts
$ cat /sys/fs/cgroup/memory/docker/$CONTAINER_ID/memory.failcnt
    
# Memory limit of the cgroup in bytes 
$ cat /sys/fs/cgroup/memory/docker/$CONTAINER_ID/memory.limit_in_bytes

Note that if the final command returns a long garbage number like 18446744073709551615, you did not set the limit when you launched the container. To set a 500MB limit, for example:

Code Block
$ docker run -m 500M IMAGE [COMMAND] [ARG...]

Further information about the memory metrics can be found in the official documentation.

I/O pseudo-files

The path to I/O stats pseudo-files for most operating systems is: /sys/fs/cgroup/blkio/docker/$CONTAINER_ID/.

Depending on your system, you may have many metrics available from these pseudo-files: blkio.io_queued_recursive, blkio.io_service_time_recursive, blkio.io_wait_time_recursive and more.

On many systems, however, many of these pseudo-files only return zero values. In this case there are usually still two pseudo-files that work: blkio.throttle.io_service_bytes and blkio.throttle.io_serviced, which report total I/O bytes and operations, respectively. Contrary to their names, these numbers do not report throttled I/O but actual I/O bytes and ops.

The first two numbers reported by these pseudo-files are the major:minor device IDs, which uniquely identify a device. Example output from blkio.throttle.io_service_bytes:

Code Block
253:0 Read 13750272 253:0 Write 180224 253:0 Sync 180224 253:0 Async 13750272 253:0 Total 13930496

Network pseudo-files

Docker version 1.6.1 and greater

In release 1.6.1, Docker fixed read/write /proc paths.

Code Block

$ CONTAINER_PID=`docker inspect -f '{{ .State.Pid }}' $CONTAINER_ID`
$ cat /proc/$CONTAINER_PID/net/dev    
    
Inter-|   Receive                                                |  Transmit
 face |bytes    packets errs drop fifo frame compressed multicast|bytes    packets errs drop fifo colls carrier compressed
  eth0:     1296     16    0    0    0     0          0         0      816      10    0    0    0     0       0          0
    lo:        0      0    0    0    0     0          0         0        0       0    0    0    0     0       0          0

Older versions of Docker

You can get network metrics from ip netns, with some symlinking:

Code Block

$ CONTAINER_PID=`docker inspect -f '{{ .State.Pid }}' $CONTAINER_ID`
$ mkdir -p /var/run/netns
$ ln -sf /proc/$CONTAINER_PID/ns/net /var/run/netns/$CONTAINER_ID
$ ip netns exec $CONTAINER_ID netstat -i

Stats command

The docker stats command will continuously report a live stream of basic CPU, memory, and network metrics. As of version 1.9.0, docker stats also includes disk I/O metrics.

Code Block

# Usage: docker stats CONTAINER [CONTAINER...]
$ docker stats $CONTAINER_ID
    
CONTAINER       CPU %     MEM USAGE/LIMIT     MEM %     NET I/O             BLOCK I/O
ecb37227ac84    0.12%     71.53 MiB/490 MiB   14.60%    900.2 MB/275.5 MB   266.8 MB/872.7 MB

CPU stats

CPU is reported as % of total host capacity. So if you have two containers each using as much CPU as they can, each allocated the same CPU shares by docker, then the stat command for each would register 50% utilization, though in practice their CPU resources would be fully utilized.

Memory stats

If you do not explicitly set the memory limits for the container, then the memory usage limit will be the memory limit of the host machine. If the host is using memory for other processes, your container will run out of memory before it hits the limit reported by the stats command.

I/O stats

As of Docker version 1.9.0, docker stats now displays total bytes read and written.

Network stats

Displays total bytes received (RX) and transmitted (TX).

Requirements

Docker version 1.5.0 (released February 2015) or higher
Exec driver ‘libcontainer’, which has been the default since Docker 0.9.

API

Like the docker stats command, the API will continuously report a live stream of CPU, memory, I/O, and network metrics. The difference is that the API provides far more detail than the stats command.

The daemon listens on unix:///var/run/docker.sock to allow only local connections by the root user. When you launch Docker, however, you can bind it to another port or socket; instructions and strong warnings are here. This article describes how to access the API on the default socket.

You can send commands to the socket with nc. All API calls will take this general form:

Code Block
echo "" \| nc -U /var/run/docker.sock

To collect all metrics in a continuously updated live stream of JSON, run:

Code Block
$ echo -ne "GET /containers/$CONTAINER_ID/stats HTTP/1.1\r\n\r\n" \| sudo nc -U /var/run/docker.sock

The response will be long, live-streaming chunks of JSON with metrics about the container. Rather than print an entire example JSON object here, its parts are discussed individually below.

CPU

Code Block

"cpu_stats": {
    "cpu_usage": {
      "total_usage": 44651120376,
      "percpu_usage": [
        44651120376
      ],
      "usage_in_kernelmode": 9660000000,
      "usage_in_usermode": 24510000000
    },
    "system_cpu_usage": 269321720000000,
    "throttling_data": {
      "periods": 0,
      "throttled_periods": 0,
      "throttled_time": 0
    }
}

system_cpu_usage represents the host’s cumulative CPU usage in nanoseconds; this includes user, system, idle, etc. (the sum of the /proc/stat CPU line).

All other CPU metrics can also be accessed through pseudo-files, as described above, with a few differences:

usage_in_kernelmode is the same as system CPU usage reported by pseudo-files, although the API expresses this value in nanoseconds rather than 10-millisecond increments. As you can see, in the example reading in this article, both methods report the same number: 9.66s
usage_in_usermode is the same as user CPU usage reported by pseudo-files. As above, this number is reported in nanoseconds.

Memory

Most of the memory stats available through the API are also available through the pseudo-files as described in that section above. usage is memory.usage_in_bytes, max_usage is memory.max_usage_in_bytes, stats is memory.stat pseudo-file, limit is the memory limit set on the container memory.limit_in_bytes, if it is set; otherwise limit is the host memory limit in /proc/meminfo (MemTotal).

Code Block

"memory_stats": {
    "usage": 2699264,
    "max_usage": 20971520,
    "stats": {
      "active_anon": 577536,
      "active_file": 0,
      "cache": 2207744,
      "hierarchical_memory_limit": 20971520,
      "hierarchical_memsw_limit": 1.844674407371e+19,
      "inactive_anon": 16384,
      "inactive_file": 2105344,
      "mapped_file": 479232,
      "pgfault": 1821069,
      "pgmajfault": 2398,
      "pgpgin": 507907,
      "pgpgout": 507248,
      "rss": 491520,
      "rss_huge": 0,
      "swap": 0,
      "total_active_anon": 577536,
      "total_active_file": 0,
      "total_cache": 2207744,
      "total_inactive_anon": 16384,
      "total_inactive_file": 2105344,
      "total_mapped_file": 479232,
      "total_pgfault": 1821069,
      "total_pgmajfault": 2398,
      "total_pgpgin": 507907,
      "total_pgpgout": 507248,
      "total_rss": 491520,
      "total_rss_huge": 0,
      "total_swap": 0,
      "total_unevictable": 0,
      "total_writeback": 0,
      "unevictable": 0,
      "writeback": 0
    },
    "failcnt": 24422,
    "limit": 513851392
}

I/O

The API currently reports a count of read, write, sync, and async operations, plus a total count of operations in blkio_stats.io_serviced_recursive. The total bytes corresponding to those operations are reported in blkio_stats.io_service_bytes_recursive. Depending on your system, other I/O stats may also be reported, or may be disabled (empty). Major and minor IDs uniquely identify a device.

Code Block
"blkio_stats": { "io_service_bytes_recursive": [ { "major": 253, "minor": 0, "op": "Read", "value": 13750272 }, { "major": 253, "minor": 0, "op": "Write", "value": 12288 }, ...

Network

The API is the easiest way to get network metrics for your container. (RX represents “received”, and TX represents “transmitted”.)

Code Block
"network": { "rx_bytes": 197942, "rx_packets": 51, "rx_errors": 0, "rx_dropped": 0, "tx_bytes": 3549, "tx_packets": 50, "tx_errors": 0, "tx_dropped": 0 }

Selecting specific Docker metrics

By sending output from the API to grep to throw out non-JSON rows, and then to jq for JSON parsing, we can create a stream of selected metrics. Some examples are below.

CPU stats

Code Block
$ echo -ne “GET /containers/$CONTAINER_ID/stats HTTP/1.1\r\n\r\n” \| nc -U /var/run/docker.sock \| grep “^{” \| jq ‘.cpu_stats’

IO bytes written

Code Block
echo -ne “GET /containers/$CONTAINER_ID/stats HTTP/1.1\r\n\r\n” \| nc -U /var/run/docker.sock \| grep “^{” \| jq ‘.blkio_stats.io_service_bytes_recursive \| .[1].value’

Network bytes received

Code Block
echo -ne “GET /containers/$CONTAINER_ID/stats HTTP/1.1\r\n\r\n” \| nc -U /var/run/docker.sock \| grep “^{” \| jq ‘.network.rx_bytes’

API requirements

Same as the stats command, above.

Additional API calls

Other useful Docker API calls are documented here. You can call them using nc as described in above.

📖 Related Articles:

Filter by label (Content by label)

showLabels	false
max	12
showSpace	false
sort	title
cql	label in ( "docker" , "job_management" )

Have Questions? Ask a HySDS Developer:

Anyone can join our public Slack channelto learn more about HySDS. JPL employees can join #HySDS-Community

JPLers can also ask HySDS questions atStack Overflow Enterprise

Live Search

placeholder	Search HySDS Wiki

🚀 Page Information:

Was this page useful?

Yes No

Contribution History:

Contributors

mode	list
showLastTime	true
order	update

Subject Matter Expert:

Gerald Manipon

Find an Error?

Is this document outdated or inaccurate? Please contact the assigned Subject Matter ExpertPage Maintainer:

Gerald Manipon

Versions Compared

Old Version 6

New Version Current

Key

Introduction

Requirements

Setup

Anchor
Reference
Reference
Reference

Pseudo-files

Pseudo-file location

CPU pseudo-files

OS-specific metric paths

Usage

CPU Usage per core

Throttled CPU

Memory pseudo-files

I/O pseudo-files

Network pseudo-files

Docker version 1.6.1 and greater

Older versions of Docker

Stats command

CPU stats

Memory stats

I/O stats

Network stats

Requirements

API

CPU

Memory

I/O

Network

Selecting specific Docker metrics

CPU stats

IO bytes written

Network bytes received

API requirements

Additional API calls

Page Comparison

Versions Compared

Old Version 6

New Version Current

Key

Introduction

Requirements

Setup

AnchorReferenceReferenceReference

Pseudo-files

Pseudo-file location

CPU pseudo-files

OS-specific metric paths

Usage

CPU Usage per core

Throttled CPU

Memory pseudo-files

I/O pseudo-files

Network pseudo-files

Docker version 1.6.1 and greater

Older versions of Docker

Stats command

CPU stats

Memory stats

I/O stats

Network stats

Requirements

API

CPU

Memory

I/O

Network

Selecting specific Docker metrics

CPU stats

IO bytes written

Network bytes received

API requirements

Additional API calls

Anchor
Reference
Reference
Reference