File: http_services.rst

package info (click to toggle)
dask.distributed 2022.12.1%2Bds.1-3
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 10,164 kB
  • sloc: python: 81,938; javascript: 1,549; makefile: 228; sh: 100
file content (108 lines) | stat: -rw-r--r-- 4,016 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
HTTP endpoints
==============

A subset of the following pages will be available from the scheduler or
workers of a running cluster. The list of currently available endpoints can
be found by examining ``/sitemap.json``.


Main dashboard links
--------------------


Dynamic bokeh pages. The root redirects to /status, and each page links to the
others via a header navbar.

- ``/status``: entry point to the dashboards, shows cluster-wide memory and tasks
- ``/workers``: currently connected workers and their CPU/memory usage
- ``/tasks``: task block view with longer look-back than on /status
- ``/system``: global stats for the cluster, CPU, memory, bandwidth, file descriptors
- ``/profile``: flamegraph statistical profiling across the cluster
- ``/graph``: currently processing graphs in a dependency tree view
- ``/groups``: graph layout for task groups (dependencies, memory, output type, progress, tasks status)
- ``/info``: redirect to ``/info/main/workers.html``
- ``/hardware``: gathers bandwidth information on memory, disk, and network

Scheduler HTTP
--------------

Pages and JSON endpoints served by the scheduler

- ``/health``: check server is alive
- ``/info/main/workers.html`` basic info about workers and links to their dashboards and logs
- ``info/worker/(worker).html``: more detail about given worker, keyed by TCP address; links to tasks
- ``info/task/(task).html``: details about a task on the cluster, by dask key; links to worker,
  related tasks, and client
- ``/info/call-stacks/(worker).html``: tasks currently handled by given worker
- ``/info/call-stack/(task).html``: state of task (where it is running)
- ``/info/main/logs.html``: scheduler logs
- ``/info/logs/(worker).html``: logs of given worker
- ``/individual-plots.json``: map of path to description for available individual
  (i.e,. one-pane, non-dashboard) plots
- ``/eventstream``: scheduler events on a websocket
- ``/proxy/(port)/(address)/(path)``: proxy to worker HTTP locations (if you have jupyter-server-proxy)
- ``/metrics``: prometheus endpoint
- ``/json/counts.json``: cluster count stats
- ``/json/identity.json``: scheduler information
- ``/json/index.html``: link to the above two endpoints
- ``/sitemap.json``: list of available endpoints
- ``/statics/()``: static file content (CSS, etc)
- ``/stealing``: worker occupancy metrics, to evaluate task stealing

Scheduler API
-------------

Scheduler methods exposed by the API with an example of the request body they take

- ``/api/v1/retire_workers`` : retire certain workers on the scheduler

.. code-block:: json

    {
        "workers":["tcp://127.0.0.1:53741", "tcp://127.0.0.1:53669"]
    }

- ``/api/v1/get_workers`` : get all workers on the scheduler
- ``/api/v1/adaptive_target`` : get the target number of workers based on the scheduler's load 

Individual bokeh plots
----------------------

- ``/individual-task-stream``
- ``/individual-progress``
- ``/individual-graph``
- ``/individual-groups``
- ``/individual-profile``
- ``/individual-profile-server``
- ``/individual-workers-memory``
- ``/individual-cluster-memory``
- ``/individual-cpu``
- ``/individual-nprocessing``
- ``/individual-occupancy``
- ``/individual-workers``
- ``/individual-exceptions``
- ``/individual-bandwidth-types``
- ``/individual-bandwidth-workers``
- ``/individual-workers-network``
- ``/individual-workers-disk``
- ``/individual-workers-network-timeseries``
- ``/individual-workers-cpu-timeseries``
- ``/individual-workers-memory-timeseries``
- ``/individual-workers-disk-timeseries``
- ``/individual-memory-by-key``
- ``/individual-compute-time-per-key``
- ``/individual-aggregate-time-per-action``
- ``/individual-scheduler-system``
- ``/individual-gpu-memory`` (GPU only)
- ``/individual-gpu-utilization`` (GPU only)

Worker HTTP
-----------

- ``/status``:
- ``/counters``:
- ``/sitemap.json``: list of available endpoints
- ``/system``:
- ``/health``: check server is alive
- ``/metrics``: prometheus endpoint
- ``/statics/()``: static file content (CSS, etc)