docs: improve documentation
[poolifier.git] / docs / worker-choice-strategies.md
1 # Worker choice strategies
2
3 ## Table of contents
4
5 - [Strategies](#strategies)
6 - [Fair share](#fair-share)
7 - [Weighted round robin](#weighted-round-robin)
8 - [Interleaved weighted round robin](#interleaved-weighted-round-robin)
9 - [Statistics](#statistics)
10 - [Median](#median)
11
12 ## Strategies
13
14 All duration or timestamp are expressed in milliseconds.
15
16 ### Fair share
17
18 Its goal is to distribute the load evenly across all workers. To achieve this, the strategy keeps track of the average task execution time for each worker and assigns the next task to the worker with the lowest task end prediction time: `task_end_prediction = max(current_time, task_end_prediction) + average_task_execution_time`.
19 By default, the strategy uses the average task execution time for each worker but it can be configured to use the average task event loop utilization (ELU) active time instead.
20
21 ### Weighted round robin
22
23 The worker weights are maximum tasks execution time, once the worker has reached its maximum tasks execution time, the next task is assigned to the next worker. The worker default weights are the same for all workers and is computed given the CPU cores speed and theirs numbers.
24
25 ### Interleaved weighted round robin
26
27 The worker weights are maximum tasks execution time. The rounds are the deduplicated worker weights.
28 During a round, if worker weight is inferior to the current round, the next task is assigned to the next worker. Once all workers have been assigned a task, the next round starts.
29 The worker default weights are the same for all workers and is computed given the CPU cores speed and theirs numbers. So the default rounds consists of a unique worker weight.
30
31 ## Statistics
32
33 Worker choice strategies enable only the statistics that are needed to choose the next worker to avoid unnecessary overhead.
34
35 ### Median
36
37 Strategies using the average task execution time for each worker can use the median instead. Median is more robust to outliers and can be used to avoid assigning tasks to workers that are currently overloaded. Median usage introduces a small overhead: measurement history must be kept for each worker and the median must be recomputed each time a task has finished.