refactor: uniform namespace for task function(s)
[poolifier.git] / README.md
1 <div align="center">
2 <img src="./images/logo.png" width="340px" height="266px"/>
3 </div>
4
5 <h2 align="center">Node Thread Pool and Cluster Pool :arrow_double_up: :on:</h2>
6
7 <p align="center">
8 <a href="https://github.com/poolifier/poolifier/graphs/commit-activity">
9 <img alt="GitHub commit activity (master)" src="https://img.shields.io/github/commit-activity/m/poolifier/poolifier/master"></a>
10 <a href="https://www.npmjs.com/package/poolifier">
11 <img alt="Weekly Downloads" src="https://img.shields.io/npm/dw/poolifier"></a>
12 <a href="https://github.com/poolifier/poolifier/actions/workflows/ci.yml">
13 <img alt="Actions Status" src="https://github.com/poolifier/poolifier/actions/workflows/ci.yml/badge.svg"></a>
14 <a href="https://sonarcloud.io/dashboard?id=pioardi_poolifier">
15 <img alt="Code Coverage" src="https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=coverage"></a>
16 <a href="https://sonarcloud.io/dashboard?id=pioardi_poolifier">
17 <img alt="Quality Gate Status" src="https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=alert_status"></a>
18 <a href="https://standardjs.com">
19 <img alt="Javascript Standard Style Guide" src="https://img.shields.io/badge/code_style-standard-brightgreen.svg"></a>
20 <a href="https://gitter.im/poolifier/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge">
21 <img alt="Gitter chat" src="https://badges.gitter.im/poolifier/community.svg"></a>
22 <a href="https://opencollective.com/poolifier">
23 <img alt="Open Collective" src="https://opencollective.com/poolifier/tiers/badge.svg"></a>
24 <a href="http://makeapullrequest.com">
25 <img alt="PR Welcome" src="https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square"></a>
26 <a href="https://img.shields.io/static/v1?label=dependencies&message=no%20dependencies&color=brightgreen">
27 <img alt="No dependencies" src="https://img.shields.io/static/v1?label=dependencies&message=no%20dependencies&color=brightgreen"></a>
28 </p>
29
30 ## Why Poolifier?
31
32 Poolifier is used to perform CPU and/or I/O intensive tasks on Node.js servers, it implements worker pools using [worker_threads](https://nodejs.org/api/worker_threads.html) and [cluster](https://nodejs.org/api/cluster.html) Node.js modules.
33 With poolifier you can improve your **performance** and resolve problems related to the event loop.
34 Moreover you can execute your tasks using an API designed to improve the **developer experience**.
35 Please consult our [general guidelines](#general-guidance).
36
37 - Easy to use :white_check_mark:
38 - Performance [benchmarks](./benchmarks/README.md) :white_check_mark:
39 - Fixed and dynamic pool size :white_check_mark:
40 - Easy switch from a pool type to another :white_check_mark:
41 - No runtime dependencies :white_check_mark:
42 - Proper integration with node [async_hooks](https://nodejs.org/api/async_hooks.html) :white_check_mark:
43 - Support CommonJS, ESM, and TypeScript :white_check_mark:
44 - Support for [worker_threads](https://nodejs.org/api/worker_threads.html) and [cluster](https://nodejs.org/api/cluster.html) Node.js modules :white_check_mark:
45 - Support multiple task functions :white_check_mark:
46 - Support sync and async task functions :white_check_mark:
47 - Tasks distribution strategies :white_check_mark:
48 - General guidance on pool choice :white_check_mark:
49 - Error handling out of the box :white_check_mark:
50 - Widely tested :white_check_mark:
51 - Active community :white_check_mark:
52 - Code quality [![Bugs](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=bugs)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
53 [![Code Smells](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=code_smells)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
54 [![Duplicated Lines (%)](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=duplicated_lines_density)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
55 [![Maintainability Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=sqale_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
56 [![Reliability Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=reliability_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
57 [![Technical Debt](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=sqale_index)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
58 - Code security [![Security Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=security_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier) [![Vulnerabilities](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=vulnerabilities)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
59
60 ## Contents
61
62 <h3 align="center">
63 <a href="#overview">Overview</a>
64 <span> · </span>
65 <a href="#installation">Installation</a>
66 <span> · </span>
67 <a href="#usage">Usage</a>
68 <span> · </span>
69 <a href="#node-versions">Node versions</a>
70 <span> · </span>
71 <a href="#api">API</a>
72 <span> · </span>
73 <a href="#general-guidance">General guidance</a>
74 <span> · </span>
75 <a href="#contribute">Contribute</a>
76 <span> · </span>
77 <a href="#team">Team</a>
78 <span> · </span>
79 <a href="#license">License</a>
80 </h3>
81
82 ## Overview
83
84 Poolifier contains two [worker_threads](https://nodejs.org/api/worker_threads.html#class-worker)/[cluster](https://nodejs.org/api/cluster.html#cluster_class_worker) worker pool implementations, you don't have to deal with [worker_threads](https://nodejs.org/api/worker_threads.html)/[cluster](https://nodejs.org/api/cluster.html) complexity.
85 The first implementation is a fixed worker pool, with a defined number of workers that are started at creation time and will be reused.
86 The second implementation is a dynamic worker pool, with a number of worker started at creation time (these workers will be always active and reused) and other workers created when the load will increase (with an upper limit, these workers will be reused when active), the new created workers will be stopped after a configurable period of inactivity.
87 You have to implement your worker by extending the _ThreadWorker_ or _ClusterWorker_ class.
88
89 ## Installation
90
91 ```shell
92 npm install poolifier --save
93 ```
94
95 ## Usage
96
97 You can implement a [worker_threads](https://nodejs.org/api/worker_threads.html#class-worker) worker in a simple way by extending the class _ThreadWorker_:
98
99 ```js
100 'use strict'
101 const { ThreadWorker } = require('poolifier')
102
103 function yourFunction(data) {
104 // this will be executed in the worker thread,
105 // the data will be received by using the execute method
106 return { ok: 1 }
107 }
108
109 module.exports = new ThreadWorker(yourFunction, {
110 maxInactiveTime: 60000
111 })
112 ```
113
114 Instantiate your pool based on your needs :
115
116 ```js
117 'use strict'
118 const { DynamicThreadPool, FixedThreadPool, PoolEvents, availableParallelism } = require('poolifier')
119
120 // a fixed worker_threads pool
121 const pool = new FixedThreadPool(availableParallelism(), './yourWorker.js', {
122 errorHandler: e => console.error(e),
123 onlineHandler: () => console.info('worker is online')
124 })
125
126 pool.emitter.on(PoolEvents.ready, () => console.info('Pool is ready'))
127 pool.emitter.on(PoolEvents.busy, () => console.info('Pool is busy'))
128
129 // or a dynamic worker_threads pool
130 const pool = new DynamicThreadPool(Math.floor(availableParallelism() / 2), availableParallelism(), './yourWorker.js', {
131 errorHandler: e => console.error(e),
132 onlineHandler: () => console.info('worker is online')
133 })
134
135 pool.emitter.on(PoolEvents.full, () => console.info('Pool is full'))
136 pool.emitter.on(PoolEvents.ready, () => console.info('Pool is ready'))
137 pool.emitter.on(PoolEvents.busy, () => console.info('Pool is busy'))
138
139 // the execute method signature is the same for both implementations,
140 // so you can easy switch from one to another
141 pool
142 .execute()
143 .then(res => {
144 console.info(res)
145 })
146 .catch(err => {
147 console.error(err)
148 })
149 ```
150
151 You can do the same with the classes _ClusterWorker_, _FixedClusterPool_ and _DynamicClusterPool_.
152
153 **See [examples](./examples/) folder for more details (in particular if you want to use a pool with [multiple task functions](./examples/multiFunctionExample.js))**.
154
155 Remember that workers can only send and receive structured-cloneable data.
156
157 ## Node versions
158
159 Node versions >= 16.14.x are supported.
160
161 ## [API](https://poolifier.github.io/poolifier/)
162
163 ### `pool = new FixedThreadPool/FixedClusterPool(numberOfThreads/numberOfWorkers, filePath, opts)`
164
165 `numberOfThreads/numberOfWorkers` (mandatory) Number of workers for this pool
166 `filePath` (mandatory) Path to a file with a worker implementation
167 `opts` (optional) An object with the pool options properties described below
168
169 ### `pool = new DynamicThreadPool/DynamicClusterPool(min, max, filePath, opts)`
170
171 `min` (mandatory) Same as _FixedThreadPool_/_FixedClusterPool_ numberOfThreads/numberOfWorkers, this number of workers will be always active
172 `max` (mandatory) Max number of workers that this pool can contain, the new created workers will die after a threshold (default is 1 minute, you can override it in your worker implementation).
173 `filePath` (mandatory) Path to a file with a worker implementation
174 `opts` (optional) An object with the pool options properties described below
175
176 ### `pool.execute(data, name)`
177
178 `data` (optional) An object that you want to pass to your worker implementation
179 `name` (optional) A string with the task function name that you want to execute on the worker. Default: `'default'`
180
181 This method is available on both pool implementations and returns a promise with the task function execution response.
182
183 ### `pool.destroy()`
184
185 This method is available on both pool implementations and will call the terminate method on each worker.
186
187 ### `PoolOptions`
188
189 An object with these properties:
190
191 - `messageHandler` (optional) - A function that will listen for message event on each worker
192 - `errorHandler` (optional) - A function that will listen for error event on each worker
193 - `onlineHandler` (optional) - A function that will listen for online event on each worker
194 - `exitHandler` (optional) - A function that will listen for exit event on each worker
195 - `workerChoiceStrategy` (optional) - The worker choice strategy to use in this pool:
196
197 - `WorkerChoiceStrategies.ROUND_ROBIN`: Submit tasks to worker in a round robin fashion
198 - `WorkerChoiceStrategies.LEAST_USED`: Submit tasks to the worker with the minimum number of executed, executing and queued tasks
199 - `WorkerChoiceStrategies.LEAST_BUSY`: Submit tasks to the worker with the minimum tasks total execution and wait time
200 - `WorkerChoiceStrategies.LEAST_ELU`: Submit tasks to the worker with the minimum event loop utilization (ELU) (experimental)
201 - `WorkerChoiceStrategies.WEIGHTED_ROUND_ROBIN`: Submit tasks to worker by using a [weighted round robin scheduling algorithm](./src/pools/selection-strategies/README.md#weighted-round-robin) based on tasks execution time
202 - `WorkerChoiceStrategies.INTERLEAVED_WEIGHTED_ROUND_ROBIN`: Submit tasks to worker by using an [interleaved weighted round robin scheduling algorithm](./src/pools/selection-strategies/README.md#interleaved-weighted-round-robin) based on tasks execution time (experimental)
203 - `WorkerChoiceStrategies.FAIR_SHARE`: Submit tasks to worker by using a [fair share scheduling algorithm](./src/pools/selection-strategies/README.md#fair-share) based on tasks execution time (the default) or ELU active time
204
205 `WorkerChoiceStrategies.WEIGHTED_ROUND_ROBIN`, `WorkerChoiceStrategies.INTERLEAVED_WEIGHTED_ROUND_ROBIN` and `WorkerChoiceStrategies.FAIR_SHARE` strategies are targeted to heavy and long tasks.
206 Default: `WorkerChoiceStrategies.ROUND_ROBIN`
207
208 - `workerChoiceStrategyOptions` (optional) - The worker choice strategy options object to use in this pool.
209 Properties:
210
211 - `measurement` (optional) - The measurement to use in worker choice strategies: `runTime`, `waitTime` or `elu`.
212 - `runTime` (optional) - Use the tasks [median](./src/pools/selection-strategies/README.md#median) runtime instead of the tasks average runtime in worker choice strategies.
213 - `waitTime` (optional) - Use the tasks [median](./src/pools/selection-strategies/README.md#median) wait time instead of the tasks average wait time in worker choice strategies.
214 - `elu` (optional) - Use the tasks [median](./src/pools/selection-strategies/README.md#median) ELU instead of the tasks average ELU in worker choice strategies.
215 - `weights` (optional) - The worker weights to use in weighted round robin worker choice strategies: `{ 0: 200, 1: 300, ..., n: 100 }`.
216
217 Default: `{ runTime: { median: false }, waitTime: { median: false }, elu: { median: false } }`
218
219 - `restartWorkerOnError` (optional) - Restart worker on uncaught error in this pool.
220 Default: `true`
221 - `enableEvents` (optional) - Events emission enablement in this pool.
222 Default: `true`
223 - `enableTasksQueue` (optional) - Tasks queue per worker enablement in this pool.
224 Default: `false`
225
226 - `tasksQueueOptions` (optional) - The worker tasks queue options object to use in this pool.
227 Properties:
228
229 - `concurrency` (optional) - The maximum number of tasks that can be executed concurrently on a worker.
230
231 Default: `{ concurrency: 1 }`
232
233 #### `ThreadPoolOptions extends PoolOptions`
234
235 - `workerOptions` (optional) - An object with the worker options to pass to worker. See [worker_threads](https://nodejs.org/api/worker_threads.html#worker_threads_new_worker_filename_options) for more details.
236
237 #### `ClusterPoolOptions extends PoolOptions`
238
239 - `env` (optional) - An object with the environment variables to pass to worker. See [cluster](https://nodejs.org/api/cluster.html#cluster_cluster_fork_env) for more details.
240
241 - `settings` (optional) - An object with the cluster settings. See [cluster](https://nodejs.org/api/cluster.html#cluster_cluster_settings) for more details.
242
243 ### `class YourWorker extends ThreadWorker/ClusterWorker`
244
245 `taskFunctions` (mandatory) The task function or task functions object `{ name_1: fn_1, ..., name_n: fn_n }` that you want to execute on the worker
246 `opts` (optional) An object with these properties:
247
248 - `maxInactiveTime` (optional) - Maximum waiting time in milliseconds for tasks on newly created workers. After this time newly created workers will die.
249 The last active time of your worker will be updated when it terminates a task.
250 If `killBehavior` is set to `KillBehaviors.HARD` this value represents also the timeout for the tasks that you submit to the pool, when this timeout expires your tasks is interrupted before completion and removed. The worker is killed if is not part of the minimum size of the pool.
251 If `killBehavior` is set to `KillBehaviors.SOFT` your tasks have no timeout and your workers will not be terminated until your task is completed.
252 Default: `60000`
253
254 - `killBehavior` (optional) - Dictates if your worker will be deleted in case a task is active on it.
255 **KillBehaviors.SOFT**: If `currentTime - lastActiveTime` is greater than `maxInactiveTime` but a task is still executing or queued, then the worker **won't** be deleted.
256 **KillBehaviors.HARD**: If `currentTime - lastActiveTime` is greater than `maxInactiveTime` but a task is still executing or queued, then the worker will be deleted.
257 This option only apply to the newly created workers.
258 Default: `KillBehaviors.SOFT`
259
260 #### `YourWorker.hasTaskFunction(name)`
261
262 `name` (mandatory) The task function name
263
264 This method is available on both worker implementations and returns a boolean.
265
266 #### `YourWorker.addTaskFunction(name, fn)`
267
268 `name` (mandatory) The task function name
269 `fn` (mandatory) The task function
270
271 This method is available on both worker implementations and returns a boolean.
272
273 #### `YourWorker.removeTaskFunction(name)`
274
275 `name` (mandatory) The task function name
276
277 This method is available on both worker implementations and returns a boolean.
278
279 #### `YourWorker.listTaskFunctions()`
280
281 This method is available on both worker implementations and returns an array of the task function names.
282
283 #### `YourWorker.setDefaultTaskFunction(name)`
284
285 `name` (mandatory) The task function name
286
287 This method is available on both worker implementations and returns a boolean.
288
289 ## General guidance
290
291 Performance is one of the main target of these worker pool implementations, poolifier team wants to have a strong focus on this.
292 Poolifier already has a [benchmarks](./benchmarks/) folder where you can find some comparisons.
293
294 ### Internal Node.js thread pool
295
296 Before to jump into each poolifier pool type, let highlight that **Node.js comes with a thread pool already**, the libuv thread pool where some particular tasks already run by default.
297 Please take a look at [which tasks run on the libuv thread pool](https://nodejs.org/en/docs/guides/dont-block-the-event-loop/#what-code-runs-on-the-worker-pool).
298
299 **If your task runs on libuv thread pool**, you can try to:
300
301 - Tune the libuv thread pool size setting the [UV_THREADPOOL_SIZE](https://nodejs.org/api/cli.html#cli_uv_threadpool_size_size).
302
303 and/or
304
305 - Use poolifier cluster pools that are spawning child processes, they will also increase the number of libuv threads since that any new child process comes with a separated libuv thread pool. **More threads does not mean more fast, so please tune your application**.
306
307 ### Cluster vs Threads worker pools
308
309 **If your task does not run into libuv thread pool** and is CPU intensive then poolifier **thread pools** (_FixedThreadPool_ and _DynamicThreadPool_) are suggested to run CPU intensive tasks, you can still run I/O intensive tasks into thread pools, but performance enhancement is expected to be minimal.
310 Thread pools are built on top of Node.js [worker_threads](https://nodejs.org/api/worker_threads.html) module.
311
312 **If your task does not run into libuv thread pool** and is I/O intensive then poolifier **cluster pools** (_FixedClusterPool_ and _DynamicClusterPool_) are suggested to run I/O intensive tasks, again you can still run CPU intensive tasks into cluster pools, but performance enhancement is expected to be minimal.
313 Consider that by default Node.js already has great performance for I/O tasks (asynchronous I/O).
314 Cluster pools are built on top of Node.js [cluster](https://nodejs.org/api/cluster.html) module.
315
316 If your task contains code that runs on libuv plus code that is CPU intensive or I/O intensive you either split it either combine more strategies (i.e. tune the number of libuv threads and use cluster/thread pools).
317 But in general, **always profile your application**.
318
319 ### Fixed vs Dynamic pools
320
321 To choose your pool consider first that with a _FixedThreadPool_/_FixedClusterPool_ or a _DynamicThreadPool_/_DynamicClusterPool_ your application memory footprint will increase.
322 By doing so, your application will be ready to execute in parallel more tasks, but during idle time your application will consume more memory.
323 One good choice from poolifier team point of view is to profile your application using a fixed or dynamic worker pool, and analyze your application metrics when you increase/decrease the number of workers.
324 For example you could keep the memory footprint low by choosing a _DynamicThreadPool_/_DynamicClusterPool_ with a minimum of 5 workers, and allowing it to create new workers until a maximum of 50 workers if needed. This is the advantage of using a _DynamicThreadPool_/_DynamicClusterPool_.
325 But in general, **always profile your application**.
326
327 ## Contribute
328
329 Choose your task here [2.6.x](https://github.com/orgs/poolifier/projects/1), propose an idea, a fix, an improvement.
330
331 See [CONTRIBUTING](CONTRIBUTING.md) guidelines.
332
333 ## Team
334
335 **Creator/Owner:**
336
337 - [**Alessandro Pio Ardizio**](https://github.com/pioardi)
338
339 **_Contributors_**
340
341 - [**Shinigami92**](https://github.com/Shinigami92)
342 - [**Jérôme Benoit**](https://github.com/jerome-benoit)
343
344 ## License
345
346 [MIT](./LICENSE)