Merge branch 'master' into interleaved-weighted-round-robin-worker-choice-strategy
[poolifier.git] / README.md
1 <div align="center">
2 <img src="./images/logo.png" width="340px" height="266px"/>
3 </div>
4
5 <h2 align="center">Node Thread Pool and Cluster Pool :arrow_double_up: :on:</h2>
6
7 <p align="center">
8 <a href="https://www.npmjs.com/package/poolifier">
9 <img alt="Weekly Downloads" src="https://img.shields.io/npm/dw/poolifier"></a>
10 <a href="https://github.com/poolifier/poolifier/actions/workflows/ci.yml">
11 <img alt="Actions Status" src="https://github.com/poolifier/poolifier/actions/workflows/ci.yml/badge.svg"></a>
12 <a href="https://sonarcloud.io/dashboard?id=pioardi_poolifier">
13 <img alt="Quality Gate Status" src="https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=alert_status"></a>
14 <a href="https://sonarcloud.io/dashboard?id=pioardi_poolifier">
15 <img alt="Code Coverage" src="https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=coverage"></a>
16 <a href="https://standardjs.com">
17 <img alt="Javascript Standard Style Guide" src="https://img.shields.io/badge/code_style-standard-brightgreen.svg"></a>
18 <a href="https://gitter.im/poolifier/community?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge">
19 <img alt="Gitter chat" src="https://badges.gitter.im/poolifier/community.svg"></a>
20 <a href="https://opencollective.com/poolifier">
21 <img alt="Open Collective" src="https://opencollective.com/poolifier/tiers/badge.svg"></a>
22 <a href="https://badgen.net/badge/Dependabot/enabled/green?icon=dependabot">
23 <img alt="Dependabot" src="https://badgen.net/badge/Dependabot/enabled/green?icon=dependabot"></a>
24 <a href="http://makeapullrequest.com">
25 <img alt="PR Welcome" src="https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square"></a>
26 <a href="https://img.shields.io/static/v1?label=dependencies&message=no%20dependencies&color=brightgreen">
27 <img alt="No dependencies" src="https://img.shields.io/static/v1?label=dependencies&message=no%20dependencies&color=brightgreen"></a>
28 </p>
29
30 ## Why Poolifier?
31
32 Poolifier is used to perform CPU intensive and I/O intensive tasks on nodejs servers, it implements worker pools using [worker-threads](https://nodejs.org/api/worker_threads.html#worker_threads_worker_threads) and cluster pools using [Node.js cluster](https://nodejs.org/api/cluster.html) modules.
33 With poolifier you can improve your **performance** and resolve problems related to the event loop.
34 Moreover you can execute your tasks using an API designed to improve the **developer experience**.
35 Please consult our [general guidelines](#general-guidance).
36
37 - Performance :racehorse: [benchmarks](./benchmarks/README.md)
38 - Security :bank: :cop: [![Security Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=security_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier) [![Vulnerabilities](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=vulnerabilities)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
39 - Easy to use :couple:
40 - Dynamic pool size :white_check_mark:
41 - Easy switch from a pool to another :white_check_mark:
42 - No runtime dependencies :white_check_mark:
43 - Proper async integration with node async hooks :white_check_mark:
44 - Support for worker threads and cluster node modules :white_check_mark:
45 - Support sync and async tasks :white_check_mark:
46 - General guidance on pools to use :white_check_mark:
47 - Widely tested :white_check_mark:
48 - Error handling out of the box :white_check_mark:
49 - Active community :white_check_mark:
50 - Code quality :octocat: [![Bugs](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=bugs)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
51 [![Code Smells](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=code_smells)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
52 [![Duplicated Lines (%)](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=duplicated_lines_density)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
53 [![Maintainability Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=sqale_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
54 [![Reliability Rating](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=reliability_rating)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
55 [![Technical Debt](https://sonarcloud.io/api/project_badges/measure?project=pioardi_poolifier&metric=sqale_index)](https://sonarcloud.io/dashboard?id=pioardi_poolifier)
56
57 ## Contents
58
59 <h3 align="center">
60 <a href="#overview">Overview</a>
61 <span> · </span>
62 <a href="#installation">Installation</a>
63 <span> · </span>
64 <a href="#usage">Usage</a>
65 <span> · </span>
66 <a href="#node-versions">Node versions</a>
67 <span> · </span>
68 <a href="#api">API</a>
69 <span> · </span>
70 <a href="#general-guidance">General guidance</a>
71 <span> · </span>
72 <a href="#contribute">Contribute</a>
73 <span> · </span>
74 <a href="#team">Team</a>
75 <span> · </span>
76 <a href="#license">License</a>
77 </h3>
78
79 ## Overview
80
81 Node pool contains two [worker-threads](https://nodejs.org/api/worker_threads.html#worker_threads_worker_threads)/[cluster worker](https://nodejs.org/api/cluster.html#cluster_class_worker) pool implementations, you don't have to deal with worker-threads/cluster worker complexity.
82 The first implementation is a static worker pool, with a defined number of workers that are started at creation time and will be reused.
83 The second implementation is a dynamic worker pool with a number of worker started at creation time (these workers will be always active and reused) and other workers created when the load will increase (with an upper limit, these workers will be reused when active), the new created workers will be stopped after a configurable period of inactivity.
84 You have to implement your worker extending the ThreadWorker or ClusterWorker class.
85
86 ## Installation
87
88 ```shell
89 npm install poolifier --save
90 ```
91
92 ## Usage
93
94 You can implement a worker-threads worker in a simple way by extending the class ThreadWorker:
95
96 ```js
97 'use strict'
98 const { ThreadWorker } = require('poolifier')
99
100 function yourFunction(data) {
101 // this will be executed in the worker thread,
102 // the data will be received by using the execute method
103 return { ok: 1 }
104 }
105
106 module.exports = new ThreadWorker(yourFunction, {
107 maxInactiveTime: 60000
108 })
109 ```
110
111 Instantiate your pool based on your needs :
112
113 ```js
114 'use strict'
115 const { DynamicThreadPool, FixedThreadPool, PoolEvents } = require('poolifier')
116
117 // a fixed worker-threads pool
118 const pool = new FixedThreadPool(15,
119 './yourWorker.js',
120 { errorHandler: (e) => console.error(e), onlineHandler: () => console.log('worker is online') })
121
122 pool.emitter.on(PoolEvents.busy, () => console.log('Pool is busy'))
123
124 // or a dynamic worker-threads pool
125 const pool = new DynamicThreadPool(10, 100,
126 './yourWorker.js',
127 { errorHandler: (e) => console.error(e), onlineHandler: () => console.log('worker is online') })
128
129 pool.emitter.on(PoolEvents.full, () => console.log('Pool is full'))
130 pool.emitter.on(PoolEvents.busy, () => console.log('Pool is busy'))
131
132 // the execute method signature is the same for both implementations,
133 // so you can easy switch from one to another
134 pool.execute({}).then(res => {
135 console.log(res)
136 }).catch ....
137
138 ```
139
140 You can do the same with the classes ClusterWorker, FixedClusterPool and DynamicClusterPool.
141
142 **See examples folder for more details (in particular if you want to use a pool with [multiple worker functions](./examples/multiFunctionExample.js))**.
143
144 Remember that workers can only send and receive serializable data.
145
146 ## Node versions
147
148 Node versions >= 16.x are supported.
149
150 ## [API](https://poolifier.github.io/poolifier/)
151
152 ### `pool = new FixedThreadPool/FixedClusterPool(numberOfThreads/numberOfWorkers, filePath, opts)`
153
154 `numberOfThreads/numberOfWorkers` (mandatory) Number of workers for this pool
155 `filePath` (mandatory) Path to a file with a worker implementation
156 `opts` (optional) An object with these properties:
157
158 - `messageHandler` (optional) - A function that will listen for message event on each worker
159 - `errorHandler` (optional) - A function that will listen for error event on each worker
160 - `onlineHandler` (optional) - A function that will listen for online event on each worker
161 - `exitHandler` (optional) - A function that will listen for exit event on each worker
162 - `workerChoiceStrategy` (optional) - The worker choice strategy to use in this pool:
163
164 - `WorkerChoiceStrategies.ROUND_ROBIN`: Submit tasks to worker in a round robbin fashion
165 - `WorkerChoiceStrategies.LESS_USED`: Submit tasks to the less used worker
166 - `WorkerChoiceStrategies.LESS_BUSY`: Submit tasks to the less busy worker
167 - `WorkerChoiceStrategies.WEIGHTED_ROUND_ROBIN`: Submit tasks to worker using a weighted round robin scheduling algorithm based on tasks execution time
168 - `WorkerChoiceStrategies.FAIR_SHARE`: Submit tasks to worker using a fair share tasks scheduling algorithm based on tasks execution time
169
170 `WorkerChoiceStrategies.WEIGHTED_ROUND_ROBIN` and `WorkerChoiceStrategies.FAIR_SHARE` strategies are targeted to heavy and long tasks.
171 Default: `WorkerChoiceStrategies.ROUND_ROBIN`
172
173 - `workerChoiceStrategyOptions` (optional) - The worker choice strategy options object to use in this pool.
174 Properties:
175
176 - `medRunTime` (optional) - Use the tasks median run time instead of the tasks average run time in worker choice strategies.
177 - `weights` (optional) - The worker weights to use in the weighted round robin worker choice strategy: `{ 0: 200, 1: 300, ..., n: 100 }`
178
179 Default: `{ medRunTime: false }`
180
181 - `enableEvents` (optional) - Events emission enablement in this pool.
182 Default: true
183 - `enableTasksQueue` (optional) - Tasks queue per worker enablement in this pool.
184 Default: false
185
186 - `tasksQueueOptions` (optional) - The worker tasks queue options object to use in this pool.
187 Properties:
188
189 - `concurrency` (optional) - The maximum number of tasks that can be executed concurrently on a worker.
190
191 Default: `{ concurrency: 1 }`
192
193 ### `pool = new DynamicThreadPool/DynamicClusterPool(min, max, filePath, opts)`
194
195 `min` (mandatory) Same as FixedThreadPool/FixedClusterPool numberOfThreads/numberOfWorkers, this number of workers will be always active
196 `max` (mandatory) Max number of workers that this pool can contain, the new created workers will die after a threshold (default is 1 minute, you can override it in your worker implementation).
197 `filePath` (mandatory) Same as FixedThreadPool/FixedClusterPool
198 `opts` (optional) Same as FixedThreadPool/FixedClusterPool
199
200 ### `pool.execute(data)`
201
202 `data` (optional) An object that you want to pass to your worker implementation
203 This method is available on both pool implementations and returns a promise.
204
205 ### `pool.destroy()`
206
207 Destroy method is available on both pool implementations.
208 This method will call the terminate method on each worker.
209
210 ### `class YourWorker extends ThreadWorker/ClusterWorker`
211
212 `taskFunctions` (mandatory) The task function(s) that you want to execute on the worker
213 `opts` (optional) An object with these properties:
214
215 - `maxInactiveTime` (optional) - Max time to wait tasks to work on in milliseconds, after this period the new worker will die.
216 The last active time of your worker unit will be updated when a task is submitted to a worker or when a worker terminate a task.
217 If `killBehavior` is set to `KillBehaviors.HARD` this value represents also the timeout for the tasks that you submit to the pool, when this timeout expires your tasks is interrupted and the worker is killed if is not part of the minimum size of the pool.
218 If `killBehavior` is set to `KillBehaviors.SOFT` your tasks have no timeout and your workers will not be terminated until your task is completed.
219 Default: 60000
220
221 - `async` (optional) - true/false. Set to true if your function contains async code pieces, else false.
222 Default: false
223 - `killBehavior` (optional) - Dictates if your async unit (worker/process) will be deleted in case that a task is active on it.
224 **KillBehaviors.SOFT**: If `currentTime - lastActiveTime` is greater than `maxInactiveTime` but a task is still running, then the worker **won't** be deleted.
225 **KillBehaviors.HARD**: If `currentTime - lastActiveTime` is greater than `maxInactiveTime` but a task is still running, then the worker will be deleted.
226 This option only apply to the newly created workers.
227 Default: `KillBehaviors.SOFT`
228
229 ## General guidance
230
231 Performance is one of the main target of these worker pool implementations, we want to have a strong focus on this.
232 We already have a bench folder where you can find some comparisons.
233
234 ### Internal Node.js thread pool
235
236 Before to jump into each poolifier pool type, let highlight that **Node.js comes with a thread pool already**, the libuv thread pool where some particular tasks already run by default.
237 Please take a look at [which tasks run on the libuv thread pool](https://nodejs.org/en/docs/guides/dont-block-the-event-loop/#what-code-runs-on-the-worker-pool).
238
239 **If your task runs on libuv thread pool**, you can try to:
240
241 - Tune the libuv thread pool size setting the [UV_THREADPOOL_SIZE](https://nodejs.org/api/cli.html#cli_uv_threadpool_size_size).
242
243 and/or
244
245 - Use poolifier cluster pool that spawning child processes will also increase the number of libuv threads since that any new child process comes with a separated libuv thread pool. **More threads does not mean more fast, so please tune your application**.
246
247 ### Cluster vs Threads worker pools
248
249 **If your task does not run into libuv thread pool** and is CPU intensive then poolifier **thread pools** (FixedThreadPool and DynamicThreadPool) are suggested to run CPU intensive tasks, you can still run I/O intensive tasks into thread pools, but performance enhancement is expected to be minimal.
250 Thread pools are built on top of Node.js [worker-threads](https://nodejs.org/api/worker_threads.html#worker_threads_worker_threads) module.
251
252 **If your task does not run into libuv thread pool** and is I/O intensive then poolifier **cluster pools** (FixedClusterPool and DynamicClusterPool) are suggested to run I/O intensive tasks, again you can still run CPU intensive tasks into cluster pools, but performance enhancement is expected to be minimal.
253 Consider that by default Node.js already has great performance for I/O tasks (asynchronous I/O).
254 Cluster pools are built on top of Node.js [cluster](https://nodejs.org/api/cluster.html) module.
255
256 If your task contains code that runs on libuv plus code that is CPU intensive or I/O intensive you either split it either combine more strategies (i.e. tune the number of libuv threads and use cluster/thread pools).
257 But in general, **always profile your application**.
258
259 ### Fixed vs Dynamic pools
260
261 To choose your pool consider that with a FixedThreadPool/FixedClusterPool or a DynamicThreadPool/DynamicClusterPool (in this case is important the min parameter passed to the constructor) your application memory footprint will increase.
262 Increasing the memory footprint, your application will be ready to accept more tasks, but during idle time your application will consume more memory.
263 One good choose from my point of view is to profile your application using Fixed/Dynamic worker pool, and to see your application metrics when you increase/decrease the num of workers.
264 For example you could keep the memory footprint low choosing a DynamicThreadPool/DynamicClusterPool with 5 workers, and allow to create new workers until 50/100 when needed, this is the advantage to use the DynamicThreadPool/DynamicClusterPool.
265 But in general, **always profile your application**.
266
267 ## Contribute
268
269 Choose your task here [2.4.x](https://github.com/orgs/poolifier/projects/1), propose an idea, a fix, an improvement.
270
271 See [CONTRIBUTING](CONTRIBUTING.md) guidelines.
272
273 ## Team
274
275 <!-- ALL-CONTRIBUTORS-LIST:START - Do not remove or modify this section -->
276
277 **Creator/Owner:**
278
279 - [**Alessandro Pio Ardizio**](https://github.com/pioardi)
280
281 **_Contributors_**
282
283 - [**Shinigami92**](https://github.com/Shinigami92)
284 - [**Jérôme Benoit**](https://github.com/jerome-benoit)
285
286 ## License
287
288 [MIT](./LICENSE)