This article is reproduced from the WeChat public account "JavaGuide". Author: Guide Brother. Please contact the JavaGuide public account for reprinting this article.
Hello everyone, I am Guide, a technical person with a more positive outlook than the protagonist. Today, let's continue to talk about thread pools~ This article is about 5,000 words long and is absolutely informative. The title is a bit exaggerated, but it is actually a summary of some of the more important points I personally feel when using thread pools. Thread pool knowledge review Before starting this article, I will briefly introduce the thread pool. The previous article "Thread Pool Learning Summary that Newbies Can Understand" introduced it in detail. Why use thread pool? "Pooling technology is already common. Thread pools, database connection pools, Http connection pools, etc. are all applications of this idea. The idea of pooling technology is mainly to reduce the consumption of each resource acquisition and improve the utilization of resources. A thread pool provides a way to limit and manage resources (including executing a task). Each thread pool also maintains some basic statistics, such as the number of completed tasks. Here I would like to borrow the "Art of Java Concurrency Programming" to talk about the benefits of using thread pools:
Thread pool usage scenarios in actual projects Thread pools are generally used to execute multiple unrelated time-consuming tasks. Without multithreading, tasks are executed sequentially. Using thread pools allows multiple unrelated tasks to be executed simultaneously. Suppose we want to perform three unrelated time-consuming tasks. Guide draws a picture to show you the difference before and after using the thread pool. Note: The following three tasks may do the same thing or they may be different things. Comparison before and after using thread pool How to use thread pool? Generally, a thread pool is created through the ThreadPoolExecutor constructor, and then tasks are submitted to the thread pool for execution. The ThreadPoolExecutor constructor is as follows:
This is a brief demonstration of how to use the thread pool. For a more detailed introduction, please see: "A summary of thread pool learning that even novices can understand."
Console output:
Thread Pool Best Practices Let me briefly summarize what I know about the things that should be paid attention to when using thread pools. There seems to be no article specifically written on this topic on the Internet. Because the Guide is still relatively new, if there are additions, improvements, or errors, you can let me know in the comment section or communicate with me on WeChat. 1. Declare the thread pool using the ThreadPoolExecutor constructor The thread pool must be manually declared through the ThreadPoolExecutor constructor. Avoid using newFixedThreadPool and newCachedThreadPool of the Executors class because there may be a risk of OOM. The drawbacks of Executors returning thread pool objects are as follows:
To put it simply: use bounded queues to control the number of threads created. In addition to avoiding OOM, the reasons why the two quick thread pools provided by Executors are not recommended are:
2. Monitor the thread pool running status You can use some means to detect the running status of the thread pool, such as the Actuator component in SpringBoot. In addition, we can also use the relevant API of ThreadPoolExecutor to do a simple monitoring. As can be seen from the figure below, ThreadPoolExecutor provides the ability to obtain the current number of threads and active threads in the thread pool, the number of completed tasks, the number of tasks in queue, and so on. Below is a simple demo. printThreadPoolStatus() will print out the number of threads in the thread pool, the number of active threads, the number of completed tasks, and the number of tasks in the queue every second.
3. It is recommended to use different thread pools for different types of business Many people have similar questions in actual projects: In my project, multiple businesses need to use thread pools. Should I define one thread pool for each or a common thread pool? It is generally recommended that different businesses use different thread pools. When configuring the thread pool, configure the current thread pool according to the current business situation. Because the concurrency and resource usage of different businesses are different, focus on optimizing businesses related to system performance bottlenecks. Let's look at a real accident case! (This case comes from: "An online accident caused by improper use of thread pool" @https://club.perfma.com/article/646639, a very exciting case) Example Code Overview The above code may cause deadlock. Why? Let me draw a picture to explain it to you. Consider this extreme situation: Suppose the number of core threads in our thread pool is n, the number of parent tasks (charge-collection tasks) is n, and there are two child tasks (child tasks under charge-collection tasks) under the parent task, one of which has been completed, and the other has been placed in the task queue. Since the parent task has used up the core thread resources of the thread pool, the child task cannot be executed normally because it cannot obtain thread resources and has been blocked in the queue. The parent task waits for the child task to complete execution, and the child task waits for the parent task to release thread pool resources, which causes a "deadlock". The solution is also very simple, which is to add a new thread pool for executing subtasks to serve it specifically. 4. Don’t forget to name the thread pool When initializing the thread pool, you need to display the name (set the thread pool name prefix) to help locate the problem. By default, the thread name created is similar to pool-1-thread-n, which has no business meaning and is not conducive to locating problems. There are usually two ways to name threads in the thread pool: 1). Using guava's ThreadFactoryBuilder
2). Implement ThreadFactor yourself.
5. Correctly configure thread pool parameters When it comes to how to configure parameters for the thread pool, Meituan’s operation is still unforgettable to me (I will mention it later)! Let's first take a look at the generally recommended ways to configure thread pool parameters in various books and blogs, which can be used as a reference! General Operation Many people may even think that it is better to configure the thread pool too large! I think this is obviously problematic. Take a very common example in our lives: more people does not mean that things can be done well, which increases the cost of communication. You only need 3 people to do a thing, but you force 6 people to do it. Will it improve the efficiency? I don't think so. The impact of too many threads is the same as how many people we assign to do things. For the multi-threaded scenario, it mainly increases the cost of context switching. If you don't know what context switching is, you can see my introduction below. "Context switch: In multithreaded programming, the number of threads is generally greater than the number of CPU cores, and a CPU core can only be used by one thread at any time. In order to allow all these threads to be executed effectively, the CPU adopts a strategy of allocating time slices to each thread and rotating them. When a thread's time slice is used up, it will be ready again for other threads to use. This process is a context switch. In summary, the current task will save its own state before switching to another task after executing the CPU time slice, so that the state of this task can be loaded again when it switches back to this task next time. The process from saving to reloading a task is a context switch. Context switching is usually computationally intensive. That is, it requires considerable processor time. In the dozens or hundreds of switches per second, each switch takes nanoseconds. Therefore, context switching means a lot of CPU time consumption for the system. In fact, it may be the most time-consuming operation in the operating system. Linux has many advantages over other operating systems (including other Unix-like systems), one of which is that its context switching and mode switching consume very little time. Analogous to the realization that humans in the world do something through cooperation, one thing we can be sure of is that setting the thread pool size too large or too small will cause problems, and the appropriate size is the best. If the number of thread pools we set is too small, if there are a large number of tasks/requests to be processed at the same time, it may cause a large number of requests/tasks to queue up in the task queue waiting for execution, or even the situation where the task queue is full and the tasks/requests cannot be processed, or a large number of tasks are piled up in the task queue, causing OOM. This is obviously problematic! The CPU is not fully utilized at all. However, if we set the number of threads to be too large, a large number of threads may compete for CPU resources at the same time, which will cause a large number of context switches, thereby increasing the execution time of the threads and affecting the overall execution efficiency. There is a simple and widely applicable formula:
How to determine whether it is a CPU-bound task or an IO-bound task? CPU intensive tasks are tasks that use CPU computing power, such as sorting a large amount of data in memory. Anything involving network reading or file reading is IO intensive. The characteristic of such tasks is that the CPU computing time is very small compared to the time waiting for the IO operation to complete, and most of the time is spent waiting for the IO operation to complete. Meituan’s tricks The Meituan technical team introduced the ideas and methods for implementing customizable configuration of thread pool parameters in the article "Java Thread Pool Implementation Principles and Its Practice in Meituan Business". The idea of the Meituan technical team is to make the core parameters of the thread pool customizable. These three core parameters are:
Why these three parameters? I have said in this article "Thread Pool Learning Summary That Newbies Can Understand" that these three parameters are the most important parameters of ThreadPoolExecutor, and they basically determine the thread pool's task processing strategy. How to support dynamic configuration of parameters? Let's look at the following methods provided by ThreadPoolExecutor. Special attention should be paid to corePoolSize. When the program is running, if we call the setCorePoolSize() method, the thread pool will first determine whether the current number of working threads is greater than corePoolSize. If so, the working threads will be recycled. In addition, you can see that there is no method to dynamically specify the queue length above. Meituan’s method is to customize a queue called ResizableCapacityLinkedBlockIngQueue (mainly by removing the final keyword modification of the capacity field of LinkedBlockingQueue to make it variable). The final effect of dynamically modifying thread pool parameters is as follows. 👏👏👏 Final effect of dynamically configuring thread pool parameters |
<<: 5G development is gradually spreading like wildfire, and China is expected to take the lead
>>: What do you think the sequence number of a TCP reset message is?
The Internet of Things (IoT) is a technological r...
On February 15 , Qualcomm's official website ...
New Year, New Atmosphere. Pesyun Standard Interco...
Both IP addresses and MAC addresses identify devi...
In the 5G era, real-time communication is still a...
On April 25, China Mobile General Manager Dong Xi...
10 days ago, pureLifi raised $18 million in Serie...
As science and technology develops at an increasi...
Networks are increasingly reliant on software and...
Megalayer launched the "April Carnival, Cont...
[[320457]] This article is reproduced from Leipho...
UUUVPS (Sanyou Cloud) launched the promotion duri...
[[407105]] On June 23, according to the "Eco...
A local area network (LAN) is a computer network ...
From April 17 to 19, the 4th "National Civil...