Linux系统之Wait CPU time解析

123人浏览 2024-05-28 22:23:23

：在Linux操作系统中，通常采用8个不同的指标来研究Linux / Unix操作系统中的CPU消耗：用户CPU时间（us）、系统CPU时间（sy）、良好的CPU时间（ni）、空闲CPU时间（id）、等待CPU时间（wa）、硬件中断CPU时间（hi），软件中断CPU时间（si），被盗CPU时间（st）。在本文中，我们主要针对“等待CPU时间”进行解析。

什么是“等待” CPU时间？

等待CPU时间表示CPU等待磁盘I / O或网络I / O操作完成所花费的时间。等待时间过长表示由于该设备上的I / O操作，CPU被“绞死”了。为了获得最佳性能，应该以使I / O等待CPU时间尽可能短为目标。如果等待时间> 10％，则需要对其进行问题排查。

我们可以通过以下场景来形象化描述I / O等待时间：大家应该经历过或者已经在堵车中，有数百辆汽车在繁忙的道路上等待交通信号灯从“红色”切换为“绿色”。但是由于技术上的故障（此刻，交警也不在现场），交通信号灯从“红色”转换为“绿色”经历很长时间，迟迟没有发生更替。结果呢？数百辆汽车就傻傻地在原地等待，等待第三方介入处理。如果没有及时处理，这将导致多种不良后果：乘客将无法及时到达目的地，驾驶员可能会感到沮丧并开始鸣喇叭（噪音污染），并且燃油将被浪费（空气污染），更有甚者直接无视交规，酿成大祸。

如何查找“等待” CPU时间？可从以下来源找到等待的CPU时间：

1、可以使用基于网络的根本原因分析工具来报告“等待”的CPU时间。如果“等待” CPU时间超出阈值，该工具便能够生成警报。

2、Linux/Unix命令行工具“ wa”字段中的“ top”中也能够打印“等待” CPU时间，如下图所示：

[administrator@JavaLangOutOfMemory nacos-docker ]% top
top - 20:50:49 up 20:39,  2 users,  load average: 1.13, 0.86, 1.05
Tasks: 123 total,   1 running, 122 sleeping,   0 stopped,   0 zombie
%Cpu(s):  9.1 us,  6.2 sy,  0.0 ni, 83.9 id,  0.0 wa,  0.0 hi,  0.8 si,  0.0 st
KiB Mem :  3880584 total,  1006448 free,   859684 used,  2014452 buff/cache
KiB Swap:        0 total,        0 free,        0 used.  2583448 avail Mem 


  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND                                                                                                                  
 2824 root      20   0  621140 357616  18272 S  10.9  9.2 118:20.09 kube-apiserver                                                                                                           
 2805 root      20   0  222392  49924  11240 S   7.6  1.3  81:42.59 kube-controller                                                                                                          
11442 root      20   0 1512420  69476  33528 S   6.9  1.8   0:03.56 kubelet                                                                                                                  
 2783 root      20   0   10.1g  58052   8152 S   5.3  1.5  48:55.54 etcd                                                                                                                     
 1108 root      20   0  626628  85044  10092 S   4.3  2.2  45:54.18 dockerd                                                                                                                  
  974 root      20   0 1078420  41340   4352 S   1.0  1.1   8:44.73 containerd                                                                                                               
11794 root      20   0  162136   2248   1556 R   0.7  0.1   0:00.09 top                                                                                                                      
    6 root      20   0       0      0      0 S   0.3  0.0   1:05.12 ksoftirqd/0                                                                                                              
    9 root      20   0       0      0      0 S   0.3  0.0   2:42.23 rcu_sched                                                                                                                
  395 root      20   0       0      0      0 S   0.3  0.0   2:26.41 xfsaild/dm-0                                                                                                             
  641 root      20   0   21540   1204    976 S   0.3  0.0   0:07.82 irqbalance                                                                                                               
 2858 root      20   0  147076  20348   6116 S   0.3  0.5   3:58.21 kube-scheduler                                                                                                           
 3884 root      20   0  142856  15952   4376 S   0.3  0.4   1:26.73 kube-proxy                                                                                                               
11268 root      20   0       0      0      0 S   0.3  0.0   0:00.01 kworker/u4:0                                                                                                             
    1 root      20   0  125616   3692   2152 S   0.0  0.1   0:42.74 systemd                                                                                                                  
    2 root      20   0       0      0      0 S   0.0  0.0   0:00.02 kthreadd                                                                                                                 
    4 root       0 -20       0      0      0 S   0.0  0.0   0:00.00 kworker/0:0H                                                                                                             
    7 root      rt   0       0      0      0 S   0.0  0.0   0:03.03 migration/0                                                                                                              
    8 root      20   0       0      0      0 S   0.0  0.0   0:00.00 rcu_bh                                                                                                                   
   10 root       0 -20       0      0      0 S   0.0  0.0   0:00.00 lru-add-drain                                                                                                            
   11 root      rt   0       0      0      0 S   0.0  0.0   0:05.44 watchdog/0

如何模拟较高的“等待” CPU时间？

为了模拟高“等待” CPU报告，同样原理，与之前的“用户” CPU时间场景类似，我们写个简单的Demon。将其打成jar包，使其运行以模拟各种性能问题。当我们启动此应用jar包时，它将导致主机上的“等待” CPU消耗激增。具体如下：

[administrator@JavaLangOutOfMemory cpu ]% java -jar devopsDemo.jar PROBLEM_IO
Application started!
Starting to write to iofile-01.log
Starting to write to iofile-02.log
Starting to write to iofile-03.log
Starting to write to iofile-04.log
Starting to write to iofile-05.log
Starting to write to iofile-06.log
Read & write 1000 times to iofile-05.log
Read & write 1000 times to iofile-02.log
Read & write 1000 times to iofile-01.log
Read & write 1000 times to iofile-04.log
Read & write 1000 times to iofile-03.log
Read & write 1000 times to iofile-06.log
Read & write 1000 times to iofile-04.log
Read & write 1000 times to iofile-02.log
... ...

针对此应用jar包，我们看下其部分代码如下：

public class IODemo { 


    public void start() {              


             for (int counter =1; counter <= 6; ++counter) { 


            // Launch 6 threads.             
            
            new IOThread ("iofile-" + counter + ".log").start();         


         }     
         
     } 


}


public class IOThread extends Thread {     


    public String fileName; 


    public static final String CONTENT = 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" + 


"Hello CPU World! We are building a world where cpu resources are wasted and memmory resource are leaking. \n" 


    public IOThread(String fileName) {         


           this.fileName = fileName;     


}     


    public void run() {     


        int counter = 0; 


        // Loop infinitely trying to read and close the file.         


       while (true) { 


            // Write the contents to the file.             


            FileUtil.write(fileName, CONTENT);                          




            // Read the contents from the file.             


             FileUtil.read(fileName);         


         }        


     } 


}