For whatever reason, at around 8:50am EST today my disk latency, disk utilization, RAM, and CPU usage skyrocketed. I usually have a CPU load of around 0.5, and it's currently 77!
I had made a few minor Exim changes through the WHM Basic Configuration editor, so I thought that might have been the culprit. I restored, and even turned off Exim entirely, with no change, so I guess that wasn't it.
You can see exactly what I mean here:
Code:
# sar -q
Linux 2.6.32-431.5.1.el6.x86_64 (server1.example.com) 04/02/2014 _x86_64_(2 CPU)
12:00:01 AM runq-sz plist-sz ldavg-1 ldavg-5 ldavg-15
12:10:01 AM 1 200 0.06 0.17 0.17
12:20:01 AM 2 195 0.12 0.21 0.18
12:30:01 AM 4 195 0.06 0.17 0.17
12:40:01 AM 2 198 0.27 0.19 0.18
12:50:01 AM 2 206 0.21 0.19 0.18
01:00:01 AM 4 196 0.11 0.12 0.15
01:10:01 AM 2 194 0.01 0.13 0.16
01:20:01 AM 2 190 0.06 0.15 0.16
01:30:01 AM 3 189 0.06 0.14 0.16
01:40:01 AM 2 199 0.02 0.08 0.13
01:50:01 AM 2 197 0.25 0.13 0.10
02:00:01 AM 2 196 0.03 0.06 0.08
02:10:01 AM 3 202 0.15 0.12 0.09
02:20:01 AM 2 185 0.09 0.05 0.06
02:30:01 AM 4 191 0.18 0.08 0.04
02:40:01 AM 2 190 0.00 0.01 0.01
02:50:01 AM 2 191 0.03 0.03 0.00
03:00:01 AM 2 184 0.07 0.11 0.03
03:10:01 AM 2 187 0.19 0.16 0.10
03:20:01 AM 2 187 0.06 0.10 0.09
03:30:01 AM 3 183 0.02 0.08 0.08
03:40:02 AM 2 200 0.04 0.06 0.07
03:40:02 AM runq-sz plist-sz ldavg-1 ldavg-5 ldavg-15
03:50:01 AM 2 195 0.08 0.09 0.08
04:00:01 AM 3 193 0.07 0.20 0.17
04:10:01 AM 2 190 0.05 0.09 0.12
04:20:01 AM 2 200 0.42 0.32 0.20
04:30:01 AM 3 205 0.18 0.20 0.18
04:40:01 AM 2 192 0.16 0.21 0.18
04:50:01 AM 2 198 0.22 0.18 0.16
05:00:01 AM 2 192 0.16 0.16 0.15
05:10:01 AM 2 184 0.17 0.11 0.10
05:20:01 AM 2 186 0.18 0.11 0.09
05:30:01 AM 3 195 0.07 0.06 0.06
05:40:01 AM 2 194 0.12 0.14 0.11
05:50:01 AM 2 206 0.19 0.13 0.10
06:00:01 AM 3 201 0.04 0.12 0.10
06:10:01 AM 2 201 0.11 0.08 0.08
06:20:01 AM 4 213 0.06 0.12 0.10
06:30:01 AM 3 210 0.13 0.19 0.13
06:40:01 AM 2 206 0.24 0.22 0.17
06:50:01 AM 2 205 0.05 0.14 0.16
07:00:01 AM 2 210 0.17 0.19 0.18
07:10:01 AM 3 207 0.10 0.17 0.17
07:20:01 AM 2 228 0.41 0.27 0.20
07:20:01 AM runq-sz plist-sz ldavg-1 ldavg-5 ldavg-15
07:30:01 AM 2 212 0.23 0.18 0.17
07:40:01 AM 4 213 0.03 0.18 0.18
07:50:01 AM 2 231 0.18 0.24 0.20
08:00:01 AM 4 246 0.09 0.27 0.25
08:10:01 AM 2 231 0.48 0.28 0.22
08:20:01 AM 1 299 0.54 0.68 0.50
08:30:01 AM 5 241 0.80 1.16 0.90
08:40:01 AM 2 286 0.52 0.95 0.96
08:50:01 AM 2 255 10.68 7.60 3.85
09:00:01 AM 1 347 15.27 8.13 5.00
09:10:01 AM 2 224 5.09 5.79 5.44
09:20:01 AM 2 313 33.26 15.00 8.44
09:30:01 AM 2 289 13.55 25.06 19.79
09:40:01 AM 2 295 25.29 31.82 27.66
09:50:01 AM 1 312 37.19 28.17 26.51
10:00:01 AM 2 232 4.27 11.82 19.89
10:10:01 AM 2 267 11.29 8.02 13.50
10:20:01 AM 2 288 28.31 35.00 25.58
10:30:02 AM 3 222 2.95 9.96 17.47
10:40:02 AM 3 235 4.73 4.73 11.03
10:50:01 AM 3 215 1.26 4.04 8.30
11:00:01 AM 0 220 5.29 7.98 8.47
11:00:01 AM runq-sz plist-sz ldavg-1 ldavg-5 ldavg-15
11:10:01 AM 2 231 4.63 4.48 6.49
11:20:01 AM 2 309 73.26 54.97 27.27
11:30:01 AM 2 271 1.68 11.20 17.52
11:40:01 AM 3 244 5.08 5.15 11.27
11:50:01 AM 2 313 53.20 29.30 18.71
12:00:01 PM 2 223 1.43 10.97 16.00
12:10:01 PM 3 254 3.21 3.42 9.33
12:20:01 PM 2 232 3.04 2.55 6.05
12:30:01 PM 4 214 4.32 3.39 4.64
12:40:13 PM 5 313 27.70 24.75 14.27
12:50:01 PM 2 288 10.56 15.85 16.26
01:00:01 PM 3 216 1.05 3.70 9.50
01:10:01 PM 1 241 1.92 5.92 8.35
01:20:01 PM 2 316 54.19 34.36 18.99
01:30:01 PM 2 213 3.33 10.49 14.00
01:40:01 PM 3 234 4.86 4.99 9.14
01:50:01 PM 2 243 9.73 11.92 11.20
02:00:01 PM 2 220 5.78 8.41 9.21
02:10:01 PM 2 231 3.87 4.82 6.95
02:20:01 PM 2 224 4.76 4.61 5.62
02:30:01 PM 3 228 1.41 3.70 5.37
02:40:01 PM 3 252 3.10 4.01 4.60
02:40:01 PM runq-sz plist-sz ldavg-1 ldavg-5 ldavg-15
02:50:01 PM 2 231 3.07 3.79 4.53
03:00:03 PM 2 319 27.38 14.55 8.95
03:10:01 PM 2 226 4.17 8.28 9.53
03:20:01 PM 2 358 16.61 16.15 12.25
03:30:01 PM 2 293 30.77 39.20 26.62
03:40:01 PM 2 238 3.64 16.06 21.69
03:50:01 PM 2 249 11.61 24.09 23.27
04:00:03 PM 1 349 40.80 33.00 25.51
04:10:01 PM 2 289 24.49 26.66 24.34
Average: 2 231 6.72 6.78 6.36
Iostat doesn't look much different than when I first posted, though:
Code:
# iostat
Linux 2.6.32-431.5.1.el6.x86_64 (server1.example.com) 04/02/2014 _x86_64_(2 CPU)
avg-cpu: %user %nice %system %iowait %steal %idle
5.89 0.63 3.92 2.66 0.07 86.84
Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn
xvdb 0.27 1.24 1.02 976420 805592
xvda 48.77 169.81 557.79 134086558 440438298
In Munin, all of the graphs are super high. VMStat shows "I/O Sleep" at 9.52, where it's usually at 0. CPU Usage - iowait is at 149.2, and it's usually at 0.5.
IO Service Time also had a major shift on the chart, but I don't know how to read or explain it; in Munin, the red line is usually stable halfway between 1e03 and 1e04, but now it is erratic and halfway between 1e06 and 1e07.
Disk Utilization in Munin is usually at around 500m, but it's currently at 95.54 (not 95m, but 95; so, 200 times higher than usual).
Disk Latency is usually around 180m, but is currently at 2.49.
If it matters, I'm using WHM 11.42.0 (build 23).
Any suggestions? Any changes to cPanel at 8:50am that could cause this?