For the past 3 days, every morning around 7am BRT one of my servers is facing high server load and 503 service unavailable errors.
I have 2600 cpanel accounts in it, with around 2100 ~ "active" and 500 suspended accounts.
Something is occurring and is generating massive log files on /var/log that quickly take up all free disk space (around 400gb)
The error seem in the logs is:
2022-12-11 13:25:33.999958 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999967 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:33.999974 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999981 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:33.999987 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999996 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:34.000004 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
Also one weird thing is when I terminate suspended accounts it's taking quite a lot of time (20-30 seconds) while on my other server it takes 3-5.
And while the accs are being terminated, load average goes up a lot from 1.5 to around 4-5.
My server is with Hivelocity, they can't find the cause. I hired Bobcares, they also can't find the cause...
I'm freaking out. We had 3 hours of downtime today. "Luckily" this only happens in the morning while not many users are online.
Without knowing the cause, only fix we find is to wait a bit, then reboot server and delete all the massive logs.
I have 2600 cpanel accounts in it, with around 2100 ~ "active" and 500 suspended accounts.
Something is occurring and is generating massive log files on /var/log that quickly take up all free disk space (around 400gb)
The error seem in the logs is:
2022-12-11 13:25:33.999958 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999967 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:33.999974 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999981 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:33.999987 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
2022-12-11 13:25:33.999996 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:443] can't accept:Too many open files in system!
2022-12-11 13:25:34.000004 [ERROR] [2632281] [T0] SocketListener::handleEvents(): [*:80] can't accept:Too many open files in system!
Also one weird thing is when I terminate suspended accounts it's taking quite a lot of time (20-30 seconds) while on my other server it takes 3-5.
And while the accs are being terminated, load average goes up a lot from 1.5 to around 4-5.
My server is with Hivelocity, they can't find the cause. I hired Bobcares, they also can't find the cause...
I'm freaking out. We had 3 hours of downtime today. "Luckily" this only happens in the morning while not many users are online.
Without knowing the cause, only fix we find is to wait a bit, then reboot server and delete all the massive logs.