Summary:
With RHEL 8.6 kernel versions 4.18.0-372.y.z.el8.x86_64 below 4.18.0-372.16.1.el8_6, a communication disruption may result in database downtime or freeze.
Symptoms:
- Database processes hanging after communication disruption with another node in the cluster.
- scsi_eh processes and various database processes reported as blocked in /var/log/messages with messages similar to the following:
kernel: INFO: task scsi_eh_6:2498 blocked for more than 122 seconds.
kernel: INFO: task ora_ckpt_orcl2:13745 blocked for more than 122 seconds.
Affected products:
FlashGrid Cluster or Storage Fabric cluster using RHEL 8 or Oracle Linux 8 with RHCK kernel
Affected versions:
RHEL/RHCK kernel versions 4.18.0-372.y.z.el8.x86_64 below 4.18.0-372.16.1.el8_6
Root cause:
A kernel driver issue may result in i/o operations freeze when communication is disrupted between cluster nodes.
Resolution:
RHEL 8: Update to 8.6 kernel version 4.18.0-372.16.1.el8_6 or newer.
Oracle Linux 8: Update to 8.6 kernel version 4.18.0-372.16.1.el8_6 or newer, or switch to UEKR6 kernel.