Signal 11 is a Segmentation fault, so definitely looks like a bug.
To analyse what happened I would need the data node log and
the trace files. Those should be found in
/srv/hops/mysql-cluster I think.
I think the segfault is the triggerr for all the log messages you
displayed above. Also good to know which version you are
running to see if you hit a bug which is already fixed in a newer
21.04 version.
First look at it indicates that you hit the bugs HOPSWORKS-2652 or HOPSWORKS-2651. They were fixed in RonDB 21.04.1 and caused a bit random failures in the ordered index queries.
An update to 21.04.8 is what I would suggest, this version has been very stable and no serious issues
have been found in it so far. I will look one more time before I update issue in GitHub.
Here is a reference to the bugs in the release notes of 21.04.1:
I read the documentation and would update to 21.0.4.6, but RonDB is used by Hospworks 2.3; there is any documentation that explain how this update can be maded ?
I’ll look into this tomorrow and point you to the docs about upgrades and will see if I can ensure
that 21.04.8 DAL is uploaded. The 21.04.6 DAL can also be used against 21.04.8 in the data
nodes. There are no bug fixes in the DAL parts in 21.04.7 and 21.04.8.
Took a bit more time than expected, working on a new section in the documentation
about upgrades and downgrades of RonDB. Will be released in conjunction with new
RonDB releases here in October.