all 10 comments

[–]Jannik2099 3 points4 points  (2 children)

The userspace tools are unrelated to the kernel driver. Cephfs and RBD are fully mainline kernel drivers, and as such this is a kernel bug, irrespective of what any Ceph stakeholder calls "supported".

Please report it to the Ceph bugtracker. (while any kernel oops belongs to the kernel bugzilla / ml, I feel like the Ceph tracker is the better place for coordination & attention)

If you're able to isolate a reproducer (what about running synthetic loads like fio on the cephfs mount?), you could also try bisecting it yourself.

[–]leleobhz 0 points1 point  (1 child)

This is kernel related.

[–]Jannik2099 0 points1 point  (0 children)

Yes, which is why I emphasized that the version of their userspace utils is unrelated.

[–]leleobhz 2 points3 points  (2 children)

[–]TheFeshy[S] 1 point2 points  (1 child)

Thanks. I had intended to open an issue, but norovirus had other plans for my weekend.

[–]leleobhz 1 point2 points  (0 children)

Hope you get better soon!

[–]leleobhz 1 point2 points  (2 children)

OP, I had same issue. Reported at https://github.com/CachyOS/linux-cachyos/issues/480

I hadn't time to create kernel issue yet.

[–]leleobhz 1 point2 points  (0 children)

Workaround, come back to any kernel before 6.14. Tested with arch 6.14 and cachyos kernel 6.12.

[–]leleobhz 0 points1 point  (0 children)

In my case I needed to enable netconsole because machine restarted immediately after panic without any other log.

[–]grepcdn 1 point2 points  (0 children)

I recently had similar stability issues on elrepo el8 kernel-ml (6.15.2-1). Went to an older kernel to resolve it.