Bug 9488 - 使用rbd-nbd挂载rbd镜像会出现内核死锁的现象
Summary: 使用rbd-nbd挂载rbd镜像会出现内核死锁的现象
Status: NEW
Alias: None
Product: Anolis OS 8
Classification: Anolis OS
Component: kernel - anck-4.19 (show other bugs) kernel - anck-4.19
Version: 8.6
Hardware: x86_64 Linux
: P3-Medium S1-blocker
Target Milestone: ---
Assignee: maqiao_mq
QA Contact: shuming
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-07-09 12:40 UTC by outman111
Modified: 2024-07-09 12:40 UTC (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description outman111 2024-07-09 12:40:55 UTC
Description of problem:
使用rbd-nbd工具挂载rbd镜像,当挂载两个rbd镜像时,内核出现了死锁的现象,内核报错如下
[Thu Jul  4 15:15:29 2024] INFO: task mount.ntfs-3g:647458 blocked for more than 240 seconds.
[Thu Jul  4 15:15:29 2024]       Tainted: G            E     4.19.91-26.an8.x86_64 #1
[Thu Jul  4 15:15:29 2024] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Thu Jul  4 15:15:29 2024] mount.ntfs-3g   D    0 647458      1 0x00000000
[Thu Jul  4 15:15:29 2024] Call Trace:
[Thu Jul  4 15:15:29 2024]  ? __schedule+0x294/0x6b0
[Thu Jul  4 15:15:29 2024]  schedule+0x29/0xc0
[Thu Jul  4 15:15:29 2024]  blk_queue_enter+0x10b/0x1b0
[Thu Jul  4 15:15:29 2024]  ? wait_woken+0x80/0x80
[Thu Jul  4 15:15:29 2024]  generic_make_request+0x27d/0x370
[Thu Jul  4 15:15:29 2024]  ? page_cache_tree_insert+0x90/0xb0
[Thu Jul  4 15:15:29 2024]  submit_bio+0x34/0x180
[Thu Jul  4 15:15:29 2024]  ? __disk_get_part+0x5/0x30
[Thu Jul  4 15:15:29 2024]  ? guard_bio_eod+0x32/0x110
[Thu Jul  4 15:15:29 2024]  mpage_readpages+0x154/0x180
[Thu Jul  4 15:15:29 2024]  ? check_disk_change+0x60/0x60
[Thu Jul  4 15:15:29 2024]  read_pages+0x5b/0x170
[Thu Jul  4 15:15:29 2024]  __do_page_cache_readahead+0x19e/0x1b0
[Thu Jul  4 15:15:29 2024]  ondemand_readahead+0x205/0x2d0
[Thu Jul  4 15:15:29 2024]  generic_file_buffered_read+0x5ce/0x910
[Thu Jul  4 15:15:29 2024]  new_sync_read+0xf1/0x140
[Thu Jul  4 15:15:29 2024]  vfs_read+0x91/0x130
[Thu Jul  4 15:15:29 2024]  ksys_pread64+0x61/0xa0
[Thu Jul  4 15:15:29 2024]  ? ksys_read+0x43/0xb0
[Thu Jul  4 15:15:29 2024]  do_syscall_64+0x5f/0x1b0
[Thu Jul  4 15:15:29 2024]  ? prepare_exit_to_usermode+0x4c/0xb0
[Thu Jul  4 15:15:29 2024]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[Thu Jul  4 15:15:29 2024] RIP: 0033:0x7fca82612388
[Thu Jul  4 15:15:29 2024] Code: Bad RIP value.
[Thu Jul  4 15:15:29 2024] RSP: 002b:00007ffe6df63178 EFLAGS: 00000246 ORIG_RAX: 0000000000000011
[Thu Jul  4 15:15:29 2024] RAX: ffffffffffffffda RBX: 0000000000001000 RCX: 00007fca82612388
[Thu Jul  4 15:15:29 2024] RDX: 0000000000001000 RSI: 00000000008577f0 RDI: 0000000000000003
[Thu Jul  4 15:15:29 2024] RBP: 0000000000000000 R08: 0000000000846016 R09: 0000000000858860
[Thu Jul  4 15:15:29 2024] R10: 0000000000050000 R11: 0000000000000246 R12: 0000000000050000
[Thu Jul  4 15:15:29 2024] R13: 00000000008577f0 R14: 00007fca8244cc20 R15: 000000000084b210

ps -A -ostat,cmd,pid,ppid,wchan | grep ^[Dd]
Ds   mount.ntfs-3g /dev/nbd0p2 /  647458       1 -
D    ls --color=auto              647507  611780 -
D    mount -l                     647674  647623 -

Version-Release number of selected component (if applicable):
lsb_release -a
LSB Version:    :core-4.1-amd64:core-4.1-noarch
Distributor ID: AnolisOS
Description:    Anolis OS release 8.6
Release:        8.6
Codename:       n/a
uname -r
4.19.91-26.an8.x86_64

存储服务端
ceph集群版本
ceph-15.2.17
内核和操作系统
NAME="openEuler"
VERSION="22.03 LTS"
ID="openEuler"
VERSION_ID="22.03"
PRETTY_NAME="openEuler 22.03 LTS"
ANSI_COLOR="0;31"

5.10.0-60.18.0.50.oe2203.x86_64

How reproducible:
使用rbd-nbd工具挂载两个rbd镜像

Steps to Reproduce:
1. 在服务端使用ceph-15.2.17创建一个存储集群
2. 在服务端创建rbd存储池,创建两个rbd镜像,使用win10快速格式化这两个rbd镜像
3. 龙蜥平台安装rbd-nbd工具和ntfs-3g工具,加载nbd内核
4、rbd-nbd挂载rbd镜像,使用ntfs-3g挂载rbd镜像中的ntfs文件系统
5、重复操作第4步

Actual results:
内核出现锁死

Expected results:
rbd镜像挂载正常,ntfs文件系统挂载正常,可以正常读写

Additional info: