Bug 19215 - [ANCK6.6.71-3 rc1][aarch64][倚天710机器]xfstests xfs文件系统,generic/630测试过程中触发coredump
Summary: [ANCK6.6.71-3 rc1][aarch64][倚天710机器]xfstests xfs文件系统,generic/630测试过程中触发coredump
Status: CLOSED FIXED
Alias: None
Product: Antest
Classification: Infrastructures
Component: 测试用例 (show other bugs) 测试用例
Version: unspecified
Hardware: aarch64 Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: chenzhuo
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-03-05 14:17 UTC by zhixin01
Modified: 2025-03-12 17:42 UTC (History)
6 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description zhixin01 alibaba_cloud_group 2025-03-05 14:17:31 UTC
[缺陷描述]:
xfstests xfs文件系统,generic/630测试过程中触发coredump

测试日志如下:
generic/630       - output mismatch (see /var/tmp/tone/run/xfstests/results//generic/630.out.bad)
    --- tests/generic/630.out   2025-03-04 13:43:34.325778348 +0800
    +++ /var/tmp/tone/run/xfstests/results//generic/630.out.bad 2025-03-05 13:50:26.181860302 +0800
    @@ -1,2 +1,6 @@
     QA output created by 630
    +ASSERT: offset 37 should be 0x58, got 0x0!
    +/var/tmp/tone/run/xfstests/tests/generic/630: line 29: 3631860 Aborted                 (core dumped) $here/src/deduperace -c $SCRATCH_MNT -n $nr_ops
    +ASSERT: offset 37 should be 0x58, got 0x0!
    +/var/tmp/tone/run/xfstests/tests/generic/630: line 32: 3631868 Aborted                 (core dumped) $here/src/deduperace -c $SCRATCH_MNT -n $nr_ops -w
     Silence is golden.
    ...
    (Run 'diff -u /var/tmp/tone/run/xfstests/tests/generic/630.out /var/tmp/tone/run/xfstests/results//generic/630.out.bad'  to see the entire diff)
Ran: generic/630
Failures: generic/630
Failed 1 of 1 tests

[重现概率]:
必现

[重现环境]:
环境信息:倚天710机器
100.82.243.208

#uname -r
6.6.71-3_rc1.al8.aarch64

#cat /etc/os-release
NAME="Alibaba Cloud Linux"
VERSION="3 (Soaring Falcon)"
ID="alinux"
ID_LIKE="rhel fedora centos anolis"
VERSION_ID="3"
UPDATE_ID="10"
PLATFORM_ID="platform:al8"
PRETTY_NAME="Alibaba Cloud Linux 3 (Soaring Falcon)"
ANSI_COLOR="0;31"
HOME_URL="https://www.aliyun.com/"

#lscpu
Architecture:        aarch64
Byte Order:          Little Endian
CPU(s):              124
On-line CPU(s) list: 0-123
Thread(s) per core:  1
Core(s) per socket:  124
Socket(s):           1
NUMA node(s):        2
Vendor ID:           ARM
BIOS Vendor ID:      T-HEAD
Model:               0
Model name:          Neoverse-N2
BIOS Model name:     Yitian710-124
Stepping:            r0p0
CPU MHz:             2750.002
BogoMIPS:            100.00
Hypervisor vendor:   Alibaba
Virtualization type: full
L1d cache:           64K
L1i cache:           64K
L2 cache:            1024K
L3 cache:            65536K
NUMA node0 CPU(s):   0-61
NUMA node1 CPU(s):   62-123
Flags:               fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh

#free -h
              total        used        free      shared  buff/cache   available
Mem:          251Gi       5.7Gi       243Gi       9.0Mi       4.0Gi       245Gi
Swap:         2.0Gi       116Mi       1.9Gi

#cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/boot/vmlinuz-6.6.71-3_rc1.al8.aarch64 root=UUID=5d4c9cac-5324-464b-8971-09deff261ae7 ro biosdevname=0 rd.driver.pre=ahci iommu.passthrough=1 iommu.strict=0 nospectre_bhb ssbd=force-off systemd.unified_cgroup_hierarchy=0 cgroup.memory=nokmem console=ttyS0,115200 fsck.repair=yes crashkernel=0M-2G:0M,2G-256G:256M,256G-1024G:320M,1024G-:384M

[重现步骤]:
# 下载并编译测试套
git clone http://gitlab-sp.alibaba-inc.com/AKTF/xfstests-dev.git -b kernel_6.6
cd xfstests-dev
make

# 测试准备
mkfs.xfs /dev/nvme0n1p1
mkfs.xfs /dev/nvme0n1p2
export FSTYP=xfs
export TEST_DEV=/dev/nvme0n1p1
export SCRATCH_DEV=/dev/nvme0n1p2
export TEST_DIR=/fs/nvme0n1p1
export SCRATCH_MNT=/fs/nvme0n1p2

# 执行测试
./check generic/630

[期望结果]:
用例执行PASS

[实际结果]:
用例执行fail,执行过程中触发coredump

[分析]
以下命令测试过程中触发coredump
xfstests/src/deduperace -c /fs/nvme0n1p2 -n 10000
xfstests/src/deduperace -c /fs/nvme0n1p2 -n 10000 -w

coredump信息:
#gdb ./deduperace /var/lib/systemd/coredump/core.deduperace.0.1178be52e3e94de6af82ee7aedea844f.2609521.1741080653000000
GNU gdb (GDB) Alibaba Cloud Linux 9.2-7.2.0.5.al8
Copyright (C) 2020 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "aarch64-redhat-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
    <http://www.gnu.org/software/gdb/documentation/>.

For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from ./deduperace...
[New LWP 2609522]
[New LWP 2609521]
[New LWP 2609523]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib64/libthread_db.so.1".
Core was generated by `/var/tmp/tone/run/xfstests/src/deduperace -c /fs/nvme0n1p2 -n 10000'.
Program terminated with signal SIGABRT, Aborted.
#0  0x0000ffffb30ecdd8 in raise () from /lib64/libc.so.6
[Current thread is 1 (Thread 0xffffb3000100 (LWP 2609522))]
(gdb) bt
#0  0x0000ffffb30ecdd8 in raise () from /lib64/libc.so.6
#1  0x0000ffffb30d454c in abort () from /lib64/libc.so.6
#2  0x000000000040139c in complain (bad=<optimized out>, offset=<optimized out>) at deduperace.c:116
#3  mreader (data=<optimized out>) at deduperace.c:134
#4  0x0000ffffb3307f74 in start_thread () from /lib64/libpthread.so.0
#5  0x0000ffffb30d8f9c in thread_start () from /lib64/libc.so.6
(gdb)
Comment 1 chenzhuo alibaba_cloud_group 2025-03-05 14:51:47 UTC
ali6000上的已知问题,相关修复patch已合入,rc2验证即可
Comment 2 zhixin01 alibaba_cloud_group 2025-03-12 17:42:04 UTC
在6.6.71-3_rc2.al8.aarch64,xfstests xfs文件系统,generic/630用例执行成功,问题已解决,bug单关闭。

#./check generic/630
FSTYP         -- xfs (non-debug)
PLATFORM      -- Linux/aarch64 v43c07451 6.6.71-3_rc2.al8.aarch64 #1 SMP PREEMPT_DYNAMIC Fri Mar  7 12:41:15 CST 2025
MKFS_OPTIONS  -- -f /dev/nvme0n1p2
MOUNT_OPTIONS -- /dev/nvme0n1p2 /fs/nvme0n1p2

generic/630 3s ...  3s
Ran: generic/630
Passed all 1 tests

#uname -r
6.6.71-3_rc2.al8.aarch64