Bug 5972 - [ANCK-5.10][aarch64]xfstests->ext4-2测试用例generic/299失败,fio: job 'direct_aio' (state=5) hasn't exited in 300 seconds, it appears to be stuck. Doing forceful exit of this job,fio: file hash not empty on exit
Summary: [ANCK-5.10][aarch64]xfstests->ext4-2测试用例generic/299失败,fio: job 'direct_aio' (...
Status: NEW
Alias: None
Product: Antest
Classification: Infrastructures
Component: 测试用例 (show other bugs) 测试用例
Version: unspecified
Hardware: All Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: yunmeng365524
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-07-25 14:42 UTC by yunhe123
Modified: 2023-10-18 14:01 UTC (History)
6 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description yunhe123 alibaba_cloud_group 2023-07-25 14:42:21 UTC
Description of problem:
[anolis8][5.10][nightly][aarch64]xfstests->ext4-2测试用例generic/299失败,测试日志如下:
generic/299       - output mismatch (see /tmp/tone/run/xfstests/results//generic/299.out.bad)
    --- tests/generic/299.out   2023-07-25 11:42:54.894801410 +0800
    +++ /tmp/tone/run/xfstests/results//generic/299.out.bad     2023-07-25 14:11:34.262801410 +0800
    @@ -3,3 +3,5 @@
     Run fio with random aio-dio pattern

     Start fallocate/truncate loop
    +fio: job 'direct_aio' (state=5) hasn't exited in 300 seconds, it appears to be stuck. Doing forceful exit of this job.
    +fio: file hash not empty on exit
    ...
    (Run 'diff -u /tmp/tone/run/xfstests/tests/generic/299.out /tmp/tone/run/xfstests/results//generic/299.out.bad'  to see the entire diff)

# diff -u /tmp/tone/run/xfstests/tests/generic/299.out /tmp/tone/run/xfstests/results//generic/299.out.bad
--- /tmp/tone/run/xfstests/tests/generic/299.out        2023-07-25 11:42:54.894801410 +0800
+++ /tmp/tone/run/xfstests/results//generic/299.out.bad 2023-07-25 14:11:34.262801410 +0800
@@ -3,3 +3,5 @@
 Run fio with random aio-dio pattern

 Start fallocate/truncate loop
+fio: job 'direct_aio' (state=5) hasn't exited in 300 seconds, it appears to be stuck. Doing forceful exit of this job.
+fio: file hash not empty on exit



Version-Release number of selected component (if applicable):
#fio --version
fio-3.27


Steps to Reproduce:
1. 准备环境
wipefs -a --force /dev/vdb1  
wipefs -a --force /dev/vdb2
wipefs -a --force /dev/vdb3 
wipefs -a --force /dev/vdb4

mkfs -t ext4 -L OStest -q -f /dev/vdb1
mkfs -t ext4 -L OStest -q -f /dev/vdb2
mkfs -t ext4 -L OStest -q -f /dev/vdb3
mkfs -t ext4 -L OStest -q -f /dev/vdb4

export FSTYP=ext4
export TEST_DIR=/fs/vdb1
export TEST_DEV=/dev/vdb1
export SCRATCH_MNT=/fs/vdb2
export SCRATCH_DEV=/dev/vdb2
mkdir -p /fs/vdb1 /fs/vdb2

2. 下载xfstests测试代码

3. 编译测试源码
git clone --branch anck-5.10 https://gitee.com/anolis/xfstests.git
cd xfstests
export CFLAGS="-fcommon"
make
make install

Actual results:
用例执行fail


Expected results:
用例执行pass


Additional info:
[root@nu4f13165 ~]# uname -r
5.10.134-735.git.dd0596c29.an8.aarch64
[root@nu4f13165 ~]# cat /etc/os-release
NAME="Anolis OS"
VERSION="8.8"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="8.8"
PLATFORM_ID="platform:an8"
PRETTY_NAME="Anolis OS 8.8"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"

[root@nu4f13165 ~]#
[root@nu4f13165 ~]# free -h
              total        used        free      shared  buff/cache   available
Mem:          753Gi       3.5Gi       744Gi        10Mi       5.3Gi       746Gi
Swap:         2.0Gi          0B       2.0Gi
[root@nu4f13165 ~]# lscpu
Architecture:        aarch64
Byte Order:          Little Endian
CPU(s):              96
On-line CPU(s) list: 0-95
Thread(s) per core:  1
Core(s) per socket:  48
Socket(s):           2
NUMA node(s):        1
Vendor ID:           HiSilicon
BIOS Vendor ID:      HiSilicon
Model:               0
Model name:          Kunpeng-920
BIOS Model name:     HUAWEI Kunpeng 920 5250
Stepping:            0x1
CPU MHz:             2600.000
CPU max MHz:         2600.0000
CPU min MHz:         200.0000
BogoMIPS:            200.00
L1d cache:           64K
L1i cache:           64K
L2 cache:            512K
L3 cache:            24576K
NUMA node0 CPU(s):   0-95
Flags:               fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm
Comment 1 zhixin01 alibaba_cloud_group 2023-07-27 17:45:34 UTC
[anolis8][4.19][x86_64][nightly]xfstests->ext4-2-bigalloc测试用例generic/299也有同样的失败

generic/299 415s ... - output mismatch (see /tmp/tone/run/xfstests/results//generic/299.out.bad)
    --- tests/generic/299.out   2023-07-27 15:34:04.075991784 +0800
    +++ /tmp/tone/run/xfstests/results//generic/299.out.bad     2023-07-27 17:37:00.509966953 +0800
    @@ -3,3 +3,5 @@
     Run fio with random aio-dio pattern

     Start fallocate/truncate loop
    +fio: job 'direct_aio' (state=5) hasn't exited in 300 seconds, it appears to be stuck. Doing forceful exit of this job.
    +fio: file hash not empty on exit
    ...
    (Run 'diff -u /tmp/tone/run/xfstests/tests/generic/299.out /tmp/tone/run/xfstests/results//generic/299.out.bad'  to see the entire diff)
Ran: generic/299
Failures: generic/299
Failed 1 of 1 tests

# fio --version
fio-3.27

# uname -r
4.19.91-796.git.8a0e92818.an8.x86_64
Comment 2 yunhe123 alibaba_cloud_group 2023-10-13 17:18:48 UTC
alinux3-5.10.134-16_rc1.al8内核依旧存在此问题;
失败日志如下:
generic/299 164s ... - output mismatch (see /var/tmp/tone/run/xfstests/results//generic/299.out.bad)
    --- tests/generic/299.out   2023-10-11 18:02:42.753864780 +0800
    +++ /var/tmp/tone/run/xfstests/results//generic/299.out.bad 2023-10-13 17:11:12.319214402 +0800
    @@ -3,3 +3,5 @@
     Run fio with random aio-dio pattern

     Start fallocate/truncate loop
    +fio: job 'buffered-aio-verifier' (state=5) hasn't exited in 300 seconds, it appears to be stuck. Doing forceful exit of this job.
    +fio: file hash not empty on exit
    ...
    (Run 'diff -u /var/tmp/tone/run/xfstests/tests/generic/299.out /var/tmp/tone/run/xfstests/results//generic/299.out.bad'  to see the entire diff)
Ran: generic/299
Failures: generic/299
Failed 1 of 1 tests

[tone]Error: The return code of run() in run.sh is not 0
generic/299: Failed