Bug 2228 - [Anolis8 4.19][debug-kernel][nightly][x86_64] rcutorture的defaults测试集下2个case fail
Summary: [Anolis8 4.19][debug-kernel][nightly][x86_64] rcutorture的defaults测试集下2个case fail
Status: NEW
Alias: None
Product: Anolis OS 8
Classification: Anolis OS
Component: Others (show other bugs) Others
Version: 8.6
Hardware: x86_64 Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: Jacob
QA Contact: shuming
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-09-20 18:05 UTC by zhixin01
Modified: 2022-11-24 16:10 UTC (History)
6 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description zhixin01 alibaba_cloud_group 2022-09-20 18:05:44 UTC
[缺陷描述]:
Anolis8 ck-4.19 x86_64环境(4.19.91-465.git.f167f3665.an8.x86_64+debug)nightly测试,defaults测试集下busted和busted_srcud用例执行失败,报错日志如下

buste用例执行过程dmesg部分失败日志如下:
[ 2082.268730] busted-torture: rtc:           (null) ver: 72517 tfle: 0 rta: 72517 rtaf: 0 rtf: 72508 rtmbe: 4952 rtbe: 0 rtbke: 0 rtbre: 0 rtbf: 0 rtb: 0 nt: 9588867 onoff: 0/0:0/0 -1,0:-1,0 0:0 (HZ=1000) barrier: 0/0:0 cbflood: 0
[ 2082.291071] busted-torture: !!! Reader Pipe:  6548941342 128049 17551 519 104 34 14 10 15 12 5166
[ 2082.301160] busted-torture: Reader Batch:  6549092816 0 0 0 0 0 0 0 0 0 0
[ 2082.309156] busted-torture: Free-Block Circulation:  72516 72516 72515 72514 72513 72512 72511 72510 72509 72508 0
[ 2082.320823] busted-torture:--- End of test: FAILURE: nreaders=95 nfakewriters=4 stat_interval=60 verbose=1 test_no_idle_hz=1 shuffle_interval=3 stutter=5 irqreader=1 fqs_duration=0 fqs_holdoff=0 fqs_stutter=3 test_boost=1/0 test_boost_interval=7 test_boost_duration=4 shutdown_secs=0 stall_cpu=0 stall_cpu_holdoff=10 stall_cpu_irqsoff=0 n_barrier_cbs=0 onoff_interval=0 onoff_holdoff=0

busted_srcud执行过程dmesg部分报错日志如下:
[ 3445.339477] busted_srcud-torture: rtc:           (null) ver: 31027 tfle: 0 rta: 31027 rtaf: 0 rtf: 31018 rtmbe: 0 rtbe: 0 rtbke: 0 rtbre: 0 rtbf: 0 rtb: 0 nt: 9362273 onoff: 0/0:0/0 -1,0:-1,0 0:0 (HZ=1000) barrier: 0/0:0 cbflood: 5056
[ 3445.364811] busted_srcud-torture: !!! Reader Pipe:  3319007977 1762334 34 29 44 80 109 93 33 37 89
[ 3445.376194] busted_srcud-torture: Reader Batch:  3313642567 7124522 2697 445 67 16 6 10 5 3 521
[ 3445.387285] busted_srcud-torture: Free-Block Circulation:  31026 31026 31025 31024 31023 31022 31021 31020 31019 31018 0
[ 3445.400559] rcu: busted_srcud-torture: Tree SRCU g0 per-CPU(idx=0): 0(0,0 (null)) 1(0,0 (null)) 2(0,0 (null)) 3(0,0 (null)) 4(0,0 (null)) 5(0,0 (null)) 6(0,0 (null)) 7(0,0 (null)) 8(0,0 (null)) 9(0,0 (null)) 10(0,0 (null)) 11(0,0 (null)) 12(0,0 (null)) 13(0,0 (null)) 14(0,0 (null)) 15(0,0 (null)) 16(0,0 (null)) 17(0,0 (null)) 18(0,0 (null)) 19(0,0 (null)) 20(0,0 (null)) 21(0,0 (null)) 22(0,0 (null)) 23(0,0 (null)) 24(0,0 (null)) 25(0,0 (null)) 26(0,0 (null)) 27(0,0 (null)) 28(0,0 (null)) 29(0,0 (null)) 30(0,0 (null)) 31(0,0 (null)) 32(0,0 (null)) 33(0,0 (null)) 34(0,0 (null)) 35(0,0 (null)) 36(0,0 (null)) 37(0,0 (null)) 38(0,0 (null)) 39(0,0 (null)) 40(0,0 (null)) 41(0,0 (null)) 42(0,0 (null)) 43(0,0 (null)) 44(0,0 (null)) 45(0,0 (null)) 46(0,0 (null)) 47(0,0 (null)) 48(0,0 (null)) 49(0,0 (null))
[ 3445.564958] busted_srcud-torture:--- End of test: FAILURE: nreaders=95 nfakewriters=4 stat_interval=60 verbose=1 test_no_idle_hz=1 shuffle_interval=3 stutter=5 irqreader=1 fqs_duration=0 fqs_holdoff=0 fqs_stutter=3 test_boost=1/0 test_boost_interval=7 test_boost_duration=4 shutdown_secs=0 stall_cpu=0 stall_cpu_holdoff=10 stall_cpu_irqsoff=0 n_barrier_cbs=0 onoff_interval=0 onoff_holdoff=0


[重现步骤]:
1)安装kernel-debug等内核:
rpm -ivh http://ci-storage.alibaba.net/kernel/anolis8/ck-4.19/x86_64/20220919_465/kernel-debug-core-4.19.91-465.git.f167f3665.an8.x86_64.rpm
http://ci-storage.alibaba.net/kernel/anolis8/ck-4.19/x86_64/20220919_465/kernel-debug-modules-4.19.91-465.git.f167f3665.an8.x86_64.rpm
rpm -ivh http://ci-storage.alibaba.net/kernel/anolis8/ck-4.19/x86_64/20220919_465/kernel-debug-4.19.91-465.git.f167f3665.an8.x86_64.rpm
rpm -ivh http://ci-storage.alibaba.net/kernel/anolis8/ck-4.19/x86_64/20220919_465/kernel-debug-devel-4.19.91-465.git.f167f3665.an8.x86_64.rpm
rpm -ivh http://ci-storage.alibaba.net/kernel/anolis8/ck-4.19/x86_64/20220919_465/kernel-headers-4.19.91-465.git.f167f3665.an8.x86_64.rpm
2)reboot环境使得新内核生效
3)modprobe rcutorture torture_type=busted
4)sleep 450
5)rmmod rcutorture
6)modprobe rcutorture torture_type=busted_srcud
7)sleep 450
8)rmmod rcutorture


[期望结果]:
rcutorture测试pass
[实际结果]:
defaults测试集下busted和busted_srcud用例执行失败

[测试环境]: 
# uname -r
4.19.91-465.git.f167f3665.an8.x86_64+debug

# cat /etc/os-release
NAME="Anolis OS"
VERSION="8.6"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="8.6"
PLATFORM_ID="platform:an8"
PRETTY_NAME="Anolis OS 8.6"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"

# cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/vmlinuz-4.19.91-465.git.f167f3665.an8.x86_64+debug root=UUID=e13209eb-5fab-4ec0-8159-6f0599a34b49 ro console=tty0 console=ttyS0,115200 rd.driver.pre=ahci crashkernel=0M-2G:0M,2G-8G:192M,8G-:256M

# lscpu
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              96
On-line CPU(s) list: 0-95
Thread(s) per core:  2
Core(s) per socket:  24
Socket(s):           2
NUMA node(s):        1
Vendor ID:           GenuineIntel
BIOS Vendor ID:      Intel(R) Corporation
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8163 CPU @ 2.50GHz
BIOS Model name:     Intel(R) Xeon(R) Platinum 8163 CPU @ 2.50GHz
Stepping:            4
CPU MHz:             1814.043
CPU max MHz:         3100.0000
CPU min MHz:         1000.0000
BogoMIPS:            5000.00
Virtualization:      VT-x
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            33792K
NUMA node0 CPU(s):   0-95
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts pku ospke md_clear flush_l1d arch_capabilities

# dmidecode -t 0
# dmidecode 3.3
Getting SMBIOS data from sysfs.
SMBIOS 3.0.0 present.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
        Vendor: Inspur
        Version: 3.20.34
        Release Date: 03/09/2019
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 16 MB
        Characteristics:
                PCI is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                BIOS ROM is socketed
                EDD is supported
                5.25"/1.2 MB floppy services are supported (int 13h)
                3.5"/720 kB floppy services are supported (int 13h)
                3.5"/2.88 MB floppy services are supported (int 13h)
                Print screen service is supported (int 5h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Targeted content distribution is supported
                UEFI is supported

[出现频率]
必现
Comment 1 yunhe123 alibaba_cloud_group 2022-10-10 11:01:35 UTC
anolis8-5.10-debug-arm nightly这两个用例存在同样的报错,日志同上。
Comment 2 anolislw alibaba_cloud_group 2022-11-24 16:10:11 UTC
anolis8.6 ck-4.19 aarch64 kernel-debug nightly测试中这两case也存在一样的问题