Bug 3994 - [Anolis8][Anck 5.10][nightly]ltp测试时tpci用例会导致ecs机器hang死
Summary: [Anolis8][Anck 5.10][nightly]ltp测试时tpci用例会导致ecs机器hang死
Status: CONFIRMED
Alias: None
Product: Antest
Classification: Infrastructures
Component: 测试用例 (show other bugs) 测试用例
Version: unspecified
Hardware: All Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: Jacob
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-02-08 15:32 UTC by shanxifanshi
Modified: 2023-03-16 16:02 UTC (History)
7 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description shanxifanshi alibaba_cloud_group 2023-02-08 15:32:28 UTC
[缺陷描述]:
ltp测试时tpci用例会导致ecs机器hang死
测试代码路径:testcases/kernel/device-drivers/pci/tpci_user/tpci.c


[复现概率]:
高概率出现,1/2

[复现步骤]:
git clone https://github.com/linux-test-project/ltp.git -b 20200930
cd ltp
make autotools
./configure
make
make install
/opt/ltp/runltp -f kernel_misc -s tpci

内核信息:
# uname -r
5.10.134-297.git.13be088f0e49.an8.x86_64

操作系统信息:
# cat /etc/os-release
NAME="Anolis OS"
VERSION="8.6"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="8.6"
PLATFORM_ID="platform:an8"
PRETTY_NAME="Anolis OS 8.6"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"

cpu信息:
# lscpu
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              32
On-line CPU(s) list: 0-31
Thread(s) per core:  2
Core(s) per socket:  16
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
BIOS Vendor ID:      Intel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2682 v4 @ 2.50GHz
BIOS Model name:     Intel(R) Xeon(R) CPU E5-2682 v4 @ 2.50GHz
Stepping:            1
CPU MHz:             2493.215
CPU max MHz:         2500.0000
CPU min MHz:         1200.0000
BogoMIPS:            4988.52
Virtualization:      VT-x
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            40960K
NUMA node0 CPU(s):   0-31
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm arat pln pts

内存信息:
# free -h
              total        used        free      shared  buff/cache   available
Mem:          125Gi       1.2Gi       120Gi        65Mi       3.4Gi       123Gi
Swap:         2.0Gi          0B       2.0Gi


[期望结果]:
用例pass

[实际结果]:
用例执行过程中会导致ecs hang死

[测试分析]:
Comment 1 shanxifanshi alibaba_cloud_group 2023-02-08 15:34:15 UTC
已知问题,tpci用例执行过程中会导致pci设备离线,ecs机器hang死
Comment 2 shanxifanshi alibaba_cloud_group 2023-02-08 15:37:30 UTC
(In reply to shanxifanshi from comment #1)
> 已知问题,tpci用例执行过程中会导致pci设备离线,ecs机器hang死

---已在用例中增加了判断systemd-detect-virt --vm,如果当前机器为vm,则对用例做skip处理