Bug 5705 - [Anolis8][aarch64][debug内核][内部nightly] LTP测试套mm:oom01测试会导致环境hung住,无法自主启动,强制重启后有vmcore生成
Summary: [Anolis8][aarch64][debug内核][内部nightly] LTP测试套mm:oom01测试会导致环境hung住,无法自主启动,强制重启...
Status: NEW
Alias: None
Product: Antest
Classification: Infrastructures
Component: 测试用例 (show other bugs) 测试用例
Version: unspecified
Hardware: All Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: shuming
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-06-30 17:39 UTC by yunhe123
Modified: 2023-06-30 17:40 UTC (History)
8 users (show)

See Also:


Attachments
vmcore-dmesg.txt (828.93 KB, text/plain)
2023-06-30 17:39 UTC, yunhe123
Details

Note You need to log in before you can comment on or make changes to this bug.
Description yunhe123 alibaba_cloud_group 2023-06-30 17:39:09 UTC
Created attachment 819 [details]
vmcore-dmesg.txt

Description of problem:
Anolis8 aarch64 debug内核nightly物理机环境下, LTP测试case:oom01测试会导致环境hung住,无法正常启动,强制重启后有vmcore生成,29号晚上的nightly出现的该问题。

crash解析如下(vmcore-dmesg.txt文件见附件):
KERNEL: /usr/lib/debug/lib/modules/5.10.134-706.git.2854f78f3.an8.aarch64+debug/vmlinux  [TAINTED]
    DUMPFILE: vmcore  [PARTIAL DUMP]
        CPUS: 96 [OFFLINE: 51]
        DATE: Fri Jun 30 14:22:46 CST 2023
      UPTIME: 02:52:05
LOAD AVERAGE: 21.80, 17.38, 11.44
       TASKS: 1016
    NODENAME: localhost.localdomain
     RELEASE: 5.10.134-706.git.2854f78f3.an8.aarch64+debug
     VERSION: #1 SMP Wed Jun 28 11:50:57 UTC 2023
     MACHINE: aarch64  (unknown Mhz)
      MEMORY: 768 GB
       PANIC: "Kernel panic - not syncing: System is deadlocked on memory"
         PID: 10140
     COMMAND: "kworker/u193:1"
        TASK: ffff561e46b40000  [THREAD_INFO: ffff561e46b40000]
         CPU: 22
       STATE: TASK_RUNNING (PANIC)

crash> bt
PID: 10140  TASK: ffff561e46b40000  CPU: 22  COMMAND: "kworker/u193:1"
 #0 [ffff561da33d6c30] machine_kexec at ffffa0001025c7cc
 #1 [ffff561da33d6cc0] __crash_kexec at ffffa000105cce5c
 #2 [ffff561da33d6ea0] panic at ffffa000120c68dc
 #3 [ffff561da33d6fe0] out_of_memory at ffffa000108ce9c4
 #4 [ffff561da33d70e0] __alloc_pages_may_oom at ffffa000109d4c7c
 #5 [ffff561da33d71e0] __alloc_pages_slowpath.constprop.106 at ffffa000109da200
 #6 [ffff561da33d7470] __alloc_pages_nodemask at ffffa000109db7d0
 #7 [ffff561da33d7560] alloc_pages_current at ffffa00010a2a0b0
 #8 [ffff561da33d75b0] alloc_slab_page at ffffa00010a3d624
 #9 [ffff561da33d75e0] allocate_slab at ffffa00010a40a74
#10 [ffff561da33d7640] ___slab_alloc at ffffa00010a4446c
#11 [ffff561da33d7730] __slab_alloc at ffffa00010a44798
#12 [ffff561da33d7770] kmem_cache_alloc_node at ffffa00010a45a8c
#13 [ffff561da33d7800] copy_process at ffffa0001032c8f8
#14 [ffff561da33d7a00] kernel_clone at ffffa00010330bf4
#15 [ffff561da33d7b10] kernel_thread at ffffa00010332234
#16 [ffff561da33d7c00] call_usermodehelper_exec_work at ffffa0001038e7b4
#17 [ffff561da33d7c20] process_one_work at ffffa0001039c040
#18 [ffff561da33d7d90] worker_thread at ffffa0001039d468
#19 [ffff561da33d7e50] kthread at ffffa000103b00e4
crash>


How reproducible:
必现(除了任务中出现一次,手动执行两次均有同样的现象)

Steps to Reproduce:
1. git clone https://gitee.com/anolis/ltp.git --branch 20200930
2. export CFLAGS+="-fcommon"
   make autotools
   ./configure 
   make && make install
3. cd /opt/ltp;./runltp -f mm -s oom01

Actual results:
测试环境bung住,且无法自主启动

Expected results:
用例执行pass

Additional info:
[root@localhost home]# uname -r
5.10.134-706.git.2854f78f3.an8.aarch64+debug
[root@localhost home]#
[root@localhost home]#
[root@localhost home]# cat /etc/os-release
NAME="Anolis OS"
VERSION="8.8"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="8.8"
PLATFORM_ID="platform:an8"
PRETTY_NAME="Anolis OS 8.8"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"

[root@localhost home]# free -g
              total        used        free      shared  buff/cache   available
Mem:            657          10         643           0           2         643
Swap:             0           0           0
[root@localhost home]# lscpu
Architecture:        aarch64
Byte Order:          Little Endian
CPU(s):              96
On-line CPU(s) list: 0-95
Thread(s) per core:  1
Core(s) per socket:  48
Socket(s):           2
NUMA node(s):        4
Vendor ID:           HiSilicon
BIOS Vendor ID:      HiSilicon
Model:               0
Model name:          Kunpeng-920
BIOS Model name:     HUAWEI Kunpeng 920 5250
Stepping:            0x1
CPU max MHz:         2600.0000
CPU min MHz:         200.0000
BogoMIPS:            200.00
L1d cache:           64K
L1i cache:           64K
L2 cache:            512K
L3 cache:            24576K
NUMA node0 CPU(s):   0-23
NUMA node1 CPU(s):   24-47
NUMA node2 CPU(s):   48-71
NUMA node3 CPU(s):   72-95
Flags:               fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm