Created attachment 819 [details] vmcore-dmesg.txt Description of problem: Anolis8 aarch64 debug内核nightly物理机环境下, LTP测试case:oom01测试会导致环境hung住,无法正常启动,强制重启后有vmcore生成,29号晚上的nightly出现的该问题。 crash解析如下(vmcore-dmesg.txt文件见附件): KERNEL: /usr/lib/debug/lib/modules/5.10.134-706.git.2854f78f3.an8.aarch64+debug/vmlinux [TAINTED] DUMPFILE: vmcore [PARTIAL DUMP] CPUS: 96 [OFFLINE: 51] DATE: Fri Jun 30 14:22:46 CST 2023 UPTIME: 02:52:05 LOAD AVERAGE: 21.80, 17.38, 11.44 TASKS: 1016 NODENAME: localhost.localdomain RELEASE: 5.10.134-706.git.2854f78f3.an8.aarch64+debug VERSION: #1 SMP Wed Jun 28 11:50:57 UTC 2023 MACHINE: aarch64 (unknown Mhz) MEMORY: 768 GB PANIC: "Kernel panic - not syncing: System is deadlocked on memory" PID: 10140 COMMAND: "kworker/u193:1" TASK: ffff561e46b40000 [THREAD_INFO: ffff561e46b40000] CPU: 22 STATE: TASK_RUNNING (PANIC) crash> bt PID: 10140 TASK: ffff561e46b40000 CPU: 22 COMMAND: "kworker/u193:1" #0 [ffff561da33d6c30] machine_kexec at ffffa0001025c7cc #1 [ffff561da33d6cc0] __crash_kexec at ffffa000105cce5c #2 [ffff561da33d6ea0] panic at ffffa000120c68dc #3 [ffff561da33d6fe0] out_of_memory at ffffa000108ce9c4 #4 [ffff561da33d70e0] __alloc_pages_may_oom at ffffa000109d4c7c #5 [ffff561da33d71e0] __alloc_pages_slowpath.constprop.106 at ffffa000109da200 #6 [ffff561da33d7470] __alloc_pages_nodemask at ffffa000109db7d0 #7 [ffff561da33d7560] alloc_pages_current at ffffa00010a2a0b0 #8 [ffff561da33d75b0] alloc_slab_page at ffffa00010a3d624 #9 [ffff561da33d75e0] allocate_slab at ffffa00010a40a74 #10 [ffff561da33d7640] ___slab_alloc at ffffa00010a4446c #11 [ffff561da33d7730] __slab_alloc at ffffa00010a44798 #12 [ffff561da33d7770] kmem_cache_alloc_node at ffffa00010a45a8c #13 [ffff561da33d7800] copy_process at ffffa0001032c8f8 #14 [ffff561da33d7a00] kernel_clone at ffffa00010330bf4 #15 [ffff561da33d7b10] kernel_thread at ffffa00010332234 #16 [ffff561da33d7c00] call_usermodehelper_exec_work at ffffa0001038e7b4 #17 [ffff561da33d7c20] process_one_work at ffffa0001039c040 #18 [ffff561da33d7d90] worker_thread at ffffa0001039d468 #19 [ffff561da33d7e50] kthread at ffffa000103b00e4 crash> How reproducible: 必现(除了任务中出现一次,手动执行两次均有同样的现象) Steps to Reproduce: 1. git clone https://gitee.com/anolis/ltp.git --branch 20200930 2. export CFLAGS+="-fcommon" make autotools ./configure make && make install 3. cd /opt/ltp;./runltp -f mm -s oom01 Actual results: 测试环境bung住,且无法自主启动 Expected results: 用例执行pass Additional info: [root@localhost home]# uname -r 5.10.134-706.git.2854f78f3.an8.aarch64+debug [root@localhost home]# [root@localhost home]# [root@localhost home]# cat /etc/os-release NAME="Anolis OS" VERSION="8.8" ID="anolis" ID_LIKE="rhel fedora centos" VERSION_ID="8.8" PLATFORM_ID="platform:an8" PRETTY_NAME="Anolis OS 8.8" ANSI_COLOR="0;31" HOME_URL="https://openanolis.cn/" [root@localhost home]# free -g total used free shared buff/cache available Mem: 657 10 643 0 2 643 Swap: 0 0 0 [root@localhost home]# lscpu Architecture: aarch64 Byte Order: Little Endian CPU(s): 96 On-line CPU(s) list: 0-95 Thread(s) per core: 1 Core(s) per socket: 48 Socket(s): 2 NUMA node(s): 4 Vendor ID: HiSilicon BIOS Vendor ID: HiSilicon Model: 0 Model name: Kunpeng-920 BIOS Model name: HUAWEI Kunpeng 920 5250 Stepping: 0x1 CPU max MHz: 2600.0000 CPU min MHz: 200.0000 BogoMIPS: 200.00 L1d cache: 64K L1i cache: 64K L2 cache: 512K L3 cache: 24576K NUMA node0 CPU(s): 0-23 NUMA node1 CPU(s): 24-47 NUMA node2 CPU(s): 48-71 NUMA node3 CPU(s): 72-95 Flags: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm