Bug 19044 - [ANCK6.6.71-3 rc1][aarch64][倚天710机器]alitests 中 perf_check 用例,/sys/kernel/mm/transparent_hugepage/enabled的值,预期always,环境上为never,不符合预期
Summary: [ANCK6.6.71-3 rc1][aarch64][倚天710机器]alitests 中 perf_check 用例,/sys/kernel/mm/t...
Status: RESOLVED BYDESIGN
Alias: None
Product: Antest
Classification: Infrastructures
Component: 测试用例 (show other bugs) 测试用例
Version: unspecified
Hardware: aarch64 Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: chenzhuo
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-02-24 14:28 UTC by zhixin01
Modified: 2025-02-28 11:08 UTC (History)
5 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description zhixin01 alibaba_cloud_group 2025-02-24 14:28:53 UTC
[缺陷描述]:
alitests测试套,perf_check 用例失败,检查系统 transparent_hugepage 开关默认值应为 always,实际是 never

ali5000和ali6000有同样问题,应该是镜像相关配置问题
https://project.aone.alibaba-inc.com/v2/project/797125/bug/60040293
https://project.aone.alibaba-inc.com/v2/project/797125/bug/60367792

测试日志:
<<<test_start>>>
tag=perf_check stime=1740376798
cmdline="perf_check.sh"
contacts=""
analysis=exit
<<<test_output>>>
incrementing stop
/var/tmp/tone/run/alitests/testcases/bin/perf_check.py
perf_check 1 TINFO: run perf_check.py
[mm.transparent_hugepage.enabled ] = never           Warning: Expected=always
(Refer to KB: https://help.aliyun.com/knowledge_detail/161963.html)
[net.ipv4.tcp_congestion_control ] = cubic
(Refer to KB: https://yq.aliyun.com/articles/712853)
[CPU BUGs]
PASS
[Mount Flags]
PASS
perf_check 1 TWARN: find unexpected system config

Summary:
passed   0
failed   0
skipped  0
warnings 1
<<<execution_status>>>
initiation_status="ok"
duration=0 termination_type=exited termination_id=4 corefile=no
cutime=2 cstime=1
<<<test_end>>>

[重现概率]:
必现

[重现环境]:
环境信息:倚天710机器
11.163.178.238

#uname -r
6.6.71-3_rc1.al8.aarch64

#cat /etc/os-release
NAME="Alibaba Cloud Linux"
VERSION="3 (Soaring Falcon)"
ID="alinux"
ID_LIKE="rhel fedora centos anolis"
VERSION_ID="3"
UPDATE_ID="10"
PLATFORM_ID="platform:al8"
PRETTY_NAME="Alibaba Cloud Linux 3 (Soaring Falcon)"
ANSI_COLOR="0;31"
HOME_URL="https://www.aliyun.com/"

#lscpu
Architecture:        aarch64
Byte Order:          Little Endian
CPU(s):              128
On-line CPU(s) list: 0-127
Thread(s) per core:  1
Core(s) per socket:  128
Socket(s):           1
NUMA node(s):        2
Vendor ID:           ARM
BIOS Vendor ID:      T-HEAD
Model:               0
Model name:          Neoverse-N2
BIOS Model name:     Yitian710-128
Stepping:            r0p0
CPU MHz:             2750.000
BogoMIPS:            100.00
Hypervisor vendor:   Alibaba
Virtualization type: full
L1d cache:           64K
L1i cache:           64K
L2 cache:            1024K
L3 cache:            65536K
NUMA node0 CPU(s):   0-63
NUMA node1 CPU(s):   64-127
Flags:               fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh

#free -h
              total        used        free      shared  buff/cache   available
Mem:          125Gi       3.6Gi       121Gi        12Mi       1.0Gi       122Gi
Swap:         2.0Gi          0B       2.0Gi

#cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/boot/vmlinuz-6.6.71-3_rc1.al8.aarch64 root=UUID=d0af582f-7147-41de-85e3-deb2e14cde99 ro biosdevname=0 rd.driver.pre=ahci iommu.passthrough=1 iommu.strict=0 nospectre_bhb ssbd=force-off systemd.unified_cgroup_hierarchy=0 cgroup.memory=nokmem console=ttyS0,115200 fsck.repair=yes crashkernel=0M-2G:0M,2G-256G:256M,256G-1024G:320M,1024G-:384M

[重现步骤]:
# 下载并编译用例
git clone http://gitlab-sp.alibaba-inc.com/os-quality/alitests.git
export CFLAGS="-fcommon"               #  gcc 10 需要添加这个
cd alitests
make autotools
./configure
make
make install

# 执行测试
/opt/ltp/runltp -f alitests -s perf_check 

[期望结果]:
用例执行PASS

[实际结果]:
用例执行Fail

[原因定位]:
相关配置如下:
#grep -i CONFIG_TRANSPARENT_HUGEPAGE_ALWAYS /boot/config-6.6.71-3_rc1.al8.aarch64
CONFIG_TRANSPARENT_HUGEPAGE_ALWAYS=y

#cat /sys/kernel/mm/transparent_hugepage/enabled
always madvise [never]
Comment 1 chenzhuo alibaba_cloud_group 2025-02-28 10:48:14 UTC
正如关联aone所述,transparent_hugepage的值内核设置为always,z_alinux-apsara-setup服务设置为了never,因此就这个问题来说是内核无关的。
至于z_alinux-apsara-setup设置为never是否符合预期,后续找baseos同学确认
Comment 2 chenzhuo alibaba_cloud_group 2025-02-28 11:08:34 UTC
(In reply to chenzhuo from comment #1)
> 正如关联aone所述,transparent_hugepage的值内核设置为always,z_alinux-apsara-
> setup服务设置为了never,因此就这个问题来说是内核无关的。
> 至于z_alinux-apsara-setup设置为never是否符合预期,后续找baseos同学确认

basesos同学确认符合预期