Bug 146 - [anolis79][anck][iso][x86] kdump.service loaded failed
Summary: [anolis79][anck][iso][x86] kdump.service loaded failed
Status: CONFIRMED
Alias: None
Product: Anolis OS 7
Classification: Anolis OS
Component: Images&Installations (show other bugs) Images&Installations
Version: 7.9
Hardware: Other Windows
: P1-Urgent S1-blocker
Target Milestone: ---
Assignee: geliwei-ali
QA Contact:
URL:
Whiteboard:
Keywords:
: 156 (view as bug list)
Depends on:
Blocks:
 
Reported: 2021-12-20 14:31 UTC by anolislw
Modified: 2022-01-07 17:28 UTC (History)
2 users (show)

See Also:


Attachments
error log (344.18 KB, image/png)
2022-01-07 17:27 UTC, 葛立伟
Details

Note You need to log in before you can comment on or make changes to this bug.
Description anolislw alibaba_cloud_group 2021-12-20 14:31:44 UTC
[问题描述]
使用兆芯anolis79 iso装机,安装菜单选择
语言:chese 和 english
Server for GUI (附件全部勾选)
装机后systemctl --failed显示kdump.service启动异常

[报错详情]
----------------   中文 模式 --------------------------------------------
[anolis79@anck-x86-chinese-gui ~]$ ll
总用量 0
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 公共
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 模板
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 视频
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 图片
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 文档
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 下载
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 音乐
drwxr-xr-x 2 anolis79 anolis79 6 12月 17 16:36 桌面
[anolis79@anck-x86-chinese-gui ~]$
[anolis79@anck-x86-chinese-gui ~]$  systemctl --failed
  UNIT          LOAD   ACTIVE SUB    DESCRIPTION
● kdump.service loaded failed failed Crash recovery kernel arming

LOAD   = Reflects whether the unit definition was properly loaded.
ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
SUB    = The low-level unit activation state, values depend on unit type.

1 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.


----------------   英文 模式 --------------------------------------------
[roo@anck-x86-english-gui ~]$ ls
Desktop  Documents  Downloads  Music  Pictures  Public  Templates  Videos
[roo@anck-x86-english-gui ~]$ systemctl --failed
  UNIT          LOAD   ACTIVE SUB    DESCRIPTION
● kdump.service loaded failed failed Crash recovery kernel arming

LOAD   = Reflects whether the unit definition was properly loaded.
ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
SUB    = The low-level unit activation state, values depend on unit type.

1 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.


[机器详情]
----------------   中文 模式 --------------------------------------------
[anolis79@anck-x86-chinese-gui ~]$ uname -a
Linux anck-x86-chinese-gui 4.19.91-25.an7.x86_64 #1 SMP Thu Nov 25 14:22:42 CST 2021 x86_64 x86_64 x86_64 GNU/Linux
[anolis79@anck-x86-chinese-gui ~]$ uname -r
4.19.91-25.an7.x86_64
[anolis79@anck-x86-chinese-gui ~]$ cat /etc/os-release
NAME="Anolis OS"
VERSION="7.9"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="7.9"
PRETTY_NAME="Anolis OS 7.9"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"
BUG_REPORT_URL="https://bugs.openanolis.cn/"

CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"

[anolis79@anck-x86-chinese-gui ~]$ df -h
文件系统             容量  已用  可用 已用% 挂载点
devtmpfs              63G     0   63G    0% /dev
tmpfs                 63G     0   63G    0% /dev/shm
tmpfs                 63G  9.9M   63G    1% /run
tmpfs                 63G     0   63G    0% /sys/fs/cgroup
/dev/mapper/ao-root   50G  5.5G   45G   11% /
/dev/sdb6           1014M  162M  853M   16% /boot
/dev/sdb1            599M   12M  588M    2% /boot/efi
/dev/mapper/ao-home   45G   37M   45G    1% /home
tmpfs                 13G   44K   13G    1% /run/user/1000
/dev/sr0             8.0G  8.0G     0  100% /run/media/anolis79/anolis-dvd-x86_64-7
tmpfs                 13G     0   13G    0% /run/user/0
[anolis79@anck-x86-chinese-gui ~]$ free -g
              total        used        free      shared  buff/cache   available
Mem:            125           0         124           0           0         124
Swap:             3           0           3
[anolis79@anck-x86-chinese-gui ~]$ lscpu
Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                16
On-line CPU(s) list:   0-15
Thread(s) per core:    1
Core(s) per socket:    8
座:                 2
NUMA 节点:         2
厂商 ID:           CentaurHauls
CPU 系列:          7
型号:              59
型号名称:        ZHAOXIN KaiSheng KH-37800D@2.7GHz
步进:              14
CPU MHz:             1200.000
CPU max MHz:           2700.0000
CPU min MHz:           1200.0000
BogoMIPS:            5389.28
虚拟化:           VT-x
L1d 缓存:          32K
L1i 缓存:          32K
L2 缓存:           4096K
NUMA 节点0 CPU:    0-7
NUMA 节点1 CPU:    8-15
Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid pni pclmulqdq monitor vmx smx est tm2 ssse3 cx16 xtpr pcid sse4_1 sse4_2 movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand rng rng_en sm3 sm4 sm3_en sm4_en ace ace_en ace2 phe phe_en pmm pmm_en parallax rng2 rng2_en phe2 phe2_en rsa rsa_en lahf_lm abm 3dnowprefetch invpcid_single ibrs ibpb tpr_shadow vnmi ept vpid fsgsbase tsc_adjust bmi1 smep bmi2 invpcid rdseed adx sha_ni xsaveopt dtherm ida umip arch_capabilities


----------------   英文模式 --------------------------------------------
[roo@anck-x86-english-gui ~]$ uname -a
Linux anck-x86-english-gui 4.19.91-25.an7.x86_64 #1 SMP Thu Nov 25 14:22:42 CST 2021 x86_64 x86_64 x86_64 GNU/Linux
[roo@anck-x86-english-gui ~]$ uname -r
4.19.91-25.an7.x86_64
[roo@anck-x86-english-gui ~]$ cat /etc/os-release
NAME="Anolis OS"
VERSION="7.9"
ID="anolis"
ID_LIKE="rhel fedora centos"
VERSION_ID="7.9"
PRETTY_NAME="Anolis OS 7.9"
ANSI_COLOR="0;31"
HOME_URL="https://openanolis.cn/"
BUG_REPORT_URL="https://bugs.openanolis.cn/"

CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"

[roo@anck-x86-english-gui ~]$ df -h
Filesystem           Size  Used Avail Use% Mounted on
devtmpfs              63G     0   63G   0% /dev
tmpfs                 63G     0   63G   0% /dev/shm
tmpfs                 63G  9.6M   63G   1% /run
tmpfs                 63G     0   63G   0% /sys/fs/cgroup
/dev/mapper/ao-root   50G   14G   37G  28% /
/dev/sdc2           1014M  162M  853M  16% /boot
/dev/sdc1            200M   12M  189M   6% /boot/efi
/dev/mapper/ao-home  3.5T   72M  3.5T   1% /home
tmpfs                 13G   12K   13G   1% /run/user/42
tmpfs                 13G     0   13G   0% /run/user/0
[roo@anck-x86-english-gui ~]$ free -h
              total        used        free      shared  buff/cache   available
Mem:           125G        712M        124G         12M        517M        124G
Swap:          4.0G          0B        4.0G
[roo@anck-x86-english-gui ~]$ lscpu
Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                16
On-line CPU(s) list:   0-15
Thread(s) per core:    1
Core(s) per socket:    8
Socket(s):             2
NUMA node(s):          2
Vendor ID:             CentaurHauls
CPU family:            7
Model:                 59
Model name:            ZHAOXIN KaiSheng KH-37800D@2.7GHz
Stepping:              14
CPU MHz:               1200.000
CPU max MHz:           2700.0000
CPU min MHz:           1200.0000
BogoMIPS:              5389.21
Virtualization:        VT-x
L1d cache:             32K
L1i cache:             32K
L2 cache:              4096K
NUMA node0 CPU(s):     0-7
NUMA node1 CPU(s):     8-15
Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid pni pclmulqdq monitor vmx smx est tm2 ssse3 cx16 xtpr pcid sse4_1 sse4_2 movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand rng rng_en sm3 sm4 sm3_en sm4_en ace ace_en ace2 phe phe_en pmm pmm_en parallax parallax_en rng2 rng2_en phe2 phe2_en rsa rsa_en lahf_lm abm 3dnowprefetch invpcid_single ibrs ibpb tpr_shadow vnmi ept vpid fsgsbase tsc_adjust bmi1 smep bmi2 invpcid rdseed adx sha_ni xsaveopt dtherm ida umip arch_capabilities

[出现频率]
装机4次都有发生
Comment 1 shanxifanshi alibaba_cloud_group 2021-12-20 15:02:05 UTC
经验证ISO安装后内核参数预留内存方式为auto,kdump服务启动失败,restart也会失败;手动改为512M后,restart kdump服务会启动成功。

如果不支持auto,建议在iso中将crashkernel值设置为可变范围,而非auto

修改crashkernel值为512M
[root@anck-x86-english-gui pkg_smoke_test]# cat /proc/cmdline
BOOT_IMAGE=/vmlinuz-4.19.91-25.an7.x86_64 root=/dev/mapper/ao-root ro crashkernel=512M rd.lvm.lv=ao/root rd.lvm.lv=ao/swap rhgb quiet

# systemctl status kdump
● kdump.service - Crash recovery kernel arming
   Loaded: loaded (/usr/lib/systemd/system/kdump.service; enabled; vendor preset: enabled)
   Active: active (exited) since Mon 2021-12-20 14:31:05 CST; 23min ago
  Process: 1567 ExecStart=/usr/bin/kdumpctl start (code=exited, status=0/SUCCESS)
 Main PID: 1567 (code=exited, status=0/SUCCESS)

Dec 20 14:30:49 anck-x86-english-gui dracut[2417]: *** Constructing AuthenticAMD.bin ****
Dec 20 14:30:49 anck-x86-english-gui dracut[2417]: *** Constructing GenuineIntel.bin ****
Dec 20 14:30:49 anck-x86-english-gui dracut[2417]: *** No early-microcode cpio image needed ***
Dec 20 14:30:49 anck-x86-english-gui dracut[2417]: *** Store current command line parameters ***
Dec 20 14:30:49 anck-x86-english-gui dracut[2417]: *** Creating image file ***
Dec 20 14:31:03 anck-x86-english-gui dracut[2417]: *** Creating image file done ***
Dec 20 14:31:03 anck-x86-english-gui dracut[2417]: *** Creating initramfs image file '/boot/initramfs-4.19.91-25.an7.x86_64...ne ***
Dec 20 14:31:05 anck-x86-english-gui kdumpctl[1567]: kexec: loaded kdump kernel
Dec 20 14:31:05 anck-x86-english-gui kdumpctl[1567]: Starting kdump: [OK]
Dec 20 14:31:05 anck-x86-english-gui systemd[1]: Started Crash recovery kernel arming.
Hint: Some lines were ellipsized, use -l to show in full.

[root@anck-x86-english-gui pkg_smoke_test]# free -h
              total        used        free      shared  buff/cache   available
Mem:           125G        679M        123G         12M        1.0G        123G
Swap:          4.0G          0B        4.0G
Comment 2 yunqi-zwt alibaba_cloud_group 2021-12-29 11:25:52 UTC
请提供下 kexec-tools 的版本
Comment 3 yunqi-zwt alibaba_cloud_group 2022-01-04 15:46:16 UTC
将会调整grub对应配置,将crashkernel值从 auto 改为预分配。具体参考值如下:

crashkernel=0M-2G:0M,2G-8G:192M,8G-:256M

数据来源, 阿里云alinux2 grub配置。
Comment 4 yunqi-zwt alibaba_cloud_group 2022-01-04 15:47:49 UTC
*** Bug 156 has been marked as a duplicate of this bug. ***
Comment 6 anolislw alibaba_cloud_group 2022-01-07 15:40:43 UTC
AnolisOS-7.9-GA-x86_64-dvd.iso还存在
Comment 7 葛立伟 alibaba_cloud_group 2022-01-07 17:27:34 UTC
Created attachment 94 [details]
error log
Comment 8 葛立伟 alibaba_cloud_group 2022-01-07 17:28:36 UTC
ISO安装的时候找错了grub,需要进一步调查