Bug 3726 - [Anolis 8.8][loongarch64] rc2版本iso下,kdump服务开启失败
Summary: [Anolis 8.8][loongarch64] rc2版本iso下,kdump服务开启失败
Status: IN_PROGRESS
Alias: None
Product: Anolis OS 8
Classification: Anolis OS
Component: Others (show other bugs) Others
Version: 8.8
Hardware: loongarch Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: HFD
QA Contact: shuming
URL:
Whiteboard:
Keywords:
: 850 (view as bug list)
Depends on:
Blocks:
 
Reported: 2023-01-12 16:39 UTC by liqianwen
Modified: 2023-02-08 14:06 UTC (History)
5 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description liqianwen loongson_group 2023-01-12 16:39:26 UTC
Description of problem:
rc2版本iso下,kdump服务开启失败


Version-Release number of selected component (if applicable):
iso下载:
https://mirrors.openanolis.cn/anolis/8.8/isos/RC2/loongarch64/anolis-8.8-loongarch64-dvd.iso


How reproducible:100%


Steps to Reproduce:
1.安装并启动RC2版本iso
2.查看kdump服务的状态 systemctl status kdump

Actual results:
systemctl status kdump
● kdump.service - Crash recovery kernel arming
   Loaded: loaded (/usr/lib/systemd/system/kdump.service; enabled; vendor preset: enabled)
   Active: failed (Result: exit-code) since Fri 2022-11-18 11:21:36 CST; 1 months 24 days ago
 Main PID: 640812 (code=exited, status=1/FAILURE)

11月 18 11:21:36 bogon systemd[1]: Starting Crash recovery kernel arming...
11月 18 11:21:36 bogon kdumpctl[640812]: kdump: Error: No find /boot/vmlinux-4.19.190-6.5.lns8.loongarch64+kdump kernel file
11月 18 11:21:36 bogon kdumpctl[640812]: kdump: Please specify DEFAULT_KDUMP_KERNEL configuration in /etc/sysconfig/kdump file
11月 18 11:21:36 bogon kdumpctl[640812]: kdump: failed to prepare for kdump bootinfo.
11月 18 11:21:36 bogon kdumpctl[640812]: kdump: Starting kdump: [FAILED]
11月 18 11:21:36 bogon systemd[1]: kdump.service: Main process exited, code=exited, status=1/FAILURE
11月 18 11:21:36 bogon systemd[1]: kdump.service: Failed with result 'exit-code'.
11月 18 11:21:36 bogon systemd[1]: Failed to start Crash recovery kernel arming.

Expected results:
kdump服务正常
Comment 1 liqianwen loongson_group 2023-01-16 09:46:04 UTC
kdump服务状态现更新如下:
systemctl status kdump
● kdump.service - Crash recovery kernel arming
   Loaded: loaded (/usr/lib/systemd/system/kdump.service; enabled; vendor preset: enabled)
   Active: failed (Result: exit-code) since Mon 2023-01-16 09:45:05 CST; 4s ago
  Process: 24195 ExecStart=/usr/bin/kdumpctl start (code=exited, status=1/FAILURE)
 Main PID: 24195 (code=exited, status=1/FAILURE)

1月 16 09:45:05 bogon systemd[1]: Starting Crash recovery kernel arming...
1月 16 09:45:05 bogon kdumpctl[24197]: kdump: No memory reserved for crash kernel
1月 16 09:45:05 bogon kdumpctl[24197]: kdump: Starting kdump: [FAILED]
1月 16 09:45:05 bogon systemd[1]: kdump.service: Main process exited, code=exited, status=1/FAILURE
1月 16 09:45:05 bogon systemd[1]: kdump.service: Failed with result 'exit-code'.
1月 16 09:45:05 bogon systemd[1]: Failed to start Crash recovery kernel arming
Comment 2 葛立伟 alibaba_cloud_group 2023-01-16 16:55:40 UTC
(In reply to liqianwen from comment #1)
> kdump服务状态现更新如下:
> systemctl status kdump
> ● kdump.service - Crash recovery kernel arming
>    Loaded: loaded (/usr/lib/systemd/system/kdump.service; enabled; vendor
> preset: enabled)
>    Active: failed (Result: exit-code) since Mon 2023-01-16 09:45:05 CST; 4s
> ago
>   Process: 24195 ExecStart=/usr/bin/kdumpctl start (code=exited,
> status=1/FAILURE)
>  Main PID: 24195 (code=exited, status=1/FAILURE)
> 
> 1月 16 09:45:05 bogon systemd[1]: Starting Crash recovery kernel arming...
> 1月 16 09:45:05 bogon kdumpctl[24197]: kdump: No memory reserved for crash
> kernel
> 1月 16 09:45:05 bogon kdumpctl[24197]: kdump: Starting kdump: [FAILED]
> 1月 16 09:45:05 bogon systemd[1]: kdump.service: Main process exited,
> code=exited, status=1/FAILURE
> 1月 16 09:45:05 bogon systemd[1]: kdump.service: Failed with result
> 'exit-code'.
> 1月 16 09:45:05 bogon systemd[1]: Failed to start Crash recovery kernel arming

/proc/cmdline是什么 ?
Comment 3 tangyouling loongson_group 2023-01-17 14:48:42 UTC
针对最原始的问题,
>11月 18 11:21:36 bogon kdumpctl[640812]: kdump: Error: No find 
>/boot/vmlinux-4.19.190-6.5.lns8.loongarch64+kdump kernel file
>11月 18 11:21:36 bogon kdumpctl[640812]: kdump: Please specify
>DEFAULT_KDUMP_KERNEL configuration in /etc/sysconfig/kdump file
如上失败是因为没找到kdump内核,所以kdump服务失败。
原因是loongarch的实现和x86/arm64架构不一样,针对于kdump操作,需要编译两个不同的内核,一个是生产内核,一个是捕获内核,而不像arm64它们那样只需要一个内核就可以。

之后修改计划:
内部产品正在重新重写kdump功能实现,计划实现同一套二进制方案,之后就可以类似于arm64那样,生产内核和捕获内核可以是同一个,但修改比较大,需内部评审通过之后再修改anolis上的内核。

当内核修改完成之后,再修改kdump服务,并同步更新kexec-tools和crash工具。
完成以上所需工作,需要一定的时间。


> 1月 16 09:45:05 bogon kdumpctl[24197]: kdump: No memory reserved for crash
> kernel
而显示如上问题,大概率是crashkernel参数使用的不对,导致预留crashkernel空间失败。
Comment 4 葛立伟 alibaba_cloud_group 2023-01-17 15:07:23 UTC
*** Bug 850 has been marked as a duplicate of this bug. ***
Comment 5 Jacob alibaba_cloud_group 2023-02-08 14:04:41 UTC
暂不解决,GA发布时Release Notes
Comment 6 葛立伟 alibaba_cloud_group 2023-02-08 14:06:31 UTC
目前龙芯kdump服务需要另外一个转存内核,龙芯方面正在进行开发向通用做法靠拢,预计在8.8中无法支持。