Description of problem: sysom主机管理页面,添加主机后,主机的系统版本和内核版本信息无法获取到。 Version-Release number of selected component (if applicable): 使用基于下面这个commit点的定制代码:https://gitee.com/anolis/sysom/commit/0e2c489be1fdec87e8067c68fcff30e8c7c0cc62 后面合入了两个更新: https://gitee.com/anolis/sysom/commit/7fd2c54e451e0f878694af76d0dd00f391cd6dd2, https://gitee.com/anolis/sysom/commit/546fb1753cb70fc041320820fc49dbf6831f5765 How reproducible: 概率性出现 Steps to Reproduce: 1. 在sysom 网页新建集群 2. 在sysom 网页主机管理添加主机到新建集群 3. 主机添加成功 Actual results: 主机列表中,新添主机的“主机发行版本” 和 “主机内核版本” 字段一直为空,没有显示。 Expected results: 主机的“主机发行版本” 和 “主机内核版本” 信息被准确采集,并显示在网页端。 Additional info: 查看sysom 日志,发现添加主机后,该新添主机的心跳检测没有触发执行。
如下sysom-api.log中,10.15.2.204 添加成功后,没有触发心跳检测过程。 16913 172.20.239.163:0 - - [02/Aug/2023:15:36:39] "GET /api/v1/host/?current=1&pageSize=20" 200 67 16914 2023-08-02 15:37:20 | INFO | ./apps/accounts/authentication.py:37 | admin 身份通过 16915 2023-08-02 15:37:20 | INFO | /usr/local/sysom/server/target/sysom_server/sdk/cec_redis/redis_producer.py:155 | <cec_redis.redis_producer.RedisProduc er object at 0x7fad54cb2d68> produce one message '1690961840156-0'=>{'channel': 'ssh', 'type': 'init', 'params': {'instance': '10.15.2.204', 'userna me': 'root', 'port': 22, 'password': 'Oqyq0+p1aekO', '__channel_params_timeout': 600000, '__channel_params_auto_retry': True, '__channel_params_retu rn_as_stream': False}, 'echo': {'__job_id': '51eab23f-13c9-4567-aff3-3bae3f55436a'}, 'bind_result_topic': 'SYSOM_HOST_LISTEN_TOPIC'} successfully. 16916 2023-08-02 15:37:20 | SUCCESS | /usr/local/sysom/server/target/sysom_server/sdk/cec_redis/redis_producer.py:197 | <cec_redis.redis_producer.RedisPro ducer object at 0x7fad1410df98> connect to 'redis://localhost:6379?' successfully. 16917 2023-08-02 15:37:20 | SUCCESS | /usr/local/sysom/server/target/sysom_server/sdk/cec_base/producer.py:145 | Cec-base-Producer dispatch one producer i nstance success. proto=redis, url=redis://localhost:6379?cec_default_max_len=1000&cec_auto_mk_topic=true 16918 2023-08-02 15:37:20 | INFO | /usr/local/sysom/server/target/sysom_server/sdk/cec_redis/redis_producer.py:155 | <cec_redis.redis_producer.RedisProduc er object at 0x7fad1410df98> produce one message '1690961840258-0'=>{'type': 'init', 'params': {'channel': 'ssh', 'instance': '10.15.2.204', 'userna me': 'root', 'port': 22, 'token': 'eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpZCI6MSwidXNlcm5hbWUiOiJhZG1pbiIsImV4cCI6MTY5MDk3MTQ1MS41NzQzMDA1fQ.QFIV3 slPrWvQ0J7ElGzaG8vcIeR1o6RDJdWSRK31UGk'}} successfully. 16919 2023-08-02 15:37:20 | INFO | ./apps/host/views.py:206 | node init task create success 16920 2023-08-02 15:37:20 | INFO | /usr/local/sysom/server/target/sysom_server/sdk/cec_redis/redis_producer.py:155 | <cec_redis.redis_producer.RedisProduc er object at 0x7fad148d0898> produce one message '1690961840285-0'=>{'sub': 'admin', 'message': {'id': 10, 'collected_time': '2023-08-02 15:37:20', 'level': 'success', 'noticelcon_type': 'notification', 'sub': 'admin', 'host': None, 'receiver': None, 'created_at': '2023-08-02 15:37:20', 'deleted _at': None, 'message': 'IP: 10.15.2.204 add success!', 'duration_time': None, 'item': 'host', 'is_read': False}} successfully. 16921 172.20.239.163:0 - - [02/Aug/2023:15:37:20] "POST /api/v1/host/" 200 57 16922 127.0.0.1:35740 - - [02/Aug/2023:15:37:28] "GET /api/v1/cluster/" 200 200 16923 127.0.0.1:35804 - - [02/Aug/2023:15:37:38] "GET /api/v1/host/?cluster=1" 200 278 16924 127.0.0.1:35850 - - [02/Aug/2023:15:37:53] "GET /api/v1/host/?cluster=1" 200 278 16925 127.0.0.1:35870 - - [02/Aug/2023:15:37:58] "GET /api/v1/host/?cluster=1" 200 278 16926 127.0.0.1:0 - - [02/Aug/2023:15:38:01] "GET /api/v1/host" 301 - 16927 127.0.0.1:0 - - [02/Aug/2023:15:38:01] "GET /api/v1/host/" 200 278 16928 127.0.0.1:35910 - - [02/Aug/2023:15:38:08] "GET /api/v1/host/?cluster=1" 200 278 16929 127.0.0.1:35944 - - [02/Aug/2023:15:38:18] "GET /api/v1/host/?cluster=1" 200 278 16930 2023-08-02 15:38:43 | SUCCESS | /usr/local/sysom/server/target/sysom_server/sdk/cec_redis/redis_producer.py:232 | <cec_redis.redis_producer.RedisPro ducer object at 0x7fad1410df98> disconnect from 'redis://localhost:6379?' successfully. 16931 172.20.239.163:0 - - [02/Aug/2023:15:36:39] "DELETE /api/v1/host/3/" 200 68 16932 172.20.239.163:0 - - [02/Aug/2023:15:37:20] "GET /api/v1/host/?current=1&pageSize=20" 200 278 16933 127.0.0.1:35746 - - [02/Aug/2023:15:37:28] "GET /api/v1/host/?cluster=1" 200 278
可以看一下重启 sysom-api 是否恢复,或者将 sysom-api.ini 的 numprocs 改成1 试试。 即将发布的新版(在dev分支)已经不采用 daphne 启动 sysom-api 了,而是统一采用 gunicorn + uvicorn,可以选择将社区最新的特性合入,或者参考这个提交进行修改 https://gitee.com/anolis/sysom/commit/e2ad41b6f770a4931f0a2492b2328b333ec74cf6