Bug 4818 - [Anolis23][x86_64][社区nightly] LTP测试套net.tcp_cmds/ipneigh01_ip run fail:TFAIL: cannot ping 10.0.0.1
Summary: [Anolis23][x86_64][社区nightly] LTP测试套net.tcp_cmds/ipneigh01_ip run fail:TFAIL:...
Status: RESOLVED INVALID
Alias: None
Product: Anolis OS 23
Classification: Anolis OS
Component: Others (show other bugs) Others
Version: 23.0
Hardware: All Linux
: P3-Medium S3-normal
Target Milestone: ---
Assignee: yunmeng365524
QA Contact:
URL:
Whiteboard:
Keywords:
: 4819 4820 (view as bug list)
Depends on:
Blocks:
 
Reported: 2023-04-24 09:56 UTC by anolislw
Modified: 2023-06-07 11:10 UTC (History)
3 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description anolislw alibaba_cloud_group 2023-04-24 09:56:51 UTC
Description of problem:
anolis23 x86_64 ECS环境下,ltp测试套net.tcp_cmds/ipneigh01_ip run fail:TFAIL: cannot ping 10.0.0.1

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. git clone https://gitee.com/anolis/ltp.git --branch 20230315
2. export CFLAGS+="-fcommon"
   make autotools
   ./configure 
   make && make install
3. cd /opt/ltp;./runltp  -f net.tcp_cmds -s ipneigh01_ip

Actual results:
tag=ipneigh01_ip stime=1682274846
cmdline="ipneigh01.sh -c ip"
contacts=""
analysis=exit
<<<test_output>>>
ipneigh01 1 TINFO: initialize 'lhost' 'ltp_ns_veth2' interface
ipneigh01 1 TINFO: add local addr 10.0.0.2/24
ipneigh01 1 TINFO: add local addr fd00:1:1:1::2/64
ipneigh01 1 TINFO: initialize 'rhost' 'ltp_ns_veth1' interface
ipneigh01 1 TINFO: add remote addr 10.0.0.1/24
ipneigh01 1 TINFO: add remote addr fd00:1:1:1::1/64
ipneigh01 1 TINFO: Network config (local -- remote):
ipneigh01 1 TINFO: ltp_ns_veth2 -- ltp_ns_veth1
ipneigh01 1 TINFO: 10.0.0.2/24 -- 10.0.0.1/24
ipneigh01 1 TINFO: fd00:1:1:1::2/64 -- fd00:1:1:1::1/64
ipneigh01 1 TINFO: timeout per run is 0h 5m 0s
ipneigh01 1 TINFO: stress auto-creation ARP cache entry deleted with 'ip' 50 times
ipneigh01 1 TFAIL: cannot ping 10.0.0.1

Summary:
passed   0
failed   1
broken   0
skipped  0
warnings 0

Expected results:
case pass

Additional info:
[root@qibo-anolis23-nightly-func-x86-1 ~]# uname -r
5.10.134-88.git.df00332440f4.an23.x86_64
[root@qibo-anolis23-nightly-func-x86-1 ~]# cat /etc/anolis-release
Anolis OS release 23
[root@qibo-anolis23-nightly-func-x86-1 ~]# cat /proc/cmdline
BOOT_IMAGE=(hd0,msdos1)/boot/vmlinuz-5.10.134-88.git.df00332440f4.an23.x86_64 root=UUID=ece72b7f-465b-433d-8b3b-e5fa53a04642 ro rhgb cryptomgr.notests rcupdate.rcu_cpu_stall_timeout=300 quiet biosdevname=0 net.ifnames=0 console=tty0 console=ttyS0,115200n8 noibrs nvme_core.io_timeout=4294967295 nvme_core.admin_timeout=4294967295 cgroup.memory=nokmem crashkernel=0M-2G:0M,2G-8G:192M,8G-:256M
[root@qibo-anolis23-nightly-func-x86-1 ~]# df -h
Filesystem      Size  Used Avail Use% Mounted on
devtmpfs        4.0M     0  4.0M   0% /dev
tmpfs           7.6G     0  7.6G   0% /dev/shm
tmpfs           3.1G  716K  3.1G   1% /run
/dev/vda1        40G   14G   27G  34% /
/dev/vdc1        98G  3.3G   90G   4% /tmp
tmpfs           1.6G     0  1.6G   0% /run/user/0
[root@qibo-anolis23-nightly-func-x86-1 ~]# free -g
               total        used        free      shared  buff/cache   available
Mem:              15           0          12           0           1          14
Swap:              0           0           0
[root@qibo-anolis23-nightly-func-x86-1 ~]# lscpu
Architecture:            x86_64
  CPU op-mode(s):        32-bit, 64-bit
  Address sizes:         46 bits physical, 57 bits virtual
  Byte Order:            Little Endian
CPU(s):                  4
  On-line CPU(s) list:   0-3
Vendor ID:               GenuineIntel
  BIOS Vendor ID:        Alibaba Cloud
  Model name:            Intel(R) Xeon(R) Platinum 8369B CPU @ 2.70GHz
    BIOS Model name:     pc-i440fx-2.1  CPU @ 0.0GHz
    BIOS CPU family:     1
    CPU family:          6
    Model:               106
    Thread(s) per core:  2
    Core(s) per socket:  2
    Socket(s):           1
    Stepping:            6
    BogoMIPS:            5399.99
    Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht sys
                         call nx pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid tsc_known_freq pni pclmulqdq monitor
                          ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dn
                         owprefetch cpuid_fault invpcid_single ibrs_enhanced fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid av
                         x512f avx512dq rdseed adx smap avx512ifma clflushopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec
                          xgetbv1 xsaves wbnoinvd arat avx512vbmi pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bit
                         alg avx512_vpopcntdq rdpid fsrm arch_capabilities
Virtualization features:
  Hypervisor vendor:     KVM
  Virtualization type:   full
Caches (sum of all):
  L1d:                   96 KiB (2 instances)
  L1i:                   64 KiB (2 instances)
  L2:                    2.5 MiB (2 instances)
  L3:                    48 MiB (1 instance)
NUMA:
  NUMA node(s):          1
  NUMA node0 CPU(s):     0-3
Vulnerabilities:
  Itlb multihit:         Not affected
  L1tf:                  Not affected
  Mds:                   Not affected
  Meltdown:              Not affected
  Mmio stale data:       Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown
  Retbleed:              Not affected
  Spec store bypass:     Vulnerable
  Spectre v1:            Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:            Mitigation; Enhanced IBRS, RSB filling, PBRSB-eIBRS SW sequence
  Srbds:                 Not affected
  Tsx async abort:       Not affected
Comment 1 yunmeng365524 2023-05-29 16:27:21 UTC
定位结论:测试环境导致,应该是前面某个用例有残留导致。
分析测试用例:
do_test()
{
	local entry_name="ARP"
	[ "$TST_IPV6" ] && entry_name="NDISC"

	tst_res TINFO "stress auto-creation $entry_name cache entry deleted with '$CMD' $NUMLOOPS times"

	for i in $(seq 1 $NUMLOOPS); do

		ping$TST_IPV6 -q -c1 $(tst_ipaddr rhost) -I $(tst_iface) > /dev/null || \
			tst_brk TFAIL "cannot ping $(tst_ipaddr rhost)"

		local k
		local ret=1
		for k in $(seq 1 30); do
			$SHOW_CMD | grep -q $(tst_ipaddr rhost)
			if [ $? -eq 0 ]; then
				ret=0
				break;
			fi
			tst_sleep 100ms
		done

		[ "$ret" -ne 0 ] && \
			tst_brk TFAIL "$entry_name entry '$(tst_ipaddr rhost)' not listed"

		$DEL_CMD

		$SHOW_CMD | grep -q "$(tst_ipaddr rhost).*$(tst_hwaddr rhost)" && \
			tst_brk TFAIL "'$DEL_CMD' failed, entry has " \
				"$(tst_hwaddr rhost)' $i/$NUMLOOPS"
	done

	tst_res TPASS "verified adding/removing $entry_name cache entry"
}

可以看出失败在ping$TST_IPV6 -q -c1 $(tst_ipaddr rhost) -I $(tst_iface) 这里。
看日志是ping失败了。查看环境上的ip信息:
[root@qibo-anolis23-nightly-func-arm-1 ltp]# ip add
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet 10.0.0.1/32 scope global lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
    link/ether 00:16:3e:34:47:5e brd ff:ff:ff:ff:ff:ff
    altname enp5s0
    inet 172.16.0.38/24 brd 172.16.0.255 scope global dynamic noprefixroute eth0
       valid_lft 315293108sec preferred_lft 315293108sec
    inet6 fe80::dc7d:8781:b38d:5b5c/64 scope link noprefixroute
       valid_lft forever preferred_lft forever
4: gre0@NONE: <NOARP> mtu 1476 qdisc noop state DOWN group default qlen 1000
    link/gre 0.0.0.0 brd 0.0.0.0
5: gretap0@NONE: <BROADCAST,MULTICAST> mtu 1462 qdisc noop state DOWN group default qlen 1000
    link/ether 00:00:00:00:00:00 brd ff:ff:ff:ff:ff:ff
6: erspan0@NONE: <BROADCAST,MULTICAST> mtu 1450 qdisc noop state DOWN group default qlen 1000
    link/ether 00:00:00:00:00:00 brd ff:ff:ff:ff:ff:ff
8: ip6tnl0@NONE: <NOARP> mtu 1452 qdisc noop state DOWN group default qlen 1000
    link/tunnel6 :: brd :: permaddr 2e08:8864:4a59::
9: ip6gre0@NONE: <NOARP> mtu 1448 qdisc noop state DOWN group default qlen 1000
    link/gre6 :: brd :: permaddr 66d:96ce:d010::
20: veth-outside@if19: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether b6:a8:60:97:7f:06 brd ff:ff:ff:ff:ff:ff link-netns test-ns
    inet6 fe80::b4a8:60ff:fe97:7f06/64 scope link
       valid_lft forever preferred_lft forever
31: tunl0@NONE: <NOARP> mtu 1480 qdisc noop state DOWN group default qlen 1000
    link/ipip 0.0.0.0 brd 0.0.0.0
32: sit0@NONE: <NOARP> mtu 1480 qdisc noop state DOWN group default qlen 1000
    link/sit 0.0.0.0 brd 0.0.0.0
33: ip6_vti0@NONE: <NOARP> mtu 1428 qdisc noop state DOWN group default qlen 1000
    link/tunnel6 :: brd :: permaddr 2ed8:9818:d58d::
34: ip_vti0@NONE: <NOARP> mtu 1480 qdisc noop state DOWN group default qlen 1000
    link/ipip 0.0.0.0 brd 0.0.0.0
53: ifb0: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN group default qlen 32
    link/ether 2e:5b:2a:a0:f5:b5 brd ff:ff:ff:ff:ff:ff
54: ifb1: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN group default qlen 32
    link/ether 0e:a1:c5:ab:74:47 brd ff:ff:ff:ff:ff:ff
1394: ltp_ns_veth2@if1395: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether fe:3d:85:46:fc:49 brd ff:ff:ff:ff:ff:ff link-netns ltp_ns
    inet 10.0.0.2/24 scope global ltp_ns_veth2
       valid_lft forever preferred_lft forever
    inet6 fd00:1:1:1::2/64 scope global nodad
       valid_lft forever preferred_lft forever
    inet6 fe80::fc3d:85ff:fe46:fc49/64 scope link
       valid_lft forever preferred_lft forever

可以看到lo接口上多了一个10.0.0.1的ip,按照case的设置,这个ip是remote addr。
删除这个ip:
ip add del 10.0.0.1/32 dev lo
再次执行通过:

<<<test_start>>>
tag=ipneigh01_ip stime=1685348456
cmdline="ipneigh01.sh -c ip"
contacts=""
analysis=exit
<<<test_output>>>
incrementing stop
ipneigh01 1 TINFO: initialize 'lhost' 'ltp_ns_veth2' interface
ipneigh01 1 TINFO: add local addr 10.0.0.2/24
ipneigh01 1 TINFO: add local addr fd00:1:1:1::2/64
ipneigh01 1 TINFO: initialize 'rhost' 'ltp_ns_veth1' interface
ipneigh01 1 TINFO: add remote addr 10.0.0.1/24
ipneigh01 1 TINFO: add remote addr fd00:1:1:1::1/64
ipneigh01 1 TINFO: Network config (local -- remote):
ipneigh01 1 TINFO: ltp_ns_veth2 -- ltp_ns_veth1
ipneigh01 1 TINFO: 10.0.0.2/24 -- 10.0.0.1/24
ipneigh01 1 TINFO: fd00:1:1:1::2/64 -- fd00:1:1:1::1/64
ipneigh01 1 TINFO: timeout per run is 0h 5m 0s
ipneigh01 1 TINFO: stress auto-creation ARP cache entry deleted with 'ip' 50 times
ipneigh01 1 TPASS: verified adding/removing ARP cache entry

Summary:
passed   1
failed   0
broken   0
skipped  0
warnings 0
<<<execution_status>>>
initiation_status="ok"
duration=1 termination_type=exited termination_id=0 corefile=no
cutime=73 cstime=25
<<<test_end>>>
INFO: ltp-pan reported all tests PASS
LTP Version: 20220527-692-g147bb6499

       ###############################################################

            Done executing testcases.
            LTP Version:  20220527-692-g147bb6499
       ###############################################################

[root@qibo-anolis23-nightly-func-arm-1 ltp]#
Comment 2 yunmeng365524 2023-06-07 11:08:02 UTC
*** Bug 4819 has been marked as a duplicate of this bug. ***
Comment 3 yunmeng365524 2023-06-07 11:10:25 UTC
*** Bug 4820 has been marked as a duplicate of this bug. ***