Description of problem: 在AMD和Intel的服务器上测试CXL内存扩展性能,测试结果发现在Intel架构的服务器上CXL性能并不理想,Intel架构上是否需要进行特殊配置? 具体情况如下: 一、AMD NF5280A7(AMD EPYC 9654) 1)测试本地内存带宽 numactl -N 1 -m 1 ./mlc --max_bandwidth ALL Reads : 200019.12 3:1 Reads-Writes : 188374.01 2:1 Reads-Writes : 184458.78 1:1 Reads-Writes : 179893.15 Stream-triad like: 184115.71 2)测试CXL内存带宽 numactl -N 1 -m 3 ./mlc --max_bandwidth ALL Reads : 39822.43 3:1 Reads-Writes : 42220.97 2:1 Reads-Writes : 42495.56 1:1 Reads-Writes : 44322.83 Stream-triad like: 41507.47 2)测试本地内存+CXL内存带宽 设置本地内存和CXL内存交织分配比例为6:1 echo 6 > /sys/kernel/mm/mempolicy/weighted_interleave/node1 echo 1 > /sys/kernel/mm/mempolicy/weighted_interleave/node3 numactl -N 1 -w 1,3 ./mlc --max_bandwidth ALL Reads : 228274.52 - 相比只用本地内存带宽提升28G 3:1 Reads-Writes : 218346.08 - 相比只用本地内存带宽提升30G 2:1 Reads-Writes : 214417.09 - 相比只用本地内存带宽提升30G 1:1 Reads-Writes : 209612.49 - 相比只用本地内存带宽提升30G Stream-triad like: 214027.14 - 相比只用本地内存带宽提升30G 测试结果总结:在AMD处理器下,本地内存和CXL内存加权交织之后可以有效提升混合内存的读写性能 二、Intel NF5280G8(Intel(R) Xeon(R) 6766E) 1)测试本地内存带宽 numactl -N 1 -m 1 ./mlc --max_bandwidth ALL Reads : 252742.36 3:1 Reads-Writes : 230513.30 2:1 Reads-Writes : 225615.53 1:1 Reads-Writes : 215417.56 Stream-triad like: 217166.30 2)测试CXL内存带宽 numactl -N 1 -m 3 ./mlc --max_bandwidth ALL Reads : 41463.36 3:1 Reads-Writes : 46854.17 2:1 Reads-Writes : 51032.39 1:1 Reads-Writes : 50032.76 Stream-triad like: 46104.24 2)测试本地内存+CXL内存带宽 设置本地内存和CXL内存交织分配比例为6:1 echo 6 > /sys/kernel/mm/mempolicy/weighted_interleave/node1 echo 1 > /sys/kernel/mm/mempolicy/weighted_interleave/node3 numactl -N 1 -w 1,3 ./mlc --max_bandwidth ALL Reads : 231472.66 3:1 Reads-Writes : 226916.89 2:1 Reads-Writes : 229203.39 1:1 Reads-Writes : 229336.36 Stream-triad like: 202989.51 测试结果总结:在Intel处理器下本地内存和CXL内存加权交织之后并不能提升混合内存的读性能