X-Git-Url: https://gerrit.fd.io/r/gitweb?p=csit.git;a=blobdiff_plain;f=docs%2Freport%2Fvpp_performance_tests%2Fcsit_release_notes.rst;h=17003bc85ac6aaddca84136062a505b94f2c8b77;hp=54fa73dfee3b12633dee6c853398baa11ff5bd8d;hb=cddac498bafc7a6092dade5e183e5c7a95cff64d;hpb=da2aa127ee071e37bc4d6060dd39476179e65365 diff --git a/docs/report/vpp_performance_tests/csit_release_notes.rst b/docs/report/vpp_performance_tests/csit_release_notes.rst index 54fa73dfee..17003bc85a 100644 --- a/docs/report/vpp_performance_tests/csit_release_notes.rst +++ b/docs/report/vpp_performance_tests/csit_release_notes.rst @@ -4,225 +4,122 @@ CSIT Release Notes Changes in CSIT |release| ------------------------- -#. Naming change for all VPP performance test suites and test cases. -#. VPP performance test environment changes - - - upgrade to Ubuntu 16.04 - - VM and vhost-user test environment optimizations - - addition of HW cryptodev devices in LFD FD.io physical testbed - -#. Added tests - - - NICs - - - Intel x710 (L2 Bridge Domain switched-forwarding, vhost-user) - - Cisco VIC1385 (L2 Bridge Domain switched-forwarding) - - Cisco VIC1227 (L2 Bridge Domain switched-forwarding) - - - vhost-user tests with VM - - - L2 Bridge Domain switched-forwarding with Intel x710 NIC - - L2 Bridge Domain switched-forwarding with VxLAN and Intel x520 NIC - - L2 Bridge Domain switched-forwarding with Intel xl710 NIC - - - Tests with VxLAN - - - L2 Bridge Domain switched-forwarding with Intel x520 NIC - - L2 Bridge Domain switched-forwarding with vhost-user, VM and Intel x520 - NIC - -Performance Improvements ------------------------- - -Substantial improvements in measured packet throughput, with relative increase -of double-digit percentage points, have been observed in a number of CSIT -|release| tests listed below. Relative improvements are calculated against the -test results in CSIT rls1609 report. - -NDR Throughput -~~~~~~~~~~~~~~ - -Non-Drop Rate Throughput discovery tests: - -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| VPP Functionality | Test Name | VPP-16.09 | VPP-17.01 | Relative Improvement | -+===================+=================================================================+===========+=================+======================+ -| L2XC | 10ge2p1x520: 64B-1t1c-eth-l2xcbase-ndrdisc | 9.4 Mpps | 12.7 Mpps | 35% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC | 10ge2p1xl710: 64B-1t1c-eth-l2xcbase-ndrdisc | 9.5 Mpps | 12.2..12.4 Mpps | 28..30% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC dot1ad | 10ge2p1x520: 64B-1t1c-dot1ad-l2xcbase-ndrdisc | 7.4 Mpps | 8.8..9.0 Mpps | 19..23% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC dot1q | 10ge2p1x520: 64B-1t1c-dot1q-l2xcbase-ndrdisc | 7.5 Mpps | 8.8..9.0 Mpps | 17..20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC VxLAN | 10ge2p1x520: 64B-1t1c-ethip4vxlan-l2xcbase-ndrdisc | 5.4 Mpps | 6.5 Mpps | 20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC-vhost-VM | 10ge2p1x520: 64B-1t1c-eth-l2xcbase-eth-2vhost-1vm-ndrdisc | 0.5 Mpps | 2.8..3.2 Mpps | 460..540% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2BD | 10ge2p1x520: 64B-1t1c-eth-l2bdbasemaclrn-ndrdisc | 7.8 Mpps | 10.4..10.6 Mpps | 33..36% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2BD-vhost-VM | 10ge2p1x520: 64B-1t1c-eth-l2bdbasemaclrn-eth-2vhost-1vm-ndrdisc | 0.4 Mpps | 2.7..2.8 Mpps | 575..600% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-ndrdisc | 8.7 Mpps | 9.7 Mpps | 12% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 COP | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-copwhtlistbase-ndrdisc | 7.1 Mpps | 8.3..8.5 Mpps | 17..20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 iAcl | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-iacldstbase-ndrdisc | 6.9 Mpps | 7.6..7.8 Mpps | 10..13% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 vhost | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-eth-2vhost-1vm-ndrdisc | 0.3 Mpps | 2.6 Mpps | 767% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-ndrdisc | 3.0 Mpps | 7.3..7.7 Mpps | 143..157% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ - -PDR Throughput -~~~~~~~~~~~~~~ - -Partial Drop Rate thoughput discovery tests with packet Loss Tolerance of 0.5%: - -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| VPP Functionality | Test Name | VPP-16.09 | VPP-17.01 | Relative Improvement | -+===================+=================================================================+===========+=================+======================+ -| L2XC | 10ge2p1x520: 64B-1t1c-eth-l2xcbase-pdrdisc | 9.4 Mpps | 12.7..12.9 Mpps | 35..37% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC dot1ad | 10ge2p1x520: 64B-1t1c-dot1ad-l2xcbase-pdrdisc | 7.4 Mpps | 8.8..9.1 Mpps | 19..23% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC dot1q | 10ge2p1x520: 64B-1t1c-dot1q-l2xcbase-pdrdisc | 7.5 Mpps | 8.8..9.0 Mpps | 17..20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC VxLAN | 10ge2p1x520: 64B-1t1c-ethip4vxlan-l2xcbase-pdrdisc | 5.4 Mpps | 6.5 Mpps | 20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2XC-vhost-VM | 10ge2p1x520: 64B-1t1c-eth-l2xcbase-eth-2vhost-1vm-pdrdisc | 2.6 Mpps | 3.2..3.3 Mpps | 23..26% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2BD | 10ge2p1x520: 64B-1t1c-eth-l2bdbasemaclrn-pdrdisc | 7.8 Mpps | 10.6 Mpps | 36% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| L2BD-vhost-VM | 10ge2p1x520: 64B-1t1c-eth-l2bdbasemaclrn-eth-2vhost-1vm-pdrdisc | 2.1 Mpps | 2.9 Mpps | 38% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-pdrdisc | 8.7 Mpps | 9.7 Mpps | 11% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 COP | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-copwhtlistbase-pdrdisc | 7.1 Mpps | 8.3..8.5 Mpps | 17..20% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 vhost | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-eth-2vhost-1vm-pdrdisc | 2.0 Mpps | 2.7 Mpps | 35% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ +#. Added VPP performance tests -Measured improvements are in line with VPP code optimizations listed in -`VPP-17.01 release notes -`_. + - **Container Service Chain Topologies Orchestrated by K8s with VPP Memif** -Additionally, vhost-VM performance improvements are due to both VPP code -optimizations as well as due to the FD.io CSIT Linux KVM test environment -optimizations for vhost-VM tests - see section "2.1.7. Methodology: KVM VM -vhost". + - Added tests with VPP vswitch in container connecting a number of VPP- + in-container service chain topologies with L2 Cross-Connect and L2 + Bridge-Domain configurations, orchestrated by Kubernetes. Added + following forwarding topologies: i) "Parallel" with packets flowing from + NIC via VPP to container and back to VPP and NIC; ii) "Chained" (a.k.a. + "Snake") with packets flowing via VPP to container, back to VPP, to next + container, back to VPP and so on until the last container in a chain, + then back to VPP and NIC; iii) "Horizontal" with packets flowing via VPP + to container, then via "horizontal" memif to next container, and so on + until the last container, then back to VPP and NIC; + - **VPP TCP/IP stack** -Other Performance Changes -------------------------- + - Added tests for VPP TCP/IP stack using VPP built-in HTTP server. + WRK traffic generator is used as a client-side; + + - **SRv6** + + - Initial SRv6 (Segment Routing IPv6) tests verifying performance of + IPv6 and SRH (Segment Routing Header) encapsulation, decapsulation, + lookups and rewrites based on configured End and End.DX6 SRv6 egress + functions; + + - **IPSecSW** + + - SW computed IPSec encryption with AES-GCM, CBC-SHA1 ciphers, in + combination with IPv4 routed-forwarding; + +#. Presentation and Analytics Layer + + - Added throughput speedup analysis for multi-core and multi-thread + VPP tests into Presentation and Analytics Layer (PAL) for automated + CSIT test results analysis; + +#. Other changes + + - **Framework optimizations** + + - Ability to run CSIT framework on ARM architecture; + + - Overall stability improvements; + + - **NDR and PDR throughput binary search change** -Other changes in measured packet throughput, with either minor relative -increase or decrease, have been observed in a number of CSIT |release| tests -listed below. Relative changes are calculated against the test results in CSIT -rls1609 report. - -NDR Throughput -~~~~~~~~~~~~~~ - -Non-Drop Rate Throughput discovery tests: - -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| VPP Functionality | Test Name | VPP-16.09 | VPP-17.01 | Relative Change | -+===================+=================================================================+===========+=================+======================+ -| IPv4 FIB 200k | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale200k-ndrdisc | 8.5 Mpps | 9.0 Mpps | 6% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 FIB 20k | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale20k-ndrdisc | 8.5 Mpps | 9.0..9.2 Mpps | 6..8% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 FIB 2M | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale2m-ndrdisc | 8.5 Mpps | 7.8..8.1 Mpps | -8..-5% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 Policer | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-ipolicemarkbase-ndrdisc | 6.9 Mpps | 7.4..7.6 Mpps | 7..10% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 LISP | 10ge2p1x520: 64B-1t1c-ethip4lispip4-ip4base-ndrdisc | 4.4 Mpps | 4.8 Mpps | 9% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 COP | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-copwhtlistbase-ndrdisc | 6.1 Mpps | 6.1..6.5 Mpps | 0..7% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 200k | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale200k-ndrdisc | 6.5 Mpps | 5.3..5.7 Mpps | -18..-12% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 20k | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale20k-ndrdisc | 6.9 Mpps | 6.5 Mpps | -6% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 2M | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale2m-ndrdisc | 5.3 Mpps | 4.2 Mpps | -21% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 iAcl | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-iacldstbase-ndrdisc | 6.5 Mpps | 6.1..6.5 Mpps | -6..0% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ - -PDR Throughput -~~~~~~~~~~~~~~ - -Partial Drop Rate thoughput discovery tests with packet Loss Tolerance of 0.5%: - -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| VPP Functionality | Test Name | VPP-16.09 | VPP-17.01 | Relative Change | -+===================+=================================================================+===========+=================+======================+ -| IPv4 FIB 200k | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale200k-pdrdisc | 8.5 Mpps | 9.0 Mpps | 6% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 FIB 20k | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale20k-pdrdisc | 8.5 Mpps | 9.0..9.2 Mpps | 6..8% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 FIB 2M | 10ge2p1x520: 64B-1t1c-ethip4-ip4scale2m-pdrdisc | 8.3 Mpps | 8.1 Mpps | -2% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 iAcl | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-iacldstbase-pdrdisc | 7.1 Mpps | 7.6..7.8 Mpps | 7..10% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 Policer | 10ge2p1x520: 64B-1t1c-ethip4-ip4base-ipolicemarkbase-pdrdisc | 7.1 Mpps | 7.4..7.6 Mpps | 4..7% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv4 LISP | 10ge2p1x520: 64B-1t1c-ethip4lispip4-ip4base-pdrdisc | 4.6 Mpps | 4.8 Mpps | 9% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-pdrdisc | 7.7 Mpps | 7.3..7.7 Mpps | -5..0% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 COP | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-copwhtlistbase-pdrdisc | 6.1 Mpps | 6.1..6.5 Mpps | 0..7% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 200k | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale200k-pdrdisc | 6.9 Mpps | 5.3..5.7 Mpps | -23..-17% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 20k | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale20k-pdrdisc | 6.9 Mpps | 6.5 Mpps | -6% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 FIB 2M | 10ge2p1x520: 78B-1t1c-ethip6-ip6scale2m-pdrdisc | 5.3 Mpps | 4.2 Mpps | -21% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ -| IPv6 iAcl | 10ge2p1x520: 78B-1t1c-ethip6-ip6base-iacldstbase-pdrdisc | 6.5 Mpps | 6.1..6.5 Mpps | -6..0% | -+-------------------+-----------------------------------------------------------------+-----------+-----------------+----------------------+ + - Increased binary search resolution by reducing final step from + 100kpps to 50kpps; + + - **VPP plugin loaded as needed by tests** + + - From this release only plugins required by tests are loaded at + VPP initialization time. Previously all plugins were loaded for + all tests; + +Performance Changes +------------------- + +Relative performance changes in measured packet throughput in CSIT +|release| are calculated against the results from CSIT |release-1| +report. Listed mean and standard deviation values are computed based on +a series of the same tests executed against respective VPP releases to +verify test results repeatibility, with percentage change calculated for +mean values. Note that the standard deviation is quite high for a small +number of packet throughput tests, what indicates poor test results +repeatability and makes the relative change of mean throughput value not +fully representative for these tests. The root causes behind poor +results repeatibility vary between the test cases. + +NDR Throughput Changes +~~~~~~~~~~~~~~~~~~~~~~ + +NDR small packet throughput changes between releases are available in a CSV and +pretty ASCII formats: + + - `csv format for 1t1c <../_static/vpp/performance-changes-ndr-1t1c-full.csv>`_, + - `csv format for 2t2c <../_static/vpp/performance-changes-ndr-2t2c-full.csv>`_, + - `pretty ASCII format for 1t1c <../_static/vpp/performance-changes-ndr-1t1c-full.txt>`_, + - `pretty ASCII format for 2t2c <../_static/vpp/performance-changes-ndr-2t2c-full.txt>`_. + +PDR Throughput Changes +~~~~~~~~~~~~~~~~~~~~~~ + +NDR small packet throughput changes between releases are available in a CSV and +pretty ASCII formats: + + - `csv format for 1t1c <../_static/vpp/performance-changes-pdr-1t1c-full.csv>`_, + - `csv format for 2t2c <../_static/vpp/performance-changes-pdr-2t2c-full.csv>`_, + - `pretty ASCII format for 1t1c <../_static/vpp/performance-changes-pdr-1t1c-full.txt>`_, + - `pretty ASCII format for 2t2c <../_static/vpp/performance-changes-pdr-2t2c-full.txt>`_. + +Measured improvements are in line with VPP code optimizations listed in +`VPP-18.01 release notes +`_. Known Issues ------------ Here is the list of known issues in CSIT |release| for VPP performance tests: -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| # | Issue | Description | -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| 1 | Sporadic IPv4 routed-forwarding NDR discovery | Suspected issue with DPDK IPv4 checksum calculation. | -| | test failures for 1518B frame size | Observed frequency: sporadic, ca. 20% to 30% of test runs. | -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| 2 | Vic1385 and Vic1227 NICs jumbo frames test | Suspected issue with VIC DPDK drivers that do not support jumbo | -| | failures (9000B) | frames (dropped rx-miss). Observed frequency: 100%. | -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| 3 | Vic1385 and Vic1227 performance | Low performance of NDR results. Big difference between NDR and | -| | | PDR. . | -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| 4 | Sporadic NDR discovery test failures on x520 | Suspected issue with HW settings (BIOS, FW) in LF | -| | | infrastructure. Issue can't be replicated outside LF. | -+---+-------------------------------------------------+-----------------------------------------------------------------+ -| 5 | Non-repeatible zig-zagging NDR throughput | Suspected NIC driver issue in multi-thread multi-core operation | -| | results for multi-thread multi-core tests | affectin NDR throughput. Applies to XL710 and X710 NICs tested. | -| | - 2t2c, 4t4c - for some tested NICs. | | -+---+-------------------------------------------------+-----------------------------------------------------------------+ - -Tests to be Added - Cryptodev ------------------------------ - -DPDK Cryptodev functionality support for both SW and HW crypto devices has -been introduced in VPP-17.01 release. CSIT functional and performance tests -have been also developed and merged. However due to the factors beyond CSIT -project control execution of those tests within the LF FD.io test environment -still need to complete. Once the results become available, they will be -published as an addendum to the current version of CSIT |release| report. - -Tests to be Added - SNAT44 --------------------------- - -VPP SNAT44 functionality has been introduced in VPP-17.01 release. CSIT -performance tests are still in development and integration into LFD FD.io test -environment. Once the tests are fully integrated and results become available, -they will be published as an addendum to the current version of CSIT |release| -report. ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +| # | Issue | Jira ID | Description | ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +| 1 | Vic1385 and Vic1227 low performance. | VPP-664 | Low NDR performance. | +| | | | | ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +| 2 | Sporadic (1 in 200) NDR discovery test failures | CSIT-570 | DPDK reporting rx-errors, indicating L1 issue. Suspected issue | +| | on x520. | | with HW combination of X710-X520 in LF testbeds. Not observed | +| | | | outside of LF testbeds. | ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +| 3 | Lower than expected NDR throughput with | CSIT-571 | Suspected NIC firmware or DPDK driver issue affecting NDR and | +| | xl710 and x710 NICs, compared to x520 NICs. | | PDR throughput. Applies to XL710 and X710 NICs. | ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +| 4 | QAT IPSec scale with 1000 tunnels (interfaces) | VPP-1121 | VPP crashes during configuration of 1000 IPsec tunnels. | +| | in 2t2c config, all tests are failing. | | 1t1c tests are not affected | ++---+-------------------------------------------------+------------+-----------------------------------------------------------------+ +