X-Git-Url: https://gerrit.fd.io/r/gitweb?p=csit.git;a=blobdiff_plain;f=docs%2Freport%2Fvpp_performance_tests%2Fcsit_release_notes.rst;h=2e4377ff1dc5eb9fff18d43da7d84d0db745008c;hp=9c6514326688ec56046686442884ebe8160b56f9;hb=0f680760424b4935983b009bead3e31bb553ad74;hpb=966a5b95c5c24bf0bbbeca7e837743cb92d1144e diff --git a/docs/report/vpp_performance_tests/csit_release_notes.rst b/docs/report/vpp_performance_tests/csit_release_notes.rst index 9c65143266..2e4377ff1d 100644 --- a/docs/report/vpp_performance_tests/csit_release_notes.rst +++ b/docs/report/vpp_performance_tests/csit_release_notes.rst @@ -4,233 +4,133 @@ Release Notes Changes in |csit-release| ------------------------- -#. **VPP Performance Tests** - - - **MRR Throughput**: MRR (Maximum Receive Rate) test code has now - configurable trial duration and number of consecutive executions. - Coverage of MRR tests has been extended across more test - scenarios. MRR tests are used for continuous performance trending - and for comparison between VPP releases. - - - **MLRsearch Throughput**: MLRsearch algorithm has been introduced - for all NDR and PDR throughput tests. All tests that previously - used binary search got converted to MLRsearch. Coverage of NDR/PDR - tests has been extended across more test scenarios. - - - **L2patch Tests**: Tests measure performance of VPP L2patch, the - fastest L2 forwarding path implemented in VPP, that cross-links - RX and TX of two physical interfaces. - - - **2-Node Tests**: A new baseline set of 2-node tests covering base - ip4, ip6, l2patch, l2bd, l2xc, running on new Xeon Skylake - testbeds. - - - **Generated tests**: Simplified and unified test structure, semi- - autogenerated by generator script. Test generator is currently - able to create test combinations with various frame size and - cores combinations. All existing test cases were converted to new - format. - - - **Simultaneous Multi-Threading**: SMT-aware detection of server - processor operation mode (HyperThreading enabled/disabled) with - associated compute resource configuration including thread - affinity, number of Rx queues and DPDK I/O mbufs. Tests are - automatically tagged during execution to indicate executed thread - configuration. - - - **Intel Xeon Skylake Support**: Support for 2-Node and 3-Node - physical testbed topologies based on the new SuperMirco servers - each with two Intel Xeon Skylake Platinum processors. Full - Ansible playbooks refactor for quick server (re)installation and - reference pointers of configuration. - -#. **Presentation and Analytics Layer** - - - **Performance trending**: Further improved continuous performance - trending with anomaly detection and analysis. - -#. **Test Framework Optimizations** +#. VPP PERFORMANCE TESTS + + - **Intel Xeon 2n-skx, 3n-skx and 2n-clx testbeds**: VPP performance + test data is not included in this report version. This is due to + the lower performance and behaviour inconsistency of these + systems following the upgrade of processor microcode packages + (skx ucode 0x2000064, clx ucode 0x500002c), done as part of + updating Ubuntu 18.04 LTS kernel version. Tested VPP and DPDK + applications (L3fwd) are affected. Skx and Clx test data will be + added in subsequent maintenance report version(s) once the issue + is resolved. See :ref:`vpp_known_issues`. + + - **Service density 2n-skx tests**: Added new NF density tests with + IPsec encryption between DUTs. + + - **AVF tests**: Full test coveraged based on code changes in CSIT + core layer (driver/interface awareness) and generated by suite + generator (Intel Fortville NICs only). + + - **Hoststack tests**: Major refactor of VPP Hoststack TCP/IP + performance tests using WRK generator talking to the VPP HTTP + static server plugin measuring connections per second and + requests per second. Added new iperf3 with LDPreload tests, + iperf3/LDPreload tests with packet loss induced via the VPP NSIM + (Network Simulator) plugin, and QUIC/UDP/IP transport tests. + All of the new tests measure goodput through the VPP Hoststack + from client to server. + + - **Latency HDRHistogram**: Added High Dynamic Range Histogram + latency measurements based on the new capability in TRex traffic + generator. HDRH latency data presented in latency packet + percentile graphs and in detailed results tables. + + - **Mellanox MCX556A-EDAT tests**: Added tests with Mellanox + ConnectX5-2p100GE NICs in 2n-clx testbeds using VPP native rdma + driver. + + - **IPsec reconfiguration tests**: Added tests measuring the impact + of IPsec tunnels creations and removals. + + - **Load Balancer tests**: Added VPP performance tests for Maglev, + L3DSR (Direct Server Return), Layer 4 Load Balancing NAT Mode. + +#. TEST FRAMEWORK + + - **CSIT Python3 support**: Full migration of CSIT from Python2.7 to + Python3.6. This change includes library migration, PIP dependency + upgrade, CSIT container images, infrastructure packages + ugrade/installation. + + - **CSIT PAPI support**: Finished conversion of CSIT VAT L1 keywords + to PAPI L1 KWs in CSIT using VPP Python bindings (VPP PAPI). + Redesign of key components of PAPI Socket Executor and PAPI + history. Due to issues with PAPI performance, VAT is still used + in CSIT for all VPP scale tests. See known issues below. + + - **Test Suite Generator**: Added capability to generate suites for + different drivers per NIC model including DPDK, AVF, RDMA. + Extended coverage for all tests. - **General Code Housekeeping**: Ongoing RF keywords optimizations, - removal of redundant RF keywords. - -Performance Changes -------------------- - -Relative performance changes in measured NDR, PDR and MRR packet -throughput in |csit-release| are calculated against the test results -from |csit-release-1| report, for tests running on 3-Node Intel Xeon -Haswell testbeds (3n-hsw) in 1-core, 2-core and 4-core (MRR only) -configurations. - -Listed mean and standard deviation values are computed based on a series -of the same tests executed against respective VPP releases to verify -test results repeatability, with percentage change calculated for mean -values. Note that the standard deviation is quite high for a small -number of packet throughput tests, what indicates poor test results -repeatability and makes the relative change of mean throughput value not -fully representative for these tests. The root causes behind poor -results repeatability vary between the test cases. - -NDR Changes -~~~~~~~~~~~ - -NDR throughput changes between releases are available in CSV and pretty -ASCII formats: - - - `CSV 1t1c NDR changes <../_static/vpp/performance-changes-1t1c-ndr.csv>`_, - - `CSV 2t2c NDR changes <../_static/vpp/performance-changes-2t2c-ndr.csv>`_, - - `ASCII 1t1c NDR changes <../_static/vpp/performance-changes-1t1c-ndr.txt>`_, - - `ASCII 2t2c NDR changes <../_static/vpp/performance-changes-2t2c-ndr.txt>`_. - -.. note:: - - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_, - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. - -PDR Changes -~~~~~~~~~~~ - -PDR throughput changes between releases are available in CSV and pretty -ASCII formats: - - - `CSV 1t1c PDR changes <../_static/vpp/performance-changes-1t1c-pdr.csv>`_, - - `CSV 2t2c PDR changes <../_static/vpp/performance-changes-2t2c-pdr.csv>`_, - - `ASCII 1t1c PDR changes <../_static/vpp/performance-changes-1t1c-pdr.txt>`_, - - `ASCII 2t2c PDR changes <../_static/vpp/performance-changes-2t2c-pdr.txt>`_. - -.. note:: + removal of redundant RF keywords and aligning of suite/test + setup/teardowns. - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_, - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. -MRR Changes -~~~~~~~~~~~ +#. PRESENTATION AND ANALYTICS LAYER -MRR throughput changes between releases are available in CSV and pretty -ASCII formats: + - **Graphs layout improvements**: Improved performance graphs layout + for better readibility and maintenance: test grouping, axis + labels, descriptions, other informative decoration. - - `CSV 1t1c MRR changes <../_static/vpp/performance-changes-1t1c-mrr.csv>`_, - - `CSV 2t2c MRR changes <../_static/vpp/performance-changes-2t2c-mrr.csv>`_, - - `CSV 4t4c MRR changes <../_static/vpp/performance-changes-4t4c-mrr.csv>`_, - - `ASCII 1t1c MRR changes <../_static/vpp/performance-changes-1t1c-mrr.txt>`_, - - `ASCII 2t2c MRR changes <../_static/vpp/performance-changes-2t2c-mrr.txt>`_, - - `ASCII 4t4c MRR changes <../_static/vpp/performance-changes-4t4c-mrr.txt>`_. + - **Latency graphs**: Min/Avg/Max group bar latency graphs are + replaced with packet latency percentile distributon at different + background packet loads based on TRex latency hdrhistogram + measurements. -.. note:: +.. + // Alternative Note for 1st Bullet when bad microcode Skx, Clx results are published + - **Intel Xeon 2n-skx, 3n-skx and 2n-clx testbeds**: VPP performance + test data is included in this report version, but it shows lower + performance and behaviour inconsistency of these systems + following the upgrade of processor microcode packages (skx ucode + 0x2000064, clx ucode 0x500002c) as part of updating Ubuntu 18.04 + LTS kernel version. Tested VPP and DPDK applications (L3fwd) are + affected. Skx and Clx test data will be corrected in subsequent + maintenance report version(s) once the issue is resolved. See + :ref:`vpp_known_issues`. - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_, - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. +.. raw:: latex -Skx vs. Hsw Comparison ----------------------- + \clearpage -Relative performance comparison in measured NDR, PDR and MRR packet -throughput is calculated for tests executed on 3-Node Skylake (3n-skx) -and 3-Node Haswell (3n-hsw) physical testbed types in 1-core -configurations. - -NDR Comparison -~~~~~~~~~~~~~~ - -NDR comparison between testbed types is available in CSV and pretty -ASCII formats: - - - `CSV 1c NDR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-ndr.csv>`_, - - `ASCII 1c NDR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-ndr.txt>`_. - -.. note:: - - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_ and - `FD.io test executor vpp performance job 3n-skx`_ - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. - -PDR Comparison -~~~~~~~~~~~~~~ - -PDR comparison between testbed types is available in CSV and pretty -ASCII formats: - - - `CSV 1c PDR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-pdr.csv>`_, - - `ASCII 1c PDR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-pdr.txt>`_. - -.. note:: - - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_ and - `FD.io test executor vpp performance job 3n-skx`_ - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. - -MRR Comparison -~~~~~~~~~~~~~~ - -MRR comparison between testbed types is available in CSV and pretty -ASCII formats: - - - `CSV 1c MRR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-mrr.csv>`_, - - `ASCII 1c MRR comparison <../_static/vpp/performance-compare-testbeds-3n-hsw-3n-skx-mrr.txt>`_. - -.. note:: - - Test results have been generated by - `FD.io test executor vpp performance job 3n-hsw`_ and - `FD.io test executor vpp performance job 3n-skx`_ - with RF result - files csit-vpp-perf-|srelease|-\*.zip - `archived here <../_static/archive/>`_. - -Throughput Trending -------------------- - -In addition to reporting throughput changes between VPP releases, CSIT -provides continuous performance trending for VPP master branch: - -#. `VPP Performance Dashboard `_ - - per VPP test case throughput trend, trend compliance and summary of - detected anomalies. - -#. `Trending Methodology `_ - - throughput test metrics, trend calculations and anomaly - classification (progression, regression, outlier). - -#. `Trendline Graphs `_ - - per VPP build MRR throughput measurements against the trendline - with anomaly highlights, with associated CSIT test jobs. +.. _vpp_known_issues: Known Issues ------------ List of known issues in |csit-release| for VPP performance tests: -+---+-------------------------------------------------+------------+-----------------------------------------------------------------+ -| # | Issue | Jira ID | Description | -+===+=================================================+============+=================================================================+ -| 1 | Sporadic (1 in 200) NDR discovery test failures | CSIT-570 | DPDK reporting rx-errors, indicating L1 issue. Suspected issue | -| | on x520. | | with HW combination of X710-X520 in LF testbeds. Not observed | -| | | | outside of LF testbeds. | -+---+-------------------------------------------------+------------+-----------------------------------------------------------------+ -| 2 | High failure rate of api call | VPP-1361 | Failure rate: 30-40% of tests failing due to interfaces not | -| | sw_interface_set_flags [admin-up|link-up] | | in link-up state after API call sw_interface_set_flags. | -+---+-------------------------------------------------+------------+-----------------------------------------------------------------+ -| 3 | Scale IPSecHW Interface mode throughput | CSIT-1234 | IPSec throughput regression - 1core deltas: NDR -32%, PDR -33%, | -| | regression. | | MRR -38%. Affects IPSec HW Scale 1000tnl tests with Int mode. | -+---+-------------------------------------------------+------------+-----------------------------------------------------------------+ -| 4 | Lower than expected 64B NDR and PDR | CSIT-1242 | NDR and PDR regressions: ip4base -29%. | -| | throughput in VPP ip4base tests | | | -| | with xl710 NIC in 3n-hsw testbeds. | | | -+---+-------------------------------------------------+------------+-----------------------------------------------------------------+ - ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| # | JiraID | Issue Description | ++====+=========================================+===========================================================================================================+ +| 1 | `CSIT-570 | Sporadic (1 in 200) NDR discovery test failures on x520. DPDK reporting rx-errors, indicating L1 issue. | +| | `_ | Suspected issue with HW combination of X710-X520 in LF testbeds. Not observed outside of LF testbeds. | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 2 | `VPP-662 | 9000B packets not supported by NICs VIC1227 and VIC1387. | +| | `_ | | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 3 | `CSIT-1498 | Memif tests are sporadically failing on initialization of memif connection. | +| | `_ | | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 4 | `VPP-1677 | 9000B ip4 nat44: VPP crash + coredump. | +| | `_ | VPP crashes very often in case that NAT44 is configured and it has to process IP4 jumbo frames (9000B). | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 5 | `CSIT-1591 | All CSIT scale tests can not use PAPI due to much slower performance compared to VAT/CLI (it takes much | +| | `_ | longer to program VPP). This needs to be addressed on the PAPI side. | +| +-----------------------------------------+ | +| | `VPP-1763 | | +| | `_ | | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 6 | `VPP-1675 | IPv4 IPSEC 9000B packet tests are failing as no packet is forwarded. | +| | `_ | Reason: chained buffers are not supported. | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 7 | `CSIT-1593 | IPv4 AVF 9000B packet tests are failing on 3n-skx while passing on 2n-skx. | +| | `_ | | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+ +| 8 | `CSIT-1675 | Intel Xeon 2n-skx, 3n-skx and 2n-clx testbeds behaviour and performance became inconsistent following | +| | `_ | the upgrade to the latest Ubuntu 18.04 LTS kernel version (4.15.0-72-generic) and associated microcode | +| | | packages (skx ucode 0x2000064, clx ucode 0x500002c). VPP as well as DPDK L3fwd tests are affected. | ++----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+