94cab9ac02caeeb775c4559793369454e60e45ac
[csit.git] / docs / report / vpp_performance_tests / csit_release_notes.rst
1 .. _vpp_performance_tests_release_notes:
2
3 Release Notes
4 =============
5
6 Changes in |csit-release|
7 -------------------------
8
9 #. VPP PERFORMANCE TESTS
10
11    - **Regressions with DPDK drivers**: Change from DPDK v21.08 to DPDK
12      v21.11 introduced regression across all tests using dpdk
13      drivers (with dpdk_plugin loaded). Compared to previous VPP
14      release performance drop varies in the range of -15% to -6%,
15      depending on test. It is related to updated MTU checks within
16      DPDK code and associated VPP code changes. See
17      `VPP v2202 release notes <https://s3-docs.fd.io/vpp/22.02/aboutvpp/releasenotes/v22.02.html>`_
18      and :ref:`vpp_known_issues`.
19
20    - **Number of CSIT 9000B frame tests failing**: tests with higher
21      encapsulation overhead are failing due to exceeding default
22      Ethernet Maximum Frame Size value that has been reduced by MTU
23      related VPP code changes. See
24      `VPP v2202 release notes <https://s3-docs.fd.io/vpp/22.02/aboutvpp/releasenotes/v22.02.html>`_
25      and :ref:`vpp_known_issues`.
26
27    - **Intel Xeon Ice Lake**: Performance test data for these platforms
28      is now provided by testbeds newly installed in FD.io CSIT labs.
29      For details about the physical setup see
30      :ref:`physical_testbeds_2n_icx` and
31      :ref:`physical_testbeds_3n_icx`.
32
33    - **Reduction of tests**: Removed certain test variations executed
34      iteratively for the report (as well as in daily and weekly
35      trending) due to physical testbeds overload.
36
37 #. TEST FRAMEWORK
38
39    - **CSIT test environment** version has been updated to ver. 9, see
40      :ref:`test_environment_versioning`.
41
42    - **CSIT PAPI support**: Due to issues with PAPI performance, and
43      deprecation of VAT, VPP CLI is used in CSIT for many VPP scale
44      tests. See :ref:`vpp_known_issues`.
45
46    - **General Code Housekeeping**: Ongoing code optimizations and bug
47      fixes.
48
49 #. PRESENTATION AND ANALYTICS LAYER
50
51    - **Graphs improvements**: Updated Packet Latency graphs,
52      see :ref:`latency_methodology`.
53
54 .. raw:: latex
55
56     \clearpage
57
58 .. _vpp_known_issues:
59
60 Known Issues
61 ------------
62
63 New
64 ___
65
66 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
67 |  # | JiraID                                  | Issue Description                                                                                         |
68 +====+=========================================+===========================================================================================================+
69 |  1 | `CSIT-1799                              | All NAT44-ED 16M scale tests fail while setting NAT44 address range.                                      |
70 |    | <https://jira.fd.io/browse/CSIT-1799>`_ |                                                                                                           |
71 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
72 |  2 | `CSIT-1800                              | All Geneve L3 mode scale tests (1024 tunnels) are failing.                                                |
73 |    | <https://jira.fd.io/browse/CSIT-1800>`_ |                                                                                                           |
74 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
75 |  3 | `CSIT-1801                              | 9000B payload frames not forwarded over tunnels due to violating supported Max Frame Size (VxLAN, LISP,   |
76 |    | <https://jira.fd.io/browse/CSIT-1801>`_ | SRv6).                                                                                                    |
77 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
78 |  4 | `CSIT-1808                              | All tests with 9000B payload frames not forwarded over memif interfaces.                                  |
79 |    | <https://jira.fd.io/browse/CSIT-1808>`_ |                                                                                                           |
80 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
81 |  5 | `CSIT-1809                              | All tests with 9000B payload frames not forwarded over vhostuser interfaces.                              |
82 |    | <https://jira.fd.io/browse/CSIT-1809>`_ |                                                                                                           |
83 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
84 |  6 | `CSIT-1802                              | AF-XDP - NDR tests failing from time to time.                                                             |
85 |    | <https://jira.fd.io/browse/CSIT-1802>`_ |                                                                                                           |
86 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
87 |  7 | `CSIT-1803                              | 3n-icx testbeds (Icelake): all IMIX aes128cbc-hmac512sha tests are failing due to excessive packet loss.  |
88 |    | <https://jira.fd.io/browse/CSIT-1803>`_ |                                                                                                           |
89 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
90 |  8 | `CSIT-1804                              | 3n-tsh testbed (Taishan): all tests are failing with small packet drops at NDR with medium frequency.     |
91 |    | <https://jira.fd.io/browse/CSIT-1804>`_ |                                                                                                           |
92 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
93 |  9 | `CSIT-1805                              | 3n-skx testbeds (Skylake): all hoststack vppecho-Bps scale tests are failing.                             |
94 |    | <https://jira.fd.io/browse/CSIT-1805>`_ |                                                                                                           |
95 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
96 | 10 | `CSIT-1806                              | 3n-skx testbeds (Skylake): all LISP ip6-ip6 tests are failing due to address overflow drops.              |
97 |    | <https://jira.fd.io/browse/CSIT-1806>`_ |                                                                                                           |
98 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
99 | 11 | `CSIT-1810                              | DPDK - performance regression with DPDK driver when Max Frame Size is set to less than 2023.      |
100 |    | <https://jira.fd.io/browse/CSIT-1810>`_ |                                                                                                           |
101 |    +-----------------------------------------+                                                                                                           |
102 |    | `VPP-1876                               |                                                                                                           |
103 |    | <https://jira.fd.io/browse/VPP-1876>`_  |                                                                                                           |
104 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
105
106
107 Previous
108 ________
109
110 Issues reported in previous releases which still affect the current results.
111
112 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
113 |  # | JiraID                                  | Issue Description                                                                                         |
114 +====+=========================================+===========================================================================================================+
115 |  3 | `CSIT-1782                              | Multicore AVF tests are failing when trying to create interface.                                          |
116 |    | <https://jira.fd.io/browse/CSIT-1782>`_ | Frequency is reduced by CSIT workaround, but occasional failures do still happen.                         |
117 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
118 |  4 | `CSIT-1671                              | All CSIT scale tests can not use PAPI due to much slower performance compared to VAT/CLI (it takes much   |
119 |    | <https://jira.fd.io/browse/CSIT-1671>`_ | longer to program VPP). This needs to be addressed on the PAPI side.                                      |
120 |    +-----------------------------------------+ Currently, the time critical code uses VAT running large files with exec statements and CLI commands.     |
121 |    | `VPP-1763                               | Still, we needed to reduce the number of scale tests run to keep overall duration reasonable.             |
122 |    | <https://jira.fd.io/browse/VPP-1763>`_  | More improvements needed to achieve sufficient configuration speed.                                       |
123 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
124 |  5 | `CSIT-1789                              | IPSEC SW async scheduler tests show bad behavior.                                                         |
125 |    | <https://jira.fd.io/browse/CSIT-1789>`_ | VPP code is not behaving correctly when crypto workers are the bottleneck.                                |
126 |    +-----------------------------------------+                                                                                                           |
127 |    | `VPP-1998                               |                                                                                                           |
128 |    | <https://jira.fd.io/browse/VPP-1998>`_  |                                                                                                           |
129 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
130 |  6 | `CSIT-1785                              | NAT44ED tests failing to establish all TCP sessions.                                                      |
131 |    | <https://jira.fd.io/browse/CSIT-1785>`_ | At least for max scale, in allotted time (limited by session 500s timeout) due to worse                   |
132 |    +-----------------------------------------+ slow path performance than previously measured and calibrated for.                                        |
133 |    | `VPP-1972                               | CSIT removed the max scale NAT tests to avoid this issue.                                                 |
134 |    | <https://jira.fd.io/browse/VPP-1972>`_  |                                                                                                           |
135 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
136 |  7 | `CSIT-1791                              | Performance regression in RDMA tests, due to CSIT environment changes.                                    |
137 |    | <https://jira.fd.io/browse/CSIT-1791>`_ | Two symptoms: 1. 10-20% regression across most tests. 2. DUT performance cap just below 38 Mpps.          |
138 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
139
140 Fixed
141 _____
142
143 Issues reported in previous releases which were fixed in this release:
144
145 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
146 |  # | JiraID                                  | Issue Description                                                                                         |
147 +====+=========================================+===========================================================================================================+
148 |  1 | `CSIT-1789                              | AVF driver does not perform RSS in a deterministic way.                                                   |
149 |    | <https://jira.fd.io/browse/CSIT-1789>`_ | VPP now uses the same RSS key with AVF driver as with DPDK driver.                                        |
150 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
151 |  2 | `CSIT-1786                              | IP4 and IP6 scale tests failing with no traffic forwarded.                                                |
152 |    | <https://jira.fd.io/browse/CSIT-1786>`_ | CSIT replaced the old single VAT command by file full of "exec" CLI commands executed by VAT.             |
153 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
154
155 Root Cause Analysis for Performance Changes
156 -------------------------------------------
157
158 List of RCAs in |csit-release| for VPP performance changes:
159
160 +----+-----------------------------------------+------------------------------------------------------------+
161 |  # | JiraID                                  | Issue Description                                          |
162 +====+=========================================+============================================================+
163 |  0 |                                         | To be updated when descriptions in Jira tickets are ready. |
164 +----+-----------------------------------------+------------------------------------------------------------+