b8b3419de76989fca18998dd5c2611c906d09214
[csit.git] / docs / report / vpp_performance_tests / csit_release_notes.rst
1 .. _vpp_performance_tests_release_notes:
2
3 Release Notes
4 =============
5
6 Changes in |csit-release|
7 -------------------------
8
9 #. VPP PERFORMANCE TESTS
10
11    - **Regressions with DPDK drivers**: Change from DPDK v21.08 to DPDK
12      v21.11 introduced regression across all tests using dpdk
13      drivers (with dpdk_plugin loaded). Compared to previous VPP
14      release performance drop varies in the range of -15% to -6%,
15      depending on test. It is related to updated MTU checks within
16      DPDK code and associated VPP code changes. See
17      `VPP v2202 release notes <https://s3-docs.fd.io/vpp/22.02/aboutvpp/releasenotes/v22.02.html>`_
18      and :ref:`vpp_known_issues`.
19
20    - **Number of CSIT 9000B frame tests failing**: tests with higher
21      encapsulation overhead are failing due to exceeding default
22      Ethernet Maximum Frame Size value that has been reduced by MTU
23      related VPP code changes. See
24      `VPP v2202 release notes <https://s3-docs.fd.io/vpp/22.02/aboutvpp/releasenotes/v22.02.html>`_
25      and :ref:`vpp_known_issues`.
26
27    - **Intel Xeon Ice Lake**: Performance test data for these platforms
28      is now provided by testbeds newly installed in FD.io CSIT labs.
29      For details about the physical setup see
30      :ref:`physical_testbeds_2n_icx` and
31      :ref:`physical_testbeds_3n_icx`.
32
33    - **Arm Ampere Altra**: Performance test data for these platforms
34      is now provided by testbeds newly installed in FD.io CSIT labs.
35      For details about the physical setup see
36      :ref:`physical_testbeds_3n_alt`.
37
38    - **Reduction of tests**: Removed certain test variations executed
39      iteratively for the report (as well as in daily and weekly
40      trending) due to physical testbeds overload.
41
42 #. TEST FRAMEWORK
43
44    - **CSIT test environment** version has been updated to ver. 10, see
45      :ref:`test_environment_versioning`.
46
47    - **CSIT PAPI support**: Due to issues with PAPI performance, and
48      deprecation of VAT, VPP CLI is used in CSIT for many VPP scale
49      tests. See :ref:`vpp_known_issues`.
50
51    - **General Code Housekeeping**: Ongoing code optimizations and bug
52      fixes.
53
54 #. PRESENTATION AND ANALYTICS LAYER
55
56    - **Graphs improvements**: Updated Packet Latency graphs,
57      see :ref:`latency_methodology`.
58
59 .. raw:: latex
60
61     \clearpage
62
63 .. _vpp_known_issues:
64
65 Known Issues
66 ------------
67
68 New
69 ___
70
71 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
72 |  # | JiraID                                  | Issue Description                                                                                         |
73 +====+=========================================+===========================================================================================================+
74 |  1 | `CSIT-1827                              | 3n-icx, 3n-skx: all AVF crypto tests sporadically fail. 1518B with no traffic, IMIX with excessive        |
75 |    | <https://jira.fd.io/browse/CSIT-1827>`_ | packet loss.                                                                                              |
76 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
77 |  2 | `CSIT-1830                              | All testbeds: All TCP tput (and pps) tests are failing for small packets.                                 |
78 |    | <https://jira.fd.io/browse/CSIT-1830>`_ |                                                                                                           |
79 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
80 |  3 | `CSIT-1832                              | 3n-alt: NDR 1 packet lost on random tests.                                                                |
81 |    | <https://jira.fd.io/browse/CSIT-1832>`_ |                                                                                                           |
82 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
83 |  4 | `CSIT-1834                              | 2n-icx, 2n-skx: sporadic AVF soak tests failing to find critical load with PLRsearch.                     |
84 |    | <https://jira.fd.io/browse/CSIT-1834>`_ |                                                                                                           |
85 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
86 |  5 | `CSIT-1846                              | 2n-skx, 2n-clx, 2n-icx: ALL 1518B TCP tput tests failing with big packet loss.                            |
87 |    | <https://jira.fd.io/browse/CSIT-1846>`_ |                                                                                                           |
88 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
89 |  6 | `CSIT-1847                              | 2n-skx: all 10vm-1t test failed with half of packets dropped.                                             |
90 |    | <https://jira.fd.io/browse/CSIT-1847>`_ |                                                                                                           |
91 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
92 |  7 | `CSIT-1849                              | 2n-skx: UDP 16m tput tests fail to create all sessions.                                                   |
93 |    | <https://jira.fd.io/browse/CSIT-1849>`_ |                                                                                                           |
94 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
95
96 Previous
97 ________
98
99 Issues reported in previous releases which still affect the current results.
100
101 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
102 |  # | JiraID                                  | Issue Description                                                                                         |
103 +====+=========================================+===========================================================================================================+
104 |  1 | `CSIT-1799                              | All NAT44-ED 16M scale tests fail while setting NAT44 address range.                                      |
105 |    | <https://jira.fd.io/browse/CSIT-1799>`_ |                                                                                                           |
106 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
107 |  2 | `CSIT-1800                              | All Geneve L3 mode scale tests (1024 tunnels) are failing.                                                |
108 |    | <https://jira.fd.io/browse/CSIT-1800>`_ |                                                                                                           |
109 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
110 |  3 | `CSIT-1802                              | AF-XDP - NDR tests failing from time to time.                                                             |
111 |    | <https://jira.fd.io/browse/CSIT-1802>`_ |                                                                                                           |
112 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
113 |  4 | `CSIT-1803                              | 3n-icx testbeds (Icelake): all IMIX aes128cbc-hmac512sha tests are failing due to excessive packet loss.  |
114 |    | <https://jira.fd.io/browse/CSIT-1803>`_ |                                                                                                           |
115 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
116 |  5 | `CSIT-1804                              | 3n-tsh, 3n-alt testbed (Taishan, Altra): NDR tests failing from time to time.                             |
117 |    | <https://jira.fd.io/browse/CSIT-1804>`_ |                                                                                                           |
118 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
119 |  6 | `CSIT-1812                              | All IMIX NAT44DET 4m 16m scale tests fail due to not creating required session count.                     |
120 |    | <https://jira.fd.io/browse/CSIT-1812>`_ |                                                                                                           |
121 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
122 |  7 | `CSIT-1782                              | Multicore AVF tests are failing when trying to create interface.                                          |
123 |    | <https://jira.fd.io/browse/CSIT-1782>`_ | Frequency is reduced by CSIT workaround, but occasional failures do still happen.                         |
124 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
125 |  8 | `CSIT-1671                              | All CSIT scale tests can not use PAPI due to much slower performance compared to VAT/CLI (it takes much   |
126 |    | <https://jira.fd.io/browse/CSIT-1671>`_ | longer to program VPP). This needs to be addressed on the PAPI side.                                      |
127 |    +-----------------------------------------+ Currently, the time critical code uses VAT running large files with exec statements and CLI commands.     |
128 |    | `VPP-1763                               | Still, we needed to reduce the number of scale tests run to keep overall duration reasonable.             |
129 |    | <https://jira.fd.io/browse/VPP-1763>`_  | More improvements needed to achieve sufficient configuration speed.                                       |
130 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
131 |  9 | `CSIT-1785                              | NAT44ED tests failing to establish all TCP sessions.                                                      |
132 |    | <https://jira.fd.io/browse/CSIT-1785>`_ | At least for max scale, in allotted time (limited by session 500s timeout) due to worse                   |
133 |    +-----------------------------------------+ slow path performance than previously measured and calibrated for.                                        |
134 |    | `VPP-1972                               | CSIT removed the max scale NAT tests to avoid this issue.                                                 |
135 |    | <https://jira.fd.io/browse/VPP-1972>`_  |                                                                                                           |
136 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
137 | 10 | `CSIT-1801                              | 9000B payload frames not forwarded over tunnels due to violating supported Max Frame Size (VxLAN, LISP,   |
138 |    | <https://jira.fd.io/browse/CSIT-1801>`_ | SRv6).                                                                                                    |
139 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
140 | 11 | `CSIT-1808                              | All tests with 9000B payload frames not forwarded over memif interfaces.                                  |
141 |    | <https://jira.fd.io/browse/CSIT-1808>`_ |                                                                                                           |
142 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
143 | 12 | `CSIT-1809                              | All tests with 9000B payload frames not forwarded over vhostuser interfaces.                              |
144 |    | <https://jira.fd.io/browse/CSIT-1809>`_ |                                                                                                           |
145 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
146
147 Fixed
148 _____
149
150 Issues reported in previous releases which were fixed in this release:
151
152 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
153 |  # | JiraID                                  | Issue Description                                                                                         |
154 +====+=========================================+===========================================================================================================+
155 |  1 | `CSIT-1810                              | DPDK - performance regression with DPDK driver when Max Frame Size is set to less than 2023.              |
156 |    | <https://jira.fd.io/browse/CSIT-1810>`_ |                                                                                                           |
157 |    +-----------------------------------------+                                                                                                           |
158 |    | `VPP-1876                               | Worse performance with DPDK driver when MTU is set to 2022 or less.                                       |
159 |    | <https://jira.fd.io/browse/VPP-1876>`_  |                                                                                                           |
160 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
161 |  2 | `CSIT-1811                              | All 9000B NAT44DET 64k 1m scale tests fail due to bps rate set to high on TRex.                           |
162 |    | <https://jira.fd.io/browse/CSIT-1811>`_ |                                                                                                           |
163 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
164 |  3 | `CSIT-1791                              | Performance regression in RDMA tests, due to CSIT environment changes.                                    |
165 |    | <https://jira.fd.io/browse/CSIT-1791>`_ | Two symptoms: 1. 10-20% regression across most tests. 2. DUT performance cap just below 38 Mpps.          |
166 +----+-----------------------------------------+-----------------------------------------------------------------------------------------------------------+
167
168 Root Cause Analysis for Performance Changes
169 -------------------------------------------
170
171 List of RCAs in |csit-release| for VPP performance changes:
172
173 +----+-----------------------------------------+--------------------------------------------------------------+
174 |  # | JiraID                                  | Issue Description                                            |
175 +====+=========================================+==============================================================+