Reduce default queue size to twice the vlib frame size. This throttles
the PP from transferring more packets than are necessary for the next
round of graph execution on a core. When combined with DMA'ing into L3
cache, Mpps for 64B packets increases 35%.
Change-Id: I0afe426d23560f262fdc56accb6302b099bd0076
Signed-off-by: Brian Brooks <brian.brooks@arm.com>
n_outqs = tm->n_vlib_mains;
/* defaults */
- args->tx_q_sz = args->tx_q_sz ? args->tx_q_sz : 2048;
- args->rx_q_sz = args->rx_q_sz ? args->rx_q_sz : 2048;
+ args->tx_q_sz = args->tx_q_sz ? args->tx_q_sz : 2 * VLIB_FRAME_SIZE;
+ args->rx_q_sz = args->rx_q_sz ? args->rx_q_sz : 2 * VLIB_FRAME_SIZE;
if (vec_len (ppm->per_thread_data) == 0)
{