77361825bb01 - platform/external/linux-kselftest

commit	77361825bb01ecadf3ac8622e2e4dbc28806e858	[log] [tgz]
author	Jesper Dangaard Brouer <brouer@redhat.com>	Fri Apr 12 17:07:32 2019 +0200
committer	Alexei Starovoitov <ast@kernel.org>	Wed Apr 17 19:09:24 2019 -0700
tree	c521a9c061e9cc31281cc613cb9ed7b11e517286
parent	00967e84f742f87603e769529628e32076ade188 [diff]

bpf: cpumap use ptr_ring_consume_batched

Move ptr_ring dequeue outside loop, that allocate SKBs and calls network
stack, as these operations that can take some time. The ptr_ring is a
communication channel between CPUs, where we want to reduce/limit any
cacheline bouncing.

Do a concentrated bulk dequeue via ptr_ring_consume_batched, to shorten the
period and times the remote cacheline in ptr_ring is read

Batch size 8 is both to (1) limit BH-disable period, and (2) consume one
cacheline on 64-bit archs. After reducing the BH-disable section further
then we can consider changing this, while still thinking about L1 cacheline
size being active.

Signed-off-by: Jesper Dangaard Brouer <brouer@redhat.com>
Acked-by: Song Liu <songliubraving@fb.com>
Signed-off-by: Alexei Starovoitov <ast@kernel.org>

kernel/bpf/cpumap.c[diff]

1 file changed

tree: c521a9c061e9cc31281cc613cb9ed7b11e517286