arch/sparc/net/bpf_jit_comp_64.c · 99d363976ae4ed1002367be50a928467f806458f · openeuler / raspberrypi-kernel

[ Upstream commit c44768a33da81b4a0986e79bbf0588f1a0651dec ] On T4 and later sparc64 cpus we can use the fused compare and branch instruction. However, it can only be used if the branch destination is in the range of a signed 10-bit immediate offset. This amounts to 1024 instructions forwards or backwards. After the commit referenced in the Fixes: tag, the largest possible size program seen by the JIT explodes by a significant factor. As a result of this convergance takes many more passes since the expanded "BPF_LDX | BPF_MSH | BPF_B" code sequence, for example, contains several embedded branch on condition instructions. On each pass, as suddenly new fused compare and branch instances become valid, this makes thousands more in range for the next pass. And so on and so forth. This is most greatly exemplified by "BPF_MAXINSNS: exec all MSH" which takes 35 passes to converge, and shrinks the image by about 64K. To decrease the cost of this number of convergance passes, do the convergance pass before we have the program image allocated, just like other JITs (such as x86) do. Fixes: ("bpf: implement ld_abs/ld_ind in native bpf") Signed-off-by: N David S. Miller <davem@davemloft.net> Signed-off-by: N Alexei Starovoitov <ast@kernel.org> Signed-off-by: N Sasha Levin <sashal@kernel.org> Signed-off-by: N Yang Yingliang <yangyingliang@huawei.com>

bpf_jit_comp_64.c 37.0 KB

openeuler / raspberrypi-kernel

Replace bpf_jit_comp_64.c