提交 · 168a2cca81438aef819e43feb161614488dee97b · openeuler / Kernel

21 5月, 2020 9 次提交

ipv6: do compat setsockopt for MCAST_MSFILTER directly · 168a2cca

由 Al Viro 提交于 3月 30, 2020

similar to the ipv4 counterpart of that patch - the same
trick used to align the tail array properly.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

168a2cca

A
ip6_mc_msfilter(): pass the address list separately · d59eb177
由 Al Viro 提交于 3月 30, 2020
```
that way we'll be able to reuse it for compat case
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
d59eb177

ipv4: do compat setsockopt for MCAST_MSFILTER directly · 2e041728

由 Al Viro 提交于 3月 30, 2020

Parallel to what the native setsockopt() does, except that unlike
the native setsockopt() we do not use memdup_user() - we want
the sockaddr_storage fields properly aligned, so we allocate
4 bytes more and copy compat_group_filter at the offset 4,
which yields the proper alignments.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2e041728

A
set_mcast_msfilter(): take the guts of setsockopt(MCAST_MSFILTER) into a helper · e986d4da
由 Al Viro 提交于 3月 29, 2020
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
e986d4da

get rid of compat_mc_getsockopt() · 0dfe6581

由 Al Viro 提交于 3月 29, 2020

now we can do MCAST_MSFILTER in compat ->getsockopt() without
playing silly buggers with copying things back and forth.
We can form a native struct group_filter (sans the variable-length
tail) on stack, pass that + pointer to the tail of original request
to the helper doing the bulk of the work, then do the rest of
copyout - same as the native getsockopt() does.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0dfe6581

ip*_mc_gsfget(): lift copyout of struct group_filter into callers · 931ca7ab

由 Al Viro 提交于 3月 29, 2020

pass the userland pointer to the array in its tail, so that part
gets copied out by our functions; copyout of everything else is
done in the callers.  Rationale: reuse for compat; the array
is the same in native and compat, the layout of parts before it
is different for compat.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

931ca7ab

compat_ip{,v6}_setsockopt(): enumerate MCAST_... options explicitly · e9c375fb

由 Al Viro 提交于 5月 09, 2020

We want to check if optname is among the MCAST_... ones; do that as
an explicit switch.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e9c375fb

lift compat definitions of mcast [sg]etsockopt requests into net/compat.h · 63287de6

由 Al Viro 提交于 5月 09, 2020

We want to get rid of compat_mc_[sg]etsockopt() and to have that stuff
handled without compat_alloc_user_space(), extra copying through
userland, etc.  To do that we'll need ipv4 and ipv6 instances of
->compat_[sg]etsockopt() to manipulate the 32bit variants of mcast
requests, so we need to move the definitions of those out of net/compat.c
and into a public header.

This patch just does a mechanical move to include/net/compat.h
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

63287de6

rds: fix crash in rds_info_getsockopt() · f78cdbd7

由 John Hubbard 提交于 5月 20, 2020

The conversion to pin_user_pages() had a bug: it overlooked
the case of allocation of pages failing. Fix that by restoring
an equivalent check.

Reported-by: syzbot+118ac0af4ac7f785a45b@syzkaller.appspotmail.com
Fixes: dbfe7d74 ("rds: convert get_user_pages() --> pin_user_pages()")

Cc: David S. Miller <davem@davemloft.net>
Cc: Jakub Kicinski <kuba@kernel.org>
Cc: netdev@vger.kernel.org
Cc: linux-rdma@vger.kernel.org
Cc: rds-devel@oss.oracle.com
Signed-off-by: NJohn Hubbard <jhubbard@nvidia.com>
Acked-by: NSantosh Shilimkar <santosh.shilimkar@oracle.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f78cdbd7

20 5月, 2020 15 次提交

net: unexport skb_gro_receive() · 4f65e2f4

由 Eric Dumazet 提交于 5月 19, 2020

skb_gro_receive() used to be used by SCTP, it is no longer the case.

skb_gro_receive_list() is in the same category : never used from modules.
Signed-off-by: NEric Dumazet <edumazet@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4f65e2f4

ipv6: use ->ndo_tunnel_ctl in addrconf_set_dstaddr · 8e3db0bb

由 Christoph Hellwig 提交于 5月 19, 2020

Use the new ->ndo_tunnel_ctl instead of overriding the address limit
and using ->ndo_do_ioctl just to do a pointless user copy.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8e3db0bb

ipv6: streamline addrconf_set_dstaddr · 68ad6886

由 Christoph Hellwig 提交于 5月 19, 2020

Factor out a addrconf_set_sit_dstaddr helper for the actual work if we
found a SIT device, and only hold the rtnl lock around the device lookup
and that new helper, as there is no point in holding it over a
copy_from_user call.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

68ad6886

ipv6: stub out even more of addrconf_set_dstaddr if SIT is disabled · f0988460

由 Christoph Hellwig 提交于 5月 19, 2020

There is no point in copying the structure from userspace or looking up
a device if SIT support is not disabled and we'll eventually return
-ENODEV anyway.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f0988460

sit: impement ->ndo_tunnel_ctl · f60fe2df

由 Christoph Hellwig 提交于 5月 19, 2020

Implement the ->ndo_tunnel_ctl method, and use ip_tunnel_ioctl to
handle userspace requests for the SIOCGETTUNNEL, SIOCADDTUNNEL,
SIOCCHGTUNNEL and SIOCDELTUNNEL ioctls.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f60fe2df

sit: refactor ipip6_tunnel_ioctl · fd5d687b

由 Christoph Hellwig 提交于 5月 19, 2020

Split the ioctl handler into one function per command instead of having
a all the logic sit in one giant switch statement.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fd5d687b

impr: use ->ndo_tunnel_ctl in ipmr_new_tunnel · c7e36705

由 Christoph Hellwig 提交于 5月 19, 2020

Use the new ->ndo_tunnel_ctl instead of overriding the address limit
and using ->ndo_do_ioctl just to do a pointless user copy.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c7e36705

net: add a new ndo_tunnel_ioctl method · 607259a6

由 Christoph Hellwig 提交于 5月 19, 2020

This method is used to properly allow kernel callers of the IPv4 route
management ioctls.  The exsting ip_tunnel_ioctl helper is renamed to
ip_tunnel_ctl to better reflect that it doesn't directly implement ioctls
touching user memory, and is used for the guts of ndo_tunnel_ctl
implementations. A new ip_tunnel_ioctl helper is added that can be wired
up directly to the ndo_do_ioctl method and takes care of the copy to and
from userspace.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

607259a6

ipv4: consolidate the VIFF_TUNNEL handling in ipmr_new_tunnel · c1fd1182

由 Christoph Hellwig 提交于 5月 19, 2020

Also move the dev_set_allmulti call and the error handling into the
ioctl helper. This allows reusing already looked up tunnel_dev pointer
and the set up argument structure for the deletion in the error handler.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c1fd1182

ipv4: streamline ipmr_new_tunnel · c384b8a7

由 Christoph Hellwig 提交于 5月 19, 2020

Reduce a few level of indentation to simplify the function.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c384b8a7

net/af_iucv: clean up function prototypes · e9a36ca5

由 Julian Wiedmann 提交于 5月 19, 2020

Remove a bunch of forward declarations (trivially shifting code around
where needed), and make a few functions static.
Signed-off-by: NJulian Wiedmann <jwi@linux.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e9a36ca5

net/af_iucv: remove a redundant zero initialization · dca1262f

由 Julian Wiedmann 提交于 5月 19, 2020

txmsg is declared as {0}, no need to clear individual fields later on.
Signed-off-by: NJulian Wiedmann <jwi@linux.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dca1262f

net/af_iucv: replace open-coded U16_MAX · 0d1c7664

由 Julian Wiedmann 提交于 5月 19, 2020

Improve the readability of a range check.
Signed-off-by: NJulian Wiedmann <jwi@linux.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0d1c7664

net/af_iucv: remove pm support · 585bc220

由 Julian Wiedmann 提交于 5月 19, 2020

commit 39421627 ("s390: remove broken hibernate / power management support")
removed support for ARCH_HIBERNATION_POSSIBLE from s390.

So drop the unused pm ops from the s390-only af_iucv socket code.
Signed-off-by: NJulian Wiedmann <jwi@linux.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

585bc220

net/iucv: remove pm support · 4b32f86b

由 Julian Wiedmann 提交于 5月 19, 2020

commit 39421627 ("s390: remove broken hibernate / power management support")
removed support for ARCH_HIBERNATION_POSSIBLE from s390.

So drop the unused pm ops from the s390-only iucv bus driver.

CC: Hendrik Brueckner <brueckner@linux.ibm.com>
Signed-off-by: NJulian Wiedmann <jwi@linux.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4b32f86b

19 5月, 2020 4 次提交

ipv4,appletalk: move SIOCADDRT and SIOCDELRT handling into ->compat_ioctl · dc13c876

由 Christoph Hellwig 提交于 5月 18, 2020

To prepare removing the global routing_ioctl hack start lifting the code
into the ipv4 and appletalk ->compat_ioctl handlers.  Unlike the existing
handler we don't bother copying in the name - there are no compat issues for
char arrays.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dc13c876

appletalk: factor out a atrtr_ioctl_addrt helper · a5004923

由 Christoph Hellwig 提交于 5月 18, 2020

Add a helper than can be shared with the upcoming compat ioctl handler.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a5004923

ipv6: move SIOCADDRT and SIOCDELRT handling into ->compat_ioctl · 3986912f

由 Christoph Hellwig 提交于 5月 18, 2020

To prepare removing the global routing_ioctl hack start lifting the code
into a newly added ipv6 ->compat_ioctl handler.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3986912f

ipv6: lift copy_from_user out of ipv6_route_ioctl · 7c1552da

由 Christoph Hellwig 提交于 5月 18, 2020

Prepare for better compat ioctl handling by moving the user copy out
of ipv6_route_ioctl.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7c1552da

18 5月, 2020 8 次提交

rds: convert get_user_pages() --> pin_user_pages() · dbfe7d74

由 John Hubbard 提交于 5月 16, 2020

This code was using get_user_pages_fast(), in a "Case 2" scenario
(DMA/RDMA), using the categorization from [1]. That means that it's
time to convert the get_user_pages_fast() + put_page() calls to
pin_user_pages_fast() + unpin_user_pages() calls.

There is some helpful background in [2]: basically, this is a small
part of fixing a long-standing disconnect between pinning pages, and
file systems' use of those pages.

[1] Documentation/core-api/pin_user_pages.rst

[2] "Explicit pinning of user-space pages":
    https://lwn.net/Articles/807108/

Cc: David S. Miller <davem@davemloft.net>
Cc: Jakub Kicinski <kuba@kernel.org>
Cc: netdev@vger.kernel.org
Cc: linux-rdma@vger.kernel.org
Cc: rds-devel@oss.oracle.com
Signed-off-by: NJohn Hubbard <jhubbard@nvidia.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dbfe7d74

net: allow __skb_ext_alloc to sleep · 4930f483

由 Florian Westphal 提交于 5月 16, 2020

mptcp calls this from the transmit side, from process context.
Allow a sleeping allocation instead of unconditional GFP_ATOMIC.
Acked-by: NPaolo Abeni <pabeni@redhat.com>
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4930f483

mptcp: remove inner wait loop from mptcp_sendmsg_frag · 5c826443

由 Florian Westphal 提交于 5月 16, 2020

previous patches made sure we only call into this function
when these prerequisites are met, so no need to wait on the
subflow socket anymore.

Closes: https://github.com/multipath-tcp/mptcp_net-next/issues/7Acked-by: NPaolo Abeni <pabeni@redhat.com>
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5c826443

mptcp: fill skb page frag cache outside of mptcp_sendmsg_frag · 17091708

由 Florian Westphal 提交于 5月 16, 2020

The mptcp_sendmsg_frag helper contains a loop that will wait on the
subflow sk.

It seems preferrable to only wait in mptcp_sendmsg() when blocking io is
requested.  mptcp_sendmsg already has such a wait loop that is used when
no subflow socket is available for transmission.

This is another preparation patch that makes sure we call
mptcp_sendmsg_frag only if the page frag cache has been refilled.

Followup patch will remove the wait loop from mptcp_sendmsg_frag().

The retransmit worker doesn't need to do this refill as it won't
transmit new mptcp-level data.
Acked-by: NPaolo Abeni <pabeni@redhat.com>
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

17091708

mptcp: fill skb extension cache outside of mptcp_sendmsg_frag · 149f7c71

由 Florian Westphal 提交于 5月 16, 2020

The mptcp_sendmsg_frag helper contains a loop that will wait on the
subflow sk.

It seems preferrable to only wait in mptcp_sendmsg() when blocking io is
requested.  mptcp_sendmsg already has such a wait loop that is used when
no subflow socket is available for transmission.

This is a preparation patch that makes sure we call
mptcp_sendmsg_frag only if a skb extension has been allocated.

Moreover, such allocation currently uses GFP_ATOMIC while it
could use sleeping allocation instead.

Followup patches will remove the wait loop from mptcp_sendmsg_frag()
and will allow to do a sleeping allocation for the extension.
Acked-by: NPaolo Abeni <pabeni@redhat.com>
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

149f7c71

mptcp: avoid blocking in tcp_sendpages · 72511aab

由 Florian Westphal 提交于 5月 16, 2020

The transmit loop continues to xmit new data until an error is returned
or all data was transmitted.

For the blocking i/o case, this means that tcp_sendpages() may block on
the subflow until more space becomes available, i.e. we end up sleeping
with the mptcp socket lock held.

Instead we should check if a different subflow is ready to be used.

This restarts the subflow sk lookup when the tx operation succeeded
and the tcp subflow can't accept more data or if tcp_sendpages
indicates -EAGAIN on a blocking mptcp socket.

In that case we also need to set the NOSPACE bit to make sure we get
notified once memory becomes available.

In case all subflows are busy, the existing logic will wait until a
subflow is ready, releasing the mptcp socket lock while doing so.

The mptcp worker already sets DONTWAIT, so no need to make changes there.

v2:
 * set NOSPACE bit
 * add a comment to clarify that mptcp-sk sndbuf limits need to
   be checked as well.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

72511aab

mptcp: break and restart in case mptcp sndbuf is full · fb529e62

由 Florian Westphal 提交于 5月 16, 2020

Its not enough to check for available tcp send space.

We also hold on to transmitted data for mptcp-level retransmits.
Right now we will send more and more data if the peer can ack data
at the tcp level fast enough, since that frees up tcp send buffer space.

But we also need to check that data was acked and reclaimed at the mptcp
level.

Therefore add needed check in mptcp_sendmsg, flush tcp data and
wait until more mptcp snd space becomes available if we are over the
limit.  Before we wait for more data, also make sure we start the
retransmit timer if we ran out of sndbuf space.

Otherwise there is a very small chance that we wait forever:

 * receiver is waiting for data
 * sender is blocked because mptcp socket buffer is full
 * at tcp level, all data was acked
 * mptcp-level snd_una was not updated, because last ack
   that acknowledged the last data packet carried an older
   MPTCP-ack.

Restarting the retransmit timer avoids this problem: if TCP
subflow is idle, data is retransmitted from the RTX queue.

New data will make the peer send a new, updated MPTCP-Ack.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fb529e62

mptcp: move common nospace-pattern to a helper · a0e17064

由 Florian Westphal 提交于 5月 16, 2020

Paolo noticed that ssk_check_wmem() has same pattern, so add/use
common helper for both places.
Suggested-by: NPaolo Abeni <pabeni@redhat.com>
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a0e17064

17 5月, 2020 4 次提交

ethtool: don't call set_channels in drivers if config didn't change · 75c36dbb

由 Jakub Kicinski 提交于 5月 15, 2020

Don't call drivers if nothing changed. Netlink code already
contains this logic.
Signed-off-by: NJakub Kicinski <kuba@kernel.org>
Reviewed-by: NMichal Kubecek <mkubecek@suse.cz>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

75c36dbb

ethtool: check if there is at least one channel for TX/RX in the core · 7be92514

由 Jakub Kicinski 提交于 5月 15, 2020

Having a channel config with no ability to RX or TX traffic is
clearly wrong. Check for this in the core so the drivers don't
have to.
Signed-off-by: NJakub Kicinski <kuba@kernel.org>
Reviewed-by: NMichal Kubecek <mkubecek@suse.cz>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7be92514

mptcp: Use 32-bit DATA_ACK when possible · a0c1d0ea

由 Christoph Paasch 提交于 5月 14, 2020

RFC8684 allows to send 32-bit DATA_ACKs as long as the peer is not
sending 64-bit data-sequence numbers. The 64-bit DSN is only there for
extreme scenarios when a very high throughput subflow is combined with a
long-RTT subflow such that the high-throughput subflow wraps around the
32-bit sequence number space within an RTT of the high-RTT subflow.

It is thus a rare scenario and we should try to use the 32-bit DATA_ACK
instead as long as possible. It allows to reduce the TCP-option overhead
by 4 bytes, thus makes space for an additional SACK-block. It also makes
tcpdumps much easier to read when the DSN and DATA_ACK are both either
32 or 64-bit.
Signed-off-by: NChristoph Paasch <cpaasch@apple.com>
Reviewed-by: NMatthieu Baerts <matthieu.baerts@tessares.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a0c1d0ea

netns: enable to inherit devconf from current netns · 9efd6a3c

由 Nicolas Dichtel 提交于 5月 13, 2020

The goal is to be able to inherit the initial devconf parameters from the
current netns, ie the netns where this new netns has been created.

This is useful in a containers environment where /proc/sys is read only.
For example, if a pod is created with specifics devconf parameters and has
the capability to create netns, the user expects to get the same parameters
than his 'init_net', which is not the real init_net in this case.
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9efd6a3c

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功