1. 10 10月, 2019 8 次提交
  2. 09 10月, 2019 15 次提交
    • E
      Revert "tun: call dev_get_valid_name() before register_netdevice()" · bacb7e18
      Eric Dumazet 提交于
      This reverts commit 0ad646c8.
      
      As noticed by Jakub, this is no longer needed after
      commit 11fc7d5a ("tun: fix memory leak in error path")
      
      This no longer exports dev_get_valid_name() for the exclusive
      use of tun driver.
      Suggested-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      Signed-off-by: NEric Dumazet <edumazet@google.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      bacb7e18
    • J
      net: tipc: prepare attrs in __tipc_nl_compat_dumpit() · 6ea67769
      Jiri Pirko 提交于
      __tipc_nl_compat_dumpit() calls tipc_nl_publ_dump() which expects
      the attrs to be available by genl_dumpit_info(cb)->attrs. Add info
      struct and attr parsing in compat dumpit function.
      
      Reported-by: syzbot+8d37c50ffb0f52941a5e@syzkaller.appspotmail.com
      Fixes: 057af707 ("net: tipc: have genetlink code to parse the attrs during dumpit")
      Signed-off-by: NJiri Pirko <jiri@mellanox.com>
      Acked-by: NJon Maloy <jon.maloy@ericsson.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      6ea67769
    • J
      net: genetlink: always allocate separate attrs for dumpit ops · ab5b526d
      Jiri Pirko 提交于
      Individual dumpit ops (start, dumpit, done) are locked by genl_lock
      if !family->parallel_ops. However, multiple
      genl_family_rcv_msg_dumpit() calls may in in flight in parallel.
      Each has a separate struct genl_dumpit_info allocated
      but they share the same family->attrbuf. Fix this by allocating separate
      memory for attrs for dumpit ops, for non-parallel_ops (for parallel_ops
      it is done already).
      
      Reported-by: syzbot+495688b736534bb6c6ad@syzkaller.appspotmail.com
      Reported-by: syzbot+ff59dc711f2cff879a05@syzkaller.appspotmail.com
      Reported-by: syzbot+dbe02e13bcce52bcf182@syzkaller.appspotmail.com
      Reported-by: syzbot+9cb7edb2906ea1e83006@syzkaller.appspotmail.com
      Fixes: bf813b0a ("net: genetlink: parse attrs and store in contect info struct during dumpit")
      Signed-off-by: NJiri Pirko <jiri@mellanox.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      ab5b526d
    • J
      Merge branch 'hns3-next' into net-next · 48423dd7
      Jakub Kicinski 提交于
      Huazhong Tan says:
      
      ====================
      This patch-set includes some new features for the HNS3 ethernet
      controller driver.
      
      [patch 01/06] adds support for configuring VF link status on the host.
      
      [patch 02/06] adds support for configuring VF spoof check.
      
      [patch 03/06] adds support for configuring VF trust.
      
      [patch 04/06] adds support for configuring VF bandwidth on the host.
      
      [patch 05/06] adds support for configuring VF MAC on the host.
      
      [patch 06/06] adds support for tx-scatter-gather-fraglist.
      ====================
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      48423dd7
    • Y
      net: hns3: support tx-scatter-gather-fraglist feature · 8ae10cfb
      Yunsheng Lin 提交于
      The hardware supports up to 8 TX BD for non-tso skb and up to
      63 TX BD for TSO skb. Currently, the hns3 driver supports RX skb
      with fraglist when HW GRO is enabled, when the stack forwards a
      RX skb with fraglist, the stack need to linearize the skb before
      sending to other interface without TX fraglist support.
      
      This patch adds support for TX fraglist. The performance increases
      from 1 GByte to 1.5 GByte for one iperf TCP stream during
      forwarding test after this patch. BTW, the minimum BD number of
      ring should be updated to 72 for supporting TX fraglist.
      
      This patch also changes the error handling of some function that
      called by hns3_fill_desc, which returns BD num when there is no
      error, change some macro to more meaningful name.
      Signed-off-by: NYunsheng Lin <linyunsheng@huawei.com>
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      8ae10cfb
    • H
      net: hns3: add support for configuring VF MAC from the host · 8e6de441
      Huazhong Tan 提交于
      This patch adds support of configuring VF MAC from the host
      for the HNS3 driver.
      
      BTW, the parameter init in the hns3_init_mac_addr is
      unnecessary now, since the MAC address will not read from
      NCL_CONFIG when doing reset, so it should be removed,
      otherwise it will affect VF's MAC address initialization.
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      8e6de441
    • Y
      net: hns3: add support for configuring bandwidth of VF on the host · ee9e4424
      Yonglong Liu 提交于
      This patch adds support for configuring bandwidth of VF on the host
      for HNS3 drivers.
      Signed-off-by: NYonglong Liu <liuyonglong@huawei.com>
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      ee9e4424
    • J
      net: hns3: add support for setting VF trust · e196ec75
      Jian Shen 提交于
      This patch adds supports for setting VF trust by host. If specified
      VF is trusted, then it can enable promisc(include allmulti mode).
      If a trusted VF enabled promisc, and being untrusted, host will
      disable promisc mode for this VF.
      
      For VF will update its promisc mode from set_rx_mode now, so it's
      unnecessary to set broadcst promisc mode when initialization or
      reset.
      Signed-off-by: NJian Shen <shenjian15@huawei.com>
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      e196ec75
    • J
      net: hns3: add support for spoof check setting · 22044f95
      Jian Shen 提交于
      This patch adds support for spoof check configuration for VFs.
      When it is enabled, "spoof checking" is done for both mac address
      and VLAN. For each VF, the HW ensures that the source MAC address
      (or VLAN) of every outgoing packet exists in the MAC-list (or
      VLAN-list) configured for RX filtering for that VF. If not,
      the packet is dropped.
      Signed-off-by: NJian Shen <shenjian15@huawei.com>
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      22044f95
    • Y
      net: hns3: add support for setting VF link status on the host · 6430f744
      Yufeng Mo 提交于
      This patch adds support to configure VF link properties.
      The options are auto, enable, and disable. Even if the PF
      is down, the communication between VFs will be normal
      if the VFs are set to enable. The commands are as follows:
      
      'ip link set <pf> vf <vf_id> state <auto|enable|disable>'
      change the VF status
      
      'ip link show'
      show the setting status
      Signed-off-by: NYufeng Mo <moyufeng@huawei.com>
      Signed-off-by: NHuazhong Tan <tanhuazhong@huawei.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      6430f744
    • E
      tun: fix memory leak in error path · 11fc7d5a
      Eric Dumazet 提交于
      syzbot reported a warning [1] that triggered after recent Jiri patch.
      
      This exposes a bug that we hit already in the past (see commit
      ff244c6b ("tun: handle register_netdevice() failures properly")
      for details)
      
      tun uses priv->destructor without an ndo_init() method.
      
      register_netdevice() can return an error, but will
      not call priv->destructor() in some cases. Jiri recent
      patch added one more.
      
      A long term fix would be to transfer the initialization
      of what we destroy in ->destructor() in the ndo_init()
      
      This looks a bit risky given the complexity of tun driver.
      
      A simpler fix is to detect after the failed register_netdevice()
      if the tun_free_netdev() function was called already.
      
      [1]
      ODEBUG: free active (active state 0) object type: timer_list hint: tun_flow_cleanup+0x0/0x280 drivers/net/tun.c:457
      WARNING: CPU: 0 PID: 8653 at lib/debugobjects.c:481 debug_print_object+0x168/0x250 lib/debugobjects.c:481
      Kernel panic - not syncing: panic_on_warn set ...
      CPU: 0 PID: 8653 Comm: syz-executor976 Not tainted 5.4.0-rc1-next-20191004 #0
      Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
      Call Trace:
       __dump_stack lib/dump_stack.c:77 [inline]
       dump_stack+0x172/0x1f0 lib/dump_stack.c:113
       panic+0x2dc/0x755 kernel/panic.c:220
       __warn.cold+0x2f/0x3c kernel/panic.c:581
       report_bug+0x289/0x300 lib/bug.c:195
       fixup_bug arch/x86/kernel/traps.c:174 [inline]
       fixup_bug arch/x86/kernel/traps.c:169 [inline]
       do_error_trap+0x11b/0x200 arch/x86/kernel/traps.c:267
       do_invalid_op+0x37/0x50 arch/x86/kernel/traps.c:286
       invalid_op+0x23/0x30 arch/x86/entry/entry_64.S:1028
      RIP: 0010:debug_print_object+0x168/0x250 lib/debugobjects.c:481
      Code: dd 80 b9 e6 87 48 89 fa 48 c1 ea 03 80 3c 02 00 0f 85 b5 00 00 00 48 8b 14 dd 80 b9 e6 87 48 c7 c7 e0 ae e6 87 e8 80 84 ff fd <0f> 0b 83 05 e3 ee 80 06 01 48 83 c4 20 5b 41 5c 41 5d 41 5e 5d c3
      RSP: 0018:ffff888095997a28 EFLAGS: 00010082
      RAX: 0000000000000000 RBX: 0000000000000003 RCX: 0000000000000000
      RDX: 0000000000000000 RSI: ffffffff815cb526 RDI: ffffed1012b32f37
      RBP: ffff888095997a68 R08: ffff8880a92ac580 R09: ffffed1015d04101
      R10: ffffed1015d04100 R11: ffff8880ae820807 R12: 0000000000000001
      R13: ffffffff88fb5340 R14: ffffffff81627110 R15: ffff8880aa41eab8
       __debug_check_no_obj_freed lib/debugobjects.c:963 [inline]
       debug_check_no_obj_freed+0x2d4/0x43f lib/debugobjects.c:994
       kfree+0xf8/0x2c0 mm/slab.c:3755
       kvfree+0x61/0x70 mm/util.c:593
       netdev_freemem net/core/dev.c:9384 [inline]
       free_netdev+0x39d/0x450 net/core/dev.c:9533
       tun_set_iff drivers/net/tun.c:2871 [inline]
       __tun_chr_ioctl+0x317b/0x3f30 drivers/net/tun.c:3075
       tun_chr_ioctl+0x2b/0x40 drivers/net/tun.c:3355
       vfs_ioctl fs/ioctl.c:47 [inline]
       file_ioctl fs/ioctl.c:539 [inline]
       do_vfs_ioctl+0xdb6/0x13e0 fs/ioctl.c:726
       ksys_ioctl+0xab/0xd0 fs/ioctl.c:743
       __do_sys_ioctl fs/ioctl.c:750 [inline]
       __se_sys_ioctl fs/ioctl.c:748 [inline]
       __x64_sys_ioctl+0x73/0xb0 fs/ioctl.c:748
       do_syscall_64+0xfa/0x760 arch/x86/entry/common.c:290
       entry_SYSCALL_64_after_hwframe+0x49/0xbe
      RIP: 0033:0x441439
      Code: e8 9c ae 02 00 48 83 c4 18 c3 0f 1f 80 00 00 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 0f 83 3b 0a fc ff c3 66 2e 0f 1f 84 00 00 00 00
      RSP: 002b:00007fff61c37438 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
      RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 0000000000441439
      RDX: 0000000020000400 RSI: 00000000400454ca RDI: 0000000000000004
      RBP: 00007fff61c37470 R08: 0000000000000001 R09: 0000000100000000
      R10: 0000000000000000 R11: 0000000000000246 R12: ffffffffffffffff
      R13: 0000000000000005 R14: 0000000000000000 R15: 0000000000000000
      Kernel Offset: disabled
      Rebooting in 86400 seconds..
      
      Fixes: ff927412 ("net: introduce name_node struct to be used in hashlist")
      Signed-off-by: NEric Dumazet <edumazet@google.com>
      Cc: Jiri Pirko <jiri@mellanox.com>
      Reported-by: Nsyzbot <syzkaller@googlegroups.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      11fc7d5a
    • C
      netdevsim: fix spelling mistake "forbidded" -> "forbid" · f9867b51
      Colin Ian King 提交于
      There is a spelling mistake in a NL_SET_ERR_MSG_MOD message. Fix it.
      Signed-off-by: NColin Ian King <colin.king@canonical.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      f9867b51
    • C
      net: phy: mscc: make arrays static, makes object smaller · c4256794
      Colin Ian King 提交于
      Don't populate const arrays on the stack but instead make them
      static. Makes the object code smaller by 1058 bytes.
      
      Before:
         text	   data	    bss	    dec	    hex	filename
        29879	   6144	      0	  36023	   8cb7	drivers/net/phy/mscc.o
      
      After:
         text	   data	    bss	    dec	    hex	filename
        28437	   6528	      0	  34965	   8895	drivers/net/phy/mscc.o
      
      (gcc version 9.2.1, amd64)
      Signed-off-by: NColin Ian King <colin.king@canonical.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      c4256794
    • C
      nfp: bpf: make array exp_mask static, makes object smaller · 155283c3
      Colin Ian King 提交于
      Don't populate the array exp_mask on the stack but instead make it
      static. Makes the object code smaller by 224 bytes.
      
      Before:
         text	   data	    bss	    dec	    hex	filename
        77832	   2290	      0	  80122	  138fa	ethernet/netronome/nfp/bpf/jit.o
      
      After:
         text	   data	    bss	    dec	    hex	filename
        77544	   2354	      0	  79898	  1381a	ethernet/netronome/nfp/bpf/jit.o
      
      (gcc version 9.2.1, amd64)
      Signed-off-by: NColin Ian King <colin.king@canonical.com>
      Acked-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      Signed-off-by: NJakub Kicinski <jakub.kicinski@netronome.com>
      155283c3
    • V
      spi: Add a PTP system timestamp to the transfer structure · 79591b7d
      Vladimir Oltean 提交于
      SPI is one of the interfaces used to access devices which have a POSIX
      clock driver (real time clocks, 1588 timers etc). The fact that the SPI
      bus is slow is not what the main problem is, but rather the fact that
      drivers don't take a constant amount of time in transferring data over
      SPI. When there is a high delay in the readout of time, there will be
      uncertainty in the value that has been read out of the peripheral.
      When that delay is constant, the uncertainty can at least be
      approximated with a certain accuracy which is fine more often than not.
      
      Timing jitter occurs all over in the kernel code, and is mainly caused
      by having to let go of the CPU for various reasons such as preemption,
      servicing interrupts, going to sleep, etc. Another major reason is CPU
      dynamic frequency scaling.
      
      It turns out that the problem of retrieving time from a SPI peripheral
      with high accuracy can be solved by the use of "PTP system
      timestamping" - a mechanism to correlate the time when the device has
      snapshotted its internal time counter with the Linux system time at that
      same moment. This is sufficient for having a precise time measurement -
      it is not necessary for the whole SPI transfer to be transmitted "as
      fast as possible", or "as low-jitter as possible". The system has to be
      low-jitter for a very short amount of time to be effective.
      
      This patch introduces a PTP system timestamping mechanism in struct
      spi_transfer. This is to be used by SPI device drivers when they need to
      know the exact time at which the underlying device's time was
      snapshotted. More often than not, SPI peripherals have a very exact
      timing for when their SPI-to-interconnect bridge issues a transaction
      for snapshotting and reading the time register, and that will be
      dependent on when the SPI-to-interconnect bridge figures out that this
      is what it should do, aka as soon as it sees byte N of the SPI transfer.
      Since spi_device drivers are the ones who'd know best how the peripheral
      behaves in this regard, expose a mechanism in spi_transfer which allows
      them to specify which word (or word range) from the transfer should be
      timestamped.
      
      Add a default implementation of the PTP system timestamping in the SPI
      core. This is not going to be satisfactory performance-wise, but should
      at least increase the likelihood that SPI device drivers will use PTP
      system timestamping in the future.
      There are 3 entry points from the core towards the SPI controller
      drivers:
      
      - transfer_one: The driver is passed individual spi_transfers to
        execute. This is the easiest to timestamp.
      
      - transfer_one_message: The core passes the driver an entire spi_message
        (a potential batch of spi_transfers). The core puts the same pre and
        post timestamp to all transfers within a message. This is not ideal,
        but nothing better can be done by default anyway, since the core has
        no insight into how the driver batches the transfers.
      
      - transfer: Like transfer_one_message, but for unqueued drivers (i.e.
        the driver implements its own queue scheduling).
      Signed-off-by: NVladimir Oltean <olteanv@gmail.com>
      Link: https://lore.kernel.org/r/20190905010114.26718-3-olteanv@gmail.comSigned-off-by: NMark Brown <broonie@kernel.org>
      79591b7d
  3. 07 10月, 2019 17 次提交