Skip to content
  1. Jan 04, 2024
    • Andrii Staikov's avatar
      i40e: Restore VF MSI-X state during PCI reset · 371e576f
      Andrii Staikov authored
      During a PCI FLR the MSI-X Enable flag in the VF PCI MSI-X capability
      register will be cleared. This can lead to issues when a VF is
      assigned to a VM because in these cases the VF driver receives no
      indication of the PF PCI error/reset and additionally it is incapable
      of restoring the cleared flag in the hypervisor configuration space
      without fully reinitializing the driver interrupt functionality.
      
      Since the VF driver is unable to easily resolve this condition on its own,
      restore the VF MSI-X flag during the PF PCI reset handling.
      
      Fixes: 19b7960b
      
       ("i40e: implement split PCI error reset handler")
      Co-developed-by: default avatarKaren Ostrowska <karen.ostrowska@intel.com>
      Signed-off-by: default avatarKaren Ostrowska <karen.ostrowska@intel.com>
      Co-developed-by: default avatarMateusz Palczewski <mateusz.palczewski@intel.com>
      Signed-off-by: default avatarMateusz Palczewski <mateusz.palczewski@intel.com>
      Reviewed-by: default avatarWojciech Drewek <wojciech.drewek@intel.com>
      Reviewed-by: default avatarPrzemek Kitszel <przemyslaw.kitszel@intel.com>
      Signed-off-by: default avatarAndrii Staikov <andrii.staikov@intel.com>
      Tested-by: default avatarRafal Romanowski <rafal.romanowski@intel.com>
      Signed-off-by: default avatarTony Nguyen <anthony.l.nguyen@intel.com>
      371e576f
    • Ke Xiao's avatar
      i40e: fix use-after-free in i40e_aqc_add_filters() · 6a15584e
      Ke Xiao authored
      Commit 3116f59c ("i40e: fix use-after-free in
      i40e_sync_filters_subtask()") avoided use-after-free issues,
      by increasing refcount during update the VSI filter list to
      the HW. However, it missed the unicast situation.
      
      When deleting an unicast FDB entry, the i40e driver will release
      the mac_filter, and i40e_service_task will concurrently request
      firmware to add the mac_filter, which will lead to the following
      use-after-free issue.
      
      Fix again for both netdev->uc and netdev->mc.
      
      BUG: KASAN: use-after-free in i40e_aqc_add_filters+0x55c/0x5b0 [i40e]
      Read of size 2 at addr ffff888eb3452d60 by task kworker/8:7/6379
      
      CPU: 8 PID: 6379 Comm: kworker/8:7 Kdump: loaded Tainted: G
      Workqueue: i40e i40e_service_task [i40e]
      Call Trace:
       dump_stack+0x71/0xab
       print_address_description+0x6b/0x290
       kasan_report+0x14a/0x2b0
       i40e_aqc_add_filters+0x55c/0x5b0 [i40e]
       i40e_sync_vsi_filters+0x1676/0x39c0 [i40e]
       i40e_service_task+0x1397/0x2bb0 [i40e]
       process_one_work+0x56a/0x11f0
       worker_thread+0x8f/0xf40
       kthread+0x2a0/0x390
       ret_from_fork+0x1f/0x40
      
      Allocated by task 21948:
       kasan_kmalloc+0xa6/0xd0
       kmem_cache_alloc_trace+0xdb/0x1c0
       i40e_add_filter+0x11e/0x520 [i40e]
       i40e_addr_sync+0x37/0x60 [i40e]
       __hw_addr_sync_dev+0x1f5/0x2f0
       i40e_set_rx_mode+0x61/0x1e0 [i40e]
       dev_uc_add_excl+0x137/0x190
       i40e_ndo_fdb_add+0x161/0x260 [i40e]
       rtnl_fdb_add+0x567/0x950
       rtnetlink_rcv_msg+0x5db/0x880
       netlink_rcv_skb+0x254/0x380
       netlink_unicast+0x454/0x610
       netlink_sendmsg+0x747/0xb00
       sock_sendmsg+0xe2/0x120
       __sys_sendto+0x1ae/0x290
       __x64_sys_sendto+0xdd/0x1b0
       do_syscall_64+0xa0/0x370
       entry_SYSCALL_64_after_hwframe+0x65/0xca
      
      Freed by task 21948:
       __kasan_slab_free+0x137/0x190
       kfree+0x8b/0x1b0
       __i40e_del_filter+0x116/0x1e0 [i40e]
       i40e_del_mac_filter+0x16c/0x300 [i40e]
       i40e_addr_unsync+0x134/0x1b0 [i40e]
       __hw_addr_sync_dev+0xff/0x2f0
       i40e_set_rx_mode+0x61/0x1e0 [i40e]
       dev_uc_del+0x77/0x90
       rtnl_fdb_del+0x6a5/0x860
       rtnetlink_rcv_msg+0x5db/0x880
       netlink_rcv_skb+0x254/0x380
       netlink_unicast+0x454/0x610
       netlink_sendmsg+0x747/0xb00
       sock_sendmsg+0xe2/0x120
       __sys_sendto+0x1ae/0x290
       __x64_sys_sendto+0xdd/0x1b0
       do_syscall_64+0xa0/0x370
       entry_SYSCALL_64_after_hwframe+0x65/0xca
      
      Fixes: 3116f59c ("i40e: fix use-after-free in i40e_sync_filters_subtask()")
      Fixes: 41c445ff
      
       ("i40e: main driver core")
      Signed-off-by: default avatarKe Xiao <xiaoke@sangfor.com.cn>
      Signed-off-by: default avatarDing Hui <dinghui@sangfor.com.cn>
      Cc: Di Zhu <zhudi2@huawei.com>
      Reviewed-by: default avatarJan Sokolowski <jan.sokolowski@intel.com>
      Reviewed-by: default avatarSimon Horman <horms@kernel.org>
      Reviewed-by: default avatarJacob Keller <jacob.e.keller@intel.com>
      Tested-by: Pucha Himasekhar Reddy <himasekharx.reddy.pucha@intel.com> (A Contingent worker at Intel)
      Signed-off-by: default avatarTony Nguyen <anthony.l.nguyen@intel.com>
      6a15584e
  2. Jan 03, 2024
  3. Jan 02, 2024
  4. Jan 01, 2024
  5. Dec 30, 2023
    • David Thompson's avatar
      mlxbf_gige: fix receive packet race condition · dcea1bd4
      David Thompson authored
      Under heavy traffic, the BlueField Gigabit interface can
      become unresponsive. This is due to a possible race condition
      in the mlxbf_gige_rx_packet function, where the function exits
      with producer and consumer indices equal but there are remaining
      packet(s) to be processed. In order to prevent this situation,
      read receive consumer index *before* the HW replenish so that
      the mlxbf_gige_rx_packet function returns an accurate return
      value even if a packet is received into just-replenished buffer
      prior to exiting this routine. If the just-replenished buffer
      is received and occupies the last RX ring entry, the interface
      would not recover and instead would encounter RX packet drops
      related to internal buffer shortages since the driver RX logic
      is not being triggered to drain the RX ring. This patch will
      address and prevent this "ring full" condition.
      
      Fixes: f92e1869
      
       ("Add Mellanox BlueField Gigabit Ethernet driver")
      Reviewed-by: default avatarAsmaa Mnebhi <asmaa@nvidia.com>
      Signed-off-by: default avatarDavid Thompson <davthompson@nvidia.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      dcea1bd4
  6. Dec 29, 2023
  7. Dec 27, 2023
  8. Dec 25, 2023
    • David S. Miller's avatar
      Merge branch 'nfc-refcounting' · dff90e4a
      David S. Miller authored
      
      
      @ 2023-12-19 17:49 Siddh Raman Pant
        2023-12-19 17:49 ` [PATCH net-next v7 1/2] nfc: llcp_core: Hold a ref to llcp_local->dev when holding a ref to llcp_local Siddh Raman Pant
        2023-12-19 17:49 ` [PATCH net-next v7 2/2] nfc: Do not send datagram if socket state isn't LLCP_BOUND Siddh Raman Pant
        0 siblings, 2 replies; 4+ messages in thread
      Siddh Raman Pant says:
      
      ====================
      [PATCH net-next v7 0/2] nfc: Fix UAF during datagram sending caused by missing refcounting
      
      Changes in v7:
      - Stupidly reverted ordering in recv() too, fix that.
      - Remove redundant call to nfc_llcp_sock_free().
      
      Changes in v6:
      - Revert label introduction from v4, and thus also v5 entirely.
      
      Changes in v5:
      - Move reason = LLCP_DM_REJ under the fail_put_sock label.
      - Checkpatch now warns about == NULL check for new_sk, so fix that,
        and also at other similar places in the same function.
      
      Changes in v4:
      - Fix put ordering and comments.
      - Separate freeing in recv() into end labels.
      - Remove obvious comment and add reasoning.
      - Picked up r-bs by Suman.
      
      Changes in v3:
      - Fix missing freeing statements.
      
      Changes in v2:
      - Add net-next in patch subject.
      - Removed unnecessary extra lock and hold nfc_dev ref when holding llcp_sock.
      - Remove last formatting patch.
      - Picked up r-b from Krzysztof for LLCP_BOUND patch.
      
      ---
      
      For connectionless transmission, llcp_sock_sendmsg() codepath will
      eventually call nfc_alloc_send_skb() which takes in an nfc_dev as
      an argument for calculating the total size for skb allocation.
      
      virtual_ncidev_close() codepath eventually releases socket by calling
      nfc_llcp_socket_release() (which sets the sk->sk_state to LLCP_CLOSED)
      and afterwards the nfc_dev will be eventually freed.
      
      When an ndev gets freed, llcp_sock_sendmsg() will result in an
      use-after-free as it
      
      (1) doesn't have any checks in place for avoiding the datagram sending.
      
      (2) calls nfc_llcp_send_ui_frame(), which also has a do-while loop
          which can race with freeing. This loop contains the call to
          nfc_alloc_send_skb() where we dereference the nfc_dev pointer.
      
      nfc_dev is being freed because we do not hold a reference to it when
      we hold a reference to llcp_local. Thus, virtual_ncidev_close()
      eventually calls nfc_release() due to refcount going to 0.
      
      Since state has to be LLCP_BOUND for datagram sending, we can bail out
      early in llcp_sock_sendmsg().
      
      Please review and let me know if any errors are there, and hopefully
      this gets accepted.
      ====================
      
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      dff90e4a
    • Siddh Raman Pant's avatar
      nfc: Do not send datagram if socket state isn't LLCP_BOUND · 6ec0d752
      Siddh Raman Pant authored
      
      
      As we know we cannot send the datagram (state can be set to LLCP_CLOSED
      by nfc_llcp_socket_release()), there is no need to proceed further.
      
      Thus, bail out early from llcp_sock_sendmsg().
      
      Signed-off-by: default avatarSiddh Raman Pant <code@siddh.me>
      Reviewed-by: default avatarKrzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
      Reviewed-by: default avatarSuman Ghosh <sumang@marvell.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      6ec0d752
    • Siddh Raman Pant's avatar
      nfc: llcp_core: Hold a ref to llcp_local->dev when holding a ref to llcp_local · c95f9195
      Siddh Raman Pant authored
      
      
      llcp_sock_sendmsg() calls nfc_llcp_send_ui_frame() which in turn calls
      nfc_alloc_send_skb(), which accesses the nfc_dev from the llcp_sock for
      getting the headroom and tailroom needed for skb allocation.
      
      Parallelly the nfc_dev can be freed, as the refcount is decreased via
      nfc_free_device(), leading to a UAF reported by Syzkaller, which can
      be summarized as follows:
      
      (1) llcp_sock_sendmsg() -> nfc_llcp_send_ui_frame()
      	-> nfc_alloc_send_skb() -> Dereference *nfc_dev
      (2) virtual_ncidev_close() -> nci_free_device() -> nfc_free_device()
      	-> put_device() -> nfc_release() -> Free *nfc_dev
      
      When a reference to llcp_local is acquired, we do not acquire the same
      for the nfc_dev. This leads to freeing even when the llcp_local is in
      use, and this is the case with the UAF described above too.
      
      Thus, when we acquire a reference to llcp_local, we should acquire a
      reference to nfc_dev, and release the references appropriately later.
      
      References for llcp_local is initialized in nfc_llcp_register_device()
      (which is called by nfc_register_device()). Thus, we should acquire a
      reference to nfc_dev there.
      
      nfc_unregister_device() calls nfc_llcp_unregister_device() which in
      turn calls nfc_llcp_local_put(). Thus, the reference to nfc_dev is
      appropriately released later.
      
      Reported-and-tested-by: default avatar <syzbot+bbe84a4010eeea00982d@syzkaller.appspotmail.com>
      Closes: https://syzkaller.appspot.com/bug?extid=bbe84a4010eeea00982d
      Fixes: c7aa1225
      
       ("NFC: Take a reference on the LLCP local pointer when creating a socket")
      Reviewed-by: default avatarSuman Ghosh <sumang@marvell.com>
      Signed-off-by: default avatarSiddh Raman Pant <code@siddh.me>
      Reviewed-by: default avatarKrzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      c95f9195
  9. Dec 22, 2023
    • Linus Torvalds's avatar
      Merge tag 'net-6.7-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net · 7c5e046b
      Linus Torvalds authored
      Pull networking fixes from Paolo Abeni:
       "Including fixes from WiFi and bpf.
      
        Current release - regressions:
      
         - bpf: syzkaller found null ptr deref in unix_bpf proto add
      
         - eth: i40e: fix ST code value for clause 45
      
        Previous releases - regressions:
      
         - core: return error from sk_stream_wait_connect() if sk_wait_event()
           fails
      
         - ipv6: revert remove expired routes with a separated list of routes
      
         - wifi rfkill:
             - set GPIO direction
             - fix crash with WED rx support enabled
      
         - bluetooth:
             - fix deadlock in vhci_send_frame
             - fix use-after-free in bt_sock_recvmsg
      
         - eth: mlx5e: fix a race in command alloc flow
      
         - eth: ice: fix PF with enabled XDP going no-carrier after reset
      
         - eth: bnxt_en: do not map packet buffers twice
      
        Previous releases - always broken:
      
         - core:
             - check vlan filter feature in vlan_vids_add_by_dev() and
               vlan_vids_del_by_dev()
             - check dev->gso_max_size in gso_features_check()
      
         - mptcp: fix inconsistent state on fastopen race
      
         - phy: skip LED triggers on PHYs on SFP modules
      
         - eth: mlx5e:
             - fix double free of encap_header
             - fix slab-out-of-bounds in mlx5_query_nic_vport_mac_list()"
      
      * tag 'net-6.7-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net: (69 commits)
        net: check dev->gso_max_size in gso_features_check()
        kselftest: rtnetlink.sh: use grep_fail when expecting the cmd fail
        net/ipv6: Revert remove expired routes with a separated list of routes
        net: avoid build bug in skb extension length calculation
        net: ethernet: mtk_wed: fix possible NULL pointer dereference in mtk_wed_wo_queue_tx_clean()
        net: stmmac: fix incorrect flag check in timestamp interrupt
        selftests: add vlan hw filter tests
        net: check vlan filter feature in vlan_vids_add_by_dev() and vlan_vids_del_by_dev()
        net: hns3: add new maintainer for the HNS3 ethernet driver
        net: mana: select PAGE_POOL
        net: ks8851: Fix TX stall caused by TX buffer overrun
        ice: Fix PF with enabled XDP going no-carrier after reset
        ice: alter feature support check for SRIOV and LAG
        ice: stop trashing VF VSI aggregator node ID information
        mailmap: add entries for Geliang Tang
        mptcp: fill in missing MODULE_DESCRIPTION()
        mptcp: fix inconsistent state on fastopen race
        selftests: mptcp: join: fix subflow_send_ack lookup
        net: phy: skip LED triggers on PHYs on SFP modules
        bpf: Add missing BPF_LINK_TYPE invocations
        ...
      7c5e046b
  10. Dec 21, 2023