Skip to content
  1. Dec 24, 2020
  2. Dec 22, 2020
  3. Dec 18, 2020
    • Sebastian Andrzej Siewior's avatar
      blk-mq: Don't complete on a remote CPU in force threaded mode · 71425189
      Sebastian Andrzej Siewior authored
      
      
      With force threaded interrupts enabled, raising softirq from an SMP
      function call will always result in waking the ksoftirqd thread. This is
      not optimal given that the thread runs at SCHED_OTHER priority.
      
      Completing the request in hard IRQ-context on PREEMPT_RT (which enforces
      the force threaded mode) is bad because the completion handler may
      acquire sleeping locks which violate the locking context.
      
      Disable request completing on a remote CPU in force threaded mode.
      
      Signed-off-by: default avatarSebastian Andrzej Siewior <bigeasy@linutronix.de>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarDaniel Wagner <dwagner@suse.de>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      71425189
    • Stefan Haberland's avatar
      s390/dasd: fix list corruption of lcu list · 53a7f655
      Stefan Haberland authored
      In dasd_alias_disconnect_device_from_lcu the device is removed from any
      list on the LCU. Afterwards the LCU is removed from the lcu list if it
      does not contain devices any longer.
      
      The lcu->lock protects the lcu from parallel updates. But to cancel all
      workers and wait for completion the lcu->lock has to be unlocked.
      
      If two devices are removed in parallel and both are removed from the LCU
      the first device that takes the lcu->lock again will delete the LCU because
      it is already empty but the second device also tries to free the LCU which
      leads to a list corruption of the lcu list.
      
      Fix by removing the device right before the lcu is checked without
      unlocking the lcu->lock in between.
      
      Fixes: 8e09f215
      
       ("[S390] dasd: add hyper PAV support to DASD device driver, part 1")
      Cc: stable@vger.kernel.org
      Signed-off-by: default avatarStefan Haberland <sth@linux.ibm.com>
      Reviewed-by: default avatarJan Hoeppner <hoeppner@linux.ibm.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      53a7f655
    • Stefan Haberland's avatar
      s390/dasd: fix list corruption of pavgroup group list · 0ede91f8
      Stefan Haberland authored
      dasd_alias_add_device() moves devices to the active_devices list in case
      of a scheduled LCU update regardless if they have previously been in a
      pavgroup or not.
      
      Example: device A and B are in the same pavgroup.
      
      Device A has already been in a pavgroup and the private->pavgroup pointer
      is set and points to a valid pavgroup. While going through dasd_add_device
      it is moved from the pavgroup to the active_devices list.
      
      In parallel device B might be removed from the same pavgroup in
      remove_device_from_lcu() which in turn checks if the group is empty
      and deletes it accordingly because device A has already been removed from
      there.
      
      When now device A enters remove_device_from_lcu() it is tried to remove it
      from the pavgroup again because the pavgroup pointer is still set and again
      the empty group will be cleaned up which leads to a list corruption.
      
      Fix by setting private->pavgroup to NULL in dasd_add_device.
      
      If the device has been the last device on the pavgroup an empty pavgroup
      remains but this will be cleaned up by the scheduled lcu_update which
      iterates over all existing pavgroups.
      
      Fixes: 8e09f215
      
       ("[S390] dasd: add hyper PAV support to DASD device driver, part 1")
      Cc: stable@vger.kernel.org
      Signed-off-by: default avatarStefan Haberland <sth@linux.ibm.com>
      Reviewed-by: default avatarJan Hoeppner <hoeppner@linux.ibm.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      0ede91f8
    • Stefan Haberland's avatar
      s390/dasd: prevent inconsistent LCU device data · a29ea016
      Stefan Haberland authored
      Prevent _lcu_update from adding a device to a pavgroup if the LCU still
      requires an update. The data is not reliable any longer and in parallel
      devices might have been moved on the lists already.
      This might lead to list corruptions or invalid PAV grouping.
      Only add devices to a pavgroup if the LCU is up to date. Additional steps
      are taken by the scheduled lcu update.
      
      Fixes: 8e09f215
      
       ("[S390] dasd: add hyper PAV support to DASD device driver, part 1")
      Cc: stable@vger.kernel.org
      Signed-off-by: default avatarStefan Haberland <sth@linux.ibm.com>
      Reviewed-by: default avatarJan Hoeppner <hoeppner@linux.ibm.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      a29ea016
    • Stefan Haberland's avatar
      s390/dasd: fix hanging device offline processing · 658a337a
      Stefan Haberland authored
      For an LCU update a read unit address configuration IO is required.
      This is started using sleep_on(), which has early exit paths in case the
      device is not usable for IO. For example when it is in offline processing.
      
      In those cases the LCU update should fail and not be retried.
      Therefore lcu_update_work checks if EOPNOTSUPP is returned or not.
      
      Commit 41995342 ("s390/dasd: fix endless loop after read unit address configuration")
      accidentally removed the EOPNOTSUPP return code from
      read_unit_address_configuration(), which in turn might lead to an endless
      loop of the LCU update in offline processing.
      
      Fix by returning EOPNOTSUPP again if the device is not able to perform the
      request.
      
      Fixes: 41995342
      
       ("s390/dasd: fix endless loop after read unit address configuration")
      Cc: stable@vger.kernel.org #5.3
      Signed-off-by: default avatarStefan Haberland <sth@linux.ibm.com>
      Reviewed-by: default avatarJan Hoeppner <hoeppner@linux.ibm.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      658a337a
  4. Dec 17, 2020
    • Baolin Wang's avatar
      blk-iocost: Add iocg idle state tracepoint · 76efc1c7
      Baolin Wang authored
      
      
      It will be helpful to trace the iocg's whole state, including active and
      idle state. And we can easily expand the original iocost_iocg_activate
      trace event to support a state trace class, including active and idle
      state tracing.
      
      Acked-by: default avatarTejun Heo <tj@kernel.org>
      Signed-off-by: default avatarBaolin Wang <baolin.wang@linux.alibaba.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      76efc1c7
    • Josh Triplett's avatar
      nbd: Respect max_part for all partition scans · 1aba169e
      Josh Triplett authored
      
      
      The creation path of the NBD device respects max_part and only scans for
      partitions if max_part is not 0. However, some other code paths ignore
      max_part, and unconditionally scan for partitions. Add a check for
      max_part on each partition scan.
      
      Signed-off-by: default avatarJosh Triplett <josh@joshtriplett.org>
      Reviewed-by: default avatarJosef Bacik <josef@toxicpanda.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      1aba169e
    • Gioh Kim's avatar
      block/rnbd-clt: Does not request pdu to rtrs-clt · 9aaf9a2a
      Gioh Kim authored
      
      
      Previously the rnbd client requested the rtrs to allocate rnbd_iu
      just after the rtrs_iu. So the rnbd client passes the size of
      rnbd_iu for rtrs_clt_open() and rtrs creates an array of
      rnbd_iu and rtrs_iu.
      
      For IO handling, rnbd_iu exists after the request because we pass
      the size of rnbd_iu when setting the tag-set. Therefore we do not
      use the rnbd_iu allocated by rtrs for IO handling.
      We only use the rnbd_iu allocated by rtrs when doing session
      initialization. Almost all rnbd_iu allocated by rtrs are wasted.
      
      By this patch the rnbd client does not request rnbd_iu allocation
      to rtrs but allocate it for itself when doing session initialization.
      
      Also remove unused rtrs_permit_to_pdu from rtrs.
      
      Signed-off-by: default avatarGioh Kim <gi-oh.kim@cloud.ionos.com>
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      9aaf9a2a
    • Gioh Kim's avatar
      block/rnbd-clt: Dynamically allocate sglist for rnbd_iu · 5a1328d0
      Gioh Kim authored
      
      
      The BMAX_SEGMENT static array for scatterlist is embedded in
      rnbd_iu structure to avoid memory allocation in hot IO path.
      In many cases, we do need only several sg entries because many IOs
      have only several segments.
      
      This patch change rnbd_iu to check the number of segments in the request
      and allocate sglist dynamically.
      
      For io path, use sg_alloc_table_chained to allocate sg list faster.
      
      First it makes two sg entries after pdu of request.
      The sg_alloc_table_chained uses the pre-allocated sg entries
      if the number of segments of the request is less than two.
      So it reduces the number of memory allocation.
      
      Signed-off-by: default avatarGioh Kim <gi-oh.kim@cloud.ionos.com>
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      5a1328d0
    • Gioh Kim's avatar
      block/rnbd: Set write-back cache and fua same to the target device · 512c781f
      Gioh Kim authored
      
      
      The rnbd-client always sets the write-back cache and fua attributes
      of the rnbd device queue regardless of the target device on the server.
      That generates IO hang issue when the target device does not
      support both of write-back cacne and fua.
      
      This patch adds more fields for the cache policy and fua into the
      device opening message. The rnbd-server sends the information
      if the target device supports the write-back cache and fua
      and rnbd-client recevives it and set the device queue accordingly.
      
      Signed-off-by: default avatarGioh Kim <gi-oh.kim@cloud.ionos.com>
      [jwang: some minor change, rename a few varables, remove unrelated comments.]
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      512c781f
    • Jack Wang's avatar
      block/rnbd: Fix typos · 3877ece0
      Jack Wang authored
      
      
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      3877ece0
    • Md Haris Iqbal's avatar
      block/rnbd-srv: Protect dev session sysfs removal · 87019e7d
      Md Haris Iqbal authored
      Since the removal of the session sysfs can also be called from the
      function destroy_sess, there is a need to protect the call from the
      function rnbd_srv_sess_dev_force_close
      
      Fixes: 78699805
      
       ("block/rnbd-srv: close a mapped device from server side.")
      Signed-off-by: default avatarMd Haris Iqbal <haris.iqbal@cloud.ionos.com>
      Reviewed-by: default avatarGuoqing Jiang <guoqing.jiang@cloud.ionos.com>
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      87019e7d
    • Jack Wang's avatar
      block/rnbd-clt: Fix possible memleak · 46067844
      Jack Wang authored
      In error case, we do not free the memory for blk_symlink_name.
      
      Do it by free the memory in error case, and set to NULL
      afterwards.
      
      Also fix the condition in rnbd_clt_remove_dev_symlink.
      
      Fixes: 64e8a6ec
      
       ("block/rnbd-clt: Dynamically alloc buffer for pathname & blk_symlink_name")
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Reviewed-by: default avatarMd Haris Iqbal <haris.iqbal@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      46067844
    • Md Haris Iqbal's avatar
      block/rnbd-clt: Get rid of warning regarding size argument in strlcpy · e7508d48
      Md Haris Iqbal authored
      The kernel test robot triggerred the following warning,
      
      >> drivers/block/rnbd/rnbd-clt.c:1397:42: warning: size argument in
      'strlcpy' call appears to be size of the source; expected the size of the
      destination [-Wstrlcpy-strlcat-size]
      	strlcpy(dev->pathname, pathname, strlen(pathname) + 1);
      					      ~~~~~~~^~~~~~~~~~~~~
      
      To get rid of the above warning, use a kstrdup as Bart suggested.
      
      Fixes: 64e8a6ec
      
       ("block/rnbd-clt: Dynamically alloc buffer for pathname & blk_symlink_name")
      Reported-by: default avatarkernel test robot <lkp@intel.com>
      Signed-off-by: default avatarMd Haris Iqbal <haris.iqbal@cloud.ionos.com>
      Signed-off-by: default avatarJack Wang <jinpu.wang@cloud.ionos.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      e7508d48
    • Daniel Wagner's avatar
      blk-mq: Remove 'running from the wrong CPU' warning · e6582cb5
      Daniel Wagner authored
      
      
      It's guaranteed that no request is in flight when a hctx is going
      offline. This warning is only triggered when the wq's CPU is hot
      plugged and the blk-mq is not synced up yet.
      
      As this state is temporary and the request is still processed
      correctly, better remove the warning as this is the fast path.
      
      Suggested-by: default avatarMing Lei <ming.lei@redhat.com>
      Signed-off-by: default avatarDaniel Wagner <dwagner@suse.de>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarMing Lei <ming.lei@redhat.com>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      e6582cb5
    • Linus Torvalds's avatar
      Merge tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma · 009bd55d
      Linus Torvalds authored
      Pull rdma updates from Jason Gunthorpe:
       "A smaller set of patches, nothing stands out as being particularly
        major this cycle. The biggest item would be the new HIP09 HW support
        from HNS, otherwise it was pretty quiet for new work here:
      
         - Driver bug fixes and updates: bnxt_re, cxgb4, rxe, hns, i40iw,
           cxgb4, mlx4 and mlx5
      
         - Bug fixes and polishing for the new rts ULP
      
         - Cleanup of uverbs checking for allowed driver operations
      
         - Use sysfs_emit all over the place
      
         - Lots of bug fixes and clarity improvements for hns
      
         - hip09 support for hns
      
         - NDR and 50/100Gb signaling rates
      
         - Remove dma_virt_ops and go back to using the IB DMA wrappers
      
         - mlx5 optimizations for contiguous DMA regions"
      
      * tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma: (147 commits)
        RDMA/cma: Don't overwrite sgid_attr after device is released
        RDMA/mlx5: Fix MR cache memory leak
        RDMA/rxe: Use acquire/release for memory ordering
        RDMA/hns: Simplify AEQE process for different types of queue
        RDMA/hns: Fix inaccurate prints
        RDMA/hns: Fix incorrect symbol types
        RDMA/hns: Clear redundant variable initialization
        RDMA/hns: Fix coding style issues
        RDMA/hns: Remove unnecessary access right set during INIT2INIT
        RDMA/hns: WARN_ON if get a reserved sl from users
        RDMA/hns: Avoid filling sl in high 3 bits of vlan_id
        RDMA/hns: Do shift on traffic class when using RoCEv2
        RDMA/hns: Normalization the judgment of some features
        RDMA/hns: Limit the length of data copied between kernel and userspace
        RDMA/mlx4: Remove bogus dev_base_lock usage
        RDMA/uverbs: Fix incorrect variable type
        RDMA/core: Do not indicate device ready when device enablement fails
        RDMA/core: Clean up cq pool mechanism
        RDMA/core: Update kernel documentation for ib_create_named_qp()
        MAINTAINERS: SOFT-ROCE: Change Zhu Yanjun's email address
        ...
      009bd55d
    • Linus Torvalds's avatar
      Merge tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi · 60f7c503
      Linus Torvalds authored
      Pull SCSI updates from James Bottomley:
       "This consists of the usual driver updates (ufs, qla2xxx, smartpqi,
        target, zfcp, fnic, mpt3sas, ibmvfc) plus a load of cleanups, a major
        power management rework and a load of assorted minor updates.
      
        There are a few core updates (formatting fixes being the big one) but
        nothing major this cycle"
      
      * tag 'scsi-misc' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi: (279 commits)
        scsi: mpt3sas: Update driver version to 36.100.00.00
        scsi: mpt3sas: Handle trigger page after firmware update
        scsi: mpt3sas: Add persistent MPI trigger page
        scsi: mpt3sas: Add persistent SCSI sense trigger page
        scsi: mpt3sas: Add persistent Event trigger page
        scsi: mpt3sas: Add persistent Master trigger page
        scsi: mpt3sas: Add persistent trigger pages support
        scsi: mpt3sas: Sync time periodically between driver and firmware
        scsi: qla2xxx: Update version to 10.02.00.104-k
        scsi: qla2xxx: Fix device loss on 4G and older HBAs
        scsi: qla2xxx: If fcport is undergoing deletion complete I/O with retry
        scsi: qla2xxx: Fix the call trace for flush workqueue
        scsi: qla2xxx: Fix flash update in 28XX adapters on big endian machines
        scsi: qla2xxx: Handle aborts correctly for port undergoing deletion
        scsi: qla2xxx: Fix N2N and NVMe connect retry failure
        scsi: qla2xxx: Fix FW initialization error on big endian machines
        scsi: qla2xxx: Fix crash during driver load on big endian machines
        scsi: qla2xxx: Fix compilation issue in PPC systems
        scsi: qla2xxx: Don't check for fw_started while posting NVMe command
        scsi: qla2xxx: Tear down session if FW say it is down
        ...
      60f7c503
    • Linus Torvalds's avatar
      Merge tag 'for-5.11/drivers-2020-12-14' of git://git.kernel.dk/linux-block · 69f637c3
      Linus Torvalds authored
      Pull block driver updates from Jens Axboe:
       "Nothing major in here:
      
         - NVMe pull request from Christoph:
              - nvmet passthrough improvements (Chaitanya Kulkarni)
              - fcloop error injection support (James Smart)
              - read-only support for zoned namespaces without Zone Append
                (Javier González)
              - improve some error message (Minwoo Im)
              - reject I/O to offline fabrics namespaces (Victor Gladkov)
              - PCI queue allocation cleanups (Niklas Schnelle)
              - remove an unused allocation in nvmet (Amit Engel)
              - a Kconfig spelling fix (Colin Ian King)
              - nvme_req_qid simplication (Baolin Wang)
      
         - MD pull request from Song:
              - Fix race condition in md_ioctl() (Dae R. Jeong)
              - Initialize read_slot properly for raid10 (Kevin Vigor)
              - Code cleanup (Pankaj Gupta)
              - md-cluster resync/reshape fix (Zhao Heming)
      
         - Move null_blk into its own directory (Damien Le Moal)
      
         - null_blk zone and discard improvements (Damien Le Moal)
      
         - bcache race fix (Dongsheng Yang)
      
         - Set of rnbd fixes/improvements (Gioh Kim, Guoqing Jiang, Jack Wang,
           Lutz Pogrell, Md Haris Iqbal)
      
         - lightnvm NULL pointer deref fix (tangzhenhao)
      
         - sr in_interrupt() removal (Sebastian Andrzej Siewior)
      
         - FC endpoint security support for s390/dasd (Jan Höppner, Sebastian
           Ott, Vineeth Vijayan). From the s390 arch guys, arch bits included
           as it made it easier for them to funnel the feature through the
           block driver tree.
      
         - Follow up fixes (Colin Ian King)"
      
      * tag 'for-5.11/drivers-2020-12-14' of git://git.kernel.dk/linux-block: (64 commits)
        block: drop dead assignments in loop_init()
        sr: Remove in_interrupt() usage in sr_init_command().
        sr: Switch the sector size back to 2048 if sr_read_sector() changed it.
        cdrom: Reset sector_size back it is not 2048.
        drivers/lightnvm: fix a null-ptr-deref bug in pblk-core.c
        null_blk: Move driver into its own directory
        null_blk: Allow controlling max_hw_sectors limit
        null_blk: discard zones on reset
        null_blk: cleanup discard handling
        null_blk: Improve implicit zone close
        null_blk: improve zone locking
        block: Align max_hw_sectors to logical blocksize
        null_blk: Fail zone append to conventional zones
        null_blk: Fix zone size initialization
        bcache: fix race between setting bdev state to none and new write request direct to backing
        block/rnbd: fix a null pointer dereference on dev->blk_symlink_name
        block/rnbd-clt: Dynamically alloc buffer for pathname & blk_symlink_name
        block/rnbd: call kobject_put in the failure path
        Documentation/ABI/rnbd-srv: add document for force_close
        block/rnbd-srv: close a mapped device from server side.
        ...
      69f637c3
    • Linus Torvalds's avatar
      Merge tag 'for-5.11/block-2020-12-14' of git://git.kernel.dk/linux-block · ac7ac461
      Linus Torvalds authored
      Pull block updates from Jens Axboe:
       "Another series of killing more code than what is being added, again
        thanks to Christoph's relentless cleanups and tech debt tackling.
      
        This contains:
      
         - blk-iocost improvements (Baolin Wang)
      
         - part0 iostat fix (Jeffle Xu)
      
         - Disable iopoll for split bios (Jeffle Xu)
      
         - block tracepoint cleanups (Christoph Hellwig)
      
         - Merging of struct block_device and hd_struct (Christoph Hellwig)
      
         - Rework/cleanup of how block device sizes are updated (Christoph
           Hellwig)
      
         - Simplification of gendisk lookup and removal of block device
           aliasing (Christoph Hellwig)
      
         - Block device ioctl cleanups (Christoph Hellwig)
      
         - Removal of bdget()/blkdev_get() as exported API (Christoph Hellwig)
      
         - Disk change rework, avoid ->revalidate_disk() (Christoph Hellwig)
      
         - sbitmap improvements (Pavel Begunkov)
      
         - Hybrid polling fix (Pavel Begunkov)
      
         - bvec iteration improvements (Pavel Begunkov)
      
         - Zone revalidation fixes (Damien Le Moal)
      
         - blk-throttle limit fix (Yu Kuai)
      
         - Various little fixes"
      
      * tag 'for-5.11/block-2020-12-14' of git://git.kernel.dk/linux-block: (126 commits)
        blk-mq: fix msec comment from micro to milli seconds
        blk-mq: update arg in comment of blk_mq_map_queue
        blk-mq: add helper allocating tagset->tags
        Revert "block: Fix a lockdep complaint triggered by request queue flushing"
        nvme-loop: use blk_mq_hctx_set_fq_lock_class to set loop's lock class
        blk-mq: add new API of blk_mq_hctx_set_fq_lock_class
        block: disable iopoll for split bio
        block: Improve blk_revalidate_disk_zones() checks
        sbitmap: simplify wrap check
        sbitmap: replace CAS with atomic and
        sbitmap: remove swap_lock
        sbitmap: optimise sbitmap_deferred_clear()
        blk-mq: skip hybrid polling if iopoll doesn't spin
        blk-iocost: Factor out the base vrate change into a separate function
        blk-iocost: Factor out the active iocgs' state check into a separate function
        blk-iocost: Move the usage ratio calculation to the correct place
        blk-iocost: Remove unnecessary advance declaration
        blk-iocost: Fix some typos in comments
        blktrace: fix up a kerneldoc comment
        block: remove the request_queue to argument request based tracepoints
        ...
      ac7ac461
    • Linus Torvalds's avatar
      Merge tag 'for-5.11/io_uring-2020-12-14' of git://git.kernel.dk/linux-block · 48aba79b
      Linus Torvalds authored
      Pull io_uring updates from Jens Axboe:
       "Fairly light set of changes this time around, and mostly some bits
        that were pushed out to 5.11 instead of 5.10, fixes/cleanups, and a
        few features. In particular:
      
         - Cleanups around iovec import (David Laight, Pavel)
      
         - Add timeout support for io_uring_enter(2), which enables us to
           clean up liburing and avoid a timeout sqe submission in the
           completion path.
      
           The big win here is that it allows setups that split SQ and CQ
           handling into separate threads to avoid locking, as the CQ side
           will no longer submit when timeouts are needed when waiting for
           events (Hao Xu)
      
         - Add support for socket shutdown, and renameat/unlinkat.
      
         - SQPOLL cleanups and improvements (Xiaoguang Wang)
      
         - Allow SQPOLL setups for CAP_SYS_NICE, and enable regular
           (non-fixed) files to be used.
      
         - Cancelation improvements (Pavel)
      
         - Fixed file reference improvements (Pavel)
      
         - IOPOLL related race fixes (Pavel)
      
         - Lots of other little fixes and cleanups (mostly Pavel)"
      
      * tag 'for-5.11/io_uring-2020-12-14' of git://git.kernel.dk/linux-block: (43 commits)
        io_uring: fix io_cqring_events()'s noflush
        io_uring: fix racy IOPOLL flush overflow
        io_uring: fix racy IOPOLL completions
        io_uring: always let io_iopoll_complete() complete polled io
        io_uring: add timeout update
        io_uring: restructure io_timeout_cancel()
        io_uring: fix files cancellation
        io_uring: use bottom half safe lock for fixed file data
        io_uring: fix miscounting ios_left
        io_uring: change submit file state invariant
        io_uring: check kthread stopped flag when sq thread is unparked
        io_uring: share fixed_file_refs b/w multiple rsrcs
        io_uring: replace inflight_wait with tctx->wait
        io_uring: don't take fs for recvmsg/sendmsg
        io_uring: only wake up sq thread while current task is in io worker context
        io_uring: don't acquire uring_lock twice
        io_uring: initialize 'timeout' properly in io_sq_thread()
        io_uring: refactor io_sq_thread() handling
        io_uring: always batch cancel in *cancel_files()
        io_uring: pass files into kill timeouts/poll
        ...
      48aba79b
    • Linus Torvalds's avatar
      Merge tag 'tif-task_work.arch-2020-12-14' of git://git.kernel.dk/linux-block · 005b2a9d
      Linus Torvalds authored
      Pull TIF_NOTIFY_SIGNAL updates from Jens Axboe:
       "This sits on top of of the core entry/exit and x86 entry branch from
        the tip tree, which contains the generic and x86 parts of this work.
      
        Here we convert the rest of the archs to support TIF_NOTIFY_SIGNAL.
      
        With that done, we can get rid of JOBCTL_TASK_WORK from task_work and
        signal.c, and also remove a deadlock work-around in io_uring around
        knowing that signal based task_work waking is invoked with the sighand
        wait queue head lock.
      
        The motivation for this work is to decouple signal notify based
        task_work, of which io_uring is a heavy user of, from sighand. The
        sighand lock becomes a huge contention point, particularly for
        threaded workloads where it's shared between threads. Even outside of
        threaded applications it's slower than it needs to be.
      
        Roman Gershman <romger@amazon.com> reported that his networked
        workload dropped from 1.6M QPS at 80% CPU to 1.0M QPS at 100% CPU
        after io_uring was changed to use TIF_NOTIFY_SIGNAL. The time was all
        spent hammering on the sighand lock, showing 57% of the CPU time there
        [1].
      
        There are further cleanups possible on top of this. One example is
        TIF_PATCH_PENDING, where a patch already exists to use
        TIF_NOTIFY_SIGNAL instead. Hopefully this will also lead to more
        consolidation, but the work stands on its own as well"
      
      [1] https://github.com/axboe/liburing/issues/215
      
      * tag 'tif-task_work.arch-2020-12-14' of git://git.kernel.dk/linux-block: (28 commits)
        io_uring: remove 'twa_signal_ok' deadlock work-around
        kernel: remove checking for TIF_NOTIFY_SIGNAL
        signal: kill JOBCTL_TASK_WORK
        io_uring: JOBCTL_TASK_WORK is no longer used by task_work
        task_work: remove legacy TWA_SIGNAL path
        sparc: add support for TIF_NOTIFY_SIGNAL
        riscv: add support for TIF_NOTIFY_SIGNAL
        nds32: add support for TIF_NOTIFY_SIGNAL
        ia64: add support for TIF_NOTIFY_SIGNAL
        h8300: add support for TIF_NOTIFY_SIGNAL
        c6x: add support for TIF_NOTIFY_SIGNAL
        alpha: add support for TIF_NOTIFY_SIGNAL
        xtensa: add support for TIF_NOTIFY_SIGNAL
        arm: add support for TIF_NOTIFY_SIGNAL
        microblaze: add support for TIF_NOTIFY_SIGNAL
        hexagon: add support for TIF_NOTIFY_SIGNAL
        csky: add support for TIF_NOTIFY_SIGNAL
        openrisc: add support for TIF_NOTIFY_SIGNAL
        sh: add support for TIF_NOTIFY_SIGNAL
        um: add support for TIF_NOTIFY_SIGNAL
        ...
      005b2a9d
    • Linus Torvalds's avatar
      Merge branch 'parisc-5.11-1' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/parisc-linux · 5ee863be
      Linus Torvalds authored
      Pull parisc updates from Helge Deller:
       "A change to increase the default maximum stack size on parisc to 100MB
        and the ability to further increase the stack hard limit size at
        runtime with ulimit for newly started processes.
      
        The other patches fix compile warnings, utilize the Kbuild logic and
        cleanups the parisc arch code"
      
      * 'parisc-5.11-1' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/parisc-linux:
        parisc: pci-dma: fix warning unused-function
        parisc/uapi: Use Kbuild logic to provide <asm/types.h>
        parisc: Make user stack size configurable
        parisc: Use _TIF_USER_WORK_MASK in entry.S
        parisc: Drop loops_per_jiffy from per_cpu struct
      5ee863be
    • Linus Torvalds's avatar
      Merge tag 'mips_5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/mips/linux · 8312f41f
      Linus Torvalds authored
      Pull MIPS updates from Thomas Bogendoerfer:
      
       - enable GCOV
      
       - rework setup of protection map
      
       - add support for more MSCC platforms
      
       - add sysfs boardinfo for Loongson64
      
       - enable KASLR for Loogson64
      
       - add reset controller for BCM63xx
      
       - cleanups and fixes
      
      * tag 'mips_5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/mips/linux: (70 commits)
        mips: fix Section mismatch in reference
        MAINTAINERS: Add linux-mips mailing list to JZ47xx entries
        MAINTAINERS: Remove JZ4780 DMA driver entry
        MAINTAINERS: chenhc@lemote.com -> chenhuacai@kernel.org
        MIPS: Octeon: irq: Alloc desc before configuring IRQ
        MIPS: mm: Add back define for PAGE_SHARED
        MIPS: Select ARCH_KEEP_MEMBLOCK if DEBUG_KERNEL to enable sysfs memblock debug
        mips: lib: uncached: fix non-standard usage of variable 'sp'
        MIPS: DTS: img: Fix schema warnings for pwm-leds
        MIPS: KASLR: Avoid endless loop in sync_icache if synci_step is zero
        MIPS: Move memblock_dump_all() to the end of setup_arch()
        MIPS: SMP-CPS: Add support for irq migration when CPU offline
        MIPS: OCTEON: Don't add kernel sections into memblock allocator
        MIPS: Don't round up kernel sections size for memblock_add()
        MIPS: Enable GCOV
        MIPS: configs: drop unused BACKLIGHT_GENERIC option
        MIPS: Loongson64: Fix up reserving kernel memory range
        MIPS: mm: Remove unused is_aligned_hugepage_range
        MIPS: No need to check CPU 0 in {loongson3,bmips,octeon}_cpu_disable()
        mips: cdmm: fix use-after-free in mips_cdmm_bus_discover
        ...
      8312f41f
    • Linus Torvalds's avatar
      Merge tag 'microblaze-v5.11' of git://git.monstr.eu/linux-2.6-microblaze · 66fc6a62
      Linus Torvalds authored
      Pull microblaze updates from Michal Simek:
       "The biggest change is to remove support for noMMU configuration.
      
        FPGAs are bigger so people use Microblaze with MMU for a lot of years
        and there is likely no user of this code anymore. No one is updating
        libraries for this configuration either.
      
         - Remove noMMU support
      
         - Add support for TIF_NOTIFY_SIGNAL
      
         - Small header fix"
      
      * tag 'microblaze-v5.11' of git://git.monstr.eu/linux-2.6-microblaze:
        microblaze: Remove noMMU code
        microblaze: add support for TIF_NOTIFY_SIGNAL
        microblaze: Replace <linux/clk-provider.h> by <linux/of_clk.h>
      66fc6a62
    • Linus Torvalds's avatar
      Merge tag 'for-linus-5.11-rc1-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip · 7acfd427
      Linus Torvalds authored
      Pull xen updates from Juergen Gross:
       "Fixes for security issues just having been disclosed:
      
         - a five patch series for fixing of XSA-349 (DoS via resource
           depletion in Xen dom0)
      
         - a patch fixing XSA-350 (access of stale pointer in a Xen dom0)"
      
      * tag 'for-linus-5.11-rc1-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip:
        xen-blkback: set ring->xenblkd to NULL after kthread_stop()
        xenbus/xenbus_backend: Disallow pending watch messages
        xen/xenbus: Count pending messages for each watch
        xen/xenbus/xen_bus_type: Support will_handle watch callback
        xen/xenbus: Add 'will_handle' callback support in xenbus_watch_path()
        xen/xenbus: Allow watches discard events before queueing
      7acfd427
    • Linus Torvalds's avatar
      Merge tag 'hyperv-next-signed-20201214' of... · 571b12dd
      Linus Torvalds authored
      Merge tag 'hyperv-next-signed-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux
      
      Pull Hyper-V updates from Wei Liu:
      
       - harden VMBus (Andres Beltran)
      
       - clean up VMBus driver (Matheus Castello)
      
       - fix hv_balloon reporting (Vitaly Kuznetsov)
      
       - fix a potential OOB issue (Andrea Parri)
      
       - remove an obsolete TODO item (Stefan Eschenbacher)
      
      * tag 'hyperv-next-signed-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux:
        hv_balloon: do adjust_managed_page_count() when ballooning/un-ballooning
        hv_balloon: simplify math in alloc_balloon_pages()
        drivers/hv: remove obsolete TODO and fix misleading typo in comment
        drivers: hv: vmbus: Fix checkpatch SPLIT_STRING
        hv_netvsc: Validate number of allocated sub-channels
        drivers: hv: vmbus: Fix call msleep using < 20ms
        drivers: hv: vmbus: Fix checkpatch LINE_SPACING
        drivers: hv: vmbus: Replace symbolic permissions by octal permissions
        drivers: hv: Fix hyperv_record_panic_msg path on comment
        hv_netvsc: Use vmbus_requestor to generate transaction IDs for VMBus hardening
        scsi: storvsc: Use vmbus_requestor to generate transaction IDs for VMBus hardening
        Drivers: hv: vmbus: Add vmbus_requestor data structure for VMBus hardening
      571b12dd
    • Linus Torvalds's avatar
      Merge tag 'seccomp-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · e994cc24
      Linus Torvalds authored
      Pull seccomp updates from Kees Cook:
       "The major change here is finally gaining seccomp constant-action
        bitmaps, which internally reduces the seccomp overhead for many
        real-world syscall filters to O(1), as discussed at Plumbers this
        year.
      
         - Improve seccomp performance via constant-action bitmaps (YiFei Zhu
           & Kees Cook)
      
         - Fix bogus __user annotations (Jann Horn)
      
         - Add missed CONFIG for improved selftest coverage (Mickaël Salaün)"
      
      * tag 'seccomp-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        selftests/seccomp: Update kernel config
        seccomp: Remove bogus __user annotations
        seccomp/cache: Report cache data through /proc/pid/seccomp_cache
        xtensa: Enable seccomp architecture tracking
        sh: Enable seccomp architecture tracking
        s390: Enable seccomp architecture tracking
        riscv: Enable seccomp architecture tracking
        powerpc: Enable seccomp architecture tracking
        parisc: Enable seccomp architecture tracking
        csky: Enable seccomp architecture tracking
        arm: Enable seccomp architecture tracking
        arm64: Enable seccomp architecture tracking
        selftests/seccomp: Compare bitmap vs filter overhead
        x86: Enable seccomp architecture tracking
        seccomp/cache: Add "emulator" to check if filter is constant allow
        seccomp/cache: Lookup syscall allowlist bitmap for fast path
      e994cc24
    • Linus Torvalds's avatar
      Merge tag 'pstore-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · ba1d41a5
      Linus Torvalds authored
      Pull pstore updates from Kees Cook:
      
       - Clean up unused but exposed API (Christoph Hellwig)
      
       - Provide KCONFIG for default size of kmsg buffer (Vasile-Laurentiu
         Stanimir)
      
      * tag 'pstore-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        pstore: Move kmsg_bytes default into Kconfig
        pstore/blk: remove {un,}register_pstore_blk
        pstore/blk: update the command line example
        pstore/zone: cap the maximum device size
      ba1d41a5
    • Linus Torvalds's avatar
      Merge tag 'gcc-plugins-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · 6df0f285
      Linus Torvalds authored
      Pull gcc-plugins updates from Kees Cook:
      
       - Clean up gcc plugin builds now that GCC must be 4.9+ (Masahiro
         Yamada)
      
       - Update MAINTAINERS (Kees Cook)
      
      * tag 'gcc-plugins-v5.11-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        MAINTAINERS: Drop inactive gcc-plugins maintainer
        gcc-plugins: simplify GCC plugin-dev capability test
        gcc-plugins: remove code for GCC versions older than 4.9
      6df0f285
    • Linus Torvalds's avatar
      Merge tag 'Smack-for-5.11' of git://github.com/cschaufler/smack-next · 8bda68d6
      Linus Torvalds authored
      Pull smack updates from Casey Schaufler:
       "There are no functional changes. Just one minor code clean-up and a
        set of corrections in function header comments"
      
      * tag 'Smack-for-5.11' of git://github.com/cschaufler/smack-next:
        security/smack: remove unused varible 'rc'
        Smack: fix kernel-doc interface on functions
      8bda68d6
    • Linus Torvalds's avatar
      Merge tag 'integrity-v5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/zohar/linux-integrity · e20a9b92
      Linus Torvalds authored
      Pull integrity subsystem updates from Mimi Zohar:
       "Just three patches here. Other integrity changes are being upstreamed
        via EFI (defines a common EFI secure and trusted boot IMA policy) and
        BPF LSM (exporting the IMA file cache hash info based on inode).
      
        The three patches included here:
      
         - bug fix: fail calculating the file hash, when a file not opened for
           read and the attempt to re-open it for read fails.
      
         - defer processing the "ima_appraise" boot command line option to
           avoid enabling different modes (e.g. fix, log) to when the secure
           boot flag is available on arm.
      
         - defines "ima-buf" as the default IMA buffer measurement template in
           preparation for the builtin integrity "critical data" policy"
      
      * tag 'integrity-v5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/zohar/linux-integrity:
        ima: Don't modify file descriptor mode on the fly
        ima: select ima-buf template for buffer measurement
        ima: defer arch_ima_get_secureboot() call to IMA init time
      e20a9b92
    • Linus Torvalds's avatar
      Merge tag 'selinux-pr-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/selinux · ca5b877b
      Linus Torvalds authored
      Pull selinux updates from Paul Moore:
       "While we have a small number of SELinux patches for v5.11, there are a
        few changes worth highlighting:
      
         - Change the LSM network hooks to pass flowi_common structs instead
           of the parent flowi struct as the LSMs do not currently need the
           full flowi struct and they do not have enough information to use it
           safely (missing information on the address family).
      
           This patch was discussed both with Herbert Xu (representing team
           netdev) and James Morris (representing team
           LSMs-other-than-SELinux).
      
         - Fix how we handle errors in inode_doinit_with_dentry() so that we
           attempt to properly label the inode on following lookups instead of
           continuing to treat it as unlabeled.
      
         - Tweak the kernel logic around allowx, auditallowx, and dontauditx
           SELinux policy statements such that the auditx/dontauditx are
           effective even without the allowx statement.
      
        Everything passes our test suite"
      
      * tag 'selinux-pr-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/selinux:
        lsm,selinux: pass flowi_common instead of flowi to the LSM hooks
        selinux: Fix fall-through warnings for Clang
        selinux: drop super_block backpointer from superblock_security_struct
        selinux: fix inode_doinit_with_dentry() LABEL_INVALID error handling
        selinux: allow dontauditx and auditallowx rules to take effect without allowx
        selinux: fix error initialization in inode_doinit_with_dentry()
      ca5b877b
    • Linus Torvalds's avatar
      Merge tag 'audit-pr-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/audit · 3d5de2dd
      Linus Torvalds authored
      Pull audit updates from Paul Moore:
       "A small set of audit patches for v5.11 with four patches in total and
        only one of any real significance.
      
        Richard's patch to trigger accompanying records causes the kernel to
        emit additional related records when an audit event occurs; helping
        provide some much needed context to events in the audit log. It is
        also worth mentioning that this is a revised patch based on an earlier
        attempt that had to be reverted in the v5.8 time frame.
      
        Everything passes our test suite, and with no problems reported please
        merge this for v5.11"
      
      * tag 'audit-pr-20201214' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/audit:
        audit: replace atomic_add_return()
        audit: fix macros warnings
        audit: trigger accompanying records when no rules present
        audit: fix a kernel-doc markup
      3d5de2dd
    • Linus Torvalds's avatar
      Merge tag 'printk-for-5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux · d3eb5211
      Linus Torvalds authored
      Pull printk updates from Petr Mladek:
      
       - Finally allow parallel writes and reads into/from the lockless
         ringbuffer. But it is not a complete solution. Readers are still
         serialized against each other. And nested writes are still prevented
         by printk_safe per-CPU buffers.
      
       - Use ttynull as the ultimate fallback for /dev/console.
      
       - Officially allow disabling console output by using console="" or
         console=null
      
       - A few code cleanups
      
      * tag 'printk-for-5.11' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux:
        printk: remove logbuf_lock writer-protection of ringbuffer
        printk: inline log_output(),log_store() in vprintk_store()
        printk: remove obsolete dead assignment
        printk/console: Allow to disable console output by using console="" or console=null
        init/console: Use ttynull as a fallback when there is no console
        printk: ringbuffer: Reference text_data_ring directly in callees.
      d3eb5211
  5. Dec 16, 2020