Skip to content
  1. May 08, 2019
    • Andreas Gruenbacher's avatar
      gfs2: Fix iomap write page reclaim deadlock · d0a22a4b
      Andreas Gruenbacher authored
      Since commit 64bc06bb
      
       ("gfs2: iomap buffered write support"), gfs2 is doing
      buffered writes by starting a transaction in iomap_begin, writing a range of
      pages, and ending that transaction in iomap_end.  This approach suffers from
      two problems:
      
        (1) Any allocations necessary for the write are done in iomap_begin, so when
        the data aren't journaled, there is no need for keeping the transaction open
        until iomap_end.
      
        (2) Transactions keep the gfs2 log flush lock held.  When
        iomap_file_buffered_write calls balance_dirty_pages, this can end up calling
        gfs2_write_inode, which will try to flush the log.  This requires taking the
        log flush lock which is already held, resulting in a deadlock.
      
      Fix both of these issues by not keeping transactions open from iomap_begin to
      iomap_end.  Instead, start a small transaction in page_prepare and end it in
      page_done when necessary.
      
      Reported-by: default avatarEdwin Török <edvin.torok@citrix.com>
      Fixes: 64bc06bb
      
       ("gfs2: iomap buffered write support")
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      Signed-off-by: default avatarBob Peterson <rpeterso@redhat.com>
      d0a22a4b
    • Abhi Das's avatar
      gfs2: fix race between gfs2_freeze_func and unmount · 8f918219
      Abhi Das authored
      As part of the freeze operation, gfs2_freeze_func() is left blocking
      on a request to hold the sd_freeze_gl in SH. This glock is held in EX
      by the gfs2_freeze() code.
      
      A subsequent call to gfs2_unfreeze() releases the EXclusively held
      sd_freeze_gl, which allows gfs2_freeze_func() to acquire it in SH and
      resume its operation.
      
      gfs2_unfreeze(), however, doesn't wait for gfs2_freeze_func() to complete.
      If a umount is issued right after unfreeze, it could result in an
      inconsistent filesystem because some journal data (statfs update) isn't
      written out.
      
      Refer to commit 24972557
      
       for a more detailed explanation of how
      freeze/unfreeze work.
      
      This patch causes gfs2_unfreeze() to wait for gfs2_freeze_func() to
      complete before returning to the user.
      
      Signed-off-by: default avatarAbhi Das <adas@redhat.com>
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      8f918219
    • Andreas Gruenbacher's avatar
      gfs2: Rename gfs2_trans_{add_unrevoke => remove_revoke} · fbb27873
      Andreas Gruenbacher authored
      
      
      Rename gfs2_trans_add_unrevoke to gfs2_trans_remove_revoke: there is no
      such thing as an "unrevoke" object; all this function does is remove
      existing revoke objects plus some bookkeeping.
      
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      fbb27873
    • Andreas Gruenbacher's avatar
      gfs2: Rename sd_log_le_{revoke,ordered} · a5b1d3fc
      Andreas Gruenbacher authored
      
      
      Rename sd_log_le_revoke to sd_log_revokes and sd_log_le_ordered to
      sd_log_ordered: not sure what le stands for here, but it doesn't add
      clarity, and if it stands for list entry, it's actually confusing as
      those are both list heads but not list entries.
      
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      a5b1d3fc
    • Andreas Gruenbacher's avatar
      gfs2: Remove unnecessary extern declarations · 32ac43f6
      Andreas Gruenbacher authored
      
      
      Make log operations statuc; they are only used locally.
      
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      32ac43f6
    • Andreas Gruenbacher's avatar
    • Bob Peterson's avatar
      gfs2: Replace gl_revokes with a GLF flag · 73118ca8
      Bob Peterson authored
      
      
      The gl_revokes value determines how many outstanding revokes a glock has
      on the superblock revokes list; this is used to avoid unnecessary log
      flushes.  However, gl_revokes is only ever tested for being zero, and it's
      only decremented in revoke_lo_after_commit, which removes all revokes
      from the list, so we know that the gl_revoke values of all the glocks on
      the list will reach zero.  Therefore, we can replace gl_revokes with a
      bit flag. This saves an atomic counter in struct gfs2_glock.
      
      Signed-off-by: default avatarBob Peterson <rpeterso@redhat.com>
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      73118ca8
    • Andreas Gruenbacher's avatar
      gfs2: Fix occasional glock use-after-free · 9287c645
      Andreas Gruenbacher authored
      
      
      This patch has to do with the life cycle of glocks and buffers.  When
      gfs2 metadata or journaled data is queued to be written, a gfs2_bufdata
      object is assigned to track the buffer, and that is queued to various
      lists, including the glock's gl_ail_list to indicate it's on the active
      items list.  Once the page associated with the buffer has been written,
      it is removed from the ail list, but its life isn't over until a revoke
      has been successfully written.
      
      So after the block is written, its bufdata object is moved from the
      glock's gl_ail_list to a file-system-wide list of pending revokes,
      sd_log_le_revoke.  At that point the glock still needs to track how many
      revokes it contributed to that list (in gl_revokes) so that things like
      glock go_sync can ensure all the metadata has been not only written, but
      also revoked before the glock is granted to a different node.  This is
      to guarantee journal replay doesn't replay the block once the glock has
      been granted to another node.
      
      Ross Lagerwall recently discovered a race in which an inode could be
      evicted, and its glock freed after its ail list had been synced, but
      while it still had unwritten revokes on the sd_log_le_revoke list.  The
      evict decremented the glock reference count to zero, which allowed the
      glock to be freed.  After the revoke was written, function
      revoke_lo_after_commit tried to adjust the glock's gl_revokes counter
      and clear its GLF_LFLUSH flag, at which time it referenced the freed
      glock.
      
      This patch fixes the problem by incrementing the glock reference count
      in gfs2_add_revoke when the glock's first bufdata object is moved from
      the glock to the global revokes list. Later, when the glock's last such
      bufdata object is freed, the reference count is decremented. This
      guarantees that whichever process finishes last (the revoke writing or
      the evict) will properly free the glock, and neither will reference the
      glock after it has been freed.
      
      Reported-by: default avatarRoss Lagerwall <ross.lagerwall@citrix.com>
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      Signed-off-by: default avatarBob Peterson <rpeterso@redhat.com>
      9287c645
    • Bob Peterson's avatar
      gfs2: clean_journal improperly set sd_log_flush_head · 7c70b896
      Bob Peterson authored
      This patch fixes regressions in 588bff95.
      Due to that patch, function clean_journal was setting the value of
      sd_log_flush_head, but that's only valid if it is replaying the node's
      own journal. If it's replaying another node's journal, that's completely
      wrong and will lead to multiple problems. This patch tries to clean up
      the mess by passing the value of the logical journal block number into
      gfs2_write_log_header so the function can treat non-owned journals
      generically. For the local journal, the journal extent map is used for
      best performance. For other nodes from other journals, new function
      gfs2_lblk_to_dblk is called to figure it out using gfs2_iomap_get.
      
      This patch also tries to establish more consistency when passing journal
      block parameters by changing several unsigned int types to a consistent
      u32.
      
      Fixes: 588bff95
      
       ("GFS2: Reduce code redundancy writing log headers")
      Signed-off-by: default avatarBob Peterson <rpeterso@redhat.com>
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      7c70b896
    • Ross Lagerwall's avatar
      gfs2: Fix lru_count going negative · 7881ef3f
      Ross Lagerwall authored
      
      
      Under certain conditions, lru_count may drop below zero resulting in
      a large amount of log spam like this:
      
      vmscan: shrink_slab: gfs2_dump_glock+0x3b0/0x630 [gfs2] \
          negative objects to delete nr=-1
      
      This happens as follows:
      1) A glock is moved from lru_list to the dispose list and lru_count is
         decremented.
      2) The dispose function calls cond_resched() and drops the lru lock.
      3) Another thread takes the lru lock and tries to add the same glock to
         lru_list, checking if the glock is on an lru list.
      4) It is on a list (actually the dispose list) and so it avoids
         incrementing lru_count.
      5) The glock is moved to lru_list.
      5) The original thread doesn't dispose it because it has been re-added
         to the lru list but the lru_count has still decreased by one.
      
      Fix by checking if the LRU flag is set on the glock rather than checking
      if the glock is on some list and rearrange the code so that the LRU flag
      is added/removed precisely when the glock is added/removed from lru_list.
      
      Signed-off-by: default avatarRoss Lagerwall <ross.lagerwall@citrix.com>
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      7881ef3f
    • Andreas Gruenbacher's avatar
      gfs2: Fix loop in gfs2_rbm_find (v2) · 71921ef8
      Andreas Gruenbacher authored
      Fix the resource group wrap-around logic in gfs2_rbm_find that commit
      e579ed4f broke.  The bug can lead to unnecessary repeated scanning of the
      same bitmaps; there is a risk that future changes will turn this into an
      endless loop.
      
      This is an updated version of commit 2d29f6b9 ("gfs2: Fix loop in
      gfs2_rbm_find") which ended up being reverted because it introduced a
      performance regression in iozone (see commit e74c98ca).  Changes since v1:
      
       - Simplify the wrap-around logic.
      
       - Handle the case where each resource group only has a single bitmap block
         (small filesystem).
      
       - Update rd_extfail_pt whenever we scan the entire bitmap, even when we don't
         start the scan at the very beginning of the bitmap.
      
      Fixes: e579ed4f
      
       ("GFS2: Introduce rbm field bii")
      Signed-off-by: default avatarAndreas Gruenbacher <agruenba@redhat.com>
      71921ef8
    • Linus Torvalds's avatar
      Merge tag 'Wimplicit-fallthrough-5.2-rc1' of... · b4b52b88
      Linus Torvalds authored
      Merge tag 'Wimplicit-fallthrough-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/gustavoars/linux
      
      Pull Wimplicit-fallthrough updates from Gustavo A. R. Silva:
       "Mark switch cases where we are expecting to fall through.
      
        This is part of the ongoing efforts to enable -Wimplicit-fallthrough.
      
        Most of them have been baking in linux-next for a whole development
        cycle. And with Stephen Rothwell's help, we've had linux-next
        nag-emails going out for newly introduced code that triggers
        -Wimplicit-fallthrough to avoid gaining more of these cases while we
        work to remove the ones that are already present.
      
        We are getting close to completing this work. Currently, there are
        only 32 of 2311 of these cases left to be addressed in linux-next. I'm
        auditing every case; I take a look into the code and analyze it in
        order to determine if I'm dealing with an actual bug or a false
        positive, as explained here:
      
            https://lore.kernel.org/lkml/c2fad584-1705-a5f2-d63c-824e9b96cf50@embeddedor.com/
      
        While working on this, I've found and fixed the several missing
        break/return bugs, some of them introduced more than 5 years ago.
      
        Once this work is finished, we'll be able to universally enable
        "-Wimplicit-fallthrough" to avoid any of these kinds of bugs from
        entering the kernel again"
      
      * tag 'Wimplicit-fallthrough-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/gustavoars/linux: (27 commits)
        memstick: mark expected switch fall-throughs
        drm/nouveau/nvkm: mark expected switch fall-throughs
        NFC: st21nfca: Fix fall-through warnings
        NFC: pn533: mark expected switch fall-throughs
        block: Mark expected switch fall-throughs
        ASN.1: mark expected switch fall-through
        lib/cmdline.c: mark expected switch fall-throughs
        lib: zstd: Mark expected switch fall-throughs
        scsi: sym53c8xx_2: sym_nvram: Mark expected switch fall-through
        scsi: sym53c8xx_2: sym_hipd: mark expected switch fall-throughs
        scsi: ppa: mark expected switch fall-through
        scsi: osst: mark expected switch fall-throughs
        scsi: lpfc: lpfc_scsi: Mark expected switch fall-throughs
        scsi: lpfc: lpfc_nvme: Mark expected switch fall-through
        scsi: lpfc: lpfc_nportdisc: Mark expected switch fall-through
        scsi: lpfc: lpfc_hbadisc: Mark expected switch fall-throughs
        scsi: lpfc: lpfc_els: Mark expected switch fall-throughs
        scsi: lpfc: lpfc_ct: Mark expected switch fall-throughs
        scsi: imm: mark expected switch fall-throughs
        scsi: csiostor: csio_wr: mark expected switch fall-through
        ...
      b4b52b88
    • Linus Torvalds's avatar
      Merge tag 'meminit-v5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · 2d60d96b
      Linus Torvalds authored
      Pull compiler-based variable initialization updates from Kees Cook:
       "This is effectively part of my gcc-plugins tree, but as this adds some
        Clang support, it felt weird to still call it "gcc-plugins". :)
      
        This consolidates Kconfig for the existing stack variable
        initialization (via structleak and stackleak gcc plugins) and adds
        Alexander Potapenko's support for Clang's new similar functionality.
      
        Summary:
      
         - Consolidate memory initialization Kconfigs (Kees)
      
         - Implement support for Clang's stack variable auto-init (Alexander)"
      
      * tag 'meminit-v5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        security: Implement Clang's stack initialization
        security: Move stackleak config to Kconfig.hardening
        security: Create "kernel hardening" config area
      2d60d96b
    • Linus Torvalds's avatar
      Merge tag 'pidfd-v5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux · eac7078a
      Linus Torvalds authored
      Pull pidfd updates from Christian Brauner:
       "This patchset makes it possible to retrieve pidfds at process creation
        time by introducing the new flag CLONE_PIDFD to the clone() system
        call. Linus originally suggested to implement this as a new flag to
        clone() instead of making it a separate system call.
      
        After a thorough review from Oleg CLONE_PIDFD returns pidfds in the
        parent_tidptr argument. This means we can give back the associated pid
        and the pidfd at the same time. Access to process metadata information
        thus becomes rather trivial.
      
        As has been agreed, CLONE_PIDFD creates file descriptors based on
        anonymous inodes similar to the new mount api. They are made
        unconditional by this patchset as they are now needed by core kernel
        code (vfs, pidfd) even more than they already were before (timerfd,
        signalfd, io_uring, epoll etc.). The core patchset is rather small.
        The bulky looking changelist is caused by David's very simple changes
        to Kconfig to make anon inodes unconditional.
      
        A pidfd comes with additional information in fdinfo if the kernel
        supports procfs. The fdinfo file contains the pid of the process in
        the callers pid namespace in the same format as the procfs status
        file, i.e. "Pid:\t%d".
      
        To remove worries about missing metadata access this patchset comes
        with a sample/test program that illustrates how a combination of
        CLONE_PIDFD and pidfd_send_signal() can be used to gain race-free
        access to process metadata through /proc/<pid>.
      
        Further work based on this patchset has been done by Joel. His work
        makes pidfds pollable. It finished too late for this merge window. I
        would prefer to have it sitting in linux-next for a while and send it
        for inclusion during the 5.3 merge window"
      
      * tag 'pidfd-v5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux:
        samples: show race-free pidfd metadata access
        signal: support CLONE_PIDFD with pidfd_send_signal
        clone: add CLONE_PIDFD
        Make anon_inodes unconditional
      eac7078a
    • Linus Torvalds's avatar
      Merge tag 'stream_open-5.2' of https://lab.nexedi.com/kirr/linux · 41bc10ca
      Linus Torvalds authored
      Pull stream_open conversion from Kirill Smelkov:
      
       - remove unnecessary double nonseekable_open from drivers/char/dtlk.c
         as noticed by Pavel Machek while reviewing nonseekable_open ->
         stream_open mass conversion.
      
       - the mass conversion patch promised in commit 10dce8af ("fs:
         stream_open - opener for stream-like files so that read and write can
         run simultaneously without deadlock") and is automatically generated
         by running
      
              $ make coccicheck MODE=patch COCCI=scripts/coccinelle/api/stream_open.cocci
      
         I've verified each generated change manually - that it is correct to
         convert - and each other nonseekable_open instance left - that it is
         either not correct to convert there, or that it is not converted due
         to current stream_open.cocci limitations. More details on this in the
         patch.
      
       - finally, change VFS to pass ppos=NULL into .read/.write for files
         that declare themselves streams. It was suggested by Rasmus Villemoes
         and makes sure that if ppos starts to be erroneously used in a stream
         file, such bug won't go unnoticed and will produce an oops instead of
         creating illusion of position change being taken into account.
      
         Note: this patch does not conflict with "fuse: Add FOPEN_STREAM to
         use stream_open()" that will be hopefully coming via FUSE tree,
         because fs/fuse/ uses new-style .read_iter/.write_iter, and for these
         accessors position is still passed as non-pointer kiocb.ki_pos .
      
      * tag 'stream_open-5.2' of https://lab.nexedi.com/kirr/linux:
        vfs: pass ppos=NULL to .read()/.write() of FMODE_STREAM files
        *: convert stream-like files from nonseekable_open -> stream_open
        dtlk: remove double call to nonseekable_open
      41bc10ca
    • Linus Torvalds's avatar
      Merge tag 'xfs-5.2-merge-4' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux · aa26690f
      Linus Torvalds authored
      Pull xfs updates from Darrick Wong:
       "Here's a big pile of new stuff for XFS for 5.2. XFS has grown the
        ability to report metadata health status to userspace after online
        fsck checks the filesystem. The online metadata checking code is (I
        really hope) feature complete with the addition of checks for the
        global fs counters, though it'll remain EXPERIMENTAL for now.
      
        There are also fixes for thundering herds of writeback completions and
        some other deadlocks, fixes for theoretical integer overflow attacks
        on space accounting, and removal of the long-defunct 'mntpt' option
        which was deprecated in the mid-2000s and (it turns out) totally
        broken since 2011 (and nobody complained...).
      
        Summary:
      
         - Fix some more buffer deadlocks when performing an unmount after a
           hard shutdown.
      
         - Fix some minor space accounting issues.
      
         - Fix some use after free problems.
      
         - Make the (undocumented) FITRIM behavior consistent with other
           filesystems.
      
         - Embiggen the xfs geometry ioctl's data structure.
      
         - Introduce a new AG geometry ioctl.
      
         - Introduce a new online health reporting infrastructure and ioctl
           for userspace to query a filesystem's health status.
      
         - Enhance online scrub and repair to update the health reports.
      
         - Reduce thundering herd problems when writeback io completes.
      
         - Fix some transaction reservation type errors.
      
         - Fix integer overflow problems with delayed alloc reservation
           counters.
      
         - Fix some problems where we would exit to userspace without
           unlocking.
      
         - Fix inconsistent behavior when finishing deferred ops fails.
      
         - Strengthen scrub to check incore data against ondisk metadata.
      
         - Remove long-broken mntpt mount option.
      
         - Add an online scrub function for the filesystem summary counters,
           which should make online metadata scrub more or less feature
           complete for now.
      
         - Various cleanups"
      
      * tag 'xfs-5.2-merge-4' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux: (38 commits)
        xfs: change some error-less functions to void types
        xfs: add online scrub for superblock counters
        xfs: don't parse the mtpt mount option
        xfs: always rejoin held resources during defer roll
        xfs: add missing error check in xfs_prepare_shift()
        xfs: scrub should check incore counters against ondisk headers
        xfs: allow scrubbers to pause background reclaim
        xfs: rename the speculative block allocation reclaim toggle functions
        xfs: track delayed allocation reservations across the filesystem
        xfs: fix broken bhold behavior in xrep_roll_ag_trans
        xfs: unlock inode when xfs_ioctl_setattr_get_trans can't get transaction
        xfs: kill the xfs_dqtrx_t typedef
        xfs: widen inode delalloc block counter to 64-bits
        xfs: widen quota block counters to 64-bit integers
        xfs: abort unaligned nowait directio early
        xfs: assert that we don't enter agfl freeing with a non-permanent transaction
        xfs: make tr_growdata a permanent transaction
        xfs: merge adjacent io completions of the same type
        xfs: remove unused m_data_workqueue
        xfs: implement per-inode writeback completion queues
        ...
      aa26690f
    • Linus Torvalds's avatar
      Merge tag 'iomap-5.2-merge-2' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux · d8456eaf
      Linus Torvalds authored
      Pull iomap updates from Darrick Wong:
       "Nothing particularly exciting here, just adding some callouts for gfs2
        and cleaning a few things.
      
        Summary:
      
         - Add some extra hooks to the iomap buffered write path to enable
           gfs2 journalled writes
      
         - SPDX conversion
      
         - Various refactoring"
      
      * tag 'iomap-5.2-merge-2' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux:
        iomap: move iomap_read_inline_data around
        iomap: Add a page_prepare callback
        iomap: Fix use-after-free error in page_done callback
        fs: Turn __generic_write_end into a void function
        iomap: Clean up __generic_write_end calling
        iomap: convert to SPDX identifier
      d8456eaf
    • Linus Torvalds's avatar
      Merge tag 'jfs-5.2' of git://github.com/kleikamp/linux-shaggy · b8cac3cd
      Linus Torvalds authored
      Pull jfs updates from Dave Kleikamp:
       "Several minor jfs fixes"
      
      * tag 'jfs-5.2' of git://github.com/kleikamp/linux-shaggy:
        jfs: fix bogus variable self-initialization
        fs/jfs: Switch to use new generic UUID API
        jfs: compare old and new mode before setting update_mode flag
        jfs: remove incorrect comment in jfs_superblock
        jfs: fix spelling mistake, EACCESS -> EACCES
      b8cac3cd
    • Linus Torvalds's avatar
      Merge tag 'for-5.2-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux · 9f2e3a53
      Linus Torvalds authored
      Pull btrfs updates from David Sterba:
       "This time the majority of changes are cleanups, though there's still a
        number of changes of user interest.
      
        User visible changes:
      
         - better read time and write checks to catch errors early and before
           writing data to disk (to catch potential memory corruption on data
           that get checksummed)
      
         - qgroups + metadata relocation: last speed up patch int the series
           to address the slowness, there should be no overhead comparing
           balance with and without qgroups
      
         - FIEMAP ioctl does not start a transaction unnecessarily, this can
           result in a speed up and less blocking due to IO
      
         - LOGICAL_INO (v1, v2) does not start transaction unnecessarily, this
           can speed up the mentioned ioctl and scrub as well
      
         - fsync on files with many (but not too many) hardlinks is faster,
           finer decision if the links should be fsynced individually or
           completely
      
         - send tries harder to find ranges to clone
      
         - trim/discard will skip unallocated chunks that haven't been touched
           since the last mount
      
        Fixes:
      
         - send flushes delayed allocation before start, otherwise it could
           miss some changes in case of a very recent rw->ro switch of a
           subvolume
      
         - fix fallocate with qgroups that could lead to space accounting
           underflow, reported as a warning
      
         - trim/discard ioctl honours the requested range
      
         - starting send and dedupe on a subvolume at the same time will let
           only one of them succeed, this is to prevent changes that send
           could miss due to dedupe; both operations are restartable
      
        Core changes:
      
         - more tree-checker validations, errors reported by fuzzing tools:
            - device item
            - inode item
            - block group profiles
      
         - tracepoints for extent buffer locking
      
         - async cow preallocates memory to avoid errors happening too deep in
           the call chain
      
         - metadata reservations for delalloc reworked to better adapt in
           many-writers/low-space scenarios
      
         - improved space flushing logic for intense DIO vs buffered workloads
      
         - lots of cleanups
            - removed unused struct members
            - redundant argument removal
            - properties and xattrs
            - extent buffer locking
            - selftests
            - use common file type conversions
            - many-argument functions reduction"
      
      * tag 'for-5.2-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux: (227 commits)
        btrfs: Use kvmalloc for allocating compressed path context
        btrfs: Factor out common extent locking code in submit_compressed_extents
        btrfs: Set io_tree only once in submit_compressed_extents
        btrfs: Replace clear_extent_bit with unlock_extent
        btrfs: Make compress_file_range take only struct async_chunk
        btrfs: Remove fs_info from struct async_chunk
        btrfs: Rename async_cow to async_chunk
        btrfs: Preallocate chunks in cow_file_range_async
        btrfs: reserve delalloc metadata differently
        btrfs: track DIO bytes in flight
        btrfs: merge calls of btrfs_setxattr and btrfs_setxattr_trans in btrfs_set_prop
        btrfs: delete unused function btrfs_set_prop_trans
        btrfs: start transaction in xattr_handler_set_prop
        btrfs: drop local copy of inode i_mode
        btrfs: drop old_fsflags in btrfs_ioctl_setflags
        btrfs: modify local copy of btrfs_inode flags
        btrfs: drop useless inode i_flags copy and restore
        btrfs: start transaction in btrfs_ioctl_setflags()
        btrfs: export btrfs_set_prop
        btrfs: refactor btrfs_set_props to validate externally
        ...
      9f2e3a53
    • Linus Torvalds's avatar
      Merge branch 'stable-fodder' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs · 78438ce1
      Linus Torvalds authored
      Pull vfs stable fodder fixes from Al Viro:
      
       - acct_on() fix for deadlock caught by overlayfs folks
      
       - autofs RCU use-after-free SNAFU (->d_manage() can be called
         locklessly, so we need to RCU-delay freeing the objects it looks at)
      
       - (hopefully) the end of "do we need freeing this dentry RCU-delayed"
         whack-a-mole.
      
      * 'stable-fodder' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
        autofs: fix use-after-free in lockless ->d_manage()
        dcache: sort the freeing-without-RCU-delay mess for good.
        acct_on(): don't mess with freeze protection
      78438ce1
    • Linus Torvalds's avatar
      Merge branch 'work.icache' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs · 168e153d
      Linus Torvalds authored
      Pull vfs inode freeing updates from Al Viro:
       "Introduction of separate method for RCU-delayed part of
        ->destroy_inode() (if any).
      
        Pretty much as posted, except that destroy_inode() stashes
        ->free_inode into the victim (anon-unioned with ->i_fops) before
        scheduling i_callback() and the last two patches (sockfs conversion
        and folding struct socket_wq into struct socket) are excluded - that
        pair should go through netdev once davem reopens his tree"
      
      * 'work.icache' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (58 commits)
        orangefs: make use of ->free_inode()
        shmem: make use of ->free_inode()
        hugetlb: make use of ->free_inode()
        overlayfs: make use of ->free_inode()
        jfs: switch to ->free_inode()
        fuse: switch to ->free_inode()
        ext4: make use of ->free_inode()
        ecryptfs: make use of ->free_inode()
        ceph: use ->free_inode()
        btrfs: use ->free_inode()
        afs: switch to use of ->free_inode()
        dax: make use of ->free_inode()
        ntfs: switch to ->free_inode()
        securityfs: switch to ->free_inode()
        apparmor: switch to ->free_inode()
        rpcpipe: switch to ->free_inode()
        bpf: switch to ->free_inode()
        mqueue: switch to ->free_inode()
        ufs: switch to ->free_inode()
        coda: switch to ->free_inode()
        ...
      168e153d
    • Linus Torvalds's avatar
      Merge branch 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 8ff468c2
      Linus Torvalds authored
      Pull x86 FPU state handling updates from Borislav Petkov:
       "This contains work started by Rik van Riel and brought to fruition by
        Sebastian Andrzej Siewior with the main goal to optimize when to load
        FPU registers: only when returning to userspace and not on every
        context switch (while the task remains in the kernel).
      
        In addition, this optimization makes kernel_fpu_begin() cheaper by
        requiring registers saving only on the first invocation and skipping
        that in following ones.
      
        What is more, this series cleans up and streamlines many aspects of
        the already complex FPU code, hopefully making it more palatable for
        future improvements and simplifications.
      
        Finally, there's a __user annotations fix from Jann Horn"
      
      * 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (29 commits)
        x86/fpu: Fault-in user stack if copy_fpstate_to_sigframe() fails
        x86/pkeys: Add PKRU value to init_fpstate
        x86/fpu: Restore regs in copy_fpstate_to_sigframe() in order to use the fastpath
        x86/fpu: Add a fastpath to copy_fpstate_to_sigframe()
        x86/fpu: Add a fastpath to __fpu__restore_sig()
        x86/fpu: Defer FPU state load until return to userspace
        x86/fpu: Merge the two code paths in __fpu__restore_sig()
        x86/fpu: Restore from kernel memory on the 64-bit path too
        x86/fpu: Inline copy_user_to_fpregs_zeroing()
        x86/fpu: Update xstate's PKRU value on write_pkru()
        x86/fpu: Prepare copy_fpstate_to_sigframe() for TIF_NEED_FPU_LOAD
        x86/fpu: Always store the registers in copy_fpstate_to_sigframe()
        x86/entry: Add TIF_NEED_FPU_LOAD
        x86/fpu: Eager switch PKRU state
        x86/pkeys: Don't check if PKRU is zero before writing it
        x86/fpu: Only write PKRU if it is different from current
        x86/pkeys: Provide *pkru() helpers
        x86/fpu: Use a feature number instead of mask in two more helpers
        x86/fpu: Make __raw_xsave_addr() use a feature number instead of mask
        x86/fpu: Add an __fpregs_load_activate() internal helper
        ...
      8ff468c2
    • Linus Torvalds's avatar
      Merge tag 'ktest-v5.1' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest · 68253e71
      Linus Torvalds authored
      Pull ktest updates from Steven Rostedt:
       "Minor updates to ktest.pl
      
         - Handle meta characters in grub memu
      
         - Use configurable reboot return code for handling ssh reboots
      
         - Display names and iteration number on error message"
      
      * tag 'ktest-v5.1' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest:
        ktest: introduce REBOOT_RETURN_CODE to confirm the result of REBOOT
        ktest: Add support for meta characters in GRUB_MENU
        ktest: Show name and iteration on errors
      68253e71
    • Linus Torvalds's avatar
      Merge tag 'printk-for-5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/pmladek/printk · 09686219
      Linus Torvalds authored
      Pull printk updates from Petr Mladek:
      
       - Allow state reset of printk_once() calls.
      
       - Prevent crashes when dereferencing invalid pointers in vsprintf().
         Only the first byte is checked for simplicity.
      
       - Make vsprintf warnings consistent and inlined.
      
       - Treewide conversion of obsolete %pf, %pF to %ps, %pF printf
         modifiers.
      
       - Some clean up of vsprintf and test_printf code.
      
      * tag 'printk-for-5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/pmladek/printk:
        lib/vsprintf: Make function pointer_string static
        vsprintf: Limit the length of inlined error messages
        vsprintf: Avoid confusion between invalid address and value
        vsprintf: Prevent crash when dereferencing invalid pointers
        vsprintf: Consolidate handling of unknown pointer specifiers
        vsprintf: Factor out %pO handler as kobject_string()
        vsprintf: Factor out %pV handler as va_format()
        vsprintf: Factor out %p[iI] handler as ip_addr_string()
        vsprintf: Do not check address of well-known strings
        vsprintf: Consistent %pK handling for kptr_restrict == 0
        vsprintf: Shuffle restricted_pointer()
        printk: Tie printk_once / printk_deferred_once into .data.once for reset
        treewide: Switch printk users from %pf and %pF to %ps and %pS, respectively
        lib/test_printf: Switch to bitmap_zalloc()
      09686219
  2. May 07, 2019
    • Linus Torvalds's avatar
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/livepatching/livepatching · 573de2a6
      Linus Torvalds authored
      Pull livepatching updates from Jiri Kosina:
      
       - livepatching kselftests improvements from Joe Lawrence and Miroslav
         Benes
      
       - making use of gcc's -flive-patching option when available, from
         Miroslav Benes
      
       - kobject handling cleanups, from Petr Mladek
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/livepatching/livepatching:
        livepatch: Remove duplicated code for early initialization
        livepatch: Remove custom kobject state handling
        livepatch: Convert error about unsupported reliable stacktrace into a warning
        selftests/livepatch: Add functions.sh to TEST_PROGS_EXTENDED
        kbuild: use -flive-patching when CONFIG_LIVEPATCH is enabled
        selftests/livepatch: use TEST_PROGS for test scripts
      573de2a6
    • Linus Torvalds's avatar
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/hid/hid · b4dd05de
      Linus Torvalds authored
      Pull HID updates from Jiri Kosina:
      
       - support for U2F Zero device, from Andrej Shadura
      
       - logitech-dj has historically been treating devices behind
         non-unifying receivers as generic devices, using the HID emulation in
         the receiver. That had several shortcomings (special keys handling,
         battery level monitoring, etc). The driver has been reworked to
         enumarate (and directly communicate with) the devices behind the
         receiver, to avoid the (too) generic HID implementation in the
         receiver itself. All the work done by Benjamin Tissoires and Hans de
         Goede.
      
       - restructuring of intel-ish driver in order to allow for multiple
         clients of the ISH implementation, from Srinivas Pandruvada
      
       - several other smaller fixes and assorted device ID additions
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/hid/hid: (68 commits)
        HID: logitech-dj: fix spelling in printk
        HID: input: fix assignment of .value
        HID: input: make sure the wheel high resolution multiplier is set
        HID: logitech-dj: add usbhid dependency in Kconfig
        HID: logitech-hidpp: add support for HID++ 1.0 consumer keys reports
        HID: logitech-hidpp: add support for HID++ 1.0 extra mouse buttons reports
        HID: logitech-hidpp: add support for HID++ 1.0 wheel reports
        HID: logitech-hidpp: make hidpp10_set_register_bit a bit more generic
        HID: logitech-hidpp: add input_device ptr to struct hidpp_device
        HID: logitech-hidpp: do not hardcode very long report length
        HID: logitech-hidpp: handle devices attached to 27MHz wireless receivers
        HID: logitech-hidpp: use RAP instead of FAP to get the protocol version
        HID: logitech-hidpp: remove unused origin_is_hid_core function parameter
        HID: logitech-hidpp: remove double assignment from __hidpp_send_report
        HID: logitech-hidpp: do not make failure to get the name fatal
        HID: logitech-hidpp: ignore very-short or empty names
        HID: logitech-hidpp: make .probe usbhid capable
        HID: logitech-hidpp: allow non HID++ devices to be handled by this module
        HID: logitech-dj: add support for Logitech Bluetooth Mini-Receiver
        HID: logitech-dj: make appending of the HID++ descriptors conditional
        ...
      b4dd05de
    • Linus Torvalds's avatar
      Merge tag 'i3c/for-5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/i3c/linux · 80104bb0
      Linus Torvalds authored
      Pull i3c update from Boris Brezillon:
      
       - Fix a shift wrap bug in the core
      
       - Remove dead code in the DW driver
      
      * tag 'i3c/for-5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/i3c/linux:
        i3c: Fix a shift wrap bug in i3c_bus_set_addr_slot_status()
        i3c: master: dw: remove dead code from dw_i3c_master_*_xfers()
      80104bb0
    • Linus Torvalds's avatar
      Merge branch 'next-general' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/linux-security · 78ee8b1b
      Linus Torvalds authored
      Pull security subsystem updates from James Morris:
       "Just a few bugfixes and documentation updates"
      
      * 'next-general' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/linux-security:
        seccomp: fix up grammar in comment
        Revert "security: inode: fix a missing check for securityfs_create_file"
        Yama: mark function as static
        security: inode: fix a missing check for securityfs_create_file
        keys: safe concurrent user->{session,uid}_keyring access
        security: don't use RCU accessors for cred->session_keyring
        Yama: mark local symbols as static
        LSM: lsm_hooks.h: fix documentation format
        LSM: fix documentation for the shm_* hooks
        LSM: fix documentation for the sem_* hooks
        LSM: fix documentation for the msg_queue_* hooks
        LSM: fix documentation for the audit_* hooks
        LSM: fix documentation for the path_chmod hook
        LSM: fix documentation for the socket_getpeersec_dgram hook
        LSM: fix documentation for the task_setscheduler hook
        LSM: fix documentation for the socket_post_create hook
        LSM: fix documentation for the syslog hook
        LSM: fix documentation for sb_copy_data hook
      78ee8b1b
    • Linus Torvalds's avatar
      Merge tag 'spi-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi · 9bff9dfc
      Linus Torvalds authored
      Pull spi updates from Mark Brown:
       "One small feature was added this release but the bulk of the diffstat
        and the changelog comes from the fact that several older drivers got
        some fairly hefty reworks and a couple of new drivers were added:
      
         - Support for detailed control of timing around chip selects from
           Sowjanya Komatineni.
      
         - A big set of fixes and imrovements for the Tegra114 driver from
           Sowjanya Komatineni.
      
         - A big simplification of the GPIO driver from Andrey Smirnov.
      
         - DMA support and fixes for the Freescale LPSPI driver from Clark
           Wang.
      
         - Fixes and optimizations for the bcm2835aux from Martin Sparl.
      
         - New drivers for Mediatek MT7621 (graduated from staging) and Zynq
           QSPI"
      
      [ This is a so-called "evil merge" that additionally removes a warning
        due to an unused variable 'i' introduced by commit 1dfbf334 ("spi:
        ep93xx: Convert to use CS GPIO descriptors")     - Linus ]
      
      * tag 'spi-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi: (127 commits)
        spi: rspi: Fix handling of QSPI code when transmit and receive
        spi: atmel-quadspi: fix crash while suspending
        spi: stm32: return the get_irq error
        spi: tegra114: fix PIO transfer
        spi: pxa2xx: fix SCR (divisor) calculation
        spi: Clear SPI_CS_HIGH flag from bad_bits for GPIO chip-select
        spi: ep93xx: Convert to use CS GPIO descriptors
        spi: AD ASoC: declare missing of table
        spi: spi-mem: zynq-qspi: Fix build error on architectures missing readsl/writesl
        spi: stm32-qspi: manage the get_irq error case
        spi/spi-bcm2835: Split transfers that exceed DLEN
        spi: expand mode support
        dt-bindings: spi: spi-mt65xx: add support for MT8516
        spi: pxa2xx: Add support for Intel Comet Lake
        spi/trace: Cap buffer contents at 64 bytes
        spi: Release spi_res after finalizing message
        spi: Remove warning in spi_split_transfers_maxsize()
        spi: Remove one needless transfer speed fall back case
        spi: sh-msiof: Document r8a77470 bindings
        spi: pxa2xx: use a module softdep for dw_dmac
        ...
      9bff9dfc
    • Linus Torvalds's avatar
      Merge tag 'regulator-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator · 61be53f9
      Linus Torvalds authored
      Pull regulator updates from Mark Brown:
       "In terms of big picture changes this has been an extremely quiet
        release however there's a lot of changes and a fairly big diffstat
        thanks to a bunch of small fixes, mainly coming from Axel Lin. Thanks
        to his work this release removes code overall even though we've added
        a new (albiet fairly small) driver.
      
        Notable things:
      
         - A fix for a long standing issue with locking on error interrupts
           from Steve Twiss.
      
         - A new driver for ST Microelectonics STM32 PWR"
      
      * tag 'regulator-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator: (134 commits)
        regulator: core: simplify return value on suported_voltage
        regulator: da9xxx: Switch to SPDX identifier
        regulator: stm32-pwr: Remove unneeded .min_uV and .list_volage
        regulator: stm32-pwr: Remove unneeded *desc from struct stm32_pwr_reg
        regulator: ab3100: Set fixed_uV instead of min_uV for fixed regulators
        regulator: ab3100: Constify regulator_ops and ab3100_regulator_desc
        regulator: pv880x0: Switch to SPDX identifier
        regulator: hi6xxx: Switch to SPDX identifier
        regulator: vexpress: Switch to SPDX identifier
        regulator: vexpress: Get rid of struct vexpress_regulator
        regulator: sky81452: Switch to SPDX identifier
        regulator: sky81452: Constify sky81452_reg_ops
        regulator: sy8106a: Get rid of struct sy8106a
        regulator: core: do not report EPROBE_DEFER as error but as debug
        regulator: mt63xx: Switch to SPDX identifier
        regulator: fan53555: Switch to SPDX identifier
        regulator: fan53555: Clean up unneeded fields from struct fan53555_device_info
        regulator: ltc3589: Switch to SPDX identifier
        regulator: ltc3589: Get rid of struct ltc3589_regulator
        regulator: ltc3589: Convert to use simplified DT parsing
        ...
      61be53f9
    • Linus Torvalds's avatar
      Merge tag 'regmap-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regmap · 962d5ecc
      Linus Torvalds authored
      Pull regmap updates from Mark Brown:
       "A larger than usual set of changes, though mainly small:
      
         - An optimization to the debugfs code to greatly improve performance
           when dumping extremely sparse register maps from Lucas Tanure.
      
         - Stricter enforcement of writability checks from Han Nandor.
      
         - A fix for default interrupt mode configuration from Srinivas
           Kandagatla.
      
         - SPDX header conversion from Greg Kroah-Hartman"
      
      * tag 'regmap-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regmap:
        regmap: add proper SPDX identifiers on files that did not have them.
        regmap: verify if register is writeable before writing operations
        regmap: regmap-irq: fix getting type default values
        regmap: debugfs: Jump to the next readable register
        regmap: debugfs: Replace code by already existing function
      962d5ecc
    • Christian Brauner's avatar
      samples: show race-free pidfd metadata access · 43c6afee
      Christian Brauner authored
      
      
      This is a sample program showing userspace how to get race-free access
      to process metadata from a pidfd.  It is rather easy to do and userspace
      can actually simply reuse code that currently parses a process's status
      file in procfs.
      The program can easily be extended into a generic helper suitable for
      inclusion in a libc to make it even easier for userspace to gain metadata
      access.
      
      Since this came up in a discussion because this API is going to be used
      in various service managers: A lot of programs will have a whitelist
      seccomp filter that returns <some-errno> for all new syscalls.  This
      means that programs might get confused if CLONE_PIDFD works but the
      later pidfd_send_signal() syscall doesn't.  Hence, here's a ahead of
      time check that pidfd_send_signal() is supported:
      
      bool pidfd_send_signal_supported()
      {
              int procfd = open("/proc/self", O_DIRECTORY | O_RDONLY | O_CLOEXEC);
              if (procfd < 0)
                      return false;
      
              /*
               * A process is always allowed to signal itself so
               * pidfd_send_signal() should never fail this test. If it does
               * it must mean it is not available, blocked by an LSM, seccomp,
               * or other.
               */
              return pidfd_send_signal(procfd, 0, NULL, 0) == 0;
      }
      
      Signed-off-by: default avatarChristian Brauner <christian@brauner.io>
      Co-developed-by: default avatarJann Horn <jannh@google.com>
      Signed-off-by: default avatarJann Horn <jannh@google.com>
      Reviewed-by: default avatarOleg Nesterov <oleg@redhat.com>
      Cc: Arnd Bergmann <arnd@arndb.de>
      Cc: "Eric W. Biederman" <ebiederm@xmission.com>
      Cc: Kees Cook <keescook@chromium.org>
      Cc: Thomas Gleixner <tglx@linutronix.de>
      Cc: David Howells <dhowells@redhat.com>
      Cc: "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>
      Cc: Andy Lutomirsky <luto@kernel.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Cc: Aleksa Sarai <cyphar@cyphar.com>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Al Viro <viro@zeniv.linux.org.uk>
      43c6afee
    • Christian Brauner's avatar
      signal: support CLONE_PIDFD with pidfd_send_signal · 2151ad1b
      Christian Brauner authored
      
      
      Let pidfd_send_signal() use pidfds retrieved via CLONE_PIDFD.  With this
      patch pidfd_send_signal() becomes independent of procfs.  This fullfils
      the request made when we merged the pidfd_send_signal() patchset.  The
      pidfd_send_signal() syscall is now always available allowing for it to
      be used by users without procfs mounted or even users without procfs
      support compiled into the kernel.
      
      Signed-off-by: default avatarChristian Brauner <christian@brauner.io>
      Co-developed-by: default avatarJann Horn <jannh@google.com>
      Signed-off-by: default avatarJann Horn <jannh@google.com>
      Acked-by: default avatarOleg Nesterov <oleg@redhat.com>
      Cc: Arnd Bergmann <arnd@arndb.de>
      Cc: "Eric W. Biederman" <ebiederm@xmission.com>
      Cc: Kees Cook <keescook@chromium.org>
      Cc: Thomas Gleixner <tglx@linutronix.de>
      Cc: David Howells <dhowells@redhat.com>
      Cc: "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>
      Cc: Andy Lutomirsky <luto@kernel.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Cc: Aleksa Sarai <cyphar@cyphar.com>
      Cc: Linus...
      2151ad1b
    • Christian Brauner's avatar
      clone: add CLONE_PIDFD · b3e58382
      Christian Brauner authored
      
      
      This patchset makes it possible to retrieve pid file descriptors at
      process creation time by introducing the new flag CLONE_PIDFD to the
      clone() system call.  Linus originally suggested to implement this as a
      new flag to clone() instead of making it a separate system call.  As
      spotted by Linus, there is exactly one bit for clone() left.
      
      CLONE_PIDFD creates file descriptors based on the anonymous inode
      implementation in the kernel that will also be used to implement the new
      mount api.  They serve as a simple opaque handle on pids.  Logically,
      this makes it possible to interpret a pidfd differently, narrowing or
      widening the scope of various operations (e.g. signal sending).  Thus, a
      pidfd cannot just refer to a tgid, but also a tid, or in theory - given
      appropriate flag arguments in relevant syscalls - a process group or
      session. A pidfd does not represent a privilege.  This does not imply it
      cannot ever be that way but for now this is not the case.
      
      A pidfd comes with additional information in fdinfo if the kernel supports
      procfs.  The fdinfo file contains the pid of the process in the callers
      pid namespace in the same format as the procfs status file, i.e. "Pid:\t%d".
      
      As suggested by Oleg, with CLONE_PIDFD the pidfd is returned in the
      parent_tidptr argument of clone.  This has the advantage that we can
      give back the associated pid and the pidfd at the same time.
      
      To remove worries about missing metadata access this patchset comes with
      a sample program that illustrates how a combination of CLONE_PIDFD, and
      pidfd_send_signal() can be used to gain race-free access to process
      metadata through /proc/<pid>.  The sample program can easily be
      translated into a helper that would be suitable for inclusion in libc so
      that users don't have to worry about writing it themselves.
      
      Suggested-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
      Signed-off-by: default avatarChristian Brauner <christian@brauner.io>
      Co-developed-by: default avatarJann Horn <jannh@google.com>
      Signed-off-by: default avatarJann Horn <jannh@google.com>
      Reviewed-by: default avatarOleg Nesterov <oleg@redhat.com>
      Cc: Arnd Bergmann <arnd@arndb.de>
      Cc: "Eric W. Biederman" <ebiederm@xmission.com>
      Cc: Kees Cook <keescook@chromium.org>
      Cc: Thomas Gleixner <tglx@linutronix.de>
      Cc: David Howells <dhowells@redhat.com>
      Cc: "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>
      Cc: Andy Lutomirsky <luto@kernel.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Cc: Aleksa Sarai <cyphar@cyphar.com>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Al Viro <viro@zeniv.linux.org.uk>
      b3e58382
    • Linus Torvalds's avatar
      Merge tag 'linux-kselftest-5.2-rc1' of... · 71ae5fc8
      Linus Torvalds authored
      Merge tag 'linux-kselftest-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest
      
      Pull Kselftest updates from Shuah Khan:
      
       - fixes to seccomp test, and kselftest framework
      
       - cleanups to remove duplicate header defines
      
       - fixes to efivarfs "make clean" target
      
       - cgroup cleanup path
      
       - Moving the IMA kexec_load selftest to selftests/kexec work from Mimi
         Johar and Petr Vorel
      
       - A framework to kselftest for writing kernel test modules addition
         from Tobin C. Harding
      
      * tag 'linux-kselftest-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest: (29 commits)
        selftests: build and run gpio when output directory is the src dir
        selftests/ipc: Fix msgque compiler warnings
        selftests/efivarfs: clean up test files from test_create*()
        selftests: fix headers_install circular dependency
        selftests/kexec: update get_secureboot_mode
        selftests/kexec: make kexec_load test independent of IMA being enabled
        selftests/kexec: check kexec_load and kexec_file_load are enabled
        selftests/kexec: Add missing '=y' to config options
        selftests/kexec: kexec_file_load syscall test
        selftests/kexec: define "require_root_privileges"
        selftests/kexec: define common logging functions
        selftests/kexec: define a set of common functions
        selftests/kexec: cleanup the kexec selftest
        selftests/kexec: move the IMA kexec_load selftest to selftests/kexec
        selftests/harness: Add 30 second timeout per test
        selftests/seccomp: Handle namespace failures gracefully
        selftests: cgroup: fix cleanup path in test_memcg_subtree_control()
        selftests: efivarfs: remove the test_create_read file if it was exist
        rseq/selftests: Adapt number of threads to the number of detected cpus
        lib: Add test module for strscpy_pad
        ...
      71ae5fc8
    • Linus Torvalds's avatar
      Merge branch 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6 · 81ff5d2c
      Linus Torvalds authored
      Pull crypto update from Herbert Xu:
       "API:
         - Add support for AEAD in simd
         - Add fuzz testing to testmgr
         - Add panic_on_fail module parameter to testmgr
         - Use per-CPU struct instead multiple variables in scompress
         - Change verify API for akcipher
      
        Algorithms:
         - Convert x86 AEAD algorithms over to simd
         - Forbid 2-key 3DES in FIPS mode
         - Add EC-RDSA (GOST 34.10) algorithm
      
        Drivers:
         - Set output IV with ctr-aes in crypto4xx
         - Set output IV in rockchip
         - Fix potential length overflow with hashing in sun4i-ss
         - Fix computation error with ctr in vmx
         - Add SM4 protected keys support in ccree
         - Remove long-broken mxc-scc driver
         - Add rfc4106(gcm(aes)) cipher support in cavium/nitrox"
      
      * 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6: (179 commits)
        crypto: ccree - use a proper le32 type for le32 val
        crypto: ccree - remove set but not used variable 'du_size'
        crypto: ccree - Make cc_sec_disable static
        crypto: ccree - fix spelling mistake "protedcted" -> "protected"
        crypto: caam/qi2 - generate hash keys in-place
        crypto: caam/qi2 - fix DMA mapping of stack memory
        crypto: caam/qi2 - fix zero-length buffer DMA mapping
        crypto: stm32/cryp - update to return iv_out
        crypto: stm32/cryp - remove request mutex protection
        crypto: stm32/cryp - add weak key check for DES
        crypto: atmel - remove set but not used variable 'alg_name'
        crypto: picoxcell - Use dev_get_drvdata()
        crypto: crypto4xx - get rid of redundant using_sd variable
        crypto: crypto4xx - use sync skcipher for fallback
        crypto: crypto4xx - fix cfb and ofb "overran dst buffer" issues
        crypto: crypto4xx - fix ctr-aes missing output IV
        crypto: ecrdsa - select ASN1 and OID_REGISTRY for EC-RDSA
        crypto: ux500 - use ccflags-y instead of CFLAGS_<basename>.o
        crypto: ccree - handle tee fips error during power management resume
        crypto: ccree - add function to handle cryptocell tee fips error
        ...
      81ff5d2c
    • Linus Torvalds's avatar
      Merge tag 'hwmon-for-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging · 7aefd944
      Linus Torvalds authored
      Pull hwmon updates from Guenter Roeck:
      
       - Add driver for Intersil ISL68137 PWM Controller
      
       - Add driver for Lochnagar 2
      
       - Add driver for Infineon IR38064 Voltage Regulator
      
       - Add support for TMP75B to lm75 driver
      
       - Convert documentation to ReST format
      
       - Use request_muxed_region for Super-IO accesses in several drivers
      
       - Add 'samples' attribute to ABI, and start using it
      
       - Add support for custom sysfs attributes to pmbus drivers (used in
         ISL68137 driver)
      
       - Introduce HWMON_CHANNEL_INFO macro
      
       - Automated changes:
            - Use permission specific [SENSOR_][DEVICE_]ATTR variants
            - Fix build warnings due to unused of_device_id structures
            - Use HWMON_CHANNEL_INFO macro
      
       - Various minor improvements and fixes
      
      * tag 'hwmon-for-v5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging: (125 commits)
        hwmon: (lm75) Add support for TMP75B
        dt-bindings: hwmon: Add tmp75b to lm75.txt
        hwmon: (s3c) Use dev_get_drvdata()
        hwmon: (max6650) Drop call to thermal_cdev_update
        docs: hwmon: remove the extension from .rst files
        docs: hwmon: convert three docs to ReST format
        hwmon: (max6650) add thermal cooling device capability
        hwmon: (ina3221) Add voltage conversion time settings
        hwmon: (ina3221) Do not read-back to cache reg_config
        docs: hwmon: Add an index file and rename docs to *.rst
        docs: hwmon: convert remaining files to ReST format
        docs: hwmon: misc files: convert to ReST format
        docs: hwmon: pmbus files: convert to ReST format
        docs: hwmon: k8temp, w83793: convert to ReST format
        docs: hwmon: da9052, da9055: convert to ReST format
        docs: hwmon: wm831x, wm8350: convert to ReST format
        docs: hwmon: dme1737, vt1211: convert to ReST format
        docs: hwmon: ads1015: convert to ReST format
        docs: hwmon: asc7621: convert to ReST format
        docs: hwmon: ibmpowernv: convert to ReST format
        ...
      7aefd944
    • Linus Torvalds's avatar
      Merge branch 'ras-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · ffa6f55e
      Linus Torvalds authored
      Pull RAS updates from Borislav Petkov:
      
       - Support for varying MCA bank numbers per CPU: this is in preparation
         for future CPU enablement (Yazen Ghannam)
      
       - MCA banks read race fix (Tony Luck)
      
       - Facility to filter MCEs which should not be logged (Yazen Ghannam)
      
       - The usual round of cleanups and fixes
      
      * 'ras-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/MCE/AMD: Don't report L1 BTB MCA errors on some family 17h models
        x86/MCE: Add an MCE-record filtering function
        RAS/CEC: Increment cec_entered under the mutex lock
        x86/mce: Fix debugfs_simple_attr.cocci warnings
        x86/mce: Remove mce_report_event()
        x86/mce: Handle varying MCA bank counts
        x86/mce: Fix machine_check_poll() tests for error types
        MAINTAINERS: Fix file pattern for X86 MCE INFRASTRUCTURE
        x86/MCE: Group AMD function prototypes in <asm/mce.h>
      ffa6f55e
    • Linus Torvalds's avatar
      Merge tag 'edac_for_5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/bp/bp · 275b103a
      Linus Torvalds authored
      Pull EDAC updates from Borislav Petkov:
      
       - amd64_edac: Family 0x17, models 0x30-.. enablement (Yazen Ghannam)
      
       - skx_*: Librarize it so that it can be shared between drivers (Qiuxu Zhuo)
      
       - altera: Stratix10 improvements (Thor Thayer)
      
       - The usual round of fixes, fixlets and cleanups
      
      * tag 'edac_for_5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/bp/bp:
        Revert "EDAC/amd64: Support more than two controllers for chip select handling"
        arm64: dts: stratix10: Use new Stratix10 EDAC bindings
        Documentation: dt: edac: Add Stratix10 Peripheral bindings
        Documentation: dt: edac: Fix Stratix10 IRQ bindings
        EDAC/altera, firmware/intel: Add Stratix10 ECC DBE SMC call
        EDAC/altera: Initialize peripheral FIFOs in probe()
        EDAC/altera: Do less intrusive error injection
        EDAC/amd64: Adjust printed chip select sizes when interleaved
        EDAC/amd64: Support more than two controllers for chip select handling
        EDAC/amd64: Recognize x16 symbol size
        EDAC/amd64: Set maximum channel layer size depending on family
        EDAC/amd64: Support more than two Unified Memory Controllers
        EDAC/amd64: Use a macro for iterating over Unified Memory Controllers
        EDAC/amd64: Add Family 17h Model 30h PCI IDs
        MAINTAINERS: Add entry for EDAC-I10NM
        MAINTAINERS: Update entry for EDAC-SKYLAKE
        EDAC, altera: Fix S10 Double Bit Error Notification
        EDAC, skx, i10nm: Make skx_common.c a pure library
      275b103a
    • Linus Torvalds's avatar
      Merge tag 'devprop-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · 4dd2ab9a
      Linus Torvalds authored
      Pull device properties framework updates from Rafael Wysocki:
       "These fix the handling of data nodes in the ACPI properties support
        code, add a new helper for endpoint lookup in property graphs and
        restore a comment inadvertently removed by one of previous changes.
      
        Specifics:
      
         - Fix the handling of data nodes in the ACPI properties support code
           for devices with child devices and hierarchical _DSD properties
           (Pierre-Louis Bossart).
      
         - Add fwnode_graph_get_endpoint_by_id() helper for endpoint lookup in
           device property graphs (Sakari Ailus).
      
         - Restore the _DSD data subnodes GUID comment inadvertently removed
           by one of previous changes (Shunyong Yang)"
      
      * tag 'devprop-5.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm:
        ACPI / property: fix handling of data_nodes in acpi_get_next_subnode()
        device property: Add fwnode_graph_get_endpoint_by_id()
        ACPI: property: restore _DSD data subnodes GUID comment
      4dd2ab9a