Skip to content
  1. Jun 26, 2015
  2. Jun 25, 2015
    • Srikar Dronamraju's avatar
      perf bench numa: Fix to show proper convergence stats · 2b42b09b
      Srikar Dronamraju authored
      
      
      With commit: e1e455f4 (perf tools: Work around lack of sched_getcpu
      in glibc < 2.6), perf_bench numa mem with -c or -m option is not able to
      correctly calculate convergence.
      
      With the above commit, sched_getcpu always seems to return -1. The
      intention of commit e1e455f4 was to add a sched_getcpu in glibc < 2.6.
      Hence keep the sched_getcpu definition under an ifdef.
      
      This regression happened occurred between v4.0 and v4.1
      
      Signed-off-by: default avatarSrikar Dronamraju <srikar@linux.vnet.ibm.com>
      Acked-by: default avatarIngo Molnar <mingo@kernel.org>
      Cc: Jiri Olsa <jolsa@kernel.org>
      Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
      Cc: Namhyung Kim <namhyung@kernel.org>
      Cc: Vinson Lee <vlee@twitter.com>
      Fixes:  e1e455f4 ("perf tools: Work around lack of sched_getcpu in glibc < 2.6")
      Link: http://lkml.kernel.org/r/20150624111004.GA5220@linux.vnet.ibm.com
      
      
      Signed-off-by: default avatarArnaldo Carvalho de Melo <acme@redhat.com>
      2b42b09b
    • Ingo Molnar's avatar
      Merge tag 'perf-core-for-mingo' of... · 27451700
      Ingo Molnar authored
      Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux
      
       into perf/core
      
      Pull perf/core improvements and fixes from Arnaldo Carvalho de Melo:
      
      User visible changes:
      
        - Move toggling event logic from 'perf top' and into hists browser, allowing
          freeze/unfreeze with event lists with more than one entry (Namhyung Kim)
      
        - Add missing newlines when dumping PERF_RECORD_FINISHED_ROUND and
          showing the Aggregated stats in 'perf report -D' (Adrian Hunter)
      
      Infrastructure changes:
      
        - Allow auxtrace data alignment (Adrian Hunter)
      
        - Allow events with dot (Andi Kleen)
      
        - Fix failure to 'perf probe' events on arm (He Kuang)
      
        - Add testing for Makefile.perf (Jiri Olsa)
      
        - Add test for make install with prefix (Jiri Olsa)
      
        - Fix single target build dependency check (Jiri Olsa)
      
        - Access thread_map entries via accessors, prep patch to hold more info per
          entry, for ongoing 'perf stat --per-thread' work (Jiri Olsa)
      
        - Use __weak definition from compiler.h (Sukadev Bhattiprolu)
      
        - Split perf_pmu__new_alias() (Sukadev Bhattiprolu)
      
      Signed-off-by: default avatarArnaldo Carvalho de Melo <acme@redhat.com>
      Signed-off-by: default avatarIngo Molnar <mingo@kernel.org>
      27451700
  3. Jun 24, 2015
  4. Jun 23, 2015
    • Linus Torvalds's avatar
      Merge branch 'irq-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 407a2c72
      Linus Torvalds authored
      Pull irq updates from Thomas Gleixner:
       "The irq departement delivers:
      
         - plug a potential race related to chained interrupt handlers
      
         - core updates which address the needs of the x86 irqdomain conversion
      
         - new irqchip callback to support affinity settings for VCPUs
      
         - the usual pile of updates to interrupt chip drivers
      
         - a few helper functions to allow further cleanups and
           simplifications
      
        I have a largish pile of coccinelle scripted/verified cleanups and
        simplifications pending on top of that, but I prefer to send that
        towards the end of the merge window when the arch/driver changes have
        hit your tree to avoid API change wreckage as far as possible"
      
      * 'irq-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (34 commits)
        genirq: Remove bogus restriction in irq_move_mask_irq()
        irqchip: atmel-aic5: Add sama5d2 support
        irq: spear-shirq: Fix race in installing chained IRQ handler
        irq: irq-keystone: Fix race in installing chained IRQ handler
        gpio: gpio-tegra: Fix race in installing chained IRQ handler
        gpio: gpio-mxs: Fix race in installing chained IRQ handler
        gpio: gpio-mxc: Fix race in installing chained IRQ handler
        ARM: gemini: Fix race in installing GPIO chained IRQ handler
        GPU: ipu: Fix race in installing IPU chained IRQ handler
        ARM: sa1100: convert SA11x0 related code to use new chained handler helper
        irq: Add irq_set_chained_handler_and_data()
        irqchip: exynos-combiner: Save IRQ enable set on suspend
        genirq: Introduce helper function irq_data_get_affinity_mask()
        genirq: Introduce helper function irq_data_get_node()
        genirq: Introduce struct irq_common_data to host shared irq data
        genirq: Prevent crash in irq_move_irq()
        genirq: Enhance irq_data_to_desc() to support hierarchy irqdomain
        irqchip: gic: Simplify gic_configure_irq by using IRQCHIP_SET_TYPE_MASKED
        irqchip: renesas: intc-irqpin: Improve binding documentation
        genirq: Set IRQCHIP_SKIP_SET_WAKE for no_irq_chip
        ...
      407a2c72
    • Linus Torvalds's avatar
      Merge branch 'timers-nohz-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3a95398f
      Linus Torvalds authored
      Pull NOHZ updates from Thomas Gleixner:
       "A few updates to the nohz infrastructure:
      
         - recursion protection for context tracking
      
         - make the TIF_NOHZ inheritance smarter
      
         - isolate cpus which belong to the NOHZ full set"
      
      * 'timers-nohz-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        nohz: Set isolcpus when nohz_full is set
        nohz: Add tick_nohz_full_add_cpus_to() API
        context_tracking: Inherit TIF_NOHZ through forks instead of context switches
        context_tracking: Protect against recursion
      3a95398f
    • Linus Torvalds's avatar
      Merge branch 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 43224b96
      Linus Torvalds authored
      Pull timer updates from Thomas Gleixner:
       "A rather largish update for everything time and timer related:
      
         - Cache footprint optimizations for both hrtimers and timer wheel
      
         - Lower the NOHZ impact on systems which have NOHZ or timer migration
           disabled at runtime.
      
         - Optimize run time overhead of hrtimer interrupt by making the clock
           offset updates smarter
      
         - hrtimer cleanups and removal of restrictions to tackle some
           problems in sched/perf
      
         - Some more leap second tweaks
      
         - Another round of changes addressing the 2038 problem
      
         - First step to change the internals of clock event devices by
           introducing the necessary infrastructure
      
         - Allow constant folding for usecs/msecs_to_jiffies()
      
         - The usual pile of clockevent/clocksource driver updates
      
        The hrtimer changes contain updates to sched, perf and x86 as they
        depend on them plus changes all over the tree to cleanup API changes
        and redundant code, which got copied all over the place.  The y2038
        changes touch s390 to remove the last non 2038 safe code related to
        boot/persistant clock"
      
      * 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (114 commits)
        clocksource: Increase dependencies of timer-stm32 to limit build wreckage
        timer: Minimize nohz off overhead
        timer: Reduce timer migration overhead if disabled
        timer: Stats: Simplify the flags handling
        timer: Replace timer base by a cpu index
        timer: Use hlist for the timer wheel hash buckets
        timer: Remove FIFO "guarantee"
        timers: Sanitize catchup_timer_jiffies() usage
        hrtimer: Allow hrtimer::function() to free the timer
        seqcount: Introduce raw_write_seqcount_barrier()
        seqcount: Rename write_seqcount_barrier()
        hrtimer: Fix hrtimer_is_queued() hole
        hrtimer: Remove HRTIMER_STATE_MIGRATE
        selftest: Timers: Avoid signal deadlock in leap-a-day
        timekeeping: Copy the shadow-timekeeper over the real timekeeper last
        clockevents: Check state instead of mode in suspend/resume path
        selftests: timers: Add leap-second timer edge testing to leap-a-day.c
        ntp: Do leapsecond adjustment in adjtimex read path
        time: Prevent early expiry of hrtimers[CLOCK_REALTIME] at the leap second edge
        ntp: Introduce and use SECS_PER_DAY macro instead of 86400
        ...
      43224b96
    • Linus Torvalds's avatar
      Merge branch 'x86-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · d70b3ef5
      Linus Torvalds authored
      Pull x86 core updates from Ingo Molnar:
       "There were so many changes in the x86/asm, x86/apic and x86/mm topics
        in this cycle that the topical separation of -tip broke down somewhat -
        so the result is a more traditional architecture pull request,
        collected into the 'x86/core' topic.
      
        The topics were still maintained separately as far as possible, so
        bisectability and conceptual separation should still be pretty good -
        but there were a handful of merge points to avoid excessive
        dependencies (and conflicts) that would have been poorly tested in the
        end.
      
        The next cycle will hopefully be much more quiet (or at least will
        have fewer dependencies).
      
        The main changes in this cycle were:
      
         * x86/apic changes, with related IRQ core changes: (Jiang Liu, Thomas
           Gleixner)
      
           - This is the second and most intrusive part of changes to the x86
             interrupt handling - full conversion to hierarchical interrupt
             domains:
      
                [IOAPIC domain]   -----
                                       |
                [MSI domain]      --------[Remapping domain] ----- [ Vector domain ]
                                       |   (optional)          |
                [HPET MSI domain] -----                        |
                                                               |
                [DMAR domain]     -----------------------------
                                                               |
                [Legacy domain]   -----------------------------
      
             This now reflects the actual hardware and allowed us to distangle
             the domain specific code from the underlying parent domain, which
             can be optional in the case of interrupt remapping.  It's a clear
             separation of functionality and removes quite some duct tape
             constructs which plugged the remap code between ioapic/msi/hpet
             and the vector management.
      
           - Intel IOMMU IRQ remapping enhancements, to allow direct interrupt
             injection into guests (Feng Wu)
      
         * x86/asm changes:
      
           - Tons of cleanups and small speedups, micro-optimizations.  This
             is in preparation to move a good chunk of the low level entry
             code from assembly to C code (Denys Vlasenko, Andy Lutomirski,
             Brian Gerst)
      
           - Moved all system entry related code to a new home under
             arch/x86/entry/ (Ingo Molnar)
      
           - Removal of the fragile and ugly CFI dwarf debuginfo annotations.
             Conversion to C will reintroduce many of them - but meanwhile
             they are only getting in the way, and the upstream kernel does
             not rely on them (Ingo Molnar)
      
           - NOP handling refinements. (Borislav Petkov)
      
         * x86/mm changes:
      
           - Big PAT and MTRR rework: making the code more robust and
             preparing to phase out exposing direct MTRR interfaces to drivers -
             in favor of using PAT driven interfaces (Toshi Kani, Luis R
             Rodriguez, Borislav Petkov)
      
           - New ioremap_wt()/set_memory_wt() interfaces to support
             Write-Through cached memory mappings.  This is especially
             important for good performance on NVDIMM hardware (Toshi Kani)
      
         * x86/ras changes:
      
           - Add support for deferred errors on AMD (Aravind Gopalakrishnan)
      
             This is an important RAS feature which adds hardware support for
             poisoned data.  That means roughly that the hardware marks data
             which it has detected as corrupted but wasn't able to correct, as
             poisoned data and raises an APIC interrupt to signal that in the
             form of a deferred error.  It is the OS's responsibility then to
             take proper recovery action and thus prolonge system lifetime as
             far as possible.
      
           - Add support for Intel "Local MCE"s: upcoming CPUs will support
             CPU-local MCE interrupts, as opposed to the traditional system-
             wide broadcasted MCE interrupts (Ashok Raj)
      
           - Misc cleanups (Borislav Petkov)
      
         * x86/platform changes:
      
           - Intel Atom SoC updates
      
        ... and lots of other cleanups, fixlets and other changes - see the
        shortlog and the Git log for details"
      
      * 'x86-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (222 commits)
        x86/hpet: Use proper hpet device number for MSI allocation
        x86/hpet: Check for irq==0 when allocating hpet MSI interrupts
        x86/mm/pat, drivers/infiniband/ipath: Use arch_phys_wc_add() and require PAT disabled
        x86/mm/pat, drivers/media/ivtv: Use arch_phys_wc_add() and require PAT disabled
        x86/platform/intel/baytrail: Add comments about why we disabled HPET on Baytrail
        genirq: Prevent crash in irq_move_irq()
        genirq: Enhance irq_data_to_desc() to support hierarchy irqdomain
        iommu, x86: Properly handle posted interrupts for IOMMU hotplug
        iommu, x86: Provide irq_remapping_cap() interface
        iommu, x86: Setup Posted-Interrupts capability for Intel iommu
        iommu, x86: Add cap_pi_support() to detect VT-d PI capability
        iommu, x86: Avoid migrating VT-d posted interrupts
        iommu, x86: Save the mode (posted or remapped) of an IRTE
        iommu, x86: Implement irq_set_vcpu_affinity for intel_ir_chip
        iommu: dmar: Provide helper to copy shared irte fields
        iommu: dmar: Extend struct irte for VT-d Posted-Interrupts
        iommu: Add new member capability to struct irq_remap_ops
        x86/asm/entry/64: Disentangle error_entry/exit gsbase/ebx/usermode code
        x86/asm/entry/32: Shorten __audit_syscall_entry() args preparation
        x86/asm/entry/32: Explain reloading of registers after __audit_syscall_entry()
        ...
      d70b3ef5
    • Linus Torvalds's avatar
      Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 650ec5a6
      Linus Torvalds authored
      Pull x86 warning fixlet from Ingo Molnar:
       "A build fix for certain (rare) variants of binutils that did not make
        it into v4.1"
      
      * 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/boot: Fix overflow warning with 32-bit binutils
      650ec5a6
    • Linus Torvalds's avatar
      Merge branch 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 35ffccdb
      Linus Torvalds authored
      Pul x86 microcode updates from Ingo Molnar:
       "x86 microcode loader updates from Borislav Petkov:
      
         - early parsing of the built-in microcode
      
         - cleanups
      
         - misc smaller fixes"
      
      * 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/microcode: Correct CPU family related variable types
        x86/microcode: Disable builtin microcode loading on 32-bit for now
        x86/microcode/intel: Rename get_matching_sig()
        x86/microcode/intel: Simplify get_matching_sig()
        x86/microcode/intel: Simplify update_match_cpu()
        x86/microcode/intel: Rename get_matching_microcode
        x86/cpu/microcode: Zap changelog
        x86/microcode: Parse built-in microcode early
        x86/microcode/intel: Remove unused @rev arg of get_matching_sig()
        x86/microcode/intel: Get rid of revision_is_newer()
      35ffccdb
    • Linus Torvalds's avatar
      Merge branch 'x86-kdump-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · e2172d8f
      Linus Torvalds authored
      Pull x86 kdump updates from Ingo Molnar:
       "Three kdump robustness related improvements (Joerg Roedel)"
      
      * 'x86-kdump-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/crash: Allocate enough low memory when crashkernel=high
        x86/swiotlb: Try coherent allocations with __GFP_NOWARN
        swiotlb: Warn on allocation failure in swiotlb_alloc_coherent()
      e2172d8f
    • Linus Torvalds's avatar
      Merge branch 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · e75c73ad
      Linus Torvalds authored
      Pull x86 FPU updates from Ingo Molnar:
       "This tree contains two main changes:
      
         - The big FPU code rewrite: wide reaching cleanups and reorganization
           that pulls all the FPU code together into a clean base in
           arch/x86/fpu/.
      
           The resulting code is leaner and faster, and much easier to
           understand.  This enables future work to further simplify the FPU
           code (such as removing lazy FPU restores).
      
           By its nature these changes have a substantial regression risk: FPU
           code related bugs are long lived, because races are often subtle
           and bugs mask as user-space failures that are difficult to track
           back to kernel side backs.  I'm aware of no unfixed (or even
           suspected) FPU related regression so far.
      
         - MPX support rework/fixes.  As this is still not a released CPU
           feature, there were some buglets in the code - should be much more
           robust now (Dave Hansen)"
      
      * 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (250 commits)
        x86/fpu: Fix double-increment in setup_xstate_features()
        x86/mpx: Allow 32-bit binaries on 64-bit kernels again
        x86/mpx: Do not count MPX VMAs as neighbors when unmapping
        x86/mpx: Rewrite the unmap code
        x86/mpx: Support 32-bit binaries on 64-bit kernels
        x86/mpx: Use 32-bit-only cmpxchg() for 32-bit apps
        x86/mpx: Introduce new 'directory entry' to 'addr' helper function
        x86/mpx: Add temporary variable to reduce masking
        x86: Make is_64bit_mm() widely available
        x86/mpx: Trace allocation of new bounds tables
        x86/mpx: Trace the attempts to find bounds tables
        x86/mpx: Trace entry to bounds exception paths
        x86/mpx: Trace #BR exceptions
        x86/mpx: Introduce a boot-time disable flag
        x86/mpx: Restrict the mmap() size check to bounds tables
        x86/mpx: Remove redundant MPX_BNDCFG_ADDR_MASK
        x86/mpx: Clean up the code by not passing a task pointer around when unnecessary
        x86/mpx: Use the new get_xsave_field_ptr()API
        x86/fpu/xstate: Wrap get_xsave_addr() to make it safer
        x86/fpu/xstate: Fix up bad get_xsave_addr() assumptions
        ...
      e75c73ad