1. 13 10月, 2012 26 次提交
    • L
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net · 98260daa
      Linus Torvalds 提交于
      Pull networking updates from David Miller:
      
       1) Alexey Kuznetsov noticed we routed TCP resets improperly in the
          assymetric routing case, fix this by reverting a change that made us
          use the incoming interface in the outgoing route key when we didn't
          have a socket context to work with.
      
       2) TCP sysctl kernel memory leakage to userspace fix from Alan Cox.
      
       3) Move UAPI bits from David Howells, WIMAX and CAN this time.
      
       4) Fix TX stalls in e1000e wrt.  Byte Queue Limits, from Hiroaki
          SHIMODA, Denys Fedoryshchenko, and Jesse Brandeburg.
      
       5) Fix IPV6 crashes in packet generator module, from Amerigo Wang.
      
       6) Tidies and fixes in the new VXLAN driver from Stephen Hemminger.
      
       7) Bridge IP options parse doesn't check first if SKB header has at
          least an IP header's worth of content present.  Fix from Sarveshwar
          Bandi.
      
       8) The kernel now generates compound pages on transmit and the Xen
          netback drivers needs some adjustments in order to handle this.  Fix
          from Ian Campbell.
      
       9) Turn off ASPM in JME driver, from Kevin Bardon and Matthew Garrett.
      
      * git://git.kernel.org/pub/scm/linux/kernel/git/davem/net: (43 commits)
        mcs7830: Fix link state detection
        net: add doc for in4_pton()
        net: add doc for in6_pton()
        vti: fix sparse bit endian warnings
        tcp: resets are misrouted
        usbnet: Support devices reporting idleness
        Add CDC-ACM support for the CX93010-2x UCMxx USB Modem
        net/ethernet/jme: disable ASPM
        tcp: sysctl interface leaks 16 bytes of kernel memory
        kaweth: print correct debug ptr
        e1000e: Change wthresh to 1 to avoid possible Tx stalls
        ipv4: fix route mark sparse warning
        xen: netback: handle compound page fragments on transmit.
        bridge: Pull ip header into skb->data before looking into ip header.
        isdn: fix a wrapping bug in isdn_ppp_ioctl()
        vxlan: fix oops when give unknown ifindex
        vxlan: fix receive checksum handling
        vxlan: add additional headroom
        vxlan: allow configuring port range
        vxlan: associate with tunnel socket on transmit
        ...
      98260daa
    • L
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/sparc · e225ca27
      Linus Torvalds 提交于
      Pull Sparc updates from David Miller:
      
       1) Updated syscall tracing fix from Al Viro.
      
       2) SUN4V error reporting was deficient in several areas.
      
      * git://git.kernel.org/pub/scm/linux/kernel/git/davem/sparc:
        sparc64: fix ptrace interaction with force_successful_syscall_return()
        sparc64: Fix deficiencies in sun4v error reporting.
      e225ca27
    • L
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/cmetcalf/linux-tile · 54f7fc25
      Linus Torvalds 提交于
      Pull tile arch update from Chris Metcalf:
       "The bulk of this change is the tile uapi disintegration.  There is
        also a one-line change in here to enable interrupts in
        do_work_pending() to avoid a WARN_ON in _local_bh_enable_ip()."
      
      * git://git.kernel.org/pub/scm/linux/kernel/git/cmetcalf/linux-tile:
        arch/tile: enable interrupts in do_work_pending()
        UAPI: (Scripted) Disintegrate arch/tile/include/asm
        UAPI: (Scripted) Disintegrate arch/tile/include/arch
      54f7fc25
    • L
      Merge tag 'please-pull-dhowells-uapi' of git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux · 0c4a479b
      Linus Torvalds 提交于
      Pull ia64 UAPI changes from Tony Luck:
       "ia64 pieces of David Howells great UAPI include file move"
      
      * tag 'please-pull-dhowells-uapi' of git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux:
        UAPI: (Scripted) Disintegrate arch/ia64/include/asm
      0c4a479b
    • L
      Merge branch 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · ade0899b
      Linus Torvalds 提交于
      Pull perf updates from Ingo Molnar:
       "This tree includes some late late perf items that missed the first
        round:
      
        tools:
      
         - Bash auto completion improvements, now we can auto complete the
           tools long options, tracepoint event names, etc, from Namhyung Kim.
      
         - Look up thread using tid instead of pid in 'perf sched'.
      
         - Move global variables into a perf_kvm struct, from David Ahern.
      
         - Hists refactorings, preparatory for improved 'diff' command, from
           Jiri Olsa.
      
         - Hists refactorings, preparatory for event group viewieng work, from
           Namhyung Kim.
      
         - Remove double negation on optional feature macro definitions, from
           Namhyung Kim.
      
         - Remove several cases of needless global variables, on most
           builtins.
      
         - misc fixes
      
        kernel:
      
         - sysfs support for IBS on AMD CPUs, from Robert Richter.
      
         - Support for an upcoming Intel CPU, the Xeon-Phi / Knights Corner
           HPC blade PMU, from Vince Weaver.
      
         - misc fixes"
      
      * 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (46 commits)
        perf: Fix perf_cgroup_switch for sw-events
        perf: Clarify perf_cpu_context::active_pmu usage by renaming it to ::unique_pmu
        perf/AMD/IBS: Add sysfs support
        perf hists: Add more helpers for hist entry stat
        perf hists: Move he->stat.nr_events initialization to a template
        perf hists: Introduce struct he_stat
        perf diff: Removing the total_period argument from output code
        perf tool: Add hpp interface to enable/disable hpp column
        perf tools: Removing hists pair argument from output path
        perf hists: Separate overhead and baseline columns
        perf diff: Refactor diff displacement possition info
        perf hists: Add struct hists pointer to struct hist_entry
        perf tools: Complete tracepoint event names
        perf/x86: Add support for Intel Xeon-Phi Knights Corner PMU
        perf evlist: Remove some unused methods
        perf evlist: Introduce add_newtp method
        perf kvm: Move global variables into a perf_kvm struct
        perf tools: Convert to BACKTRACE_SUPPORT
        perf tools: Long option completion support for each subcommands
        perf tools: Complete long option names of perf command
        ...
      ade0899b
    • M
      perf: Handle new rbtree implementation · 871a0596
      Markus Trippelsdorf 提交于
      Perf build fails with the new rbtree implementation:
      
        ../../lib/rbtree.c:24:36: fatal error: linux/rbtree_augmented.h: No such file or directory compilation terminated.
      
      Fix by updating the Makefile and adding a btree_augmented.h
      wrapper.
      Reported-and-tested-by: NBorislav Petkov <borislav.petkov@amd.com>
      Signed-off-by: NMarkus Trippelsdorf <markus@trippelsdorf.de>
      Cc: Arnaldo Carvalho de Melo <acme@infradead.org>
      Cc: Borislav Petkov <bp@amd64.org>
      Link: http://lkml.kernel.org/r/20121009180156.GA245@x4Signed-off-by: NIngo Molnar <mingo@kernel.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      871a0596
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/signal · 4e21fc13
      Linus Torvalds 提交于
      Pull third pile of kernel_execve() patches from Al Viro:
       "The last bits of infrastructure for kernel_thread() et.al., with
        alpha/arm/x86 use of those.  Plus sanitizing the asm glue and
        do_notify_resume() on alpha, fixing the "disabled irq while running
        task_work stuff" breakage there.
      
        At that point the rest of kernel_thread/kernel_execve/sys_execve work
        can be done independently for different architectures.  The only
        pending bits that do depend on having all architectures converted are
        restrictred to fs/* and kernel/* - that'll obviously have to wait for
        the next cycle.
      
        I thought we'd have to wait for all of them done before we start
        eliminating the longjump-style insanity in kernel_execve(), but it
        turned out there's a very simple way to do that without flagday-style
        changes."
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/signal:
        alpha: switch to saner kernel_execve() semantics
        arm: switch to saner kernel_execve() semantics
        x86, um: convert to saner kernel_execve() semantics
        infrastructure for saner ret_from_kernel_thread semantics
        make sure that kernel_thread() callbacks call do_exit() themselves
        make sure that we always have a return path from kernel_execve()
        ppc: eeh_event should just use kthread_run()
        don't bother with kernel_thread/kernel_execve for launching linuxrc
        alpha: get rid of switch_stack argument of do_work_pending()
        alpha: don't bother passing switch_stack separately from regs
        alpha: take SIGPENDING/NOTIFY_RESUME loop into signal.c
        alpha: simplify TIF_NEED_RESCHED handling
      4e21fc13
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs · 8418263e
      Linus Torvalds 提交于
      Pull third pile of VFS updates from Al Viro:
       "Stuff from Jeff Layton, mostly.  Sanitizing interplay between audit
        and namei, removing a lot of insanity from audit_inode() mess and
        getting things ready for his ESTALE patchset."
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
        procfs: don't need a PATH_MAX allocation to hold a string representation of an int
        vfs: embed struct filename inside of names_cache allocation if possible
        audit: make audit_inode take struct filename
        vfs: make path_openat take a struct filename pointer
        vfs: turn do_path_lookup into wrapper around struct filename variant
        audit: allow audit code to satisfy getname requests from its names_list
        vfs: define struct filename and have getname() return it
        vfs: unexport getname and putname symbols
        acct: constify the name arg to acct_on
        vfs: allocate page instead of names_cache buffer in mount_block_root
        audit: overhaul __audit_inode_child to accomodate retrying
        audit: optimize audit_compare_dname_path
        audit: make audit_compare_dname_path use parent_len helper
        audit: remove dirlen argument to audit_compare_dname_path
        audit: set the name_len in audit_inode for parent lookups
        audit: add a new "type" field to audit_names struct
        audit: reverse arguments to audit_inode_child
        audit: no need to walk list in audit_inode if name is NULL
        audit: pass in dentry to audit_copy_inode wherever possible
        audit: remove unnecessary NULL ptr checks from do_path_lookup
      8418263e
    • J
    • J
      vfs: embed struct filename inside of names_cache allocation if possible · 7950e385
      Jeff Layton 提交于
      In the common case where a name is much smaller than PATH_MAX, an extra
      allocation for struct filename is unnecessary. Before allocating a
      separate one, try to embed the struct filename inside the buffer first. If
      it turns out that that's not long enough, then fall back to allocating a
      separate struct filename and redoing the copy.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      7950e385
    • J
      audit: make audit_inode take struct filename · adb5c247
      Jeff Layton 提交于
      Keep a pointer to the audit_names "slot" in struct filename.
      
      Have all of the audit_inode callers pass a struct filename ponter to
      audit_inode instead of a string pointer. If the aname field is already
      populated, then we can skip walking the list altogether and just use it
      directly.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      adb5c247
    • J
      vfs: make path_openat take a struct filename pointer · 669abf4e
      Jeff Layton 提交于
      ...and fix up the callers. For do_file_open_root, just declare a
      struct filename on the stack and fill out the .name field. For
      do_filp_open, make it also take a struct filename pointer, and fix up its
      callers to call it appropriately.
      
      For filp_open, add a variant that takes a struct filename pointer and turn
      filp_open into a wrapper around it.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      669abf4e
    • J
      vfs: turn do_path_lookup into wrapper around struct filename variant · 873f1eed
      Jeff Layton 提交于
      ...and make the user_path callers use that variant instead.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      873f1eed
    • J
      audit: allow audit code to satisfy getname requests from its names_list · 7ac86265
      Jeff Layton 提交于
      Currently, if we call getname() on a userland string more than once,
      we'll get multiple copies of the string and multiple audit_names
      records.
      
      Add a function that will allow the audit_names code to satisfy getname
      requests using info from the audit_names list, avoiding a new allocation
      and audit_names records.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      7ac86265
    • J
      vfs: define struct filename and have getname() return it · 91a27b2a
      Jeff Layton 提交于
      getname() is intended to copy pathname strings from userspace into a
      kernel buffer. The result is just a string in kernel space. It would
      however be quite helpful to be able to attach some ancillary info to
      the string.
      
      For instance, we could attach some audit-related info to reduce the
      amount of audit-related processing needed. When auditing is enabled,
      we could also call getname() on the string more than once and not
      need to recopy it from userspace.
      
      This patchset converts the getname()/putname() interfaces to return
      a struct instead of a string. For now, the struct just tracks the
      string in kernel space and the original userland pointer for it.
      
      Later, we'll add other information to the struct as it becomes
      convenient.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      91a27b2a
    • C
      arch/tile: enable interrupts in do_work_pending() · c19c6c95
      Chris Metcalf 提交于
      All the called functions expect interrupts to be enabled, and
      now one of them has started to warn about it, so make it correct.
      Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>
      c19c6c95
    • C
      Merge tag 'tags/disintegrate-tile-20121009' into for-linus · d20f2f83
      Chris Metcalf 提交于
      UAPI Disintegration 2012-10-09
      d20f2f83
    • O
      mcs7830: Fix link state detection · dabdaf0c
      Ondrej Zary 提交于
      The device had an undocumented "feature": it can provide a sequence of
      spurious link-down status data even if the link is up all the time.
      A sequence of 10 was seen so update the link state only after the device
      reports the same link state 20 times.
      Signed-off-by: NOndrej Zary <linux@rainbow-software.org>
      Reported-by: NMichael Leun <lkml20120218@newton.leun.net>
      Tested-by: NMichael Leun <lkml20120218@newton.leun.net>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      dabdaf0c
    • A
      net: add doc for in4_pton() · 93ac0ef0
      Amerigo Wang 提交于
      It is not easy to use in4_pton() correctly without reading
      its definition, so add some doc for it.
      
      Cc: David S. Miller <davem@davemloft.net>
      Signed-off-by: NCong Wang <amwang@redhat.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      93ac0ef0
    • A
      net: add doc for in6_pton() · 28194fcd
      Amerigo Wang 提交于
      It is not easy to use in6_pton() correctly without reading
      its definition, so add some doc for it.
      
      Cc: David S. Miller <davem@davemloft.net>
      Signed-off-by: NCong Wang <amwang@redhat.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      28194fcd
    • S
      vti: fix sparse bit endian warnings · 8437e761
      stephen hemminger 提交于
      Use be32_to_cpu instead of htonl to keep sparse happy.
      Signed-off-by: NStephen Hemminger <shemminger@vyatta.org>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      8437e761
    • A
      tcp: resets are misrouted · 4c675258
      Alexey Kuznetsov 提交于
      After commit e2446eaa ("tcp_v4_send_reset: binding oif to iif in no
      sock case").. tcp resets are always lost, when routing is asymmetric.
      Yes, backing out that patch will result in misrouting of resets for
      dead connections which used interface binding when were alive, but we
      actually cannot do anything here.  What's died that's died and correct
      handling normal unbound connections is obviously a priority.
      
      Comment to comment:
      > This has few benefits:
      >   1. tcp_v6_send_reset already did that.
      
      It was done to route resets for IPv6 link local addresses. It was a
      mistake to do so for global addresses. The patch fixes this as well.
      
      Actually, the problem appears to be even more serious than guaranteed
      loss of resets.  As reported by Sergey Soloviev <sol@eqv.ru>, those
      misrouted resets create a lot of arp traffic and huge amount of
      unresolved arp entires putting down to knees NAT firewalls which use
      asymmetric routing.
      Signed-off-by: NAlexey Kuznetsov <kuznet@ms2.inr.ac.ru>
      4c675258
    • A
      alpha: switch to saner kernel_execve() semantics · 5522be6a
      Al Viro 提交于
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      5522be6a
    • A
      arm: switch to saner kernel_execve() semantics · 9fff2fa0
      Al Viro 提交于
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      9fff2fa0
    • A
      x86, um: convert to saner kernel_execve() semantics · 22e2430d
      Al Viro 提交于
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      22e2430d
    • A
      infrastructure for saner ret_from_kernel_thread semantics · a74fb73c
      Al Viro 提交于
      * allow kernel_execve() leave the actual return to userland to
      caller (selected by CONFIG_GENERIC_KERNEL_EXECVE).  Callers
      updated accordingly.
      * architecture that does select GENERIC_KERNEL_EXECVE in its
      Kconfig should have its ret_from_kernel_thread() do this:
      	call schedule_tail
      	call the callback left for it by copy_thread(); if it ever
      returns, that's because it has just done successful kernel_execve()
      	jump to return from syscall
      IOW, its only difference from ret_from_fork() is that it does call the
      callback.
      * such an architecture should also get rid of ret_from_kernel_execve()
      and __ARCH_WANT_KERNEL_EXECVE
      
      This is the last part of infrastructure patches in that area - from
      that point on work on different architectures can live independently.
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      a74fb73c
  2. 12 10月, 2012 14 次提交
    • L
      Merge tag 'stable/for-linus-3.7-rc0-tag' of... · ccff9b1d
      Linus Torvalds 提交于
      Merge tag 'stable/for-linus-3.7-rc0-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen
      
      Pull Xen fixes from Konrad Rzeszutek Wilk:
       "This has four bug-fixes and one tiny feature that I forgot to put
        initially in my tree due to oversight.
      
        The feature is for kdump kernels to speed up the /proc/vmcore reading.
        There is a ram_is_pfn helper function that the different platforms can
        register for.  We are now doing that.
      
        The bug-fixes cover some embarrassing struct pv_cpu_ops variables
        being set to NULL on Xen (but not baremetal).  We had a similar issue
        in the past with {write|read}_msr_safe and this fills the three
        missing ones.  The other bug-fix is to make the console output (hvc)
        be capable of dealing with misbehaving backends and not fall flat on
        its face.  Lastly, a quirk for older XenBus implementations that came
        with an ancient v3.4 hypervisor (so RHEL5 based) - reading of certain
        non-existent attributes just hangs the guest during bootup - so we
        take precaution of not doing that on such older installations.
      
        Feature:
         - Register a pfn_is_ram helper to speed up reading of /proc/vmcore.
        Bug-fixes:
         - Three pvops call for Xen were undefined causing BUG_ONs.
         - Add a quirk so that the shutdown watches (used by kdump) are not
           used with older Xen (3.4).
         - Fix ungraceful state transition for the HVC console."
      
      * tag 'stable/for-linus-3.7-rc0-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen:
        xen/pv-on-hvm kexec: add quirk for Xen 3.4 and shutdown watches.
        xen/bootup: allow {read|write}_cr8 pvops call.
        xen/bootup: allow read_tscp call for Xen PV guests.
        xen pv-on-hvm: add pfn_is_ram helper for kdump
        xen/hvc: handle backend CLOSED without CLOSING
      ccff9b1d
    • L
      Merge branch 'slab/urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/penberg/linux · 3dc329ba
      Linus Torvalds 提交于
      Pull SLAB fix from Pekka Enberg:
       "This contains a lockdep false positive fix from Jiri Kosina I missed
        from the previous pull request."
      
      * 'slab/urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/penberg/linux:
        mm, slab: release slab_mutex earlier in kmem_cache_destroy()
      3dc329ba
    • L
      Merge branch 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 03d3602a
      Linus Torvalds 提交于
      Pull timer core update from Thomas Gleixner:
       - Bug fixes (one for a longstanding dead loop issue)
       - Rework of time related vsyscalls
       - Alarm timer updates
       - Jiffies updates to remove compile time dependencies
      
      * 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        timekeeping: Cast raw_interval to u64 to avoid shift overflow
        timers: Fix endless looping between cascade() and internal_add_timer()
        time/jiffies: bring back unconditional LATCH definition
        time: Convert x86_64 to using new update_vsyscall
        time: Only do nanosecond rounding on GENERIC_TIME_VSYSCALL_OLD systems
        time: Introduce new GENERIC_TIME_VSYSCALL
        time: Convert CONFIG_GENERIC_TIME_VSYSCALL to CONFIG_GENERIC_TIME_VSYSCALL_OLD
        time: Move update_vsyscall definitions to timekeeper_internal.h
        time: Move timekeeper structure to timekeeper_internal.h for vsyscall changes
        jiffies: Remove compile time assumptions about CLOCK_TICK_RATE
        jiffies: Kill unused TICK_USEC_TO_NSEC
        alarmtimer: Rename alarmtimer_remove to alarmtimer_dequeue
        alarmtimer: Remove unused helpers & defines
        alarmtimer: Use hrtimer per-alarm instead of per-base
        alarmtimer: Implement minimum alarm interval for allowing suspend
      03d3602a
    • L
      Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 0588f1f9
      Linus Torvalds 提交于
      Pull scheduler fixes from Ingo Molnar:
       "A CPU hotplug related crash fix and a nohz accounting fixlet."
      
      * 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        sched: Update sched_domains_numa_masks[][] when new cpus are onlined
        sched: Ensure 'sched_domains_numa_levels' is safe to use in other functions
        nohz: Fix one jiffy count too far in idle cputime
      0588f1f9
    • L
      Merge branch 'core-rcu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 9d55ab71
      Linus Torvalds 提交于
      Pull RCU fixes from Ingo Molnar:
       "This tree includes a shutdown/cpu-hotplug deadlock fix and a
        documentation fix."
      
      * 'core-rcu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        rcu: Advise most users not to enable RCU user mode
        rcu: Grace-period initialization excludes only RCU notifier
      9d55ab71
    • K
      xen/pv-on-hvm kexec: add quirk for Xen 3.4 and shutdown watches. · cb6b6df1
      Konrad Rzeszutek Wilk 提交于
      The commit 254d1a3f, titled
      "xen/pv-on-hvm kexec: shutdown watches from old kernel" assumes that the
      XenBus backend can deal with reading of values from:
       "control/platform-feature-xs_reset_watches":
      
          ... a patch for xenstored is required so that it
          accepts the XS_RESET_WATCHES request from a client (see changeset
          23839:42a45baf037d in xen-unstable.hg). Without the patch for xenstored
          the registration of watches will fail and some features of a PVonHVM
          guest are not available. The guest is still able to boot, but repeated
          kexec boots will fail."
      
      Sadly this is not true when using a Xen 3.4 hypervisor and booting a PVHVM
      guest. We end up hanging at:
      
        err = xenbus_scanf(XBT_NIL, "control",
                              "platform-feature-xs_reset_watches", "%d", &supported);
      
      This can easily be seen with guests hanging at xenbus_init:
      
      NX (Execute Disable) protection: active
      SMBIOS 2.4 present.
      DMI: Xen HVM domU, BIOS 3.4.0 05/13/2011
      Hypervisor detected: Xen HVM
      Xen version 3.4.
      Xen Platform PCI: I/O protocol version 1
      ... snip ..
      calling  xenbus_init+0x0/0x27e @ 1
      
      Reverting the commit or using the attached patch fixes the issue. This fix
      checks whether the hypervisor is older than 4.0 and if so does not try to
      perform the read.
      
      Fixes-Oracle-Bug: 14708233
      CC: stable@vger.kernel.org
      Acked-by: NOlaf Hering <olaf@aepfle.de>
      [v2: Added a comment in the source code]
      Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
      cb6b6df1
    • K
      xen/bootup: allow {read|write}_cr8 pvops call. · 1a7bbda5
      Konrad Rzeszutek Wilk 提交于
      We actually do not do anything about it. Just return a default
      value of zero and if the kernel tries to write anything but 0
      we BUG_ON.
      
      This fixes the case when an user tries to suspend the machine
      and it blows up in save_processor_state b/c 'read_cr8' is set
      to NULL and we get:
      
      kernel BUG at /home/konrad/ssd/linux/arch/x86/include/asm/paravirt.h:100!
      invalid opcode: 0000 [#1] SMP
      Pid: 2687, comm: init.late Tainted: G           O 3.6.0upstream-00002-gac264ac-dirty #4 Bochs Bochs
      RIP: e030:[<ffffffff814d5f42>]  [<ffffffff814d5f42>] save_processor_state+0x212/0x270
      
      .. snip..
      Call Trace:
       [<ffffffff810733bf>] do_suspend_lowlevel+0xf/0xac
       [<ffffffff8107330c>] ? x86_acpi_suspend_lowlevel+0x10c/0x150
       [<ffffffff81342ee2>] acpi_suspend_enter+0x57/0xd5
      
      CC: stable@vger.kernel.org
      Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
      1a7bbda5
    • K
      xen/bootup: allow read_tscp call for Xen PV guests. · cd0608e7
      Konrad Rzeszutek Wilk 提交于
      The hypervisor will trap it. However without this patch,
      we would crash as the .read_tscp is set to NULL. This patch
      fixes it and sets it to the native_read_tscp call.
      
      CC: stable@vger.kernel.org
      Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
      cd0608e7
    • I
      Merge branch 'rcu/urgent' of... · c1f8d901
      Ingo Molnar 提交于
      Merge branch 'rcu/urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/paulmck/linux-rcu into core/rcu
      
      Pull RCU fixes from Paul E. McKenney.
      Signed-off-by: NIngo Molnar <mingo@kernel.org>
      c1f8d901
    • J
      vfs: unexport getname and putname symbols · 8e377d15
      Jeff Layton 提交于
      I see no callers in module code.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      8e377d15
    • J
      acct: constify the name arg to acct_on · cfd4da17
      Jeff Layton 提交于
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      cfd4da17
    • J
      vfs: allocate page instead of names_cache buffer in mount_block_root · a608ca21
      Jeff Layton 提交于
      First, it's incorrect to call putname() after __getname_gfp() since the
      bare __getname_gfp() call skips the auditing code, while putname()
      doesn't.
      
      mount_block_root allocates a PATH_MAX buffer via __getname_gfp, and then
      calls get_fs_names to fill the buffer. That function can call
      get_filesystem_list which assumes that that buffer is a full page in
      size. On arches where PAGE_SIZE != 4k, then this could potentially
      overrun.
      
      In practice, it's hard to imagine the list of filesystem names even
      approaching 4k, but it's best to be safe. Just allocate a page for this
      purpose instead.
      
      With this, we can also remove the __getname_gfp() definition since there
      are no more callers.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      a608ca21
    • J
      audit: overhaul __audit_inode_child to accomodate retrying · 4fa6b5ec
      Jeff Layton 提交于
      In order to accomodate retrying path-based syscalls, we need to add a
      new "type" argument to audit_inode_child. This will tell us whether
      we're looking for a child entry that represents a create or a delete.
      
      If we find a parent, don't automatically assume that we need to create a
      new entry. Instead, use the information we have to try to find an
      existing entry first. Update it if one is found and create a new one if
      not.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      4fa6b5ec
    • J
      audit: optimize audit_compare_dname_path · e3d6b07b
      Jeff Layton 提交于
      In the cases where we already know the length of the parent, pass it as
      a parm so we don't need to recompute it. In the cases where we don't
      know the length, pass in AUDIT_NAME_FULL (-1) to indicate that it should
      be determined.
      Signed-off-by: NJeff Layton <jlayton@redhat.com>
      Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
      e3d6b07b