1. 21 2月, 2022 21 次提交
    • D
      random: fix locking in crng_fast_load() · 7c2fe2b3
      Dominik Brodowski 提交于
      crng_init is protected by primary_crng->lock, so keep holding that lock
      when incrementing crng_init from 0 to 1 in crng_fast_load(). The call to
      pr_notice() can wait until the lock is released; this code path cannot
      be reached twice, as crng_fast_load() aborts early if crng_init > 0.
      Signed-off-by: NDominik Brodowski <linux@dominikbrodowski.net>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      7c2fe2b3
    • J
      random: remove batched entropy locking · 77760fd7
      Jason A. Donenfeld 提交于
      Rather than use spinlocks to protect batched entropy, we can instead
      disable interrupts locally, since we're dealing with per-cpu data, and
      manage resets with a basic generation counter. At the same time, we
      can't quite do this on PREEMPT_RT, where we still want spinlocks-as-
      mutexes semantics. So we use a local_lock_t, which provides the right
      behavior for each. Because this is a per-cpu lock, that generation
      counter is still doing the necessary CPU-to-CPU communication.
      
      This should improve performance a bit. It will also fix the linked splat
      that Jonathan received with a PROVE_RAW_LOCK_NESTING=y.
      Reviewed-by: NSebastian Andrzej Siewior <bigeasy@linutronix.de>
      Reviewed-by: NDominik Brodowski <linux@dominikbrodowski.net>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Suggested-by: NAndy Lutomirski <luto@kernel.org>
      Reported-by: NJonathan Neuschäfer <j.neuschaefer@gmx.net>
      Tested-by: NJonathan Neuschäfer <j.neuschaefer@gmx.net>
      Link: https://lore.kernel.org/lkml/YfMa0QgsjCVdRAvJ@latitude/Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      77760fd7
    • E
      random: remove use_input_pool parameter from crng_reseed() · 5d58ea3a
      Eric Biggers 提交于
      The primary_crng is always reseeded from the input_pool, while the NUMA
      crngs are always reseeded from the primary_crng.  Remove the redundant
      'use_input_pool' parameter from crng_reseed() and just directly check
      whether the crng is the primary_crng.
      Signed-off-by: NEric Biggers <ebiggers@google.com>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      5d58ea3a
    • J
      random: make credit_entropy_bits() always safe · a49c010e
      Jason A. Donenfeld 提交于
      This is called from various hwgenerator drivers, so rather than having
      one "safe" version for userspace and one "unsafe" version for the
      kernel, just make everything safe; the checks are cheap and sensible to
      have anyway.
      Reported-by: NSultan Alsawaf <sultan@kerneltoast.com>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Reviewed-by: NDominik Brodowski <linux@dominikbrodowski.net>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      a49c010e
    • J
      random: always wake up entropy writers after extraction · 489c7fc4
      Jason A. Donenfeld 提交于
      Now that POOL_BITS == POOL_MIN_BITS, we must unconditionally wake up
      entropy writers after every extraction. Therefore there's no point of
      write_wakeup_threshold, so we can move it to the dustbin of unused
      compatibility sysctls. While we're at it, we can fix a small comparison
      where we were waking up after <= min rather than < min.
      
      Cc: Theodore Ts'o <tytso@mit.edu>
      Suggested-by: NEric Biggers <ebiggers@kernel.org>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Reviewed-by: NDominik Brodowski <linux@dominikbrodowski.net>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      489c7fc4
    • J
      random: use linear min-entropy accumulation crediting · c5704490
      Jason A. Donenfeld 提交于
      30e37ec5 ("random: account for entropy loss due to overwrites")
      assumed that adding new entropy to the LFSR pool probabilistically
      cancelled out old entropy there, so entropy was credited asymptotically,
      approximating Shannon entropy of independent sources (rather than a
      stronger min-entropy notion) using 1/8th fractional bits and replacing
      a constant 2-2/√𝑒 term (~0.786938) with 3/4 (0.75) to slightly
      underestimate it. This wasn't superb, but it was perhaps better than
      nothing, so that's what was done. Which entropy specifically was being
      cancelled out and how much precisely each time is hard to tell, though
      as I showed with the attack code in my previous commit, a motivated
      adversary with sufficient information can actually cancel out
      everything.
      
      Since we're no longer using an LFSR for entropy accumulation, this
      probabilistic cancellation is no longer relevant. Rather, we're now
      using a computational hash function as the accumulator and we've
      switched to working in the random oracle model, from which we can now
      revisit the question of min-entropy accumulation, which is done in
      detail in <https://eprint.iacr.org/2019/198>.
      
      Consider a long input bit string that is built by concatenating various
      smaller independent input bit strings. Each one of these inputs has a
      designated min-entropy, which is what we're passing to
      credit_entropy_bits(h). When we pass the concatenation of these to a
      random oracle, it means that an adversary trying to receive back the
      same reply as us would need to become certain about each part of the
      concatenated bit string we passed in, which means becoming certain about
      all of those h values. That means we can estimate the accumulation by
      simply adding up the h values in calls to credit_entropy_bits(h);
      there's no probabilistic cancellation at play like there was said to be
      for the LFSR. Incidentally, this is also what other entropy accumulators
      based on computational hash functions do as well.
      
      So this commit replaces credit_entropy_bits(h) with essentially `total =
      min(POOL_BITS, total + h)`, done with a cmpxchg loop as before.
      
      What if we're wrong and the above is nonsense? It's not, but let's
      assume we don't want the actual _behavior_ of the code to change much.
      Currently that behavior is not extracting from the input pool until it
      has 128 bits of entropy in it. With the old algorithm, we'd hit that
      magic 128 number after roughly 256 calls to credit_entropy_bits(1). So,
      we can retain more or less the old behavior by waiting to extract from
      the input pool until it hits 256 bits of entropy using the new code. For
      people concerned about this change, it means that there's not that much
      practical behavioral change. And for folks actually trying to model
      the behavior rigorously, it means that we have an even higher margin
      against attacks.
      
      Cc: Theodore Ts'o <tytso@mit.edu>
      Cc: Dominik Brodowski <linux@dominikbrodowski.net>
      Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Reviewed-by: NJean-Philippe Aumasson <jeanphilippe.aumasson@gmail.com>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      c5704490
    • J
      random: simplify entropy debiting · 9c07f578
      Jason A. Donenfeld 提交于
      Our pool is 256 bits, and we only ever use all of it or don't use it at
      all, which is decided by whether or not it has at least 128 bits in it.
      So we can drastically simplify the accounting and cmpxchg loop to do
      exactly this.  While we're at it, we move the minimum bit size into a
      constant so it can be shared between the two places where it matters.
      
      The reason we want any of this is for the case in which an attacker has
      compromised the current state, and then bruteforces small amounts of
      entropy added to it. By demanding a particular minimum amount of entropy
      be present before reseeding, we make that bruteforcing difficult.
      
      Note that this rationale no longer includes anything about /dev/random
      blocking at the right moment, since /dev/random no longer blocks (except
      for at ~boot), but rather uses the crng. In a former life, /dev/random
      was different and therefore required a more nuanced account(), but this
      is no longer.
      
      Behaviorally, nothing changes here. This is just a simplification of
      the code.
      
      Cc: Theodore Ts'o <tytso@mit.edu>
      Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Reviewed-by: NDominik Brodowski <linux@dominikbrodowski.net>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      9c07f578
    • J
      random: use computational hash for entropy extraction · 6e8ec255
      Jason A. Donenfeld 提交于
      The current 4096-bit LFSR used for entropy collection had a few
      desirable attributes for the context in which it was created. For
      example, the state was huge, which meant that /dev/random would be able
      to output quite a bit of accumulated entropy before blocking. It was
      also, in its time, quite fast at accumulating entropy byte-by-byte,
      which matters given the varying contexts in which mix_pool_bytes() is
      called. And its diffusion was relatively high, which meant that changes
      would ripple across several words of state rather quickly.
      
      However, it also suffers from a few security vulnerabilities. In
      particular, inputs learned by an attacker can be undone, but moreover,
      if the state of the pool leaks, its contents can be controlled and
      entirely zeroed out. I've demonstrated this attack with this SMT2
      script, <https://xn--4db.cc/5o9xO8pb>, which Boolector/CaDiCal solves in
      a matter of seconds on a single core of my laptop, resulting in little
      proof of concept C demonstrators such as <https://xn--4db.cc/jCkvvIaH/c>.
      
      For basically all recent formal models of RNGs, these attacks represent
      a significant cryptographic flaw. But how does this manifest
      practically? If an attacker has access to the system to such a degree
      that he can learn the internal state of the RNG, arguably there are
      other lower hanging vulnerabilities -- side-channel, infoleak, or
      otherwise -- that might have higher priority. On the other hand, seed
      files are frequently used on systems that have a hard time generating
      much entropy on their own, and these seed files, being files, often leak
      or are duplicated and distributed accidentally, or are even seeded over
      the Internet intentionally, where their contents might be recorded or
      tampered with. Seen this way, an otherwise quasi-implausible
      vulnerability is a bit more practical than initially thought.
      
      Another aspect of the current mix_pool_bytes() function is that, while
      its performance was arguably competitive for the time in which it was
      created, it's no longer considered so. This patch improves performance
      significantly: on a high-end CPU, an i7-11850H, it improves performance
      of mix_pool_bytes() by 225%, and on a low-end CPU, a Cortex-A7, it
      improves performance by 103%.
      
      This commit replaces the LFSR of mix_pool_bytes() with a straight-
      forward cryptographic hash function, BLAKE2s, which is already in use
      for pool extraction. Universal hashing with a secret seed was considered
      too, something along the lines of <https://eprint.iacr.org/2013/338>,
      but the requirement for a secret seed makes for a chicken & egg problem.
      Instead we go with a formally proven scheme using a computational hash
      function, described in sections 5.1, 6.4, and B.1.8 of
      <https://eprint.iacr.org/2019/198>.
      
      BLAKE2s outputs 256 bits, which should give us an appropriate amount of
      min-entropy accumulation, and a wide enough margin of collision
      resistance against active attacks. mix_pool_bytes() becomes a simple
      call to blake2s_update(), for accumulation, while the extraction step
      becomes a blake2s_final() to generate a seed, with which we can then do
      a HKDF-like or BLAKE2X-like expansion, the first part of which we fold
      back as an init key for subsequent blake2s_update()s, and the rest we
      produce to the caller. This then is provided to our CRNG like usual. In
      that expansion step, we make opportunistic use of 32 bytes of RDRAND
      output, just as before. We also always reseed the crng with 32 bytes,
      unconditionally, or not at all, rather than sometimes with 16 as before,
      as we don't win anything by limiting beyond the 16 byte threshold.
      
      Going for a hash function as an entropy collector is a conservative,
      proven approach. The result of all this is a much simpler and much less
      bespoke construction than what's there now, which not only plugs a
      vulnerability but also improves performance considerably.
      
      Cc: Theodore Ts'o <tytso@mit.edu>
      Cc: Dominik Brodowski <linux@dominikbrodowski.net>
      Reviewed-by: NEric Biggers <ebiggers@google.com>
      Reviewed-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
      Reviewed-by: NJean-Philippe Aumasson <jeanphilippe.aumasson@gmail.com>
      Signed-off-by: NJason A. Donenfeld <Jason@zx2c4.com>
      6e8ec255
    • L
      Linux 5.17-rc5 · cfb92440
      Linus Torvalds 提交于
      cfb92440
    • L
      Merge tag 'locking_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3324e6e8
      Linus Torvalds 提交于
      Pull locking fix from Borislav Petkov:
       "Fix a NULL ptr dereference when dumping lockdep chains through
        /proc/lockdep_chains"
      
      * tag 'locking_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        lockdep: Correct lock_classes index mapping
      3324e6e8
    • L
      Merge tag 'x86_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 22217739
      Linus Torvalds 提交于
      Pull x86 fixes from Borislav Petkov:
      
       - Fix the ptrace regset xfpregs_set() callback to behave according to
         the ABI
      
       - Handle poisoned pages properly in the SGX reclaimer code
      
      * tag 'x86_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/ptrace: Fix xfpregs_set()'s incorrect xmm clearing
        x86/sgx: Fix missing poison handling in reclaimer
      22217739
    • L
      Merge tag 'sched_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 0b0894ff
      Linus Torvalds 提交于
      Pull scheduler fix from Borislav Petkov:
       "Fix task exposure order when forking tasks"
      
      * tag 'sched_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        sched: Fix yet more sched_fork() races
      0b0894ff
    • L
      Merge tag 'edac_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/ras/ras · 6e8e752f
      Linus Torvalds 提交于
      Pull EDAC fix from Borislav Petkov:
       "Fix a long-standing struct alignment bug in the EDAC struct allocation
        code"
      
      * tag 'edac_urgent_for_v5.17_rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/ras/ras:
        EDAC: Fix calculation of returned address and next offset in edac_align_ptr()
      6e8e752f
    • L
      Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi · e268d708
      Linus Torvalds 提交于
      Pull SCSI fixes from James Bottomley:
       "Three fixes, all in drivers.
      
        The ufs and qedi fixes are minor; the lpfc one is a bit bigger because
        it involves adding a heuristic to detect and deal with common but not
        standards compliant behaviour"
      
      * tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi:
        scsi: ufs: core: Fix divide by zero in ufshcd_map_queues()
        scsi: lpfc: Fix pt2pt NVMe PRLI reject LOGO loop
        scsi: qedi: Fix ABBA deadlock in qedi_process_tmf_resp() and qedi_process_cmd_cleanup_resp()
      e268d708
    • L
      Merge tag 'dmaengine-fix-5.17' of git://git.kernel.org/pub/scm/linux/kernel/git/vkoul/dmaengine · 77478077
      Linus Torvalds 提交于
      Pull dmaengine fixes from Vinod Koul:
       "A bunch of driver fixes for:
      
         - ptdma error handling in init
      
         - lock fix in at_hdmac
      
         - error path and error num fix for sh dma
      
         - pm balance fix for stm32"
      
      * tag 'dmaengine-fix-5.17' of git://git.kernel.org/pub/scm/linux/kernel/git/vkoul/dmaengine:
        dmaengine: shdma: Fix runtime PM imbalance on error
        dmaengine: sh: rcar-dmac: Check for error num after dma_set_max_seg_size
        dmaengine: stm32-dmamux: Fix PM disable depth imbalance in stm32_dmamux_probe
        dmaengine: sh: rcar-dmac: Check for error num after setting mask
        dmaengine: at_xdmac: Fix missing unlock in at_xdmac_tasklet()
        dmaengine: ptdma: Fix the error handling path in pt_core_init()
      77478077
    • L
      Merge branch 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux · dacec3e7
      Linus Torvalds 提交于
      Pull i2c fixes from Wolfram Sang:
       "Some driver updates, a MAINTAINERS fix, and additions to COMPILE_TEST
        (so we won't miss build problems again)"
      
      * 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux:
        MAINTAINERS: remove duplicate entry for i2c-qcom-geni
        i2c: brcmstb: fix support for DSL and CM variants
        i2c: qup: allow COMPILE_TEST
        i2c: imx: allow COMPILE_TEST
        i2c: cadence: allow COMPILE_TEST
        i2c: qcom-cci: don't put a device tree node before i2c_add_adapter()
        i2c: qcom-cci: don't delete an unregistered adapter
        i2c: bcm2835: Avoid clock stretching timeouts
      dacec3e7
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input · 961af9db
      Linus Torvalds 提交于
      Pull input fixes from Dmitry Torokhov:
      
       - a fix for Synaptics touchpads in RMI4 mode failing to suspend/resume
         properly because I2C client devices are now being suspended and
         resumed asynchronously which changed the ordering
      
       - a change to make sure we do not set right and middle buttons
         capabilities on touchpads that are "buttonpads" (i.e. do not have
         separate physical buttons)
      
       - a change to zinitix touchscreen driver adding more compatible
         strings/IDs
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input:
        Input: psmouse - set up dependency between PS/2 and SMBus companions
        Input: zinitix - add new compatible strings
        Input: clear BTN_RIGHT/MIDDLE on buttonpads
      961af9db
    • L
      Merge tag 'for-v5.17-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply · 70d2bec7
      Linus Torvalds 提交于
      Pull power supply fixes from Sebastian Reichel:
       "Three regression fixes for the 5.17 cycle:
      
         - build warning fix for power-supply documentation
      
         - pointer size fix in cw2015 battery driver
      
         - OOM handling in bq256xx charger driver"
      
      * tag 'for-v5.17-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply:
        power: supply: bq256xx: Handle OOM correctly
        power: supply: core: fix application of sizeof to pointer
        power: supply: fix table problem in sysfs-class-power
      70d2bec7
    • L
      Merge tag 'fs.mount_setattr.v5.17-rc4' of... · 7f25f041
      Linus Torvalds 提交于
      Merge tag 'fs.mount_setattr.v5.17-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux
      
      Pull mount_setattr test/doc fixes from Christian Brauner:
       "This contains a fix for one of the selftests for the mount_setattr
        syscall to create idmapped mounts, an entry for idmapped mounts for
        maintainers, and missing kernel documentation for the helper we split
        out some time ago to get and yield write access to a mount when
        changing mount properties"
      
      * tag 'fs.mount_setattr.v5.17-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux:
        fs: add kernel doc for mnt_{hold,unhold}_writers()
        MAINTAINERS: add entry for idmapped mounts
        tests: fix idmapped mount_setattr test
      7f25f041
    • L
      Merge tag 'pidfd.v5.17-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux · c1034d24
      Linus Torvalds 提交于
      Pull pidfd fix from Christian Brauner:
       "This fixes a problem reported by lockdep when installing a pidfd via
        fd_install() with siglock and the tasklisk write lock held in
        copy_process() when calling clone()/clone3() with CLONE_PIDFD.
      
        Originally a pidfd was created prior to holding any of these locks but
        this required a call to ksys_close(). So quite some time ago in
        6fd2fe49 ("copy_process(): don't use ksys_close() on cleanups") we
        switched to a get_unused_fd_flags() + fd_install() model.
      
        As part of that we moved fd_install() as late as possible. This was
        done for two main reasons. First, because we needed to ensure that we
        call fd_install() past the point of no return as once that's called
        the fd is live in the task's file table. Second, because we tried to
        ensure that the fd is visible in /proc/<pid>/fd/<pidfd> right when the
        task is visible.
      
        This fix moves the fd_install() to an even later point which means
        that a task will be visible in proc while the pidfd isn't yet under
        /proc/<pid>/fd/<pidfd>.
      
        While this is a user visible change it's very unlikely that this will
        have any impact. Nobody should be relying on that and if they do we
        need to come up with something better but again, it's doubtful this is
        relevant"
      
      * tag 'pidfd.v5.17-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/brauner/linux:
        copy_process(): Move fd_install() out of sighand->siglock critical section
      c1034d24
    • L
      Merge branch 'ucount-rlimit-fixes-for-v5.17' of... · 2d3409eb
      Linus Torvalds 提交于
      Merge branch 'ucount-rlimit-fixes-for-v5.17' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace
      
      Pull ucounts fixes from Eric Biederman:
       "Michal Koutný recently found some bugs in the enforcement of
        RLIMIT_NPROC in the recent ucount rlimit implementation.
      
        In this set of patches I have developed a very conservative approach
        changing only what is necessary to fix the bugs that I can see
        clearly. Cleanups and anything that is making the code more consistent
        can follow after we have the code working as it has historically.
      
        The problem is not so much inconsistencies (although those exist) but
        that it is very difficult to figure out what the code should be doing
        in the case of RLIMIT_NPROC.
      
        All other rlimits are only enforced where the resource is acquired
        (allocated). RLIMIT_NPROC by necessity needs to be enforced in an
        additional location, and our current implementation stumbled it's way
        into that implementation"
      
      * 'ucount-rlimit-fixes-for-v5.17' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
        ucounts: Handle wrapping in is_ucounts_overlimit
        ucounts: Move RLIMIT_NPROC handling after set_user
        ucounts: Base set_cred_ucounts changes on the real user
        ucounts: Enforce RLIMIT_NPROC not RLIMIT_NPROC+1
        rlimit: Fix RLIMIT_NPROC enforcement failure caused by capability calls in set_user
      2d3409eb
  2. 19 2月, 2022 13 次提交
  3. 18 2月, 2022 6 次提交