提交 · 19ad5f79f92e01375070009439c9f3ae22dc6b22 · openeuler / Kernel

14 9月, 2019 1 次提交

devices.txt: improve entry for comedi (char major 98) · d62e8055

由 Ian Abbott 提交于 9月 11, 2019

Describe how the comedi minor device numbers are split across comedi
devices and comedi subdevices.

Replace the current, long dead URL with an official URL for the Comedi
project.
Signed-off-by: NIan Abbott <abbotti@mev.co.uk>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

d62e8055

06 9月, 2019 1 次提交

Documentation: sysrq: don't recommend 'S' 'U' before 'B' · 209c3aa7

由 Adam Borowski 提交于 9月 03, 2019

This advice is obsolete and slightly harmful for filesystems from this
millenium: any modern filesystem can handle unexpected crashes without
requiring fsck -- and on the other hand, trying to write to the disk when
the kernel is in a bad state risks introducing corruption.

For ext2, any unsafe shutdown meant widespread breakage, but it's no longer
a reasonable filesystem for any non-special use.
Signed-off-by: NAdam Borowski <kilobyte@angband.pl>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

209c3aa7

01 8月, 2019 6 次提交

docs: fs: cifs: convert to ReST and add to admin-guide book · f139291c

由 Mauro Carvalho Chehab 提交于 7月 31, 2019

The filenames for cifs documentation is not using the same
convention as almost all Kernel documents is using. So,
rename them to a more appropriate name. Then, manually convert
the documentation files for CIFS to ReST.

By doing a manual conversion, we can preserve the original
author's style, while making it to look more like the other
Kernel documents.

Most of the conversion here is trivial. The most complex one was
the README file (which was renamed to usage.rst).
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

f139291c

docs: wimax: convert to ReST and add to admin-guide · ff497db2

由 Mauro Carvalho Chehab 提交于 7月 26, 2019

Manually convert wimax documentation to ReST and add theit
to the Kernel doc body, inside the admin-guide.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

ff497db2

docs: admin-guide: add auxdisplay files to it after conversion to ReST · 76b5a6e8

由 Mauro Carvalho Chehab 提交于 7月 26, 2019

Those two files describe userspace-faced information. While part of
it might fit on uAPI, it sounds to me that the admin guide is the
best place for them.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NMiguel Ojeda <miguel.ojeda.sandonis@gmail.com>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

76b5a6e8

Documentation: filesystems: Convert ufs.txt to reStructuredText format · 34d5f4f2

由 Shobhit Kukreti 提交于 7月 10, 2019

This converts the plain text documentation of ufs.txt to
reStructuredText format. Added to documentation build process
and verified with make htmldocs
Signed-off-by: NShobhit Kukreti <shobhitkukreti@gmail.com>
Reviewed-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

34d5f4f2

Documentation: filesystems: Convert jfs.txt to · ac841c4e

由 Shobhit Kukreti 提交于 7月 10, 2019

This converts the plain text documentation of jfs.txt to reStructuredText
format. Added to documentation build process and verified with
make htmldocs
Signed-off-by: NShobhit Kukreti <shobhitkukreti@gmail.com>
Reviewed-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

ac841c4e

docs: cgroup-v1/blkio-controller.rst: remove a CFQ left over · 23aa1648

由 Mauro Carvalho Chehab 提交于 7月 26, 2019

changeset fb5772cb ("blkio-controller.txt: Remove references to CFQ")
removed cgroup references to CFQ, but it kept one left. Get rid of
it.

Fixes: fb5772cb ("blkio-controller.txt: Remove references to CFQ")
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

23aa1648

31 7月, 2019 1 次提交

Documentation: filesystem: fix "Removed Sysctls" table · 38a449ff

由 Sheriff Esseson 提交于 7月 23, 2019

the "Removed Sysctls" section is a table - bring it alive with ReST.
Signed-off-by: NSheriff Esseson <sheriffesseson@gmail.com>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

38a449ff

23 7月, 2019 1 次提交

docs/vm: transhuge: fix typo in madvise reference · 74af0d0b

由 Jeremy Cline 提交于 7月 16, 2019

Fix an off-by-one typo in the transparent huge pages admin
documentation.
Signed-off-by: NJeremy Cline <jcline@redhat.com>
Acked-by: NMike Rapoport <rppt@linux.ibm.com>
Signed-off-by: NJonathan Corbet <corbet@lwn.net>

74af0d0b

17 7月, 2019 5 次提交

docs: remove extra conf.py files · 9fc3a18a

由 Mauro Carvalho Chehab 提交于 7月 14, 2019

Now that the latex_documents are handled automatically, we can
remove those extra conf.py files.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

9fc3a18a

xen: Map "xen_nopv" parameter to "nopv" and mark it obsolete · b39b0497

由 Zhenzhong Duan 提交于 7月 11, 2019

Clean up unnecessory code after that operation.
Signed-off-by: NZhenzhong Duan <zhenzhong.duan@oracle.com>
Reviewed-by: NBoris Ostrovsky <boris.ostrovsky@oracle.com>
Cc: Juergen Gross <jgross@suse.com>
Cc: Stefano Stabellini <sstabellini@kernel.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Borislav Petkov <bp@alien8.de>
Signed-off-by: NJuergen Gross <jgross@suse.com>

b39b0497

x86: Add "nopv" parameter to disable PV extensions · 30978346

由 Zhenzhong Duan 提交于 7月 11, 2019

In virtualization environment, PV extensions (drivers, interrupts,
timers, etc) are enabled in the majority of use cases which is the
best option.

However, in some cases (kexec not fully working, benchmarking)
we want to disable PV extensions. We have "xen_nopv" for that purpose
but only for XEN. For a consistent admin experience a common command
line parameter "nopv" set across all PV guest implementations is a
better choice.

There are guest types which just won't work without PV extensions,
like Xen PV, Xen PVH and jailhouse. add a "ignore_nopv" member to
struct hypervisor_x86 set to true for those guest types and call
the detect functions only if nopv is false or ignore_nopv is true.
Suggested-by: NJuergen Gross <jgross@suse.com>
Signed-off-by: NZhenzhong Duan <zhenzhong.duan@oracle.com>
Reviewed-by: NJuergen Gross <jgross@suse.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Jan Kiszka <jan.kiszka@siemens.com>
Cc: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Cc: Stefano Stabellini <sstabellini@kernel.org>
Signed-off-by: NJuergen Gross <jgross@suse.com>

30978346

xen: remove tmem driver · 814bbf49

由 Juergen Gross 提交于 7月 14, 2019

The Xen tmem (transcendent memory) driver can be removed, as the
related Xen hypervisor feature never made it past the "experimental"
state and will be removed in future Xen versions (>= 4.13).

The xen-selfballoon driver depends on tmem, so it can be removed, too.
Signed-off-by: NJuergen Gross <jgross@suse.com>
Acked-by: NBoris Ostrovsky <boris.ostrovsky@oracle.com>
Signed-off-by: NJuergen Gross <jgross@suse.com>

814bbf49

vmcore: add a kernel parameter novmcoredd · c6c40533

由 Kairui Song 提交于 7月 16, 2019

Since commit 2724273e ("vmcore: add API to collect hardware dump in
second kernel"), drivers are allowed to add device related dump data to
vmcore as they want by using the device dump API.  This has a potential
issue, the data is stored in memory, drivers may append too much data
and use too much memory.  The vmcore is typically used in a kdump kernel
which runs in a pre-reserved small chunk of memory.  So as a result it
will make kdump unusable at all due to OOM issues.

So introduce new 'novmcoredd' command line option.  User can disable
device dump to reduce memory usage.  This is helpful if device dump is
using too much memory, disabling device dump could make sure a regular
vmcore without device dump data is still available.

[akpm@linux-foundation.org: tweak documentation]
[akpm@linux-foundation.org: vmcore.c needs moduleparam.h]
Link: http://lkml.kernel.org/r/20190528111856.7276-1-kasong@redhat.comSigned-off-by: NKairui Song <kasong@redhat.com>
Acked-by: NDave Young <dyoung@redhat.com>
Reviewed-by: NBhupesh Sharma <bhsharma@redhat.com>
Cc: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com>
Cc: "David S . Miller" <davem@davemloft.net>
Cc: Eric Biederman <ebiederm@xmission.com>
Cc: Alexey Dobriyan <adobriyan@gmail.com>
Cc: Baoquan He <bhe@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c6c40533

16 7月, 2019 1 次提交

Documentation: filesystem: Convert xfs.txt to ReST · 89b408a6

由 Sheriff Esseson 提交于 7月 15, 2019

Move xfs.txt to admin-guide, convert xfs.txt to ReST and broken references
Signed-off-by: NSheriff Esseson <sheriffesseson@gmail.com>
Reviewed-by: NMatthew Wilcox (Oracle) <willy@infradead.org>
Reviewed-by: NDarrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: NDarrick J. Wong <darrick.wong@oracle.com>

89b408a6

15 7月, 2019 21 次提交

docs: don't use nested tables · eddeed12

由 Mauro Carvalho Chehab 提交于 7月 06, 2019

Nested tables aren't supported for pdf output on Sphinx 1.7.9:

admin-guide/laptops/sonypi:: nested tables are not yet implemented.
admin-guide/laptops/toshiba_haps:: nested tables are not yet implemented.
driver-api/nvdimm/btt:: nested tables are not yet implemented.
s390/debugging390:: nested tables are not yet implemented.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: Andy Shevchenko <andy.shevchenko@gmail.com> # laptops

eddeed12

docs: gpio: add sysfs interface to the admin-guide · c2746a1e

由 Mauro Carvalho Chehab 提交于 6月 28, 2019

While this is stated as obsoleted, the sysfs interface described
there is still valid, and belongs to the admin-guide.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NLinus Walleij <linus.walleij@linaro.org>

c2746a1e

docs: add SPDX tags to new index files · 7e042736

由 Mauro Carvalho Chehab 提交于 6月 28, 2019

All those new files I added are under GPL v2.0 license.

Add the corresponding SPDX headers to them.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

7e042736

docs: driver-api: add a series of orphaned documents · baa293e9

由 Mauro Carvalho Chehab 提交于 6月 27, 2019

There are lots of documents under Documentation/*.txt and a few other
orphan documents elsehwere that belong to the driver-API book.

Move them to their right place.

Reviewed-by: Cornelia Huck <cohuck@redhat.com> # vfio-related parts
Acked-by: Logan Gunthorpe <logang@deltatee.com> # switchtec
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

baa293e9

docs: admin-guide: add a series of orphaned documents · 4f4cfa6c

由 Mauro Carvalho Chehab 提交于 6月 27, 2019

There are lots of documents that belong to the admin-guide but
are on random places (most under Documentation root dir).

Move them to the admin guide.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NAlexandre Belloni <alexandre.belloni@bootlin.com>
Acked-by: NBartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>

4f4cfa6c

docs: cgroup-v1: add it to the admin-guide book · da82c92f

由 Mauro Carvalho Chehab 提交于 6月 27, 2019

Those files belong to the admin guide, so add them.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

da82c92f

docs: aoe: add it to the driver-api book · 83bbf6e1

由 Mauro Carvalho Chehab 提交于 6月 27, 2019

Those files belong to the admin guide, so add them.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NJustin Sanders <justin@coraid.com>

83bbf6e1

docs: blockdev: add it to the admin-guide · e7751617

由 Mauro Carvalho Chehab 提交于 6月 18, 2019

The blockdev book basically contains user-faced documentation.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

e7751617

docs: admin-guide: add kdump documentation into it · 330d4810

由 Mauro Carvalho Chehab 提交于 6月 13, 2019

The Kdump documentation describes procedures with admins use
in order to solve issues on their systems.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

330d4810

docs: admin-guide: add laptops documentation · 9e1cbede

由 Mauro Carvalho Chehab 提交于 6月 13, 2019

The docs under Documentation/laptops contain users specific
information.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NAndy Shevchenko <andy.shevchenko@gmail.com>

9e1cbede

docs: admin-guide: move sysctl directory to it · 57043247

由 Mauro Carvalho Chehab 提交于 4月 22, 2019

The stuff under sysctl describes /sys interface from userspace
point of view. So, add it to the admin-guide and remove the
:orphan: from its index file.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

57043247

docs: device-mapper: move it to the admin-guide · 6cf2a73c

由 Mauro Carvalho Chehab 提交于 6月 18, 2019

The DM support describes lots of aspects related to mapped
disk partitions from the userspace PoV.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

6cf2a73c

docs: namespace: move it to the admin-guide · bf6b7a74

由 Mauro Carvalho Chehab 提交于 6月 18, 2019

As stated at the documentation, this is meant to be for
users to better understand namespaces.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

bf6b7a74

docs: perf: move to the admin-guide · 59809fe8

由 Mauro Carvalho Chehab 提交于 6月 18, 2019

The perf infrastructure is used for userspace to track issues.
At least a good part of what's described here is related to
it.

So, add it to the admin-guide.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

59809fe8

docs: rapidio: add it to the driver API · d2bdd48a

由 Mauro Carvalho Chehab 提交于 6月 18, 2019

This is actually a subsystem description, with contains both
kAPI and uAPI.

While it should ideally be slplit, let's place it at driver-api,
as most things are related to kAPI and driver-specific info.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

d2bdd48a

docs: block: convert to ReST · 898bd37a

由 Mauro Carvalho Chehab 提交于 4月 18, 2019

Rename the block documentation files to ReST, add an
index for them and adjust in order to produce a nice html
output via the Sphinx build system.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

898bd37a

docs: sysctl: convert to ReST · 53b95375

由 Mauro Carvalho Chehab 提交于 4月 18, 2019

Rename the /proc/sys/ documentation files to ReST, using the
README file as a template for an index.rst, adding the other
files there via TOC markup.

Despite being written on different times with different
styles, try to make them somewhat coherent with a similar
look and feel, ensuring that they'll look nice as both
raw text file and as via the html output produced by the
Sphinx build system.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

53b95375

docs: blockdev: convert to ReST · 39443104

由 Mauro Carvalho Chehab 提交于 4月 18, 2019

Rename the blockdev documentation files to ReST, add an
index for them and adjust in order to produce a nice html
output via the Sphinx build system.

The drbd sub-directory contains some graphs and data flows.
Add those too to the documentation.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

39443104

docs: laptops: convert to ReST · b02f1651

由 Mauro Carvalho Chehab 提交于 4月 18, 2019

Rename the laptops documentation files to ReST, add an
index for them and adjust in order to produce a nice html
output via the Sphinx build system.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>
Acked-by: NAndy Shevchenko <andy.shevchenko@gmail.com>

b02f1651

docs: accounting: convert to ReST · c3123552

由 Mauro Carvalho Chehab 提交于 4月 17, 2019

Rename the accounting documentation files to ReST, add an
index for them and adjust in order to produce a nice html
output via the Sphinx build system.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

c3123552

docs: m68k: convert docs to ReST and rename to *.rst · 23e02422

由 Mauro Carvalho Chehab 提交于 4月 14, 2019

Convert the m68k kernel-options.txt file to ReST.

The conversion is trivial, as the document is already on a format
close enough to ReST. Just some small adjustments were needed in
order to make it both good for being parsed while keeping it on
a good txt shape.

At its new index.rst, let's add a :orphan: while this is not linked to
the main index.rst file, in order to avoid build warnings.
Signed-off-by: NMauro Carvalho Chehab <mchehab+samsung@kernel.org>

23e02422

13 7月, 2019 3 次提交

mm: security: introduce init_on_alloc=1 and init_on_free=1 boot options · 6471384a

由 Alexander Potapenko 提交于 7月 11, 2019

Patch series "add init_on_alloc/init_on_free boot options", v10.

Provide init_on_alloc and init_on_free boot options.

These are aimed at preventing possible information leaks and making the
control-flow bugs that depend on uninitialized values more deterministic.

Enabling either of the options guarantees that the memory returned by the
page allocator and SL[AU]B is initialized with zeroes.  SLOB allocator
isn't supported at the moment, as its emulation of kmem caches complicates
handling of SLAB_TYPESAFE_BY_RCU caches correctly.

Enabling init_on_free also guarantees that pages and heap objects are
initialized right after they're freed, so it won't be possible to access
stale data by using a dangling pointer.

As suggested by Michal Hocko, right now we don't let the heap users to
disable initialization for certain allocations.  There's not enough
evidence that doing so can speed up real-life cases, and introducing ways
to opt-out may result in things going out of control.

This patch (of 2):

The new options are needed to prevent possible information leaks and make
control-flow bugs that depend on uninitialized values more deterministic.

This is expected to be on-by-default on Android and Chrome OS.  And it
gives the opportunity for anyone else to use it under distros too via the
boot args.  (The init_on_free feature is regularly requested by folks
where memory forensics is included in their threat models.)

init_on_alloc=1 makes the kernel initialize newly allocated pages and heap
objects with zeroes.  Initialization is done at allocation time at the
places where checks for __GFP_ZERO are performed.

init_on_free=1 makes the kernel initialize freed pages and heap objects
with zeroes upon their deletion.  This helps to ensure sensitive data
doesn't leak via use-after-free accesses.

Both init_on_alloc=1 and init_on_free=1 guarantee that the allocator
returns zeroed memory.  The two exceptions are slab caches with
constructors and SLAB_TYPESAFE_BY_RCU flag.  Those are never
zero-initialized to preserve their semantics.

Both init_on_alloc and init_on_free default to zero, but those defaults
can be overridden with CONFIG_INIT_ON_ALLOC_DEFAULT_ON and
CONFIG_INIT_ON_FREE_DEFAULT_ON.

If either SLUB poisoning or page poisoning is enabled, those options take
precedence over init_on_alloc and init_on_free: initialization is only
applied to unpoisoned allocations.

Slowdown for the new features compared to init_on_free=0, init_on_alloc=0:

hackbench, init_on_free=1:  +7.62% sys time (st.err 0.74%)
hackbench, init_on_alloc=1: +7.75% sys time (st.err 2.14%)

Linux build with -j12, init_on_free=1:  +8.38% wall time (st.err 0.39%)
Linux build with -j12, init_on_free=1:  +24.42% sys time (st.err 0.52%)
Linux build with -j12, init_on_alloc=1: -0.13% wall time (st.err 0.42%)
Linux build with -j12, init_on_alloc=1: +0.57% sys time (st.err 0.40%)

The slowdown for init_on_free=0, init_on_alloc=0 compared to the baseline
is within the standard error.

The new features are also going to pave the way for hardware memory
tagging (e.g.  arm64's MTE), which will require both on_alloc and on_free
hooks to set the tags for heap objects.  With MTE, tagging will have the
same cost as memory initialization.

Although init_on_free is rather costly, there are paranoid use-cases where
in-memory data lifetime is desired to be minimized.  There are various
arguments for/against the realism of the associated threat models, but
given that we'll need the infrastructure for MTE anyway, and there are
people who want wipe-on-free behavior no matter what the performance cost,
it seems reasonable to include it in this series.

[glider@google.com: v8]
  Link: http://lkml.kernel.org/r/20190626121943.131390-2-glider@google.com
[glider@google.com: v9]
  Link: http://lkml.kernel.org/r/20190627130316.254309-2-glider@google.com
[glider@google.com: v10]
  Link: http://lkml.kernel.org/r/20190628093131.199499-2-glider@google.com
Link: http://lkml.kernel.org/r/20190617151050.92663-2-glider@google.comSigned-off-by: NAlexander Potapenko <glider@google.com>
Acked-by: NKees Cook <keescook@chromium.org>
Acked-by: Michal Hocko <mhocko@suse.cz>		[page and dmapool parts
Acked-by: James Morris <jamorris@linux.microsoft.com>]
Cc: Christoph Lameter <cl@linux.com>
Cc: Masahiro Yamada <yamada.masahiro@socionext.com>
Cc: "Serge E. Hallyn" <serge@hallyn.com>
Cc: Nick Desaulniers <ndesaulniers@google.com>
Cc: Kostya Serebryany <kcc@google.com>
Cc: Dmitry Vyukov <dvyukov@google.com>
Cc: Sandeep Patil <sspatil@android.com>
Cc: Laura Abbott <labbott@redhat.com>
Cc: Randy Dunlap <rdunlap@infradead.org>
Cc: Jann Horn <jannh@google.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Marco Elver <elver@google.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

6471384a

mm, memcg: introduce memory.events.local · 1e577f97

由 Shakeel Butt 提交于 7月 11, 2019

The memory controller in cgroup v2 exposes memory.events file for each
memcg which shows the number of times events like low, high, max, oom
and oom_kill have happened for the whole tree rooted at that memcg.
Users can also poll or register notification to monitor the changes in
that file. Any event at any level of the tree rooted at memcg will
notify all the listeners along the path till root_mem_cgroup. There are
existing users which depend on this behavior.

However there are users which are only interested in the events
happening at a specific level of the memcg tree and not in the events in
the underlying tree rooted at that memcg. One such use-case is a
centralized resource monitor which can dynamically adjust the limits of
the jobs running on a system. The jobs can create their sub-hierarchy
for their own sub-tasks. The centralized monitor is only interested in
the events at the top level memcgs of the jobs as it can then act and
adjust the limits of the jobs. Using the current memory.events for such
centralized monitor is very inconvenient. The monitor will keep
receiving events which it is not interested and to find if the received
event is interesting, it has to read memory.event files of the next
level and compare it with the top level one. So, let's introduce
memory.events.local to the memcg which shows and notify for the events
at the memcg level.

Now, does memory.stat and memory.pressure need their local versions. IMHO
no due to the no internal process contraint of the cgroup v2. The
memory.stat file of the top level memcg of a job shows the stats and
vmevents of the whole tree. The local stats or vmevents of the top level
memcg will only change if there is a process running in that memcg but v2
does not allow that. Similarly for memory.pressure there will not be any
process in the internal nodes and thus no chance of local pressure.

Link: http://lkml.kernel.org/r/20190527174643.209172-1-shakeelb@google.comSigned-off-by: NShakeel Butt <shakeelb@google.com>
Reviewed-by: NRoman Gushchin <guro@fb.com>
Acked-by: NJohannes Weiner <hannes@cmpxchg.org>
Acked-by: NMichal Hocko <mhocko@suse.com>
Cc: Vladimir Davydov <vdavydov.dev@gmail.com>
Cc: Chris Down <chris@chrisdown.name>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

1e577f97

mm, debug_pagealloc: use a page type instead of page_ext flag · 3972f6bb

由 Vlastimil Babka 提交于 7月 11, 2019

When debug_pagealloc is enabled, we currently allocate the page_ext
array to mark guard pages with the PAGE_EXT_DEBUG_GUARD flag.  Now that
we have the page_type field in struct page, we can use that instead, as
guard pages are neither PageSlab nor mapped to userspace.  This reduces
memory overhead when debug_pagealloc is enabled and there are no other
features requiring the page_ext array.

Link: http://lkml.kernel.org/r/20190603143451.27353-4-vbabka@suse.czSigned-off-by: NVlastimil Babka <vbabka@suse.cz>
Cc: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Cc: Matthew Wilcox <willy@infradead.org>
Cc: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>
Cc: Mel Gorman <mgorman@techsingularity.net>
Cc: Michal Hocko <mhocko@kernel.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

3972f6bb

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功