提交 · 9f201aba56b92c3daa4b76efae056ddbb80d91e6 · openeuler / Kernel

06 4月, 2019 3 次提交

habanalabs: prevent device PTE read/write during hard-reset · 9f201aba

由 Oded Gabbay 提交于 4月 06, 2019

During hard-reset, contexts are closed as part of the tear-down process.
After a context is closed, the driver cleans up the page tables of that
context in the device's DRAM. This action is both dangerous and
unnecessary.

It is unnecessary, because the device is going through a hard-reset, which
means the device's DRAM contents are no longer valid and the device's MMU
is being reset.

It is dangerous, because if the hard-reset came as a result of a PCI
freeze, this action may cause the entire host machine to hang.

Therefore, prevent all device PTE updates when a hard-reset operation is
pending.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9f201aba

habanalabs: improve IOCTLs behavior when disabled or reset · 3f5398cf

由 Oded Gabbay 提交于 4月 06, 2019

This patch makes some improvement in how IOCTLs behave when the device is
disabled or under reset.

The new code checks, at the start of every IOCTL, if the device is
disabled or in reset. If so, it prints an appropriate kernel message and
returns -EBUSY to user-space.

In addition, the code modifies the location of where the
hard_reset_pending flag is being set or cleared:

1. It is now cleared immediately after the reset *tear-down* flow is
   finished but before the re-initialization flow begins.

2. It is being set in the remove function of the device, to make the
   behavior the same with the hard-reset flow

There are two exceptions to the disable or in reset check:

1. The HL_INFO_DEVICE_STATUS opcode in the INFO IOCTL. This opcode allows
   the user to inquire about the status of the device, whether it is
   operational, in reset or malfunction (disabled). If the driver will
   block this IOCTL, the user won't be able to retrieve the status in
   case of malfunction or in reset.

2. The WAIT_FOR_CS IOCTL. This IOCTL allows the user to inquire about the
   status of a CS. We want to allow the user to continue to do so, even if
   we started a soft-reset process because it will allow the user to get
   the correct error code for each CS he submitted.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

3f5398cf

habanalabs: all FD must be closed before removing device · caa3c8e5

由 Oded Gabbay 提交于 4月 06, 2019

This patch fixes a bug in the implementation of the function that removes
the device.

The bug can happen when the device is removed but not the driver itself
(e.g. remove by the OS due to PCI freeze in Power architecture).

In that case, there maybe open users that are calling IOCTLs while the
device is removed. This is a possible race condition that the driver must
handle. Otherwise, a kernel panic may occur.

This race is prevented in the hard-reset flow, because the driver makes
sure the users are closed before continuing with the hard-reset. This
race can not occur when the driver itself is removed because the OS makes
sure all the file descriptors are closed.

The fix is to make sure the open users close their file descriptors and if
they don't (after a certain amount of time), the driver sends them a
SIGKILL, because the remove of the device can't be stopped.

The patch re-uses the same code that is called from the hard-reset flow.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

caa3c8e5

04 4月, 2019 2 次提交

habanalabs: split mmu/no-mmu code paths in memory ioctl · 54303a1a

由 Oded Gabbay 提交于 4月 04, 2019

To make the memory ioctl code more readable, this patch moves the
legacy/debug code path of mmu-disabled to a separate function, which is
called (if necessary) from the main memory ioctl function.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

54303a1a

habanalabs: ASIC_AUTO_DETECT enum value is redundant · 29593840

由 Oded Gabbay 提交于 4月 04, 2019

This patch removes the enum value of ASIC_AUTO_DETECT because we can use
the validity of the pdev variable to know whether we have a real device or
a simulator. For a real device, we detect the asic type from the device ID
while for a simulator, the simulator code calls create_hdev() with the
specified ASIC type.

Set ASIC_INVALID as the first option in the enum to make sure that no
other enum value will receive the value 0 (which indicates a non-existing
entry in the simulator array).
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

29593840

02 4月, 2019 1 次提交

habanalabs: refactoring in goya.c · bedd1442

由 Oded Gabbay 提交于 4月 02, 2019

This patch does some refactoring in goya.c to make code more reusable
between goya code and the goya simulator code (which is not upstreamed).

In addition, the patch removes some dead functions from goya.c which are
not used by the current upstream code
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

bedd1442

03 4月, 2019 1 次提交

uapi/habanalabs: fix some comments in uapi file · 90027296

由 Oded Gabbay 提交于 4月 03, 2019

This patch adds a better explanation about the sequence number that is
returned per CS. It also fixes the comment about queue numbering rules.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

90027296

02 4月, 2019 3 次提交

habanalabs: add goya implementation for debug configuration · 8ba2876d

由 Omer Shpigelman 提交于 4月 01, 2019

This patch adds the ASIC-specific function for GOYA to configure the
coresight components.

Most of the components have an enabled/disabled flag, depending on whether
the user wants to enable the component or disable it.

For some of the components, such as ETR and SPMU, the user can also
request to read values from them. Those values are needed for the user to
parse the trace data.

The ETR configuration is also checked for security purposes, to make sure
the trace data is written to the device's DRAM.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

8ba2876d

habanalabs: add new IOCTL for debug, tracing and profiling · 315bc055

由 Omer Shpigelman 提交于 4月 01, 2019

Habanalabs ASICs use the ARM coresight infrastructure to support debug,
tracing and profiling of neural networks topologies.

Because the coresight is configured using register writes and reads, and
some of the registers hold sensitive information (e.g. the address in
the device's DRAM where the trace data is written to), the user must go
through the kernel driver to configure this mechanism.

This patch implements the common code of the IOCTL and calls the
ASIC-specific function for the actual H/W configuration.

The IOCTL supports configuration of seven coresight components:
ETR, ETF, STM, FUNNEL, BMON, SPMU and TIMESTAMP

The user specifies which component he wishes to configure and provides a
pointer to a structure (located in its process space) that contains the
relevant configuration.

The common code copies the relevant data from the user-space to kernel
space and then calls the ASIC-specific function to do the H/W
configuration.

After the configuration is done, which is usually composed
of several IOCTL calls depending on what the user wanted to trace, the
user can start executing the topology. The trace data will be written to
the user's area in the device's DRAM.

After the tracing operation is complete, and user will call the IOCTL
again to disable the tracing operation. The user also need to read
values from registers for some of the components (e.g. the size of the
trace data in the device's DRAM). In that case, the user will provide a
pointer to an "output" structure in user-space, which the IOCTL code will
fill according the to selected component.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

315bc055

habanalabs: remove extra semicolon · a1c92d1c

由 Oded Gabbay 提交于 4月 02, 2019

This patch removes an extra ; after the closing brackets of a while loop.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NMukesh Ojha <mojha@codeaurora.org>

a1c92d1c

01 4月, 2019 1 次提交

habanalabs: prevent CPU soft lockup on Palladium · e850b89f

由 Oded Gabbay 提交于 3月 31, 2019

Unmapping ptes in the device MMU on Palladium can take a long time, which
can cause a kernel BUG of CPU soft lockup.

This patch minimize the chances for this bug by sleeping a little between
unmapping ptes.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e850b89f

31 3月, 2019 1 次提交

habanalabs: remove trailing blank line from EOF · 9336c021

由 Oded Gabbay 提交于 3月 31, 2019

GIT does not like extra blank lines at the end of the file, so this patch
removes those lines.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NMukesh Ojha <mojha@codeaurora.org>

9336c021

27 3月, 2019 1 次提交

habanalabs: improve error messages · cab8e3e2

由 Oded Gabbay 提交于 3月 27, 2019

This patch improves two error messages to help the user to
better understand what error occurred.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cab8e3e2

24 3月, 2019 1 次提交

habanalabs: add device status option to INFO IOCTL · aa957088

由 Dalit Ben Zoor 提交于 3月 24, 2019

This patch adds a new opcode to INFO IOCTL that returns the device status.

This will allow users to query the device status in order to avoid sending
command submissions while device is in reset.
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

aa957088

21 3月, 2019 1 次提交

habanalabs: allow user to modify TPC clock relaxation value · 9354c29e

由 Dalit Ben Zoor 提交于 3月 21, 2019

This patch allows the user to modify the TPC PLL clock relaxation value
on-the-fly in order to reduce power consumption.

To enable this, the patch removes the protection from the specific
register that controls this behavior.
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9354c29e

20 3月, 2019 1 次提交

habanalabs: set new golden value to tpc clock relaxation · a691a1eb

由 Dalit Ben Zoor 提交于 3月 20, 2019

On init or context switch, set TPC clock relaxation counter
register to a golden value.
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

a691a1eb

17 3月, 2019 1 次提交

habanalabs: never fail hard reset of device · 0878a420

由 Oded Gabbay 提交于 3月 17, 2019

Hard-reset of our device should never fail, due to dangers of permanent
damage to the H/W.

This patch removes the last place in the reset path where the driver might
exit before doing the actual reset.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0878a420

08 3月, 2019 1 次提交

habanalabs: keep track of the device's dma mask · d9973871

由 Oded Gabbay 提交于 3月 07, 2019

This patch refactors the code that is responsible to set the DMA mask for
the device.

Upon each change of the dma mask, the driver will save the new value that
was set. This is needed in order to make sure we don't try to increase the
mask a second time, in case we failed in the first time. This is
especially relevant for Power machines, as that may cause a change in
configuration of the TVT which will break the device.

Goya will first try to set the device's dma mask to 39 bits, so that the
memory that is allocated on the host machine for communication with the
device's cpu will be in a bus address which is lower then 39 bits. Later,
Goya will try to increase that mask to 48 bits, but only if setting the
mask to 39 bits was successful.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

d9973871

24 2月, 2019 1 次提交

habanalabs: add MMU shadow mapping · 66542c3b

由 Omer Shpigelman 提交于 2月 24, 2019

This patch adds shadow mapping to the MMU module. The shadow mapping
allows traversing the page table in host memory rather reading each PTE
from the device memory.
It brings better performance and avoids reading from invalid device
address upon PCI errors.
Only at the end of map/unmap flow, writings to the device are performed in
order to sync the H/W page tables with the shadow ones.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

66542c3b

12 3月, 2019 1 次提交

habanalabs: Allow accessing DRAM virtual addresses via debugfs · d75bcf3e

由 Tomer Tayar 提交于 3月 12, 2019

The addr/data32 debugfs nodes currently permit the access to only physical
addresses of a device. This patch extends it and allows accessing also
device's DRAM virtual addresses.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

d75bcf3e

07 3月, 2019 2 次提交

habanalabs: Add a printout with the name of a busy engine · c811f7bc

由 Tomer Tayar 提交于 3月 07, 2019

Print the name of a busy engine when checking if a device is idle.
The change is done mainly to help a user to pinpoint problems in his
topology's recipe.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

c811f7bc

uapi/habanalabs: add some comments in habanalabs.h · e1266004

由 Oded Gabbay 提交于 3月 07, 2019

This patch adds two comments in uapi/habanalabs.h:
- From which queue id the internal queues begin
- Invalid values that can be returned in the seq field from the CS IOCTL
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e1266004

06 3月, 2019 1 次提交

habanalabs: Remove unneeded function pointers · 393e5b55

由 Tomer Tayar 提交于 3月 06, 2019

Remove pointers to ASIC-specific functions and instead call the functions
explicitly as they are not accessed from outside the ASIC-specific files.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

393e5b55

05 3月, 2019 2 次提交

habanalabs: Move PCI code into common file · b6f897d7

由 Tomer Tayar 提交于 3月 05, 2019

Move duplicated PCI-related code from ASIC-specific files into the common
pci.c file.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b6f897d7

habanalabs: ratelimit warnings at start of IOCTLs · 680cb399

由 Oded Gabbay 提交于 3月 05, 2019

At the start of some IOCTLs we check if the device is disabled or in reset.
If it is, we return -EBUSY and print a message to kernel log.

Because these IOCTLs can be called at very high frequency, use ratelimit
to avoid spamming the kernel log. Also use the same type of message -
dev_warn - in all the relevant IOCTLs.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

680cb399

28 2月, 2019 1 次提交

habanalabs: remove unused defines · e0a29952

由 Oded Gabbay 提交于 2月 28, 2019

This patch removes some old defines which are not in use anymore.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e0a29952

04 3月, 2019 2 次提交

habanalabs: use EQ MSI/X ID per chip · c535bfdd

由 Oded Gabbay 提交于 3月 04, 2019

The Event Queue MSI/X ID is different per ASIC. This patch renames the
current define to have the GOYA_ prefix to mark it only for Goya. It also
moves it from the common armcp_if.h file to the ASIC specific goya_fw_if.h
file.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

c535bfdd

habanalabs: Move device CPU code into common file · 3110c60f

由 Tomer Tayar 提交于 3月 04, 2019

This patch moves the code that is responsible of the communication
vs. the F/W to a dedicated file. This will allow us to share the code
between different ASICs.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

3110c60f

27 2月, 2019 1 次提交

habanalabs: remove implicit include from header files · 5eb42044

由 Dotan Barak 提交于 2月 27, 2019

This will prevent unneeded include of header files, which may increase
compilation time.
Signed-off-by: NDotan Barak <dbarak@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5eb42044

24 2月, 2019 2 次提交

habanalabs: rename goya_non_fatal_events array to all events · b24ca458

由 Oded Gabbay 提交于 2月 24, 2019

The goya_non_fatal_events array actually contains all the possible events
the driver can receive from the F/W. Therefore, use a proper name
for the array.

The patch also adds missing event Ids to the goya_async_event_id enum.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b24ca458

habanalabs: add new device CPU boot status · 0ca3b1b7

由 Igor Grinberg 提交于 2月 24, 2019

This patch adds a definition of a new status in the device CPU boot stages
and add the handling of the new status.
Signed-off-by: NIgor Grinberg <igrinberg@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0ca3b1b7

02 4月, 2019 5 次提交

chardev: update comment based on the code · d358b173

由 Chengguang Xu 提交于 2月 15, 2019

The function comment of __register_chrdev_region()
is out of date, so update it based on the code.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

d358b173

chardev: code cleanup for __register_chrdev_region() · 4b0be572

由 Chengguang Xu 提交于 2月 15, 2019

It's just code cleanup, not functional change.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

4b0be572

chardev: add a check for given minor range · 4712d379

由 Chengguang Xu 提交于 2月 15, 2019

register_chrdev_region() carefully checks minor range
before calling __register_chrdev_region() but there is
another path from alloc_chrdev_region() which does not
check the range properly. So add a check for given minor
range in __register_chrdev_region().
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

4712d379

chardev: add additional check for minor range overlap · de36e16d

由 Chengguang Xu 提交于 2月 15, 2019

Current overlap checking cannot correctly handle
a case which is baseminor < existing baseminor &&
baseminor + minorct > existing baseminor + minorct.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

de36e16d

VMCI: Use BIT() macro for bit definitions · 9a41691e

由 Vishnu DASA 提交于 3月 11, 2019

No functional changes, cleanup only.
Reviewed-by: NAdit Ranadive <aditr@vmware.com>
Reviewed-by: NJorgen Hansen <jhansen@vmware.com>
Signed-off-by: NVishnu Dasa <vdasa@vmware.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

9a41691e

01 4月, 2019 2 次提交
- G
  Merge 5.1-rc3 into char-misc-next · 62fa7843
  由 Greg Kroah-Hartman 提交于 4月 01, 2019
```
We want the char-misc fixes in here as well.
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
```
  62fa7843
- L
  
  Linux 5.1-rc3 · 79a3aaa7
  由 Linus Torvalds 提交于 3月 31, 2019
  
  79a3aaa7
31 3月, 2019 2 次提交

Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm · 63fc9c23

由 Linus Torvalds 提交于 3月 31, 2019

Pull KVM fixes from Paolo Bonzini:
 "A collection of x86 and ARM bugfixes, and some improvements to
  documentation.

  On top of this, a cleanup of kvm_para.h headers, which were exported
  by some architectures even though they not support KVM at all. This is
  responsible for all the Kbuild changes in the diffstat"

* tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm: (28 commits)
  Documentation: kvm: clarify KVM_SET_USER_MEMORY_REGION
  KVM: doc: Document the life cycle of a VM and its resources
  KVM: selftests: complete IO before migrating guest state
  KVM: selftests: disable stack protector for all KVM tests
  KVM: selftests: explicitly disable PIE for tests
  KVM: selftests: assert on exit reason in CR4/cpuid sync test
  KVM: x86: update %rip after emulating IO
  x86/kvm/hyper-v: avoid spurious pending stimer on vCPU init
  kvm/x86: Move MSR_IA32_ARCH_CAPABILITIES to array emulated_msrs
  KVM: x86: Emulate MSR_IA32_ARCH_CAPABILITIES on AMD hosts
  kvm: don't redefine flags as something else
  kvm: mmu: Used range based flushing in slot_handle_level_range
  KVM: export <linux/kvm_para.h> and <asm/kvm_para.h> iif KVM is supported
  KVM: x86: remove check on nr_mmu_pages in kvm_arch_commit_memory_region()
  kvm: nVMX: Add a vmentry check for HOST_SYSENTER_ESP and HOST_SYSENTER_EIP fields
  KVM: SVM: Workaround errata#1096 (insn_len maybe zero on SMAP violation)
  KVM: Reject device ioctls from processes other than the VM's creator
  KVM: doc: Fix incorrect word ordering regarding supported use of APIs
  KVM: x86: fix handling of role.cr4_pae and rename it to 'gpte_size'
  KVM: nVMX: Do not inherit quadrant and invalid for the root shadow EPT
  ...

63fc9c23

Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 915ee0da

由 Linus Torvalds 提交于 3月 31, 2019

Pull x86 fixes from Thomas Gleixner:
 "A pile of x86 updates:

   - Prevent exceeding he valid physical address space in the /dev/mem
     limit checks.

   - Move all header content inside the header guard to prevent compile
     failures.

   - Fix the bogus __percpu annotation in this_cpu_has() which makes
     sparse very noisy.

   - Disable switch jump tables completely when retpolines are enabled.

   - Prevent leaking the trampoline address"

* 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
  x86/realmode: Make set_real_mode_mem() static inline
  x86/cpufeature: Fix __percpu annotation in this_cpu_has()
  x86/mm: Don't exceed the valid physical address space
  x86/retpolines: Disable switch jump tables when retpolines are enabled
  x86/realmode: Don't leak the trampoline kernel address
  x86/boot: Fix incorrect ifdeffery scope
  x86/resctrl: Remove unused variable

915ee0da

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功