提交 · 018e0e3594f7dcd029d258e368c485e742fa9cdb · openeuler / Kernel

21 11月, 2019 14 次提交

habanalabs: use defines for F/W files · da1342a0

由 Oded Gabbay 提交于 11月 16, 2019

Make the code more concise and maintainable by using defines for the F/W
files.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

da1342a0

habanalabs: remove prints on successful device initialization · 7fbdc12b

由 Oded Gabbay 提交于 11月 16, 2019

Successful device initialization is mentioned in kernel log with the
message "Successfully added device to habanalabs driver". There is no point
of spamming the log with additional messages about successful queue
testing, which are implied by the above mentioned message.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

7fbdc12b

habanalabs: prevent read/write from/to the device during hard reset · bc75d799

由 Omer Shpigelman 提交于 11月 14, 2019

During hard reset we should not access the device except of necessary
reset operations because the device might be stuck or unresponsive.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

bc75d799

habanalabs: split MMU properties to PCI/DRAM · 54bb6744

由 Omer Shpigelman 提交于 11月 14, 2019

Split the properties used for MMU mappings to DRAM and PCI (host) types.
This is a prerequisite for future ASICs support.
Note that in Goya ASIC, the PMMU and DMMU are the same (except of page
sizes) as only one MMU mechanism is used for both of the mapping types.
Hence this patch should not have any effect on current behavior.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

54bb6744

habanalabs: type specific MMU cache invalidation · 7b6e4ea0

由 Omer Shpigelman 提交于 11月 14, 2019

Add the ability to invalidate the necessary MMU cache only.
This ability is a prerequisite for future ASICs support.
Note that in Goya ASIC, a single cache is used for both host/DRAM
mappings and hence this patch should not have any effect on current
behavior.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

7b6e4ea0

habanalabs: re-factor memory module code · 7f74d4d3

由 Omer Shpigelman 提交于 8月 12, 2019

Some of the functions in the memory module code were too long and/or
contained multiple operations that are not always done together. Re-factor
the code by dividing those functions to smaller functions which are more
readable and maintainable.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

7f74d4d3

habanalabs: read F/W versions before failure · f05912d8

由 Oded Gabbay 提交于 10月 20, 2019

Move the read of the F/W boot versions before exiting on possible failures
of the F/W boot. This will help debug boot failures as we will be able to
know the F/W boot version.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

f05912d8

habanalabs: expose card name in INFO IOCTL · 91edbf2c

由 Oded Gabbay 提交于 10月 16, 2019

To enable userspace processes, e.g. management utilities, to display the
card name to the user, add the card name property to the HW_IP
structure that is copied to the user in the INFO IOCTL.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

91edbf2c

habanalabs: remove set but not used variable 'qman_base_addr' · 8d6de528

由 YueHaibing 提交于 10月 16, 2019

Fixes gcc '-Wunused-but-set-variable' warning:

drivers/misc/habanalabs/goya/goya.c: In function 'goya_init_mme_cmdq':
drivers/misc/habanalabs/goya/goya.c:1536:6: warning:
 variable 'qman_base_addr' set but not used [-Wunused-but-set-variable]

It is never used, so can be removed.
Reported-by: NHulk Robot <hulkci@huawei.com>
Signed-off-by: NYueHaibing <yuehaibing@huawei.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

8d6de528

habanalabs: add opcode to INFO IOCTL to return clock rate · 62c1e124

由 Oded Gabbay 提交于 10月 10, 2019

Add a new opcode to the INFO IOCTL to allow the user application to
retrieve the ASIC's current and maximum clock rate. The rate is
returned in MHz.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NTomer Tayar <ttayar@habana.ai>

62c1e124

habanalabs: set TPC Icache to 16 cache lines · 8fdacf2a

由 Oded Gabbay 提交于 10月 02, 2019

Reduce latency to memory during TPC kernel execution.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NTomer Tayar <ttayar@habana.ai>

8fdacf2a

habanalabs: Add a new H/W queue type · cb596aee

由 Tomer Tayar 提交于 10月 03, 2019

This patch adds a support for a new H/W queue type.
This type of queue is for DMA and compute engines jobs, for which
completion notification are sent by H/W.
Command buffer for this queue can be created either through the CB
IOCTL and using the retrieved CB handle, or by preparing a buffer on the
host or device SRAM/DRAM, and using the device address to that buffer.
The patch includes the handling of the 2 options, as well as the
initialization of the H/W queue and its jobs scheduling.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cb596aee

habanalabs: Mark queue as expecting CB handle or address · df762375

由 Tomer Tayar 提交于 10月 03, 2019

Jobs on some queues must be provided with a handle to a driver command
buffer object, while for other queues, jobs must be provided with an
address to a command buffer.
Currently the distinction is done based on the queue type, which is less
flexible if the same queue type behaves differently on different
types of ASICs.
This patch adds a new queue property for this target, which is
configured per queue type per ASIC type.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

df762375

habanalabs: handle F/W failure for sensor initialization · abb7e16f

由 Oded Gabbay 提交于 9月 16, 2019

In case the F/W fails to initialize the thermal sensors, print an
appropriate error message to kernel log and fail the device
initialization.
Reviewed-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

abb7e16f

05 9月, 2019 6 次提交

habanalabs: stop using the acronym KMD · 4c172bbf

由 Oded Gabbay 提交于 8月 30, 2019

We want to stop using the acronym KMD. Therefore, replace all locations
(except for register names we can't modify) where KMD is written to other
terms such as "Linux kernel driver" or "Host kernel driver", etc.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

4c172bbf

habanalabs: display card name as sensors header · 0996bd1c

由 Oded Gabbay 提交于 8月 30, 2019

To allow the user to use a custom file for the HWMON lm-sensors library
per card type, the driver needs to register the HWMON sensors with the
specific card type name.

The card name is supplied by the F/W running on the device. If the F/W is
old and doesn't supply a card name, a default card name is displayed as
the sensors group name.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

0996bd1c

habanalabs: add uapi to retrieve aggregate H/W events · e9730763

由 Oded Gabbay 提交于 8月 28, 2019

Add a new opcode to INFO IOCTL to retrieve aggregate H/W events. i.e. the
events counters are NOT cleared upon device reset, but count from the
loading of the driver.

Add the code to support it in the device event handling function.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

e9730763

habanalabs: Make the Coresight timestamp perpetual · 413cf576

由 Tomer Tayar 提交于 8月 27, 2019

The Coresight timestamp is enabled for a specific debug session using
the HL_DEBUG_OP_TIMESTAMP opcode of the debug IOCTL.
In order to have a perpetual timestamp that would be comparable between
various debug sessions, this patch moves the timestamp enablement to be
part of the HW initialization.
The HL_DEBUG_OP_TIMESTAMP opcode turns to be deprecated and shouldn't be
used. Old user-space that will call it won't see any change in the
behavior of the debug session.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

413cf576

habanalabs: Add descriptive name to PSOC app status register · 10d7de2c

由 Tomer Tayar 提交于 8月 01, 2019

Add a meaningful name to the general PSOC application status register
which better describes its usage in keeping the HW state.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

10d7de2c

habanalabs: Add descriptive names to PSOC scratch-pad registers · 4095a176

由 Tomer Tayar 提交于 8月 01, 2019

The PSOC scratch-pad registers are used for communication with the
device CPU. This patch adds new definitions for these registers which
are more descriptive than their general names.

The new set of definitions also gathers and documents the current usage
of the scratch-pad registers by the driver and the device CPU.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

4095a176

12 8月, 2019 3 次提交

habanalabs: fix device IRQ unmasking for BE host · b421d83a

由 Ben Segal 提交于 8月 07, 2019

When unmasking IRQs inside the ASIC, the driver passes an array of all the
IRQ to unmask. The ASIC's CPU is working in LE so when running in a BE
host, the driver needs to do the proper endianness swapping when preparing
this array.

In addition, this patch also fixes the endianness of a couple of kernel log
debug messages that print values of packets
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b421d83a

habanalabs: fix endianness handling for internal QMAN submission · b9040c99

由 Oded Gabbay 提交于 8月 08, 2019

The PQs of internal H/W queues (QMANs) can be located in different memory
areas for different ASICs. Therefore, when writing PQEs, we need to use
the correct function according to the location of the PQ. e.g. if the PQ
is located in the device's memory (SRAM or DRAM), we need to use
memcpy_toio() so it would work in architectures that have separate
address ranges for IO memory.

This patch makes the code that writes the PQE to be ASIC-specific so we
can handle this properly per ASIC.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Tested-by: NBen Segal <bpsegal20@gmail.com>

b9040c99

habanalabs: fix endianness handling for packets from user · 213ad5ad

由 Ben Segal 提交于 8月 01, 2019

Packets that arrive from the user and need to be parsed by the driver are
assumed to be in LE format.

This patch fix all the places where the code handles these packets and use
the correct endianness macros to handle them, as the driver handles the
packets in CPU format (LE or BE depending on the arch).
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

213ad5ad

29 7月, 2019 1 次提交

habanalabs: fix host memory polling in BE architecture · 2aa4e410

由 Ben Segal 提交于 7月 18, 2019

This patch fix a bug in the host memory polling macro. The bug is that the
memory being polled can be written by the device, which always writes it
in LE. However, if the host is running Linux in BE mode, we need to
convert the value that was written by the device before matching it to the
required value that the caller has given to the macro.
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

2aa4e410

01 7月, 2019 3 次提交

habanalabs: Add busy engines bitmask to HW idle IOCTL · e8960ca0

由 Tomer Tayar 提交于 7月 01, 2019

The information which is currently provided as a response to the
"HL_INFO_HW_IDLE" IOCTL is merely a general boolean value.
This patch extends it and provides also a bitmask that indicates which
of the device engines are busy.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e8960ca0

habanalabs: Add debugfs node for engines status · 06deb86a

由 Tomer Tayar 提交于 7月 01, 2019

Command submissions sent to the device are composed of command buffers
which are targeted to different device engines, like DMA and compute
entities. When a command submission gets stuck, knowing in which engine
the stuck is, is crucial for debugging.
This patch adds a debugfs node that exports this information, by
displaying the engines' various registers that assemble their idle/busy
status.
The information retrieval is based on the is_device_idle ASIC function.
The printout in this function, of the first detected busy engine, is
removed because it becomes redundant in the presence of the more
elaborated info of the new debugfs node.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

06deb86a

habanalabs: Update the device idle check · ac6183ae

由 Tomer Tayar 提交于 7月 01, 2019

The patch updates the device idle check:
- Add reading the DMA core status register, because it is possible that
  a QMAN has finished its work but the DMA itself is still running.
- Remove the MME shadow status check, as the MME ARCH status register
  includes the status of all MME shadows.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ac6183ae

27 6月, 2019 1 次提交

habanalabs: don't reset device when getting VRHOT · 717261e1

由 Oded Gabbay 提交于 6月 27, 2019

VRHOT event from the F/W indicates the device has reached a temperature of
100 Celsius degrees. In this case, the driver should only print this
information to the kernel log. The device will shutdown itself
automatically when reaching 125 degrees.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

717261e1

08 7月, 2019 1 次提交

habanalabs: use %pad for printing a dma_addr_t · f62fa0ce

由 Arnd Bergmann 提交于 7月 08, 2019

dma_addr_t might be different sizes depending on the configuration,
so we cannot print it as %llx:

drivers/misc/habanalabs/goya/goya.c: In function 'goya_sw_init':
drivers/misc/habanalabs/goya/goya.c:698:21: error: format '%llx' expects argument of type 'long long unsigned int', but argument 4 has type 'dma_addr_t' {aka 'unsigned int'} [-Werror=format=]

Use the special %pad format string. This requires passing the
argument by reference.

Fixes: 2a51558c ("habanalabs: remove DMA mask hack for Goya")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f62fa0ce

16 6月, 2019 1 次提交

habanalabs: Allow accessing host mapped addresses via debugfs · 4a0ce776

由 Tomer Tayar 提交于 6月 16, 2019

Allows using the addr/data32 debugfs nodes to access a device VA of a
host mapped memory when the IOMMU is disabled.

Due to the possible large amount of a user host mapped memory, the
driver doesn't maintain a database with the host addresses per device VA.
When the IOMMU is disabled, this missing info is being overcome by
simply using phys_to_virt(). However, this is not useful when the IOMMU
is enabled, and thus the enforced limitation.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

4a0ce776

29 5月, 2019 4 次提交

habanalabs: remove DMA mask hack for Goya · 2a51558c

由 Oded Gabbay 提交于 5月 29, 2019

This patch removes the non-standard DMA mask setting for Goya. Now that
the device CPU goes through the MMU, we are not limited to allocating the
CPU accessible memory area in the address space of under 39 bits.
Therefore, we don't need to set the DMA masking twice during
initialization, a practice that is not working on POWER architecture.

The patch sets the DMA mask to 48 bits once during the initialization. The
address of the CPU accessible memory area is configured to the MMU and the
matching VA is given to the device CPU.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

2a51558c

habanalabs: set Goya CPU to use ASIC MMU · f09415f5

由 Oded Gabbay 提交于 5月 29, 2019

This patch configures the Goya CPU to actually go through the MMU for
translation. The configuration is done after the configuration of the
relevant MMU mappings.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f09415f5

habanalabs: add MMU mappings for Goya CPU · 95b5a8b8

由 Oded Gabbay 提交于 5月 29, 2019

This patch adds the necessary MMU mappings for the Goya CPU to access the
device DRAM and the host memory.

The first 256MB of the device DRAM is being mapped. That's where the F/W
is running.

The 2MB area located on the host memory for the purpose of communication
between the driver and the device CPU is also being mapped.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

95b5a8b8

habanalabs: initialize device CPU queues after MMU init · 0b28d26b

由 Oded Gabbay 提交于 5月 29, 2019

This patch changes the order of H/W IP initializations. The MMU needs to
be initialized before the device CPU queues, because the CPU will go
through the ASIC MMU in order to reach the host memory (where the queues
are located).
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0b28d26b

30 5月, 2019 2 次提交

habanalabs: restore unsecured registers default values · 5c823ae1

由 Dalit Ben Zoor 提交于 5月 30, 2019

unsecured registers can be changed by the user, and hence should be
restored to their default values in context switch
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5c823ae1

habanalabs: clear sobs and monitors in context switch · 9c46f7b1

由 Dalit Ben Zoor 提交于 5月 30, 2019

On context switch we need to ensure that each user is not be affected by
other user, so we need to clear sync objects and monitors in context
switch instead of in restore_phase_topology function.
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9c46f7b1

25 5月, 2019 1 次提交

habanalabs: halt debug engines on user process close · 89225ce4

由 Omer Shpigelman 提交于 5月 01, 2019

This patch fix a potential bug where a user's process has closed
unexpectedly without disabling the debug engines. In that case, the debug
engines might continue running but because the user's MMU mappings are
going away, we will get page fault errors.

This behavior is also opposed to the general rule where nothing runs on
the device after the user process closes.

The patch stops the debug H/W engines upon process termination and thus
makes sure nothing runs on the device after the process goes away.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

89225ce4

17 5月, 2019 1 次提交

habanalabs: don't limit packet size for device CPU · cbb10f1e

由 Oded Gabbay 提交于 5月 17, 2019

This patch removes a limitation on the maximum packet size that is read by
the device CPU as that limitation is not needed.

Therefore, the patch also removes an elaborate calculation that is based
on this limitation which is also not needed now. Instead, use a fixed
value for the memory pool size of the packets.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cbb10f1e

16 5月, 2019 1 次提交

habanalabs: support device memory memset > 4GB · ac742737

由 Oded Gabbay 提交于 5月 16, 2019

This patch adds support to the goya memset function to perform memset to
device memory with size larger then 4GB. In this case, we need to use
multiple LIN_DMA packets because a single packet supports up to 4GB.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ac742737

14 5月, 2019 1 次提交

habanalabs: print event name for fatal and non-RAZWI events · 460696ed

由 Omer Shpigelman 提交于 5月 13, 2019

This patch improves the error reporting in case of fatal and non-RAZWI
events such that the event name is printed in addition to the IRQ number.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

460696ed

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功