提交 · 8d6de52866dcf1d6cbdec9aa10f722dd43b2431f · openeuler / Kernel

21 11月, 2019 6 次提交

habanalabs: remove set but not used variable 'qman_base_addr' · 8d6de528

由 YueHaibing 提交于 10月 16, 2019

Fixes gcc '-Wunused-but-set-variable' warning:

drivers/misc/habanalabs/goya/goya.c: In function 'goya_init_mme_cmdq':
drivers/misc/habanalabs/goya/goya.c:1536:6: warning:
 variable 'qman_base_addr' set but not used [-Wunused-but-set-variable]

It is never used, so can be removed.
Reported-by: NHulk Robot <hulkci@huawei.com>
Signed-off-by: NYueHaibing <yuehaibing@huawei.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

8d6de528

habanalabs: add opcode to INFO IOCTL to return clock rate · 62c1e124

由 Oded Gabbay 提交于 10月 10, 2019

Add a new opcode to the INFO IOCTL to allow the user application to
retrieve the ASIC's current and maximum clock rate. The rate is
returned in MHz.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NTomer Tayar <ttayar@habana.ai>

62c1e124

habanalabs: set TPC Icache to 16 cache lines · 8fdacf2a

由 Oded Gabbay 提交于 10月 02, 2019

Reduce latency to memory during TPC kernel execution.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NTomer Tayar <ttayar@habana.ai>

8fdacf2a

habanalabs: Add a new H/W queue type · cb596aee

由 Tomer Tayar 提交于 10月 03, 2019

This patch adds a support for a new H/W queue type.
This type of queue is for DMA and compute engines jobs, for which
completion notification are sent by H/W.
Command buffer for this queue can be created either through the CB
IOCTL and using the retrieved CB handle, or by preparing a buffer on the
host or device SRAM/DRAM, and using the device address to that buffer.
The patch includes the handling of the 2 options, as well as the
initialization of the H/W queue and its jobs scheduling.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cb596aee

habanalabs: Mark queue as expecting CB handle or address · df762375

由 Tomer Tayar 提交于 10月 03, 2019

Jobs on some queues must be provided with a handle to a driver command
buffer object, while for other queues, jobs must be provided with an
address to a command buffer.
Currently the distinction is done based on the queue type, which is less
flexible if the same queue type behaves differently on different
types of ASICs.
This patch adds a new queue property for this target, which is
configured per queue type per ASIC type.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

df762375

habanalabs: handle F/W failure for sensor initialization · abb7e16f

由 Oded Gabbay 提交于 9月 16, 2019

In case the F/W fails to initialize the thermal sensors, print an
appropriate error message to kernel log and fail the device
initialization.
Reviewed-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

abb7e16f

05 9月, 2019 6 次提交

habanalabs: stop using the acronym KMD · 4c172bbf

由 Oded Gabbay 提交于 8月 30, 2019

We want to stop using the acronym KMD. Therefore, replace all locations
(except for register names we can't modify) where KMD is written to other
terms such as "Linux kernel driver" or "Host kernel driver", etc.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

4c172bbf

habanalabs: display card name as sensors header · 0996bd1c

由 Oded Gabbay 提交于 8月 30, 2019

To allow the user to use a custom file for the HWMON lm-sensors library
per card type, the driver needs to register the HWMON sensors with the
specific card type name.

The card name is supplied by the F/W running on the device. If the F/W is
old and doesn't supply a card name, a default card name is displayed as
the sensors group name.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

0996bd1c

habanalabs: add uapi to retrieve aggregate H/W events · e9730763

由 Oded Gabbay 提交于 8月 28, 2019

Add a new opcode to INFO IOCTL to retrieve aggregate H/W events. i.e. the
events counters are NOT cleared upon device reset, but count from the
loading of the driver.

Add the code to support it in the device event handling function.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Reviewed-by: NOmer Shpigelman <oshpigelman@habana.ai>

e9730763

habanalabs: Make the Coresight timestamp perpetual · 413cf576

由 Tomer Tayar 提交于 8月 27, 2019

The Coresight timestamp is enabled for a specific debug session using
the HL_DEBUG_OP_TIMESTAMP opcode of the debug IOCTL.
In order to have a perpetual timestamp that would be comparable between
various debug sessions, this patch moves the timestamp enablement to be
part of the HW initialization.
The HL_DEBUG_OP_TIMESTAMP opcode turns to be deprecated and shouldn't be
used. Old user-space that will call it won't see any change in the
behavior of the debug session.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

413cf576

habanalabs: Add descriptive name to PSOC app status register · 10d7de2c

由 Tomer Tayar 提交于 8月 01, 2019

Add a meaningful name to the general PSOC application status register
which better describes its usage in keeping the HW state.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

10d7de2c

habanalabs: Add descriptive names to PSOC scratch-pad registers · 4095a176

由 Tomer Tayar 提交于 8月 01, 2019

The PSOC scratch-pad registers are used for communication with the
device CPU. This patch adds new definitions for these registers which
are more descriptive than their general names.

The new set of definitions also gathers and documents the current usage
of the scratch-pad registers by the driver and the device CPU.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

4095a176

12 8月, 2019 3 次提交

habanalabs: fix device IRQ unmasking for BE host · b421d83a

由 Ben Segal 提交于 8月 07, 2019

When unmasking IRQs inside the ASIC, the driver passes an array of all the
IRQ to unmask. The ASIC's CPU is working in LE so when running in a BE
host, the driver needs to do the proper endianness swapping when preparing
this array.

In addition, this patch also fixes the endianness of a couple of kernel log
debug messages that print values of packets
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b421d83a

habanalabs: fix endianness handling for internal QMAN submission · b9040c99

由 Oded Gabbay 提交于 8月 08, 2019

The PQs of internal H/W queues (QMANs) can be located in different memory
areas for different ASICs. Therefore, when writing PQEs, we need to use
the correct function according to the location of the PQ. e.g. if the PQ
is located in the device's memory (SRAM or DRAM), we need to use
memcpy_toio() so it would work in architectures that have separate
address ranges for IO memory.

This patch makes the code that writes the PQE to be ASIC-specific so we
can handle this properly per ASIC.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>
Tested-by: NBen Segal <bpsegal20@gmail.com>

b9040c99

habanalabs: fix endianness handling for packets from user · 213ad5ad

由 Ben Segal 提交于 8月 01, 2019

Packets that arrive from the user and need to be parsed by the driver are
assumed to be in LE format.

This patch fix all the places where the code handles these packets and use
the correct endianness macros to handle them, as the driver handles the
packets in CPU format (LE or BE depending on the arch).
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

213ad5ad

29 7月, 2019 1 次提交

habanalabs: fix host memory polling in BE architecture · 2aa4e410

由 Ben Segal 提交于 7月 18, 2019

This patch fix a bug in the host memory polling macro. The bug is that the
memory being polled can be written by the device, which always writes it
in LE. However, if the host is running Linux in BE mode, we need to
convert the value that was written by the device before matching it to the
required value that the caller has given to the macro.
Signed-off-by: NBen Segal <bpsegal20@gmail.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

2aa4e410

01 7月, 2019 3 次提交

habanalabs: Add busy engines bitmask to HW idle IOCTL · e8960ca0

由 Tomer Tayar 提交于 7月 01, 2019

The information which is currently provided as a response to the
"HL_INFO_HW_IDLE" IOCTL is merely a general boolean value.
This patch extends it and provides also a bitmask that indicates which
of the device engines are busy.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e8960ca0

habanalabs: Add debugfs node for engines status · 06deb86a

由 Tomer Tayar 提交于 7月 01, 2019

Command submissions sent to the device are composed of command buffers
which are targeted to different device engines, like DMA and compute
entities. When a command submission gets stuck, knowing in which engine
the stuck is, is crucial for debugging.
This patch adds a debugfs node that exports this information, by
displaying the engines' various registers that assemble their idle/busy
status.
The information retrieval is based on the is_device_idle ASIC function.
The printout in this function, of the first detected busy engine, is
removed because it becomes redundant in the presence of the more
elaborated info of the new debugfs node.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

06deb86a

habanalabs: Update the device idle check · ac6183ae

由 Tomer Tayar 提交于 7月 01, 2019

The patch updates the device idle check:
- Add reading the DMA core status register, because it is possible that
  a QMAN has finished its work but the DMA itself is still running.
- Remove the MME shadow status check, as the MME ARCH status register
  includes the status of all MME shadows.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ac6183ae

27 6月, 2019 1 次提交

habanalabs: don't reset device when getting VRHOT · 717261e1

由 Oded Gabbay 提交于 6月 27, 2019

VRHOT event from the F/W indicates the device has reached a temperature of
100 Celsius degrees. In this case, the driver should only print this
information to the kernel log. The device will shutdown itself
automatically when reaching 125 degrees.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

717261e1

08 7月, 2019 1 次提交

habanalabs: use %pad for printing a dma_addr_t · f62fa0ce

由 Arnd Bergmann 提交于 7月 08, 2019

dma_addr_t might be different sizes depending on the configuration,
so we cannot print it as %llx:

drivers/misc/habanalabs/goya/goya.c: In function 'goya_sw_init':
drivers/misc/habanalabs/goya/goya.c:698:21: error: format '%llx' expects argument of type 'long long unsigned int', but argument 4 has type 'dma_addr_t' {aka 'unsigned int'} [-Werror=format=]

Use the special %pad format string. This requires passing the
argument by reference.

Fixes: 2a51558c ("habanalabs: remove DMA mask hack for Goya")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f62fa0ce

16 6月, 2019 1 次提交

habanalabs: Allow accessing host mapped addresses via debugfs · 4a0ce776

由 Tomer Tayar 提交于 6月 16, 2019

Allows using the addr/data32 debugfs nodes to access a device VA of a
host mapped memory when the IOMMU is disabled.

Due to the possible large amount of a user host mapped memory, the
driver doesn't maintain a database with the host addresses per device VA.
When the IOMMU is disabled, this missing info is being overcome by
simply using phys_to_virt(). However, this is not useful when the IOMMU
is enabled, and thus the enforced limitation.
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

4a0ce776

29 5月, 2019 4 次提交

habanalabs: remove DMA mask hack for Goya · 2a51558c

由 Oded Gabbay 提交于 5月 29, 2019

This patch removes the non-standard DMA mask setting for Goya. Now that
the device CPU goes through the MMU, we are not limited to allocating the
CPU accessible memory area in the address space of under 39 bits.
Therefore, we don't need to set the DMA masking twice during
initialization, a practice that is not working on POWER architecture.

The patch sets the DMA mask to 48 bits once during the initialization. The
address of the CPU accessible memory area is configured to the MMU and the
matching VA is given to the device CPU.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

2a51558c

habanalabs: set Goya CPU to use ASIC MMU · f09415f5

由 Oded Gabbay 提交于 5月 29, 2019

This patch configures the Goya CPU to actually go through the MMU for
translation. The configuration is done after the configuration of the
relevant MMU mappings.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f09415f5

habanalabs: add MMU mappings for Goya CPU · 95b5a8b8

由 Oded Gabbay 提交于 5月 29, 2019

This patch adds the necessary MMU mappings for the Goya CPU to access the
device DRAM and the host memory.

The first 256MB of the device DRAM is being mapped. That's where the F/W
is running.

The 2MB area located on the host memory for the purpose of communication
between the driver and the device CPU is also being mapped.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

95b5a8b8

habanalabs: initialize device CPU queues after MMU init · 0b28d26b

由 Oded Gabbay 提交于 5月 29, 2019

This patch changes the order of H/W IP initializations. The MMU needs to
be initialized before the device CPU queues, because the CPU will go
through the ASIC MMU in order to reach the host memory (where the queues
are located).
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0b28d26b

30 5月, 2019 2 次提交

habanalabs: restore unsecured registers default values · 5c823ae1

由 Dalit Ben Zoor 提交于 5月 30, 2019

unsecured registers can be changed by the user, and hence should be
restored to their default values in context switch
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5c823ae1

habanalabs: clear sobs and monitors in context switch · 9c46f7b1

由 Dalit Ben Zoor 提交于 5月 30, 2019

On context switch we need to ensure that each user is not be affected by
other user, so we need to clear sync objects and monitors in context
switch instead of in restore_phase_topology function.
Signed-off-by: NDalit Ben Zoor <dbenzoor@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9c46f7b1

25 5月, 2019 1 次提交

habanalabs: halt debug engines on user process close · 89225ce4

由 Omer Shpigelman 提交于 5月 01, 2019

This patch fix a potential bug where a user's process has closed
unexpectedly without disabling the debug engines. In that case, the debug
engines might continue running but because the user's MMU mappings are
going away, we will get page fault errors.

This behavior is also opposed to the general rule where nothing runs on
the device after the user process closes.

The patch stops the debug H/W engines upon process termination and thus
makes sure nothing runs on the device after the process goes away.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

89225ce4

17 5月, 2019 1 次提交

habanalabs: don't limit packet size for device CPU · cbb10f1e

由 Oded Gabbay 提交于 5月 17, 2019

This patch removes a limitation on the maximum packet size that is read by
the device CPU as that limitation is not needed.

Therefore, the patch also removes an elaborate calculation that is based
on this limitation which is also not needed now. Instead, use a fixed
value for the memory pool size of the packets.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cbb10f1e

16 5月, 2019 1 次提交

habanalabs: support device memory memset > 4GB · ac742737

由 Oded Gabbay 提交于 5月 16, 2019

This patch adds support to the goya memset function to perform memset to
device memory with size larger then 4GB. In this case, we need to use
multiple LIN_DMA packets because a single packet supports up to 4GB.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ac742737

14 5月, 2019 1 次提交

habanalabs: print event name for fatal and non-RAZWI events · 460696ed

由 Omer Shpigelman 提交于 5月 13, 2019

This patch improves the error reporting in case of fatal and non-RAZWI
events such that the event name is printed in addition to the IRQ number.
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

460696ed

12 5月, 2019 1 次提交

habanalabs: pass device pointer to asic-specific function · 921a465b

由 Oded Gabbay 提交于 5月 12, 2019

This patch adds a new parameter that is passed to the
add_end_of_cb_packets() asic-specific function.

The parameter is the pointer to the driver's device structure. The
function needs this pointer for future ASICs.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

921a465b

09 5月, 2019 3 次提交

habanalabs: change polling functions to macros · a08b51a9

由 Oded Gabbay 提交于 5月 09, 2019

This patch changes two polling functions to macros, in order to make their
API the same as the standard readl_poll_timeout so we would be able to
define the "condition for exit" when calling these macros.

This will simplify the code as it will eliminate the need to check both
for timeout and for the (cond) in the calling function.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

a08b51a9

habanalabs: remove redundant memory clear · 1f2c999b

由 Oded Gabbay 提交于 5月 09, 2019

The driver allocates memory for fence object with GFP_ZERO flag, so there
is no need to explicitly write 0 to the allocated object after the
allocation.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

1f2c999b

habanalabs: remove redundant CB size adjustment · cbe722e4

由 Oded Gabbay 提交于 5月 09, 2019

Driver-initiated DMA jobs are synchronized jobs, i.e. the driver polls on
fence object until the job is finished. There is no interrupt from the
device. Therefore, no need to add space for 2 * msg_prot packets to the
end of the CB. Only a single msg_prot is needed (to write the fence).
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

cbe722e4

08 5月, 2019 1 次提交

habanalabs: check to load F/W before boot status · 0c169b8a

由 Oded Gabbay 提交于 5月 08, 2019

This patch changes the order of checks when initializing the device CPU.
We want first to check if we need to load the F/W, and only if we need to,
then we want to check the status of the CPU boot program.
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0c169b8a

05 5月, 2019 1 次提交

habanalabs: remove redundant CPU checks · 34a5fab7

由 Omer Shpigelman 提交于 5月 05, 2019

This patch removes redundant CPU availability checks in:
goya_test_queues() - will be done in goya_test_cpu_queue().
goya_ring_doorbell() - was done earlier in goya_send_cpu_message().
Signed-off-by: NOmer Shpigelman <oshpigelman@habana.ai>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

34a5fab7

02 5月, 2019 2 次提交

habanalabs: Update CPU DMA memory label name · 9f832fda

由 Tomer Tayar 提交于 5月 02, 2019

The CPU accessible DMA memory is general and not used only for PQ.
Accordingly, this patch renames the "free_cpu_pq_dma_mem" label with
"free_cpu_dma_mem".
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9f832fda

habanalabs: Update CPU DMA pool label name · ba209e15

由 Tomer Tayar 提交于 5月 02, 2019

The CPU accessible DMA pool is general and not used only for PQ.
Accordingly, this patch rename the "free_cpu_pq_pool" label with
"free_cpu_accessible_dma_pool".
Signed-off-by: NTomer Tayar <ttayar@habana.ai>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ba209e15

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功