1. 17 7月, 2014 10 次提交
    • B
      amdkfd: Add process queue manager module · 45102048
      Ben Goz 提交于
      The queue scheduler divides into two sections, one section is process bounded
      and the other section is device bounded.
      The process bounded section is handled by this module. The PQM handles usermode
      queue setup, updates and tear-down.
      
      v3:
      
      Used kernel parameter to limit queues per process instead of define
      Added use of doorbell address from user
      
      v4:
      
      Modified pqm_create_queue so that only when creating usermode queues the
      driver should return the queue properties to the userspace.
      
      Added an info message print when no more queues can be opened because of the
      queue per process limitation
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Various fixes
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      45102048
    • B
      amdkfd: Add packet manager module · 241f24f8
      Ben Goz 提交于
      The packet manager module builds PM4 packets for the sole use of the CP
      scheduler. Those packets are used by the HIQ to submit runlists to the CP.
      
      v3:
      
      Removed include of cik_mqds.h
      Changed lower_32/upper_32 calls to use linux macros
      Used new gart allocation functions
      Added documentation
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Change format of mqd structure to match latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime
      Always chain runlist if you have more than 1 process or if you have
      over-subscription over the number of queues.
      Various fixes (typos, style)
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      241f24f8
    • B
      amdkfd: Add module parameter of scheduling policy · 31c21fec
      Ben Goz 提交于
      This patch adds a new parameter to the amdkfd driver. This parameter enables
      the user to select the scheduling policy of the CP. The choices are:
      
      * CP Scheduling with support for over-subscription
      * CP Scheduling without support for over-subscription
      * Without CP Scheduling
      
      Note that the third option (Without CP scheduling) is only for debug purposes
      and bringup of new H/W. As such, it is _not_ guaranteed to work at all times on
      all H/W versions.
      
      v3: Fixed description of parameter, changed the permissions to read_only, added
      a verification of the value and added documentation
      
      v5: Set default sched_policy to HWS as it is now supported by firmware
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      31c21fec
    • B
      amdkfd: Add kernel queue module · ed6e6a34
      Ben Goz 提交于
      The kernel queue module enables the amdkfd to establish kernel queues, not
      exposed to user space.
      
      The kernel queues are used for HIQ (HSA Interface Queue) and DIQ (Debug
      Interface Queue) operations
      
      v3: Removed use of internal typedefs and added use of the new gart allocation
      functions
      
      v4: Fixed a miscalculation in kernel queue wrapping
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Change format of mqd structure to match latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime
      Add define for kernel queue size
      Various fixes
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      ed6e6a34
    • B
      amdkfd: Add mqd_manager module · 6e99df57
      Ben Goz 提交于
      The mqd_manager module handles MQD data structures.
      MQD stands for Memory Queue Descriptor, which is used by the H/W to
      keep the usermode queue state in memory.
      
      v3:
      
      Removed new typedefs
      Removed pragma pack 4
      Remove cik_mqds.h file
      Changed lower_32/upper_32 calls to use linux macros
      Used new gart allocation functions
      Added documentation
      
      v4:
      
      Added missing initialization of the addr field in init_mqd()
      
      Setting the hqd persistent.preload_req bit ON so that when queues switches
      on/off, their context will kept and read from the mqd when the cp reassign
      them, and thus the dispatched workload context kept consistent without any
      interrupts.
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Change format of mqd structure to match latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime.
      Various fixes
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      6e99df57
    • B
      amdkfd: Add queue module · ed8aab45
      Ben Goz 提交于
      The queue module enables allocating and initializing queues uniformly.
      
      v3: Removed typedef and redundant memset call. Broke long pr_debug print to one
      liners and Added documentation.
      
      v5: Move amdkfd from drm/radeon/ to drm/amd/
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      ed8aab45
    • O
      amdkfd: Add binding/unbinding calls to amd_iommu driver · b17f068a
      Oded Gabbay 提交于
      This patch adds the functions to bind and unbind pasid
      from a device through the amd_iommu driver.
      
      The unbind function is called when the mm_struct of the
      process is released.
      
      The bind function is not called here because it is called
      only in the IOCTLs which are not yet implemented at this
      stage of the patchset.
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      b17f068a
    • O
      amdkfd: Add basic modules to amdkfd · 19f6d2a6
      Oded Gabbay 提交于
      This patch adds the process module and three helper modules:
      
      - kfd_process, which handles process which open /dev/kfd
      
      - kfd_doorbell, which provides helper functions for doorbell allocation,
        release and mapping to userspace
      
      - kfd_pasid, which provides helper functions for pasid allocation and release
      
      - kfd_aperture, which provides helper functions for managing the LDS, Local GPU
        memory and Scratch memory apertures of the process
      
      This patch only contains the basic kfd_process module, which doesn't contain
      the reference to the queue scheduler. This was done to allow easier code review.
      
      Also, this patch doesn't contain the calls to the IOMMU driver for binding the
      pasid to the device. Again, this was done to allow easier code review
      
      The kfd_process object is created when a process opens /dev/kfd and is closed
      when the mm_struct of that process is teared-down.
      
      v3:
      
      Removed kfd_vidmem.c file
      Replaced direct mmput call to mmu_notifier release
      Removed typedefs
      Moved bool field to end of the structure
      Added new kernel params for gart usage limitation
      Added initialization of sa manager
      Fixed debug messages
      Remove support for LDS in 32 bit
      Changed code to support mmap of doorbell pages from userspace
      Added documentation for apertures
      
      v4: Replaced RCU by SRCU for kfd_process list management
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Rename kfd_aperture.c to kfd_flat_memory.c
      Protect against multiple init calls
      MQD size is H/W dependent so moved it to device info structure
      Rename kfd_mem_obj structure's members
      Use delayed function for process tear-down
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      19f6d2a6
    • E
      amdkfd: Add topology module to amdkfd · 5b5c4e40
      Evgeny Pinchuk 提交于
      This patch adds the topology module to the driver. The topology is exposed to
      userspace through the sysfs.
      
      The calls to add and remove a device to/from topology are done by the radeon
      driver.
      
      v3:
      
      The CPU information, that is provided in the topology section of the amdkfd
      driver, is extracted from the CRAT table. Unlike the CPU information located
      in /sys/devices/system/cpu/cpu*, which is extracted from the SRAT table.
      
      While the CPU information provided by the CRAT and the SRAT tables might be
      identical, the node topology might be different. The SRAT table contains the
      topology of CPU nodes only. The CRAT table contains the topology of CPU and GPU
      nodes together (and can be interleaved). For example CPU node 1 in SRAT can be
      CPU node 3 in CRAT. Furthermore it's worth to mention that the CRAT table
      contains only HSA compatible nodes (nodes which are compliant with the HSA
      spec).
      
      To recap, amdkfd exposes a different kind of topology than the one exposed by
      /sys/devices/system/cpu/cpu even though it may contain similar information.
      
      v4:
      
      The topology module doesn't support uevent handling and doesn't notify the
      userspace about runtime modifications. It is up to the userspace to acquire
      snapshots of the topology information created by the amdkfd and exposed
      in sysfs.
      
      The following is an example of how the topology looks on a Kaveri A10-7850K
      system with amdkfd installed:
      
      /sys/devices/virtual/kfd/kfd/
      |
      --- topology/
            |
            |--- generation_id
            |--- system_properties
            |--- nodes/
                  |
                  |--- 0/
                       |
                       |--- gpu_id
                       |--- name
                       |--- properties
                       |--- caches/
                            |
                            |--- 0/
                                 |
                                 |--- properties
                            |--- 1/
                                 |
                                 |--- properties
                            |--- 2/
                                 |
                                 |--- properties
                       |--- io_links/
                            |
                       |--- mem_banks/
                            |
                            |--- 0/
                                 |
                                 |--- properties
                            |--- 1/
                                 |
                                 |--- properties
                            |--- 2/
                                 |
                                 |--- properties
                            |--- 3/
                                 |
                                 |--- properties
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      
      Add a check if dev->gpu pointer is null before accessing it in the
      node_show function in kfd_topology.c
      This situation may occur when amdkfd is loaded and there is a GPU with a CRAT
      table, but that GPU isn't supported by amdkfd
      Signed-off-by: NEvgeny Pinchuk <evgeny.pinchuk@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      5b5c4e40
    • O
      amdkfd: Add amdkfd skeleton driver · 4a488a7a
      Oded Gabbay 提交于
      This patch adds the amdkfd skeleton driver. The driver does nothing except
      define a /dev/kfd device.
      
      It returns -ENODEV on all amdkfd IOCTLs.
      
      v3: Move bool field to the end of structure, removed the pmc ioctls and added
      a meaningful error message for ioctl error.
      
      v5:
      
      Create a new folder drm/amd and move amdkfd from drm/radeon/ to drm/amd/
      Remove scheduler_class from kfd_priv.h as it was never used
      Add skeleton implementation of the Get Version IOCTL
      
      v6:
      Update module version to the correct number and remove the "default m" from the
      Kconfig file
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      4a488a7a
  2. 16 7月, 2014 1 次提交
    • O
      amdkfd: Add IOCTL set definitions of amdkfd · b7facbae
      Oded Gabbay 提交于
      - KFD_IOC_GET_VERSION:
      	Retrieves the interface version of amdkfd
      
      - KFD_IOC_CREATE_QUEUE:
      	Creates a usermode queue that runs on a specific GPU device
      
      - KFD_IOC_DESTROY_QUEUE:
      	Destroys an existing usermode queue
      
      - KFD_IOC_SET_MEMORY_POLICY:
      	Sets the memory policy of the default and alternate aperture of the
              calling process
      
      - KFD_IOC_GET_CLOCK_COUNTERS:
      	Retrieves counters (timestamps) of CPU and GPU
      
      - KFD_IOC_GET_PROCESS_APERTURES:
      	Retrieves information about process apertures that were initialized
              during the open() call of the amdkfd device
      
      - KFD_IOC_UPDATE_QUEUE:
      	Updates configuration of an existing usermode queue
      
      v3: Remove pragma pack and pmc ioctls. Added parameter for doorbell offset and
      a comment on counters
      
      v5:
      
      Add define for AQL queues.
      Fix arguments of Get Version IOCTL
      Make IOCTL's structures to be the same size on 32/64 bit
      
      v6: Change the version of the amdkfd-thunk interface
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      b7facbae
  3. 15 7月, 2014 2 次提交
    • O
      Update MAINTAINERS and CREDITS files with amdkfd info · 16423d67
      Oded Gabbay 提交于
      v6: Update entries to reflect new name & location of driver
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      16423d67
    • O
      drm/radeon: Add radeon <--> amdkfd interface · e28740ec
      Oded Gabbay 提交于
      This patch adds the interface between the radeon driver and the amdkfd driver.
      The interface implementation is contained in radeon_kfd.c and radeon_kfd.h.
      
      The interface itself is represented by a pointer to struct
      kfd_dev. The pointer is located inside radeon_device structure.
      
      All the register accesses that amdkfd need are done using this interface. This
      allows us to avoid direct register accesses in amdkfd proper,  while also
      avoiding locking between amdkfd and radeon.
      
      The single exception is the doorbells that are used in both of the drivers.
      However, because they are located in separate pci bar pages, the danger of
      sharing registers between the drivers is minimal.
      
      Having said that, we are planning to move the doorbells as well to radeon.
      
      v3:
      
      Add interface for sa manager init and fini. The init function will allocate a
      buffer on system memory and pin it to the GART address space via the radeon sa
      manager.
      
      All mappings of buffers to GART address space are done via the radeon sa
      manager. The interface of allocate memory will use the radeon sa manager to sub
      allocate from the single buffer that was allocated during the init function.
      
      Change lower_32/upper_32 calls to use linux macros
      
      Add documentation for the interface
      
      v4:
      
      Change ptr field type in kgd_mem from uint32_t* to void* to match to type that
      is returned by radeon_sa_bo_cpu_addr
      
      v5:
      
      Change format of mqd structure to work with latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime.
      Move generic kfd-->kgd interface and other generic kgd definitions to a generic
      header file that will be used by AMD's radeon and amdgpu drivers
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      e28740ec
  4. 14 7月, 2014 1 次提交
  5. 28 1月, 2014 1 次提交
  6. 12 2月, 2014 1 次提交
  7. 16 1月, 2014 1 次提交
  8. 10 11月, 2014 1 次提交
    • O
      iommu/amd: fix accounting of device_state · a015c1e9
      Oded Gabbay 提交于
      This patch fixes a bug in the accounting of the device_state.
      In the current code, the device_state was put (decremented) too many times,
      which sometimes lead to the driver getting stuck permanently in
      put_device_state_wait(). That happen because the device_state->count would go
      below zero, which is never supposed to happen.
      
      The root cause is that the device_state was decremented in put_pasid_state()
      and put_pasid_state_wait() but also in all the functions that call those
      functions. Therefore, the device_state was decremented twice in each of these
      code paths.
      
      The fix is to decouple the device_state accounting from the pasid_state
      accounting - remove the call to put_device_state() from the
      put_pasid_state() and the put_pasid_state_wait())
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      a015c1e9
  9. 13 11月, 2014 22 次提交