1. 17 7月, 2014 7 次提交
    • B
      amdkfd: Add kernel queue module · ed6e6a34
      Ben Goz 提交于
      The kernel queue module enables the amdkfd to establish kernel queues, not
      exposed to user space.
      
      The kernel queues are used for HIQ (HSA Interface Queue) and DIQ (Debug
      Interface Queue) operations
      
      v3: Removed use of internal typedefs and added use of the new gart allocation
      functions
      
      v4: Fixed a miscalculation in kernel queue wrapping
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Change format of mqd structure to match latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime
      Add define for kernel queue size
      Various fixes
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      ed6e6a34
    • B
      amdkfd: Add mqd_manager module · 6e99df57
      Ben Goz 提交于
      The mqd_manager module handles MQD data structures.
      MQD stands for Memory Queue Descriptor, which is used by the H/W to
      keep the usermode queue state in memory.
      
      v3:
      
      Removed new typedefs
      Removed pragma pack 4
      Remove cik_mqds.h file
      Changed lower_32/upper_32 calls to use linux macros
      Used new gart allocation functions
      Added documentation
      
      v4:
      
      Added missing initialization of the addr field in init_mqd()
      
      Setting the hqd persistent.preload_req bit ON so that when queues switches
      on/off, their context will kept and read from the mqd when the cp reassign
      them, and thus the dispatched workload context kept consistent without any
      interrupts.
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Change format of mqd structure to match latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime.
      Various fixes
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      6e99df57
    • B
      amdkfd: Add queue module · ed8aab45
      Ben Goz 提交于
      The queue module enables allocating and initializing queues uniformly.
      
      v3: Removed typedef and redundant memset call. Broke long pr_debug print to one
      liners and Added documentation.
      
      v5: Move amdkfd from drm/radeon/ to drm/amd/
      Signed-off-by: NBen Goz <ben.goz@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      ed8aab45
    • O
      amdkfd: Add binding/unbinding calls to amd_iommu driver · b17f068a
      Oded Gabbay 提交于
      This patch adds the functions to bind and unbind pasid
      from a device through the amd_iommu driver.
      
      The unbind function is called when the mm_struct of the
      process is released.
      
      The bind function is not called here because it is called
      only in the IOCTLs which are not yet implemented at this
      stage of the patchset.
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      b17f068a
    • O
      amdkfd: Add basic modules to amdkfd · 19f6d2a6
      Oded Gabbay 提交于
      This patch adds the process module and three helper modules:
      
      - kfd_process, which handles process which open /dev/kfd
      
      - kfd_doorbell, which provides helper functions for doorbell allocation,
        release and mapping to userspace
      
      - kfd_pasid, which provides helper functions for pasid allocation and release
      
      - kfd_aperture, which provides helper functions for managing the LDS, Local GPU
        memory and Scratch memory apertures of the process
      
      This patch only contains the basic kfd_process module, which doesn't contain
      the reference to the queue scheduler. This was done to allow easier code review.
      
      Also, this patch doesn't contain the calls to the IOMMU driver for binding the
      pasid to the device. Again, this was done to allow easier code review
      
      The kfd_process object is created when a process opens /dev/kfd and is closed
      when the mm_struct of that process is teared-down.
      
      v3:
      
      Removed kfd_vidmem.c file
      Replaced direct mmput call to mmu_notifier release
      Removed typedefs
      Moved bool field to end of the structure
      Added new kernel params for gart usage limitation
      Added initialization of sa manager
      Fixed debug messages
      Remove support for LDS in 32 bit
      Changed code to support mmap of doorbell pages from userspace
      Added documentation for apertures
      
      v4: Replaced RCU by SRCU for kfd_process list management
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      Rename kfd_aperture.c to kfd_flat_memory.c
      Protect against multiple init calls
      MQD size is H/W dependent so moved it to device info structure
      Rename kfd_mem_obj structure's members
      Use delayed function for process tear-down
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      19f6d2a6
    • E
      amdkfd: Add topology module to amdkfd · 5b5c4e40
      Evgeny Pinchuk 提交于
      This patch adds the topology module to the driver. The topology is exposed to
      userspace through the sysfs.
      
      The calls to add and remove a device to/from topology are done by the radeon
      driver.
      
      v3:
      
      The CPU information, that is provided in the topology section of the amdkfd
      driver, is extracted from the CRAT table. Unlike the CPU information located
      in /sys/devices/system/cpu/cpu*, which is extracted from the SRAT table.
      
      While the CPU information provided by the CRAT and the SRAT tables might be
      identical, the node topology might be different. The SRAT table contains the
      topology of CPU nodes only. The CRAT table contains the topology of CPU and GPU
      nodes together (and can be interleaved). For example CPU node 1 in SRAT can be
      CPU node 3 in CRAT. Furthermore it's worth to mention that the CRAT table
      contains only HSA compatible nodes (nodes which are compliant with the HSA
      spec).
      
      To recap, amdkfd exposes a different kind of topology than the one exposed by
      /sys/devices/system/cpu/cpu even though it may contain similar information.
      
      v4:
      
      The topology module doesn't support uevent handling and doesn't notify the
      userspace about runtime modifications. It is up to the userspace to acquire
      snapshots of the topology information created by the amdkfd and exposed
      in sysfs.
      
      The following is an example of how the topology looks on a Kaveri A10-7850K
      system with amdkfd installed:
      
      /sys/devices/virtual/kfd/kfd/
      |
      --- topology/
            |
            |--- generation_id
            |--- system_properties
            |--- nodes/
                  |
                  |--- 0/
                       |
                       |--- gpu_id
                       |--- name
                       |--- properties
                       |--- caches/
                            |
                            |--- 0/
                                 |
                                 |--- properties
                            |--- 1/
                                 |
                                 |--- properties
                            |--- 2/
                                 |
                                 |--- properties
                       |--- io_links/
                            |
                       |--- mem_banks/
                            |
                            |--- 0/
                                 |
                                 |--- properties
                            |--- 1/
                                 |
                                 |--- properties
                            |--- 2/
                                 |
                                 |--- properties
                            |--- 3/
                                 |
                                 |--- properties
      
      v5:
      
      Move amdkfd from drm/radeon/ to drm/amd/
      
      Add a check if dev->gpu pointer is null before accessing it in the
      node_show function in kfd_topology.c
      This situation may occur when amdkfd is loaded and there is a GPU with a CRAT
      table, but that GPU isn't supported by amdkfd
      Signed-off-by: NEvgeny Pinchuk <evgeny.pinchuk@amd.com>
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      5b5c4e40
    • O
      amdkfd: Add amdkfd skeleton driver · 4a488a7a
      Oded Gabbay 提交于
      This patch adds the amdkfd skeleton driver. The driver does nothing except
      define a /dev/kfd device.
      
      It returns -ENODEV on all amdkfd IOCTLs.
      
      v3: Move bool field to the end of structure, removed the pmc ioctls and added
      a meaningful error message for ioctl error.
      
      v5:
      
      Create a new folder drm/amd and move amdkfd from drm/radeon/ to drm/amd/
      Remove scheduler_class from kfd_priv.h as it was never used
      Add skeleton implementation of the Get Version IOCTL
      
      v6:
      Update module version to the correct number and remove the "default m" from the
      Kconfig file
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      4a488a7a
  2. 15 7月, 2014 1 次提交
    • O
      drm/radeon: Add radeon <--> amdkfd interface · e28740ec
      Oded Gabbay 提交于
      This patch adds the interface between the radeon driver and the amdkfd driver.
      The interface implementation is contained in radeon_kfd.c and radeon_kfd.h.
      
      The interface itself is represented by a pointer to struct
      kfd_dev. The pointer is located inside radeon_device structure.
      
      All the register accesses that amdkfd need are done using this interface. This
      allows us to avoid direct register accesses in amdkfd proper,  while also
      avoiding locking between amdkfd and radeon.
      
      The single exception is the doorbells that are used in both of the drivers.
      However, because they are located in separate pci bar pages, the danger of
      sharing registers between the drivers is minimal.
      
      Having said that, we are planning to move the doorbells as well to radeon.
      
      v3:
      
      Add interface for sa manager init and fini. The init function will allocate a
      buffer on system memory and pin it to the GART address space via the radeon sa
      manager.
      
      All mappings of buffers to GART address space are done via the radeon sa
      manager. The interface of allocate memory will use the radeon sa manager to sub
      allocate from the single buffer that was allocated during the init function.
      
      Change lower_32/upper_32 calls to use linux macros
      
      Add documentation for the interface
      
      v4:
      
      Change ptr field type in kgd_mem from uint32_t* to void* to match to type that
      is returned by radeon_sa_bo_cpu_addr
      
      v5:
      
      Change format of mqd structure to work with latest KV firmware
      Add support for AQL queues creation to enable working with open-source HSA
      runtime.
      Move generic kfd-->kgd interface and other generic kgd definitions to a generic
      header file that will be used by AMD's radeon and amdgpu drivers
      Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
      e28740ec