- 28 8月, 2015 29 次提交
-
-
由 Alexandre Courbot 提交于
GM20B has a 512-channels FIFO similar to GK104. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Alexandre Courbot 提交于
GK20A's initialization was based on GK104, but differences exist in the way the initial context is built and the initialization process itself. This patch follows the same initialization sequence as nvgpu performs to avoid bad surprises. Since the register bundles initialization also differ considerably from GK104, the register packs are now loaded from firmware files, again similarly to what is done with nvgpu. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Alexandre Courbot 提交于
NVIDIA will officially start providing GR firmwares through linux-firmware for GPUs that require it. Change the GR firmware lookup function to use these files. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
These signals and sources have been reverse engineered from CUPTI (Linux). Graphics signals exposed by PerfKit (Windows only) will be added later. I need to reverse engineer them and it's a bit painful. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
gk104_pm_ctor() is equal to gf100_pm_ctor(). Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This trivial patch makes thing more consistent since hardware signals names are prefixed by 'pcXX'. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This is going to be very useful for GF100+ because each GPC can have its own domain of counters. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
These signals and sources have been reverse engineered from CUPTI (Linux). Graphics signals exposed by PerfKit (Windows only) will be added later. I need to reverse engineer them and it's a bit painful. This commit also adds a new class for GF108 and GF117. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Ben Skeggs 提交于
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Ben Skeggs 提交于
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
These signals and sources have been reverse engineered from NVIDIA PerfKit (Windows) and CUPTI (Linux), they will be used to build complex hardware events from the userspace. This commit also adds a new class for GT200. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: Samuel Pitoiset <samuel.pitoiset at gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Configuring counters from the userspace require the kernel to handle some logic related to performance counters. Basically, it has to find a free slot to assign a counter, to handle extra counting modes like B4/B6 and it must return and error when it can't configure a counter. In my opinion, the kernel should not handle all of that logic but it should only write the configuration sent by the userspace without checking anything. In other words, it should overwrite the configuration even if it's already counting and do not return any errors. This patch allows the userspace to configure a domain instead of separate counters. This has the advantage to move all of the logic to the userspace. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This adds a new method NVIF_PERFCTR_V0_INIT which starts a batch of hardware counters for sampling. This will allow the userspace to start a monitoring session using the INIT method and to stop it with SAMPLE, for example before and after a frame is rendered. This commit temporarily breaks nv_perfmon but this is going to be fixed with the upcoming patch. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This allows to query the ID, the mask and the user-readable name of sources for each signal. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
A source (or multiplexer) is a tuple addr+mask+shift which allows to control a block of signals. The maximum number of sources that a signal can define is arbitrary limited to 8 and this should be large enough. This patch allows to define multi-level of sources for a signal. Each different sources are stored to a global list and will be exposed to the userspace through the nvif interface in order to avoid conflicts. Signed-off-by: Samuel Pitoiset <samuel.pitoiset at gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This signal index must be always allowed even if it's not clearly defined in a domain in order to monitor a counter like 0x03020100 because it's the default value of signals. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
16 bits is large enough to store the maximum number of signals available for one domain (i.e. 256). Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This will allow to configure performance counters with hardware signal indexes instead of user-readable names in an upcoming patch. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This allows to query the number of available domains, including the number of hardware counter and the number of signals per domain. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Since a new class has been introduced to query signals, we can now return an error when the userspace wants to monitor unknown signals. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This commit introduces the NVIF_IOCTL_NEW_V0_PERFMON class which will be used in order to query domains, signals and sources. This separates the querying and the counting interface. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: NMartin Peres <martin.peres@free.fr> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
PDAEMON signals don't have to be exposed by the perfmon engine. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: NMartin Peres <martin.peres@free.fr> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
- 14 8月, 2015 1 次提交
-
-
由 Alexandre Courbot 提交于
This reverts commit 1addc126 This commit seems to cause crashes in gk104_fifo_intr_runlist() by returning 0xbad0da00 when register 0x2a00 is read. Since this commit was intended for GM20B which is not completely supported yet, let's revert it for the time being. Reported-by: NEric Biggers <ebiggers3@gmail.com> Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Tested-by: NAfzal Mohammed <afzal.mohd.ma@gmail.com> Signed-off-by: NDave Airlie <airlied@redhat.com>
-
- 27 7月, 2015 6 次提交
-
-
由 Thierry Reding 提交于
The return type of exec_lookup() is struct nvkm_output *, so it should return NULL rather than 0. Signed-off-by: NThierry Reding <treding@nvidia.com> Reviewed-by: NEmil Velikov <emil.l.velikov@gmail.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
There is always the possiblity that the ppm->context pointer would get partially updated and accidentally would equal ctx. This would allow two contexts to co-exist, which is not acceptable. Moving the test to the critical section takes care of this problem. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: NMartin Peres <martin.peres@free.fr> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Samuel Pitoiset 提交于
This fixes a crash when multiple PM engine contexts are created. Signed-off-by: NSamuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: NMartin Peres <martin.peres@free.fr> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Alexandre Courbot 提交于
After submitting a GO_IDLE bundle, one must wait for GR to effectively be idle before submitting the next bundle. Failure to do so may result in undefined behavior in some rare cases. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Reported-by: NKary Jin <karyj@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Alexandre Courbot 提交于
When emitting the ICMD bundle, wait on the bottom half (bit 3 of the GR_STATUS register) instead of upper half (bit 2) to make sure methods are effectively emitted. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Alexandre Courbot 提交于
Kicking channels is part of their deactivation process. Maxwell chips are particularly sensitive to this, and can start fetching the previous pushbuffer of a recycled channel if this is not done. While we are at it, improve the channel preemption code to only wait for bit 20 of 0x002634 to turn to 0, as it is the bit indicating a preempt is pending. Signed-off-by: NAlexandre Courbot <acourbot@nvidia.com> Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
- 29 5月, 2015 1 次提交
-
-
由 Ben Skeggs 提交于
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
- 14 4月, 2015 3 次提交
-
-
由 Ben Skeggs 提交于
Uncertain whether the GPC pack change is due to a newer driver version, or a legitimate difference from GM204. My GM204 has broken vram, so can't currently try a newer binary driver on it to confirm. Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Ben Skeggs 提交于
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-
由 Ben Skeggs 提交于
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
-