[cherry-pick]Fixed a bug of log_softmax: op input was modified to 'nan' (#32937) (#33436)

使用op benchmark时发现，当输入数据量小于某个值时，python 端 log_softmax 接口的输入值经过计算过后会被改变为nan。输出正常。 cherry-pick自 #32937

[cherry-pick]Fixed a bug of log_softmax: op input was modified to 'nan' (#32937) (#33436)
使用op benchmark时发现，当输入数据量小于某个值时，python 端 log_softmax 接口的输入值经过计算过后会被改变为nan。输出正常。 cherry-pick自 #32937
61cae0df · Lijunhui · GitHub · 8461ab17 · 61cae0df
隐藏空白更改
内联并排

Showing with 2 addition and 2 deletion

paddle/fluid/operators/log_softmax_op.cu paddle/fluid/operators/log_softmax_op.cu +2 -2

未找到文件。
--- a/paddle/fluid/operators/log_softmax_op.cu
+++ b/paddle/fluid/operators/log_softmax_op.cu
@@ -104,7 +104,7 @@ __global__ void ComputeLogSoftmaxForwardInWarp(T *dst, const T *src,
 #pragma unroll
  for (int it = 0; it < warp_iter; ++it) {
    int element_index = thread_in_warp_idx + it * kernel_warp_size;
-    if (element_index < element_count) {
+    if (element_index < effective_element_count) {
      dst[batch_id * element_count + element_index] =
          static_cast<T>(elements[it] - max_value - sum);
    } else {
@@ -226,7 +226,7 @@ __global__ void ComputeLogSoftmaxBackwardInWarp(const T *output,
 #pragma unroll
  for (int iter = 0; iter < warp_iter; ++iter) {
    int element_index = thread_in_warp_idx + iter * kernel_warp_size;
-    if (element_index < element_count) {
+    if (element_index < effective_element_count) {
      grad_input[batch_id * element_count + element_index] = static_cast<T>(
          (grad_output_register[iter] - std::exp(output_register[iter]) * sum));
    }