Created by: dlkht
我试了transformer的distill,train了两个batch之后提示如下错误: Error: Tensor holds no memory. Call Tensor::mutable_data first. [Hint: holder_ should not be null.] at (/paddle/paddle/fluid/framework/tensor.cc:23) [operator < elementwise_div > error]
是内存原因吗?