Created by: ForFishes
Because the RecordedCudaMalloc
and RecordedCudaFree
need much time, we replace it with the max size of gpu memory. At the same moment, we add InputHelp
and InsRank
as input for grad in this op.
Created by: ForFishes
Because the RecordedCudaMalloc
and RecordedCudaFree
need much time, we replace it with the max size of gpu memory. At the same moment, we add InputHelp
and InsRank
as input for grad in this op.