- 03 12月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 16 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 12 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
- 26 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 25 3月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
- 02 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library * fix CI * fix rocm ci
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 11 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 10 9月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* fix scatter gather bug: * fix windows ci
-
- 01 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix safe bug of scatter/scatter_nd
-
- 22 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 13 5月, 2020 1 次提交
-
-
由 ForFishes 提交于
fix scatter and scatter_nd op error message
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 04 9月, 2019 2 次提交
- 30 8月, 2019 1 次提交
-
-
由 ShenLiang 提交于
* fixed the code for coverage * fixed the document,test=document_preview test=develop
-
- 12 6月, 2019 1 次提交
-
-
由 wawltor 提交于
* test=develop The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode. * test=develop Fix some code format problem, and the same time add the test case in gather and scatter op
-
- 25 5月, 2019 1 次提交
-
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
- 12 11月, 2018 1 次提交
-
-
由 Yibing Liu 提交于
* Add int type support for stack_op * Improve gather op to support index with shape N x 1 test=develop * Fix stack_op kernel's registry test=develop
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 04 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 03 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 29 9月, 2017 2 次提交