- 10 11月, 2022 1 次提交
-
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 12 4月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
- 26 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 25 3月, 2022 1 次提交
-
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
- 02 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library * fix CI * fix rocm ci
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 11 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move operators/math/math_function_* to pten/kernels/func * namespace from `paddle::operators::math` to `pten::funcs`
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 10 9月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* fix scatter gather bug: * fix windows ci
-
- 08 9月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* fix scatter_add_nd and gather bug * fix gather compile error
-
- 16 7月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
-
- 13 7月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Fix gather_op by adding OurOfRangeCheck for param[Index] * Code Optimization
-
- 11 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix gather shape bug * fix None * fix topo
-
- 10 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 09 11月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 23 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 14 5月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 30 8月, 2019 1 次提交
-
-
由 ShenLiang 提交于
* fixed the code for coverage * fixed the document,test=document_preview test=develop
-
- 14 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix gather op bug test=develop
-
- 25 5月, 2019 1 次提交
-
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
- 26 3月, 2019 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 12 11月, 2018 1 次提交
-
-
由 Yibing Liu 提交于
* Add int type support for stack_op * Improve gather op to support index with shape N x 1 test=develop * Fix stack_op kernel's registry test=develop
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 04 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 03 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 29 9月, 2017 2 次提交