- 10 12月, 2021 1 次提交
-
-
由 CtfGo 提交于
-
- 09 12月, 2021 6 次提交
-
-
由 CtfGo 提交于
cinn_launch_op: cache scope and place on CinnLaunchContext to skip duplicate alloc/free callback construction
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
由 jianghaicheng 提交于
-
由 Roc 提交于
-
由 Feng Xing 提交于
-
由 Chen Weihang 提交于
-
- 08 12月, 2021 6 次提交
-
-
由 CtfGo 提交于
1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it 2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
由 wuhuanzhou 提交于
* save temp * add unittest, test=develop * fix ci error, test=develop * fix grad accuracy error, test=develop * fix unused error, test=develop * fix compilation error on Windows, test=develop * add unittest, test=develop * modify by review comment and add lerp_ * fix inplace api, test=develop * fix inplace api, test=develop * fix coverage error, test=develop
-
由 crystal 提交于
* add boardcast_sub * add boardcast_sub
-
由 Yanxing Shi 提交于
-
由 sneaxiy 提交于
* fix CUDA Graph H2D bug again * fix no return bug
-
- 07 12月, 2021 2 次提交
-
-
由 danleifeng 提交于
-
由 Zuza 提交于
* quantize slice op * correct test * fix code formatting
-
- 06 12月, 2021 2 次提交
-
-
由 Huihuang Zheng 提交于
1. Modify git tag for CINN 2. Support compile option "-DWITH_CINN=ON, -DWITH_TESTING=OFF"
-
由 Chen Weihang 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage
-
- 03 12月, 2021 2 次提交
- 02 12月, 2021 1 次提交
-
-
由 furnace 提交于
* [NPU] add int64 support for scatter op * [NPU] delete debug codes * [NPU] optimize codes
-
- 01 12月, 2021 4 次提交
-
-
由 sneaxiy 提交于
* fix inplace addto pass * update * fix ut * improve ci coverage * fix musl ci compile error
-
由 TTerror 提交于
* add prior_box for kunlun * update * update CMakeLists
-
由 Feiyu Chan 提交于
* add angle_op
-
由 Huihuang Zheng 提交于
Modify ShareTensorWithCinnBuffer by callback to save memory
-
- 30 11月, 2021 6 次提交
-
-
由 Sylwester Fraczek 提交于
* refactoring matmul hierarchy * review fix * review fix * review_FIX-part2
-
由 Sławomir Siwek 提交于
* Add new unittests * Replace I with O channel for filter groups * Undo changes affecting other operators * Fix oneDNN namespace typo * Fix code format error
-
由 zhangbo9674 提交于
* add regularation and Nesterov for mergerd_momentum * refine unittest for use_nesterov attr * refine op check * refine code * fix bug * refine code of regularization_flag * delete useless code
-
由 Chen Weihang 提交于
-
由 Guoxia Wang 提交于
* support data_format='NHWC' for prelu channel mode
-
由 Yang 提交于
-
- 29 11月, 2021 4 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
由 TTerror 提交于
* add expand_v2/expand_as_v2 for kunlun * update expand_as_v2 * update expand_as_v2 * support float16/bool * update xpu.cmake
-
由 piotrekobiIntel 提交于
-
由 wanghuancoder 提交于
* suport fetch lodtensor array, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 26 11月, 2021 2 次提交
-
-
由 zhaocaibei123 提交于
* test * test * rm test * update * update * update * add unittest * update * update save
-
由 Chen Weihang 提交于
-
- 25 11月, 2021 3 次提交
-
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment * add fill_constant kernel using ScalarArray * modify some prompt * remove fill_constant kernel with no shape
-
由 furnace 提交于
* [NPU] add int64 support for argsort op * [NPU] delete debug codes
-
由 furnace 提交于
* [NPU] add NPU kernel for prior_box op * [NPU] delete debug codes
-