- 08 12月, 2021 13 次提交
-
-
由 CtfGo 提交于
1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it 2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Fix RecordEvent Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
由 wuhuanzhou 提交于
* save temp * add unittest, test=develop * fix ci error, test=develop * fix grad accuracy error, test=develop * fix unused error, test=develop * fix compilation error on Windows, test=develop * add unittest, test=develop * modify by review comment and add lerp_ * fix inplace api, test=develop * fix inplace api, test=develop * fix coverage error, test=develop
-
由 WangXi 提交于
-
由 crystal 提交于
* add boardcast_sub * add boardcast_sub
-
由 From00 提交于
* Fix CUDAGraph bug for StreamSafeCUDAAllocator * Add CUDAGrapthAllocator check in multi-stream interface * Set FLAGS_use_stream_safe_cuda_allocator defaulted to false * Fix environment error for cmake * Fix cmake error * Add UT of GetAllocatorInterfaceTest * Add UT of CUDAGraphExceptionTest * Enhance CUDAGraphExceptionTest
-
由 chentianyu03 提交于
-
由 feng_shuai 提交于
fix: when ceil_model==true && Padding_algo!=SAME, (x-size)/stride != int, this convert is wrong (#37929)
-
由 wanghuancoder 提交于
* refine a test case, test=develop * publish python c api for eager, test=develop * revert modify about test_allclose_layer.py, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * delete numpy includes, use pybind11 numpy.h, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * suport eager error msg, and add grad test case, test=develop * refine, test=develop * refine, test=develop * generate eager core ops, only 4 ops, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Zhanlue Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators
-
由 Yanxing Shi 提交于
-
由 sneaxiy 提交于
* fix CUDA Graph H2D bug again * fix no return bug
-
- 07 12月, 2021 14 次提交
-
-
由 LiYuRio 提交于
-
由 Zhanlue Yang 提交于
* Debug * Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation * Rearranged interfaces * Fixed ci issues
-
由 danleifeng 提交于
-
由 JingZhuangzhuang 提交于
* multithread_memory_optimize
-
由 Huihuang Zheng 提交于
Paddle don't have to set runtime_include_dir during run CINN.
-
由 TTerror 提交于
* format xpu op list * format xpu op list * update xpu1 op list
-
由 wanghuancoder 提交于
* refine a test case, test=develop * rm python, test=develop * refine, test=develop * fix cmake generate error, and fix circular import, test=develop
-
由 zmxdream 提交于
* fix heter service. test=develop * fix heter section worker in debug mode
-
由 wenbin 提交于
don't exit if requested_size < size
-
由 Zuza 提交于
* quantize slice op * correct test * fix code formatting
-
由 jianghaicheng 提交于
-
由 Zhanlue Yang 提交于
-
由 Leo Chen 提交于
-
由 Yuang Liu 提交于
-
- 06 12月, 2021 11 次提交
-
-
由 Huihuang Zheng 提交于
1. Modify git tag for CINN 2. Support compile option "-DWITH_CINN=ON, -DWITH_TESTING=OFF"
-
由 heliqi 提交于
* add test_unsqueeze2_eltwise_fuse_pass * fix name conflictn * rebuild CI
-
由 chentianyu03 提交于
-
由 zmxdream 提交于
* update * update. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * update. test=develop * fix ut. test=develop * fix ut. test=develop * fix sample code. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix paddle.rot90 doc. test=develop * update ut. test=develop * fix. test=develop * fix .test=develop * fix .test=develop * fix doc. test=develop
-
由 Yuang Liu 提交于
-
由 ronnywang 提交于
-
由 danleifeng 提交于
-
由 Leo Chen 提交于
* enable sequential_run for standalone_executor * add ut * fix ut
-
由 wenbin 提交于
* fix * Update stack_op_plugin.cu comments
-
由 Chen Weihang 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage
-
由 kuizhiqing 提交于
-
- 04 12月, 2021 1 次提交
-
-
由 zmxdream 提交于
this pr fix distributed service
-
- 03 12月, 2021 1 次提交
-
-
由 Yuang Liu 提交于
-