- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 5月, 2022 1 次提交
-
-
由 liutiexing 提交于
* WorkQueue supports always_spinning option * update * update * fix stat
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 01 5月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify utils naming style * polish details
-
- 26 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log
-
- 25 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* init commit * remove two hash impl * fix bug * polish details * fix compile failed * fix compile failed * fix compile failed * add default kernel sig cache * fix get kernel arg defs error * remove kernel arg defs cache * fix origin op execute
-
由 tiancaishaonvjituizi 提交于
* fix bug of std::move and others * fix an compile error in debug mode * fix wrong copy assignment operator Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * reformat Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * reformat Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * fix ArrayRef constructor following llvm * fix format * fix conflict with master
-
- 24 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add variant and replace any * split attribute
-
- 21 4月, 2022 2 次提交
-
-
由 liutiexing 提交于
* WorkQueue supports always_spinning option * update * update
-
由 Ruibiao Chen 提交于
* Support cinn_launch OP in standalone executor * Remove some redundant code
-
- 20 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clear the listener after run * only sync variables in program * refine code * fit for lod_tensor_blocking_queue
-
- 19 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel(Part1) (#41806) * seperate add/div/sub/mul from elementwise * delete code * fix compile bugs * deal with conflict * fix bugs when compile * fix windows unit test bug * fix ci converage bugs
-
- 18 4月, 2022 2 次提交
- 17 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
- 15 4月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
- 14 4月, 2022 2 次提交
-
-
由 Lijunhui 提交于
* regist elementwise_xxx
-
由 liutiexing 提交于
* executor perf statistics * fix ut * fix ut * fix ut * add ut * add ut
-
- 13 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove stack_grad infershape * fix bug of output with null * fix bug
-
- 12 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 09 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix bug that no thread is waked up when adding task to threadpool * fix typo
-
- 07 4月, 2022 1 次提交
-
-
由 liutiexing 提交于
* Profile Executors * update * fix ut * fix names * update * update
-
- 05 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* enable new executor by default * enable stream safe allocator * test=document_fix;test=coverage * do not use scope in op kernel * fit empty program for new executor * fix communication depend * fix test_sync_batch_norm * skip unsupported place * refine datatransfer * fit for dirtributed program * fix dependencpy * fix some ut
-
- 03 4月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add concat_grad kernel * fix error * remove comment code * fix outs nullptr error * change to phi header * add concat_grad declare for standalone_executor_test
-
- 02 4月, 2022 2 次提交
- 01 4月, 2022 3 次提交
-
-
由 liutiexing 提交于
* fix mac c++ version * update * fix apple systems
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
由 Leo Chen 提交于
* move WaitEvent/RecordEvent into try-catch * refine supportNpu
-
- 31 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix bug that some op has no op_role attr * add mkldnn support for new executor * fit for mkldnn data_transfer * fit for mkldnn data_transfer
-
- 30 3月, 2022 1 次提交
-
- 29 3月, 2022 3 次提交
-
-
由 liutiexing 提交于
-
由 tianshuo78520a 提交于
This reverts commit e77a947e.
-
由 From00 提交于
-
- 28 3月, 2022 2 次提交
-
-
由 hong 提交于
* update * add forward case * update * update; test=develop * add some grad kernel; test=develop * move gpu kernel; test=develop * update * update; * update test; * fix selected rows bug; * add mix vector include ; * add mixed vector depen; test=develop * add logit grad signature; * polish code * fix bug; * add namespace for abs * revert code * not move softsign * revmove duplate register; * fix softsign bug * polish code * format * format * fix bug * remove cmake dep * add square sqrt selected rows support * update * remove clip norm * add standalone executor sqrt dep * standalone exec denp sqrt * remove sqrt op in cmkaelist * open some case
-
由 Leo Chen 提交于
-
- 27 3月, 2022 3 次提交
-
-
由 Leo Chen 提交于
* fit for mkldnn and inplace op * fix compile * refine ut * register op version * fix inplace op * fix transfer_layout
-
由 hong 提交于
* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep
-
由 From00 提交于
* Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy * Set FLAGS_use_stream_safe_cuda_allocator to false * Update * Remove unnecessary code * Fix CI errors * Add UT
-