- 28 3月, 2022 3 次提交
-
-
由 JZ-LIANG 提交于
* add FP16 Pass * Support the auto completion of while_op * acc aligned
-
由 Leo Chen 提交于
-
由 Thunderbrook 提交于
* So Parser * add macro * add macro * slotrecord * add macro * code format
-
- 27 3月, 2022 5 次提交
-
-
由 Leo Chen 提交于
* fit for mkldnn and inplace op * fix compile * refine ut * register op version * fix inplace op * fix transfer_layout
-
由 TeFeng Chen 提交于
* support check data type and mutable_data with compiled infos in paddle with cinn * update cinn_instruction_run_op_test with multi data type
-
由 hong 提交于
* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep
-
由 From00 提交于
* Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy * Set FLAGS_use_stream_safe_cuda_allocator to false * Update * Remove unnecessary code * Fix CI errors * Add UT
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 25 3月, 2022 4 次提交
-
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Update ThreadDataRegistry Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Chen Weihang 提交于
* move part sum op kernel * remove deprecated names
-
由 FlyingQianMM 提交于
-
- 24 3月, 2022 8 次提交
-
-
由 Leo Chen 提交于
-
由 jiangcheng 提交于
* fix build_cinn_pass internal var may be control var problem * add annotation and vlog by review advice
-
由 zyfncg 提交于
* support intermediate for saprse api * close intermediate in yaml * fix dygraph_api dep for eager
-
由 zhangbo9674 提交于
* approve amp for intermediate_dygraph * add amp_utils for intermediate_dygraph * add amp needcast check for mlu & npu * test unittest * add SetGradNode for set_stop_gradient && add checktensor for GradientHooks * refine code * refien unittest of imperative_amp for new dygraph * inplace api skip amp * add test_imperative_qat_amp for intermediate amp * refine code * refine test_amp ci strategy * refine unittest code * refine amp_utils code * refine amp getpromotetype for some special op * refine unittest code
-
由 joanna.wozna.intel 提交于
* Correct MultipleQuantizeSquash * Correct logging
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Add EventsWaiter * update * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update * update Error MSG * update EventsWaiter * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 seemingwang 提交于
* extract sub-graph * graph-engine merging * fix * fix * fix heter-ps config * test performance * test performance * test performance * test * test * update bfs * change cmake
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
- 23 3月, 2022 6 次提交
-
-
由 Leo Chen 提交于
-
由 zhaocaibei123 提交于
* fix benchmark and communicator config * fix bugs of the_one_ps * multi program and fix bug in optimizer * multi program in the_one_ps * public commcontext * ps optimizer multi programs * cvm & datanorm backend * fix dim * fix unittest * fix * the one ps merge * remove comm * add DownpourLiteWorker * all * fix * fix * device worker downpour lite * fix * fix bug in global shuffle * save inference model * fix & add log * fix * remove log * fix * fix save summary * fix * fix pscore * fix * fix * fix * fix * fix * remove logs * fix * fix * fix * fix * fix * add some comments * fix Co-authored-by: Nesythan <esythan@126.com>
-
由 liutiexing 提交于
* AddAwaitableTask for WorkQueue Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 YuanRisheng 提交于
* remove flatten infermeta * fix bugs when run inference ci * fix bugs when run inference ci * fix bugs when run ci * support infrt * inplace infershape code'
-
由 YuanRisheng 提交于
* move activation * fix bugs when run ce
-
由 Zhanlue Yang 提交于
* Removed redundant use of declarations.h * Fixed minor bug
-
- 22 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* async prepare deps * fix bug that std::future is not set * add ut * refine code * fix standalone ut * disable prof
-
- 21 3月, 2022 3 次提交
-
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
由 Siming Dai 提交于
* gpu ps graph engine * remove logs * Add neighbor sampling method * Add actual_sample_size and offset for sampling * Delete Chinese comment * Fix code style Co-authored-by: seemingwang <zsasuke@qq.com>
-
由 Allen Guo 提交于
* sync changes * copy sOpNamescope * fix UTs * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * fix code-format * fix compile error * add comments for feed_op Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
- 18 3月, 2022 3 次提交
-
-
由 shentanyue 提交于
* add gelu * fix gelu * add log_softmax * add prelu kernel and prelu/gelu/logsoftmax infershape * fix * fix * fix * fix * fix ci * log_softmax rewrite * fix * fix * fix conflict * fix compile error * fix comment * fix * ci_fix Co-authored-by: NYan Li <liyan665@gmail.com>
-
由 Zhang Zheng 提交于
* first commit * fix compile error * support std::vector<std::srting> * fix * fix op support on GPU by chenweihang * pass test * infershape * add set_dtype * fix order * fix * unify the impl of dt and sr * fix
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
- 17 3月, 2022 6 次提交
-
-
由 Chen Weihang 提交于
* move assign kernel init commit * change vec<tensor> to vec<tensor*> * support tensor array * support api declare * fix test_list failed * fix npu and xpu failed * fix infrt failed * remove assign array size in operator * move assign sr header into sr dir * add infermeta for assign * test op success * fix test_list failed * fix kunlun failed * add set host allocator in tests * support tensor array in arg ctx * open set layout in share_meta * fix meta tensor layout error * fix test failed
-
由 seemingwang 提交于
* extract sub-graph * graph-engine merging * fix * fix * fix heter-ps config
-
由 TeFeng Chen 提交于
-
由 zyfncg 提交于
-
由 YuanRisheng 提交于
-
由 baoachun 提交于
-
- 16 3月, 2022 1 次提交
-
-
由 Zuza 提交于
* Quantize elementwise mul op * Parametrize elementwise functions * Fix code formatting
-