- 29 3月, 2022 14 次提交
-
-
由 Aurelius84 提交于
* [Eager]Add sort-simple-yaml for automatically sort api|backward.yaml * remove it test=document_fix * refine * add more yaml * remove optional * fix infRT CI
-
由 Jacek Czaja 提交于
* - update of oneDNN to 2.5 * - changes to UT testing onednn verbose * - Update of oneDNN to 2.5.3 * - update onednn to 2.5.4
-
由 Aurelius84 提交于
-
由 Roc 提交于
* add random routing op add _random_routing api in utils add random routing ut * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * add op about moe gate update utils add limit by capacity op add ut for limit_by_capacity add ut for prune_gate_by_capacity add ut for limit_by_capacity add ut for prune_gate_by_capacity * fix for win * fix bugs in test_limit_by_capacity_op * update ut * update for test (timeout) * fix ut * update * update(fix) ut for win * moe apis in incubate * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count * add apis and utils * add gate apis * add moe and grad clip apis * update moe apis * add ops for moe gate * fix * update for base moe layer api * add random routing op add _random_routing api in utils add random routing ut * fix for dygraph * update with ranodm routing * update * fix ut for limit by capacity * update Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 wangxinxin08 提交于
* add elementwise sub and elementwise div in tensorrt op teller * add unittest of elementwise mul, sub and div
-
由 zhangkaihuo 提交于
-
由 tianshuo78520a 提交于
This reverts commit e77a947e.
-
由 shiyutang 提交于
* add_module_in_init_ * Update __init__.py * Update __init__.py
-
由 huangjun12 提交于
-
由 zhangyikun02 提交于
-
由 zn 提交于
-
由 0x45f 提交于
* Use _C_ops.yolov3_loss in eager mode for test_yolov3.py * fix code for test_yolov3_loss_op * remove useless import * Fix dygraph_mode flag
-
由 Aurelius84 提交于
* [Eager]Switch new Eager mode * switch into eager * fix typo
-
由 Jiabin Yang 提交于
-
- 28 3月, 2022 21 次提交
-
-
由 hong 提交于
* move momentum, rmsprop to phi; test=develop * update * update * update * update * udpate; test=develop * fix xpu npu bugs; test=develop * fix npu bug; test=develop * fix windows compile error; test=develop * fix windows compile error; test=develop * polish code; test=develop * fix conflict; test=develop * add meshgrid; * update * polish code * polish code; * fix bug * format; remove useless code * fix npu bug * fix bug
-
由 hong 提交于
* update * add forward case * update * update; test=develop * add some grad kernel; test=develop * move gpu kernel; test=develop * update * update; * update test; * fix selected rows bug; * add mix vector include ; * add mixed vector depen; test=develop * add logit grad signature; * polish code * fix bug; * add namespace for abs * revert code * not move softsign * revmove duplate register; * fix softsign bug * polish code * format * format * fix bug * remove cmake dep * add square sqrt selected rows support * update * remove clip norm * add standalone executor sqrt dep * standalone exec denp sqrt * remove sqrt op in cmkaelist * open some case
-
由 wangguanqun 提交于
* trainer and worker * delete commonsparsetable from gpups * delete vlog * codestyle * delete communicator from gpups
-
由 Haohongxiang 提交于
* add uts for EagerReducer * add more uts * fix bugs * fix bugs * modify * modify uts * fix bugs * update * update * update * solve conflicts and merge * add some other uts * modify time of uts * update * update * update * remove uts of resnet
-
由 Jiabin Yang 提交于
-
由 JZ-LIANG 提交于
* add FP16 Pass * Support the auto completion of while_op * acc aligned
-
由 chenjian 提交于
* no * fix bugs * fix doc according to review * fix api doc format * fix api doc according to review * fix bug and add unit test * fix record event bug
-
由 KP 提交于
-
由 caozhou 提交于
* fix code stype * update unitest
-
由 Linjie Chen 提交于
* fix outpu dtype of elementwise_div * fix ci * fix ci
-
由 danleifeng 提交于
* add fused_seqpool_cvm op;test=develop
-
由 Weilong Wu 提交于
-
由 Weilong Wu 提交于
* [Eager] Support SelectedRows in eager mode * Remove unnecessary codes * Adapt new dygraph flag
-
由 Ligoml 提交于
* update docs dtype(core.VarDesc.VarType) * fix code style, test=document_fix fix code style, test=document_fix Co-authored-by: NChen Long <1300851984@qq.com>
-
由 Guanghua Yu 提交于
* add adaround post-quant method
-
由 Zhanlue Yang 提交于
Enabled eager_mode for complex unit tests, except for test_complex_op.py and test_complex_view_op.py (#40887)
-
由 kuizhiqing 提交于
-
由 Aurelius84 提交于
* [Dy2Stat] Fix ForLoop Transformation with single return * [Dy2Stat] Fix ForLoop Transformation with single return
-
由 0x45f 提交于
* Refine test_lac.py for eager mode * refine code * Fix test_program_translator for eager
-
由 Aurelius84 提交于
* Fix bug while specifying target grad in high order gradient * add more unittest * add more unittest
-
由 Zhanlue Yang 提交于
-
- 27 3月, 2022 5 次提交
-
-
由 xiongkun 提交于
* first version, maybe many errors * refactor op_test * fix compare list * fix bg * fix bugs
-
由 Leo Chen 提交于
* fit for mkldnn and inplace op * fix compile * refine ut * register op version * fix inplace op * fix transfer_layout
-
由 hong 提交于
* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep
-
由 Aganlengzi 提交于
* [NPU] fix npu cast ut * [NPU] fix npu cast ut
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-