- 31 3月, 2022 3 次提交
-
-
由 pangyoki 提交于
* support view strategy in eager_fluid state * little change * little change * optimize unittest * fix
-
由 pangyoki 提交于
-
由 pangyoki 提交于
* Supported Complex2Real Conversion for Eager Dygraph * Supported Complex2Real Conversion for Eager Dygraph * Enabled complex type promotion test for matmul_v2 * pylayer, test=develop * Fix CI issues * Support initializing specific grad tensors to zero for selected operators * finish forward, test=develop * create grad node finish, test=develop * Merged adj_edges_ with GradSlotMeta * Fixed monir issue * backward finish, start dbg, test=develop * Adjusted num runs * Recovered Eager performance tests configurations * Recovered Eager performance tests configurations * finish, test=develop * polish, test=develop * polish, test=develop * refine, test=develop * eager, test=develop * Adjusted performance tests configurations * Fixed Minor Issues with performance tests * [Phi] Fix macro name typo * support set_materialize_grads, test=develop * suppotr mark_non_differentiable, test=develop * support once_differentiable, test=develop * refine, test=develop * refine, test=develop * Moved out Edge from GradSlotMeta * Fixed issues from merge * Fixed typo * Addressed review comments * Fixed merge issues * Fixed minor issues * Fixed minor issue * refine, test=develop * refine, test=develop * refine, test=develop * Fixed major issues and enabled auto_prune test cases * Fixed issues from merge * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * support inplace for pylayer Co-authored-by: Njim19930609 <jim19930609@gmail.com> Co-authored-by: NWang Huan <wanghuan29@baidu.com> Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
- 30 3月, 2022 30 次提交
-
-
由 ziyoujiyi 提交于
* lazy import * log error
-
由 From00 提交于
-
由 0x45f 提交于
-
由 zhaoyingli 提交于
* fix converter when sliced_shape is 1 * update unittest
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * correct pass not regisiter * back * back * . * .
-
由 pangyoki 提交于
-
由 Roc 提交于
* add random routing op add _random_routing api in utils add random routing ut * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * add op about moe gate update utils add limit by capacity op add ut for limit_by_capacity add ut for prune_gate_by_capacity add ut for limit_by_capacity add ut for prune_gate_by_capacity * fix for win * fix bugs in test_limit_by_capacity_op * update ut * update for test (timeout) * fix ut * update * update(fix) ut for win * moe apis in incubate * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count * add apis and utils * add gate apis * add moe and grad clip apis * update moe apis * add ops for moe gate * fix * update for base moe layer api * add random routing op add _random_routing api in utils add random routing ut * fix for dygraph * update with ranodm routing * update * fix ut for limit by capacity * update * update limit by capacity for easily to switch to single thread mode * update api docs Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 Haohongxiang 提交于
* fix uncontrolled randomness of op * fix bugs
-
由 From00 提交于
Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657) * Add new API memory_reserved * Add memory_allocated, max_memory_reserved and max_memory_allocater * Fix CI error * Fix CI error * Enhance UT * Add FLAGS_memory_stats_opt * Add STATS macro functions * Add StatAllocator * Fix CI errors * Add UT * Fix CI errors
-
由 caozhou 提交于
-
由 Zhanlue Yang 提交于
* [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * Fixed yaml typo
-
由 努力努力在努力丶 提交于
-
由 pangyoki 提交于
-
由 0x45f 提交于
* Switch some dy2st UT to eager mode * Add UT
-
由 crystal 提交于
* remove set_value numpy * optimize code * optimize to_tensor * use common function Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0186.yq01.baidu.com>
-
由 Aurelius84 提交于
* [Yaml] Fix topk yaml compilation on Windows * fix make_shared * fix conflict
-
由 ykkk2333 提交于
* add bilinear interpolate v2 to xpu list and unitteset, *test=kunlun * Delete ps_usr_print_log * Delete ps_usr_print_log * Delete xpu_op_test
-
由 Aurelius84 提交于
-
由 wanghuancoder 提交于
* dlpack eager, test=develop * eager test_base_layer, test=develop * fix error report, test=develop * eager _getitem_from_offset, test=develop * refine, test=develop * refine offset, test=develop * add test_inner test_outer, test=develop * refine, test=develop * refine, test=develop
-
由 xiongkun 提交于
* first version, maybe many errors * refactor op_test * fix compare list * fix bg * fix bugs * skip name
-
由 houj04 提交于
* swish and pow op for xpu. test=kunlun * fix code style. test=kunlun. * use pow_grad xdnn api. test=kunlun.
-
由 zhaocaibei123 提交于
-
由 pangyoki 提交于
* suppor inplace in tensor_method_setitem * delete bump_inplace_version * optimize inplace unittest * fix * fix setitem bug * update eager_generator * optimize inplace unittest * little change
-
由 Zhanlue Yang 提交于
-
由 zyfncg 提交于
* refactor code auto-gene for no_need_buffer * fix some bug * delete test code
-
由 pangyoki 提交于
* support view strategy in eager_final state * perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage * fix inplace bug in final_state eager_gen * fix python_c_gen * support view strategy for final state * fix order of out and xshape in reshape * fix Coverage_CI unittest timeout error * support reshape view * fix reshape_sig * fix yml and api_base Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 Zhang Ting 提交于
-
由 wawltor 提交于
* fix the argosrt cpu * add the test case for the paddle.argsort
-
由 wanghuancoder 提交于
* Supported Complex2Real Conversion for Eager Dygraph * Supported Complex2Real Conversion for Eager Dygraph * Enabled complex type promotion test for matmul_v2 * pylayer, test=develop * Fix CI issues * Support initializing specific grad tensors to zero for selected operators * finish forward, test=develop * create grad node finish, test=develop * Merged adj_edges_ with GradSlotMeta * Fixed monir issue * backward finish, start dbg, test=develop * Adjusted num runs * Recovered Eager performance tests configurations * Recovered Eager performance tests configurations * finish, test=develop * polish, test=develop * polish, test=develop * refine, test=develop * eager, test=develop * Adjusted performance tests configurations * Fixed Minor Issues with performance tests * [Phi] Fix macro name typo * support set_materialize_grads, test=develop * suppotr mark_non_differentiable, test=develop * support once_differentiable, test=develop * refine, test=develop * refine, test=develop * Moved out Edge from GradSlotMeta * Fixed issues from merge * Fixed typo * Addressed review comments * Fixed merge issues * Fixed minor issues * Fixed minor issue * refine, test=develop * refine, test=develop * refine, test=develop * Fixed major issues and enabled auto_prune test cases * Fixed issues from merge * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop Co-authored-by: Njim19930609 <jim19930609@gmail.com> Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
- 29 3月, 2022 7 次提交
-
-
由 Aurelius84 提交于
* [Eager]Add sort-simple-yaml for automatically sort api|backward.yaml * remove it test=document_fix * refine * add more yaml * remove optional * fix infRT CI
-
由 Jacek Czaja 提交于
* - update of oneDNN to 2.5 * - changes to UT testing onednn verbose * - Update of oneDNN to 2.5.3 * - update onednn to 2.5.4
-
由 Aurelius84 提交于
-
由 Roc 提交于
* add random routing op add _random_routing api in utils add random routing ut * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * add op about moe gate update utils add limit by capacity op add ut for limit_by_capacity add ut for prune_gate_by_capacity add ut for limit_by_capacity add ut for prune_gate_by_capacity * fix for win * fix bugs in test_limit_by_capacity_op * update ut * update for test (timeout) * fix ut * update * update(fix) ut for win * moe apis in incubate * # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count * add apis and utils * add gate apis * add moe and grad clip apis * update moe apis * add ops for moe gate * fix * update for base moe layer api * add random routing op add _random_routing api in utils add random routing ut * fix for dygraph * update with ranodm routing * update * fix ut for limit by capacity * update Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 wangxinxin08 提交于
* add elementwise sub and elementwise div in tensorrt op teller * add unittest of elementwise mul, sub and div
-
由 zhangkaihuo 提交于
-
由 tianshuo78520a 提交于
This reverts commit e77a947e.
-