提交 · f6004ab908a0636765641cbb02bd89d97a5ae4cd · Crayon鑫 / Paddle

31 8月, 2021 4 次提交
- Q
  [NPU] fix cmake for ascend ci, test=develop (#35255) · f6004ab9
  由 Qi Li 提交于 8月 31, 2021
```
* [NPU] fix cmake for ascend ci, test=develop

* update paddle_build.sh scripts, test=allcase
```
  f6004ab9
- W
  
  update infer trt ut. (#35261) · 96e7d903
  由 Wilber 提交于 8月 31, 2021
  
  96e7d903
- A
  
  NPU add elementwise_mod (#35245) · 561841d2
  由 Aganlengzi 提交于 8月 31, 2021
  
  561841d2
- A
  
  NPU add fill_zeros_like kernel (#35246) · aaaa9965
  由 Aganlengzi 提交于 8月 31, 2021
  
  aaaa9965
30 8月, 2021 3 次提交
- X
  [Paddle Inference-TRT]Adding six op unittest codes of TRT INT8 (#35130) · 39565147
  由 xiaoxiaohehe001 提交于 8月 30, 2021
```
* add_op_unittest
```
  39565147
- Z
  [NPU] Add log_loss op (#35010) · b94d7ff3
  由 zhulei 提交于 8月 30, 2021
```
* [NPU] Add log_loss op

* [NPU] Add log_loss op

* [NPU] Add log_loss op
```
  b94d7ff3
- X
  Set value (#34886) · 37d281c9
  由 xiongkun 提交于 8月 30, 2021
```
* tmp

* Tile - Assign - Crop

* Finish the set value npu kernel and test case in npu

* improve the error message

* Modify according to zhangliujie

* code review
```
  37d281c9
27 8月, 2021 15 次提交
- J
  
  add uniform_ op and UT (#33934) · be29b8ee
  由 JYChen 提交于 8月 27, 2021
  
  be29b8ee
- X
  Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0
  由 xiaoting 提交于 8月 27, 2021
```
* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop
```
  ceee71a0
- G
  sparse_momentum_op is used to save w@GRAD memory for gather_op (#34942) · 234ce932
  由 Guoxia Wang 提交于 8月 27, 2021
```
* sparse_momentum_op is used to save w@GRAD memory for gather_op when gather from a large parameter
```
  234ce932
- W
  
  [hybrid] Fix row parallel linear bias (#35186) · 1533d7e2
  由 WangXi 提交于 8月 27, 2021
  
  1533d7e2
- H
  
  Update test_cross_entropy_loss.py · e838cacf
  由 HydrogenSulfate 提交于 8月 26, 2021
  
  e838cacf
- H
  
  Update test_cross_entropy_loss.py · ee070fbd
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  ee070fbd
- H
  
  Update test_cross_entropy_loss.py · 7afd7f3d
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  7afd7f3d
- H
  
  Update test_cross_entropy_loss.py · f6dc4b6b
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  f6dc4b6b
- H
  
  Update test_cross_entropy_loss.py · 0bf32484
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  0bf32484
- H
  
  Update test_cross_entropy_loss.py · 23cc2142
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  23cc2142
- H
  
  Update test_cross_entropy_loss.py · d1a11056
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  d1a11056
- H
  
  Update test_cross_entropy_loss.py · c61027e8
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  c61027e8
- H
  
  fix weighted CE loss's bug · 12bcd023
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  12bcd023
- B
  add elementwise max grad op for npu (#34862) · 5310ceab
  由 baoachun 提交于 8月 27, 2021
```
* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu

* add elementwise max grad op for npu
```
  5310ceab
- W
  Polish the error message of paddle.slice. (#35179) · 669853f5
  由 WeiXin 提交于 8月 27, 2021
```
* polish the error message of paddle.slice.

* polish code.
```
  669853f5
26 8月, 2021 6 次提交
- W
  support tensor index. (#34824) · e7df47ec
  由 WeiXin 提交于 8月 26, 2021
```
* polish code

* polish code.

* polish code.

* polish code.

* polish code.
```
  e7df47ec
- A
  Support Multi-Stream, Single-Thread in New Executor (#35024) · 678a259a
  由 Aurelius84 提交于 8月 26, 2021
```
* Modify into QueueSync QueueAsync

* fix complie on MacOS

* fix pointer

* fix conflict

* polish unittest

* fix windows fetch error

* polish code according reviewer

* fix device_guard on CPU place
```
  678a259a
- B
  
  [NPU] Support npu kernel for StridedSlice op without grad (#34601) · fa6c59a4
  由 Bo Liu 提交于 8月 26, 2021
  
  fa6c59a4
- fix iscan python bug (#35148) · 223c01fd
  由 zhouweiwei2014 提交于 8月 26, 2021
  
  223c01fd
- S
  Add roi align op npu (#34973) · 289e1818
  由 shiyutang 提交于 8月 26, 2021
```
* add_roi_align_npu

* update

* update

* update
```
  289e1818
- W
  
  [Inference] Replace unordered_map with map to support subgraph stability (#35147) · a1aae040
  由 Wilber 提交于 8月 26, 2021
  
  a1aae040
25 8月, 2021 4 次提交
- J
  Fix for expand_v2 op (#35101) · 1f34f7ec
  由 jakpiase 提交于 8月 25, 2021
```
* temporary change

* fix for expand_v2

* changes after review, activated ppyolov inference test
```
  1f34f7ec
- A
  SGD BF16 functional test. (#34648) · d618de2d
  由 Adam Osewski 提交于 8月 25, 2021
```
* Enable BF16 for creating global tensor and reduce_mean.

* Functional test with small model.
```
  d618de2d
- R
  
  [NPU] add npu_one_hot_v2 (#34937) · d710c3a0
  由 ronnywang 提交于 8月 25, 2021
  
  d710c3a0
- T
  
  update elementwise api in kunlun (#35021) · ff96a7d5
  由 taixiurong 提交于 8月 25, 2021
  
  ff96a7d5
24 8月, 2021 8 次提交

L

add checker, test=develop (#35109) · 881e55e4
由 lilong12 提交于 8月 24, 2021

881e55e4
W

fix convert ut framework problem (#35112) · a8dee3bb
由 Wilber 提交于 8月 24, 2021

a8dee3bb

Add no_sync in data parallel for dynamic graph (#34740) · b09f4d7f

由 Haohongxiang 提交于 8月 24, 2021

* Add no_sync in data parallel for dynamic graph

* modify UT of no_sync

* delete test_parallel_dygraph_dataparallel_no_sync.py

* add test_parallel_dygraph_no_sync.py

* modify run_trainer_with_spawn in UTs

* Add UT of complex control flow in no_sync

* add specific descriptions and notes for no_sync

* check code style

* modify UT's TIMEOUT in CMakeLists.txt

b09f4d7f

A
Update LearningRate for test fit a line BF16 (#34653) · 36f7e751
由 Adam Osewski 提交于 8月 24, 2021
```
* Small corrections.

* Fix lr for bf16.

* Revert some changes.
```
36f7e751

[oneDNN] Concat refactoring and disabling caching (#35002) · d9c0f09b

由 Jacek Czaja 提交于 8月 24, 2021

* - concat refactoring draft

* - cmpilation fixes

* - yet another compilation fix

* - fix

* - compilation fix

* - fixes to compilation

* - another compilation fix

* - fix

* - Added overloaded AcquirePrimitiveDesc for concat

* - fix

* - reserve introduced

* - UT fixes

* - test concat int8 improved

* - fixes

* - fix to crash

* - lint fixes

* - fixes after review

* - some other fixes from review

d9c0f09b

R
[NPU] add conv_op_npu and test (#34055) · 00a269de
由 ronnywang 提交于 8月 24, 2021
```
* add conv_op_npu and test

* add more tests

* clean headers & support fp16

* update
```
00a269de
R
[NPU] add pool2 op and tests (#34770) · da261732
由 ronnywang 提交于 8月 24, 2021
```
* add pool2d_op_npu and test

* update

* update pool2d_backward_navie

* clean headers
```
da261732

Add auto completion module for auto parallel (#34813) · 93d862b0

由 Yulong Ao 提交于 8月 24, 2021

* add auto_parallel dir

* mv to paddle.distributed

* add shard_xx api

* add distributed attrs for var

* add ut, test=develop

* add dist

* update

* update

* update

* update

* update

* update, test=develop

* update, test=develop

* update, test=develop

* update, test=develop

* update, test=develop

* update, test=develop

* update, test=develop

* update

* update

* update

* update

* update

* update, test=develop

* update, test=develop

* update

* update

* delete unused proto

* resotre op_desc

* restore type_defs

* update var_desc

* remove dimss_mapping for proto_pybind

* update interface.py

* update framework.py

* update

* update

* add auto_parallel dir

* mv to paddle.distributed

* add shard_xx api

* add distributed attrs for var

* add ut, test=develop

* [WIP] Add the auto completion feature and related codes

* [WIP] Improve the auto completion and related codes

* [WIP] Make the auto completion to support data-parallel

* [WIP] Make the completion support mp and dp+mp

* [WIP] Refactor auto completion unit test for MLP

* [WIP] Refactor the implementation of DistributedOperatorImpl

* [WIP] Improve dims_mapping update rule and fix a bug

* [WIP] Support auto completion for one transformer decoder layer

* [WIP] Add a minor change

* [WIP] Fix a bug within the uint test

* Shard XShape tensor, add embedding completion and refactor code

* Add the distributed_operators dir to setup.py.in

* Improve the completion process and add the unittest for gpt

* fix process_mesh ut

* fix process_mesh ut

* update

* update, test=develop

* Add support for automatically completing distributed attrs of special ops

* update

* update

* update

* fix doc sample codes, test=develop

* improve coverage, test=develop

* add static_mode check, test=develop

* Model the cluster for cost model and physical mapping

* update, test=develop

* add set_placement, test=develop

* Add the check to make sure the candidate tensors' size is great than zero

* update doc, test=develop

* update doc, test=develop

* update doc, test=develop

* update doc, test=develop

* update, test=develop

* Auto mark dist attrs annotated by user

* update ndarray to nested list, test=develop

* update, test=develop

* Add auto-completion module for auto-parallel (based on PR#33804)

* Remove unnecessary files

* Remove unrelated files for the auto completion pr

* Update the unit test to improve the coverage

* Modify codes based on reviews

* Minor changes for CI

* Improve some codes based on new comments

* Fix bugs caused by shallow copy in attributes.py
* Imporve amend_distributed_attr_for_program in context.py
* Other changes for weihang's comments
Co-authored-by: Nsandyhouse <lilong12@baidu.com>

93d862b0

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致