- 18 11月, 2021 12 次提交
-
-
由 xiayanming 提交于
* fleet support elastic train * fleet support elastic train * support elastic * add unittest * fix unitest bug * fix unittest bug * fix unittest bug * fix unittest coverage * fix unittest coverage * fix unittest coverage * fix unittest coverage * fix unittest coverage * fix elastic bug * fix ci fail * fix ci fail * fix elastic bug * fix elastic bug * fix joint debugging bug * fix joint debugging bug * fix windows ci failed * fix windows ci failed * Optimize fleet elastic scale in/out * elastic support pre hook * add prehook unittest
-
由 YuanRisheng 提交于
* elementwise_add kernel refactor * fix compile bugs in elementwise_add refactor * fix compile bugs when run in npu/xpu * fix bugs when run unit test * fix bugs when run ci-windows * modify code as recommended * code format adjust * fix bugs when run ci * fix compile bug when run in ci-windwos * elementwise_sub refactor * add PD_DLL_DECL for elementwise_sub * fix bugs when compilei
-
由 zhangbo9674 提交于
-
由 Shang Zhizhou 提交于
-
由 zmx 提交于
-
由 Yuang Liu 提交于
-
由 LielinJiang 提交于
* fix cache * Fix unittest
-
由 LielinJiang 提交于
* Fix the slow running speed of kl_div when option reduction is set * fix unittest coverage
-
由 LielinJiang 提交于
-
由 tianshuo78520a 提交于
* add benchmark
-
由 Zhen Wang 提交于
* Add the `GetFetchNames` method in CinnGraphSymbolization. * Use unordered_set instead vector as the type of fetch_var_names. * Reuse the definition of kCompilationKey. * Use CompileOptions to set fetch_var_ids. * Update the argument passing of GraphCompiler.Build. * Fix some bugs in CinnGraphSymbolization::GetFetchIds.
-
由 zhangkaihuo 提交于
topk中有cub和手写kernel两种实现,而cub是通过排序来获取topk,通过多组数据发现只有当input_width>=128且k超过input_width 75%的时候性能会比手写的更好。
-
- 17 11月, 2021 20 次提交
-
-
由 Sławomir Siwek 提交于
* Use oneDNN reorder instead of custom one * Fix whitespace typo * Fix Code format error * Incorporating feedback * Remove unncessary reorder * Support GIOHW format * Fix code format error
-
由 Leo Chen 提交于
* init * add feed ops in python side * import LRScheduler * update_feed * refine code format
-
由 piotrekobiIntel 提交于
* Change first batch of mkldnn headers and namespace names to dnnl * Revert changes to tensor.h, which require approval * Format changes with pre-commit * Add int32 tests * Fix int32 tests and call GetDataFromTensor for int32 * Fix test
-
由 niuliling123 提交于
* Modify reduce_op.op.h for xpu2 with kernel primitive api
-
由 piotrekobiIntel 提交于
* upgrade oneDNN to v2.4-rc * Removed failing test * Revert "Removed failing test" This reverts commit 60e70e717fac2c86b7beb24dfa1343a5804ea455. * Remove most tests for debugging purposes * Update hash to oneDNN 2.4 * Revert test change * Update oneDNN to 2.4.2 * Update oneDNN to 2.4.3 * Change oneDNN version to 2.3 for Jenkins test * Revert "Change oneDNN version to 2.3 for Jenkins test" This reverts commit 0b176defc3b63f65dd0ba85873a018534f287000. * Update oneDNN to 2.4.4 * Change version of oneDNN to 2.3 for new Jenkins test * Revert "Change version of oneDNN to 2.3 for new Jenkins test" This reverts commit e005a0f78f2b41cdcf4d7de3a21df7f910b78268.
-
由 Aurelius84 提交于
-
由 石晓伟 提交于
-
由 Sing_chan 提交于
-
由 YUNSHEN XIE 提交于
* remove test_hapi_hub from mac * fix format error
-
由 Chen Weihang 提交于
* add slice api impl of Tensor * fix test slice error
-
由 zhaocaibei123 提交于
-
由 zmx 提交于
* fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * refactor heter trainer. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop
-
由 danleifeng 提交于
-
由 zhangchunle 提交于
-
由 Leo Chen 提交于
* copy beta pow to same place when skip_update=1 * fix xpu
-
由 zyfncg 提交于
-
由 LiYuRio 提交于
-
由 WangXi 提交于
-
由 Tongxin Bai 提交于
* [Einsum] correct output dimension errors due to single element tensors. * [Einsum] format polish.
-
由 xiongkun 提交于
* add * add BuildOperatorDependences * fix bug * add unittest for write after write * fix merge bug * fix
-
- 16 11月, 2021 8 次提交
-
-
由 Chen Weihang 提交于
-
由 arlesniak 提交于
* Added BF16 Pool2d grad * upstream pulled * fix for CI * fixes after review
-
由 danleifeng 提交于
-
由 Weilong Wu 提交于
-
由 Zeng Jinle 提交于
-
由 Weilong Wu 提交于
-
由 YuanRisheng 提交于
* reshape kernel refactor * fix compile bugs when run ci * support xpu for reshape * fix bugs when run unittest in kunlun ci * fix compile bugs when run kunlun * perfect code according to suggestion * add api and unit test for reshape
-
由 zhangkaihuo 提交于
Add pure fp16 support for fused transformer.
-