- 16 6月, 2022 8 次提交
-
-
由 zhoutianzi666 提交于
* add squeeze2 * add squeeze * add squeeze2,unsqueeze2 * merge develop * fix format * add conditions for squeeze2 and unsqueeze in op_teller * merge develop * add squeeze unsqueeze * add squeeze unsqueeze * add squeeze unsqueeze * remove unsqueeze2_eltwise_fuse_pass * add squeeze/unsqueeze
-
由 津 提交于
* add unary
-
由 joanna.wozna.intel 提交于
-
由 jakpiase 提交于
-
由 zhangyikun02 提交于
-
由 Ruibiao Chen 提交于
* Support disable GC for some vars in standalone executor * Setting skip_gc_vars in interprecore construction
-
由 Leo Chen 提交于
* fix xpu kp compilation * add depends
-
由 Leo Chen 提交于
* lazy creating work queue * fix dry_run
-
- 15 6月, 2022 10 次提交
-
-
由 huzhiqiang 提交于
-
由 Yiqun Liu 提交于
* Optimize prod's python implementation for dygraph. * Change key_dim to head_dim. * Add comment in unittest. * Disable TF32 in unittest.
-
由 fwenguang 提交于
-
由 fwenguang 提交于
-
由 fwenguang 提交于
-
由 Guoxia Wang 提交于
-
由 zyfncg 提交于
* rename yaml file * fix merge conflict * fix infrt
-
由 zhouweiwei2014 提交于
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul * fix CI * fix CI * fix comment * fix comment
-
由 Yiqun Liu 提交于
Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to support large tensor. (#43506) * Change some data type from int to int64_t in GetGpuLaunchConfig1D to support large tensor. * Use int64_t in ElementwiseKernel as index type to support large tensor.
-
由 Ruibiao Chen 提交于
* Refactor port.h * Remove some unnecessary code * Fix CI errors
-
- 14 6月, 2022 16 次提交
-
-
由 Jiabin Yang 提交于
* fix starganv2 * fix starganv2 stop_gradient end error * fix edvr_starganv2 * fix mul kernel to fix optional ddx * fix typo
-
由 Ruibiao Chen 提交于
* Support sequential run for standalone executor * Add UTs * Fix test_standalone_multiply_write * Remove unnecessary UTs
-
由 cambriconhsq 提交于
-
由 zhaoying9105 提交于
* [MLU]: add elementwise_max mlu kernel * [MLU]: add int32 support for elementwise maxk MLU kernel
-
由 zhaoying9105 提交于
-
由 tianshuo78520a 提交于
-
由 sneaxiy 提交于
-
由 Yuang Liu 提交于
-
由 Sing_chan 提交于
* open inference_ut;test=windows_ci_inference * inference_ut need onnx;test=windows_ci_inference * disable trt_split_converter_test; use higher parallel level * too high parallel will cause ut timeout
-
由 Zhang Jun 提交于
-
由 xiongkun 提交于
* change logic for optimize * modifty * optimize the backward speed of EinsumOp * add cache optimizer for einsum op * EinsumOp: fix new dygraph mode error * fix bug * change Cache->InnerCache * fix code * fix * add nan inf utils for einsum op * add as_extra * memory optimizer for einsum * update code
-
由 Sing_chan 提交于
-
由 Sing_chan 提交于
-
由 Wilber 提交于
* cmake-lint * update
-
由 zyfncg 提交于
-
由 Jiabin Yang 提交于
* fix custom op error * fix code error
-
- 13 6月, 2022 6 次提交
-
-
由 qipengh 提交于
-
由 Chenxiao Niu 提交于
-
由 chenjian 提交于
* add serialization for new field in event node * fix a bug
-
由 zhoutianzi666 提交于
-
由 津 提交于
-
由 piotrekobi 提交于
-