- 01 7月, 2022 1 次提交
-
-
由 limingshu 提交于
* 2nd part of transpose update * add switch_auto_tune option. * add some changes according to Ci * refine the structure of auto_tune_base. * merge develop changes * reset the switch_set_range and change unittest of transpose auto-tune * change the kernel auto-tune logits
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 15 6月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Optimize prod's python implementation for dygraph. * Change key_dim to head_dim. * Add comment in unittest. * Disable TF32 in unittest.
-
- 08 6月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Polish codes and memory usage for fused_gate_attention. * Fix wrong reduce_dims in fused_gate_attention when computing gradient of nonbatched_bias.
-
- 30 5月, 2022 1 次提交
-
-
由 crystal 提交于
-