提交 · 81086cff59d1bebaf318dd04d785591c164666b9 · Oneflow-Inc / oneflow

27 10月, 2021 1 次提交

Matmul kernels use primitive (#6589) · 81086cff

由 Juncheng 提交于 10月 27, 2021

* Matmul kernels use primitive

* refine

* fix
Co-authored-by: Nguo ran <360112263@qq.com>
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

81086cff

26 10月, 2021 2 次提交

J

fix build permute_test.cpp (#6608) · 686ac9e8
由 Juncheng 提交于 10月 26, 2021

686ac9e8

Dev Batch Permute (#6441) · bca2e098

由 ZZK 提交于 10月 26, 2021

* dev torch style permute kernel

* Refine

* fix batch permute launch condition

* fix batch permute dispatch logic

* remove redundant header file

* simplified check logic

* use permute primitives in transpose kernels

* fix batch permute logic and avoid mod

* remove redundant templates

* fix grid step

* add grid for loop to avoid the elementnum is too large

* fix bug when hw is not divided by tile size

* refine format

* add a copy kernel as a baseline

* remove annotation

* add copy kernel

* add sync

* use batch permute for profile

* add copy tile baseline

* simplify params for copy kernel

* add slow copy kernel

* use mul to instead mod and remove copy

* use movement size = 4 when h w is modify by 2

* Add temp process for half2

* add half2 specialized kernel

* remove redundant license

* simplified code

* fix format

* fix comment

* fix comment

* use bad for loop condition

* merge half2 in load

* fix bad for loop in batch permute

* refine

* use align storage

* refine

* fix comment

* fix comment

* fix format

* add const and remove redundant header file

* remove register macro

* refine cuda code

* fix guoran comment

* fix format

* fix some details

* remove cuda graph

* fix for 0d tensor
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

bca2e098

23 10月, 2021 1 次提交

Fix SimplifyPermutation (#6600) · 3bcd09da

由 Juncheng 提交于 10月 23, 2021

* Fix SimplifyPermutation

* fix

* fix typo

* fix

* add test

* fix init
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

3bcd09da

22 10月, 2021 1 次提交

Interface primitive::softmax (#6594) · 8d35f739

由 guo ran 提交于 10月 22, 2021

* interface primitive::softmax

* refine
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

8d35f739

21 10月, 2021 4 次提交

J
Interface primitive::ElementwiseUnary (#6586) · 857e3f0b
由 Juncheng 提交于 10月 21, 2021
```
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
```
857e3f0b

Remove primitive::CudaGraphSupport (#6584) · ba4ae534

由 Juncheng 提交于 10月 21, 2021

* Remove primitiveCudaGraphSupport

* fix
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

ba4ae534

Elementwise function Apply2 (#6513) · ced4015b

由 Juncheng 提交于 10月 21, 2021

* Apply2

* fix

* CastFunctor::Apply2
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

ced4015b

Matmul Primitive (#6571) · 0eca806d

由 Juncheng 提交于 10月 21, 2021

* Matmul Primitive

* refine

* fix cublasComputeType_t
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

0eca806d

19 10月, 2021 1 次提交

Fix BroadcastMatmulFactory (#6550) · bb4989ed

由 Juncheng 提交于 10月 19, 2021

* Fix BroadcastMatmulFactory

* auto format by CI

* int64_t*=> const int64_t*
Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

bb4989ed

15 10月, 2021 1 次提交

MatmulPrimitive interface (#6462) · ab631be9

由 Juncheng 提交于 10月 15, 2021

* matmul api

* BlasTransposeType

* BatchMatmul/BroadcastMatmul

* fix

* fix

* enum=>enum class
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

ab631be9

08 10月, 2021 1 次提交

Primitive based add kernel (#6332) · 0970c73d

由 Juncheng 提交于 10月 08, 2021

* Primitive based add kernel

* refine

* refine

* include

* create primitive in compute

* fix sole input
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

0970c73d

01 10月, 2021 1 次提交

Primitive based cast kernel (#6271) · 97307255

由 Juncheng 提交于 10月 01, 2021

* Primitive based cast kernel

* refine

* refine

* include

* create primitve in compute

* typo
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

97307255

30 9月, 2021 2 次提交

Add Scalar::Value (#6460) · ce150259

由 Juncheng 提交于 9月 30, 2021

* Add Scalar::Value

* As=>Value
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

ce150259

add memory copy nd primitive (#6384) · 30e5b50f

由 guo ran 提交于 9月 30, 2021

* add memory_copy_nd primitive

* rm log

* reduce dim

* auto format by CI

* refine

* refine

* fix

* merge master

* refine

* refine
Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
Co-authored-by: NJuncheng <liujuncheng1022@gmail.com>

30e5b50f

27 9月, 2021 1 次提交

PermutePrimitive (#6390) · 30bca281

由 Juncheng 提交于 9月 27, 2021

* PermutePrimitive

* refine

* refine

* Refine movement size (#6417)

* refine movement size

* fix

* refine

* refine

30bca281

23 9月, 2021 1 次提交

Add primitive/include (#6379) · 6701db43

由 Juncheng 提交于 9月 23, 2021

Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

6701db43

18 9月, 2021 2 次提交

Fill primitive (#6347) · 4bfa17d9

由 Juncheng 提交于 9月 18, 2021

* Fill primitive

* Set=>Fill

* 1024=>1023

* Check CUDA_VERSION
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

4bfa17d9

Refine Lamb (#6334) · 840bbe91

由 Juncheng 提交于 9月 18, 2021

* Refine Lamb

* virtual=>override
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

840bbe91

17 9月, 2021 1 次提交
- J
  Add primitive (#6312) · 06dc375f
  由 Juncheng 提交于 9月 17, 2021
```
* Add primitive

* refine
```
  06dc375f
13 9月, 2021 1 次提交

CudaGraphPrimitive (#6264) · bc2002a2

由 Juncheng 提交于 9月 13, 2021

Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

bc2002a2

11 9月, 2021 1 次提交

Add cast primitive (#6234) · 86f77141

由 Juncheng 提交于 9月 11, 2021

* Add cast primitive

* fix
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

86f77141

09 9月, 2021 1 次提交

Add memset primitive (#6218) · 7382e4ac

由 Juncheng 提交于 9月 09, 2021

Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

7382e4ac

08 9月, 2021 1 次提交
- J
  Primitive based copy task node (#6195) · 5c667f5c
  由 Juncheng 提交于 9月 08, 2021
```
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
```
  5c667f5c
07 9月, 2021 1 次提交

Primitive (#6183) · ba36de99

由 Juncheng 提交于 9月 07, 2021

* Add Primitive

* #ifdef WITH_CUDA
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>

ba36de99

Oneflow-Inc / oneflow 上一次同步 2 年多

Oneflow-Inc / oneflow
上一次同步 2 年多