- 02 11月, 2021 3 次提交
-
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
- 01 11月, 2021 9 次提交
-
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
- 30 10月, 2021 4 次提交
-
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
- 29 10月, 2021 7 次提交
-
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
- 28 10月, 2021 17 次提交
-
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
-
由 Shenghang Tsai 提交于
* add todo * refine * add attr * refine * refine * add todo * refine * add alias c1 for check-oneflow * fix * update scripts * refine * fix single client env reinit * add attr * save and pass mlir module * fix * restore module in kernel * lower in kernel * refien * add scf to std * update lit * fmt * add all passes * add alisas * refein * refein * add check * fix pass order * add TODO * refein * create jit exe * refein * fix arity * add check and rpint err * refein * refein * refein * refein * refein * refein * emiit c * working * revert * add err print * e2e works * refein * refein * refein * use STATIC_SWITCH_FUNC * add log * rename * use invoke packed * refein * add todo * refein * rm log * fix * refein * rm * refein * add scf to gpu * add cmake flag for cuda runner * add CMAKE_CUDA_COMPILER * refine * refien * register gpu kernel * refein * add gpu passes * refein * add * refine * add ptx to cubin pass * produce cubin * add gpu to llvm pass * refein * add log * refien * link mlir cuda runtime lib * add note * make gpu runner available in file check * rm unused * add to prevent break * fix with cuda * edit mlir by hand to have it run on cuda * rm useless * add todo * upgrade llvm * refein m,irror scripts * fix for llvm upgrade * refein cmake * fix * fix for llvm upgrade * remove unused headers * refeine * refein * refactor * add * refine * refine * cmake first class cuda support * refine * refine * refein * refine * refine * refine * refein * add todo * refine * pass shared lib path from py * prevent redef ONEFLOW_CMAKE_BUILD_TYPE * refine msg * fix fmt * fix fmt * fix fmt * refine * refueb * fix * refactor jit function outline * refein * rm debug log * rm unnecessary erase * use 75 * refein * add allowFoldingUnitDimReshapes * refine * Outline JIT func (#6542) * check in pass impl * add test * check in changes * add todo * extract func to create attrs * refine * refine and mv bert * refein LLVM_EXTERNAL_LIT * refine log user_op::AttrValueUtil::ToCppAttrValue * fix for nd_sbp * refine log * fix warnings * fix * leverage input_order and output_order * save lbn_segment_keys as input output order * refine * refein * add CUDATOOLKIT_BIN_ROOT * finish todo * finish todo * finish todo * add matmul * rm repetitive code * add log * add unary * add gather * refine and add gelu * fix loc * add mlir conv op (#6559) * add mlir conv op * fix conv2d tabelgen bug * fix merge compile error * fix comments * Update mlir-cuda-75.cmake * add mlir resnet50 test * add SI32ArrayAttr Co-authored-by: NShenghang Tsai <jackalcooper@gmail.com> * backport refactoring of translation * Add resnet50 mlir dialect part ops (#6607) * add scalar math ops tablegen * add pool ops * add bias_add op * fix comment * fix comment * code format * add reshape op * add reduce ops and restruct scalar math ops * fix bug * fix typo * address review * address review * rm loggin * address review * rm logging * backport variable rename * add flag ONEFLOW_MLIR_ENABLE_FUSERS Co-authored-by: NXiaoyu Zhang <35585791+BBuf@users.noreply.github.com>
-
由 oneflow-ci-bot 提交于
-
由 guo ran 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
-
-
-
由 jackalcooper 提交于
-
由 Shenghang Tsai 提交于
* use git to clean dir * rm useless to trigger CI * trigger CI * refine * refine * refine * refine * fix typo PopulateOpAttribute
-
由 liufengwei0103 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 jackalcooper 提交于
-
由 jackalcooper 提交于
-
由 Luyang 提交于
-
由 Yinggang Wang 提交于
* feat(autograd.Function): add base class define * format * feat(autograd.Function): cache FunctionOpExpr in AutogradFunctionBase and pass autograd.Function name to cpp * feat(autograd.Function): wrapper PyFunction to FType * fix(autograd.Function): fix wrapper function capture bug * feat(autograd.Function): support autograd.Function backward * feat(autograd.Function): refine apply return value * fix(autograd.Function): fix autograd.Function name bug * feat(autograd.Function): refine ctx python api * feat(*): refine apply interface * test(autograd.Function): fix ctx interface and add test * feat(autograd.Function): support mark_non_differentiable * align ctx.saved_tensors interface * docs(autograd.Function): export documentation * refine function names * refine interface * use py::args instead of py::object * refine code * fix(*): fix `func_name` variable conflict with CHECK_JUST * feat(autograd.Function): support static call * docs(autograd.Function): update documentation * refine code * add JUST Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-