1. 30 10月, 2021 1 次提交
    • H
      Refactor oneflow.Size (#6645) · 4be2b0a3
      Houjiang Chen 提交于
      * Refactor oneflow.Size
      
      * refine
      
      * add pybind11 caster
      
      * Support Shape cast
      
      * refine
      
      * fix size index
      
      * include size header if need export C++ Shape to Python.
      4be2b0a3
  2. 29 10月, 2021 1 次提交
  3. 28 10月, 2021 2 次提交
    • S
      IR round trip pass (#4138) · 6af4e70b
      Shenghang Tsai 提交于
      * add todo
      
      * refine
      
      * add attr
      
      * refine
      
      * refine
      
      * add todo
      
      * refine
      
      * add alias c1 for check-oneflow
      
      * fix
      
      * update scripts
      
      * refine
      
      * fix single client env reinit
      
      * add attr
      
      * save and pass mlir module
      
      * fix
      
      * restore module in kernel
      
      * lower in kernel
      
      * refien
      
      * add scf to std
      
      * update lit
      
      * fmt
      
      * add all passes
      
      * add alisas
      
      * refein
      
      * refein
      
      * add check
      
      * fix pass order
      
      * add TODO
      
      * refein
      
      * create jit exe
      
      * refein
      
      * fix arity
      
      * add check and rpint err
      
      * refein
      
      * refein
      
      * refein
      
      * refein
      
      * refein
      
      * refein
      
      * emiit c
      
      * working
      
      * revert
      
      * add err print
      
      * e2e works
      
      * refein
      
      * refein
      
      * refein
      
      * use STATIC_SWITCH_FUNC
      
      * add log
      
      * rename
      
      * use invoke packed
      
      * refein
      
      * add todo
      
      * refein
      
      * rm log
      
      * fix
      
      * refein
      
      * rm
      
      * refein
      
      * add scf to gpu
      
      * add cmake flag for cuda runner
      
      * add CMAKE_CUDA_COMPILER
      
      * refine
      
      * refien
      
      * register gpu kernel
      
      * refein
      
      * add gpu passes
      
      * refein
      
      * add
      
      * refine
      
      * add ptx to cubin pass
      
      * produce cubin
      
      * add gpu to llvm pass
      
      * refein
      
      * add log
      
      * refien
      
      * link mlir cuda runtime lib
      
      * add note
      
      * make gpu runner available in file check
      
      * rm unused
      
      * add to prevent break
      
      * fix with cuda
      
      * edit mlir by hand to have it run on cuda
      
      * rm useless
      
      * add todo
      
      * upgrade llvm
      
      * refein m,irror scripts
      
      * fix for llvm upgrade
      
      * refein cmake
      
      * fix
      
      * fix for llvm upgrade
      
      * remove unused headers
      
      * refeine
      
      * refein
      
      * refactor
      
      * add
      
      * refine
      
      * refine
      
      * cmake first class cuda support
      
      * refine
      
      * refine
      
      * refein
      
      * refine
      
      * refine
      
      * refine
      
      * refein
      
      * add todo
      
      * refine
      
      * pass shared lib path from py
      
      * prevent redef ONEFLOW_CMAKE_BUILD_TYPE
      
      * refine msg
      
      * fix fmt
      
      * fix fmt
      
      * fix fmt
      
      * refine
      
      * refueb
      
      * fix
      
      * refactor jit function outline
      
      * refein
      
      * rm debug log
      
      * rm unnecessary erase
      
      * use 75
      
      * refein
      
      * add allowFoldingUnitDimReshapes
      
      * refine
      
      * Outline JIT func (#6542)
      
      * check in pass impl
      
      * add test
      
      * check in changes
      
      * add todo
      
      * extract func to create attrs
      
      * refine
      
      * refine and mv bert
      
      * refein LLVM_EXTERNAL_LIT
      
      * refine log user_op::AttrValueUtil::ToCppAttrValue
      
      * fix for nd_sbp
      
      * refine log
      
      * fix warnings
      
      * fix
      
      * leverage input_order and output_order
      
      * save lbn_segment_keys as input output order
      
      * refine
      
      * refein
      
      * add CUDATOOLKIT_BIN_ROOT
      
      * finish todo
      
      * finish todo
      
      * finish todo
      
      * add matmul
      
      * rm repetitive code
      
      * add log
      
      * add unary
      
      * add gather
      
      * refine and add gelu
      
      * fix loc
      
      * add mlir conv op (#6559)
      
      * add mlir conv op
      
      * fix conv2d tabelgen bug
      
      * fix merge compile error
      
      * fix comments
      
      * Update mlir-cuda-75.cmake
      
      * add mlir resnet50 test
      
      * add SI32ArrayAttr
      Co-authored-by: NShenghang Tsai <jackalcooper@gmail.com>
      
      * backport refactoring of translation
      
      * Add resnet50 mlir dialect part ops (#6607)
      
      * add scalar math ops tablegen
      
      * add pool ops
      
      * add bias_add op
      
      * fix comment
      
      * fix comment
      
      * code format
      
      * add reshape op
      
      * add reduce ops and restruct scalar math ops
      
      * fix bug
      
      * fix typo
      
      * address review
      
      * address review
      
      * rm loggin
      
      * address review
      
      * rm logging
      
      * backport variable rename
      
      * add flag ONEFLOW_MLIR_ENABLE_FUSERS
      Co-authored-by: NXiaoyu Zhang <35585791+BBuf@users.noreply.github.com>
      6af4e70b
    • Y
      Feat autograd function impl (#6593) · e96a5259
      Yinggang Wang 提交于
      * feat(autograd.Function): add base class define
      
      * format
      
      * feat(autograd.Function): cache FunctionOpExpr in AutogradFunctionBase
                               and pass autograd.Function name to cpp
      
      * feat(autograd.Function): wrapper PyFunction to FType
      
      * fix(autograd.Function): fix wrapper function capture bug
      
      * feat(autograd.Function): support autograd.Function backward
      
      * feat(autograd.Function): refine apply return value
      
      * fix(autograd.Function): fix autograd.Function name bug
      
      * feat(autograd.Function): refine ctx python api
      
      * feat(*): refine apply interface
      
      * test(autograd.Function): fix ctx interface and add test
      
      * feat(autograd.Function): support mark_non_differentiable
      
      * align ctx.saved_tensors interface
      
      * docs(autograd.Function): export documentation
      
      * refine function names
      
      * refine interface
      
      * use py::args instead of py::object
      
      * refine code
      
      * fix(*): fix `func_name` variable conflict with CHECK_JUST
      
      * feat(autograd.Function): support static call
      
      * docs(autograd.Function): update documentation
      
      * refine code
      
      * add JUST
      Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      e96a5259
  4. 26 10月, 2021 1 次提交
  5. 23 10月, 2021 1 次提交
  6. 15 10月, 2021 2 次提交
  7. 11 10月, 2021 1 次提交
  8. 09 10月, 2021 1 次提交
  9. 08 10月, 2021 1 次提交
  10. 30 9月, 2021 1 次提交
  11. 26 9月, 2021 1 次提交
  12. 24 9月, 2021 1 次提交
  13. 23 9月, 2021 1 次提交
  14. 22 9月, 2021 1 次提交
  15. 19 9月, 2021 1 次提交
  16. 18 9月, 2021 1 次提交
  17. 17 9月, 2021 1 次提交
  18. 15 9月, 2021 2 次提交
  19. 14 9月, 2021 1 次提交
  20. 13 9月, 2021 1 次提交
  21. 10 9月, 2021 2 次提交
  22. 09 9月, 2021 1 次提交
  23. 08 9月, 2021 1 次提交
  24. 05 9月, 2021 1 次提交
    • X
      transfer python function to c++ method (#6114) · 2c81c907
      Xiaoyu Zhang 提交于
      * Add tensor yaml, support export tensor functional api.
      
      * refine
      
      * Remove packed functor signature
      
      * remove unused file
      
      * Refine
      
      * refine
      
      * add activation op import
      
      * reinit oneflow init.py
      
      * add oneflow abs and exp
      
      * add oneflow abs and exp
      
      * add acos
      
      * add arccosh
      
      * add more op
      
      * add more ops
      
      * add more op
      
      * add more ops
      
      * add log1p
      
      * add more smaples
      
      * add more ops
      
      * add more ops
      
      * add more ops
      
      * add more ops
      
      * Complete tensor functional apis.
      
      * Fix pybind call
      
      * add more ops
      
      * add ops done
      
      * Add target of_functional_tensor_obj
      
      * Disable throw visibility warnings
      
      * fix target link
      
      * fix
      
      * fix incorrect use of flow.Tensor.
      
      * Fix error merge
      
      * fix
      
      * fix add unittest
      
      * refine
      
      * refine
      
      * fix
      
      * fix
      
      * add tensor doc
      
      * auto format by CI
      
      * refine
      
      * Fix
      
      * Add doc for python function
      
      * refine
      
      * add tensor method docstring
      
      * fix some bug
      
      * fix docs bug
      
      * Fix
      
      * auto format by CI
      
      * Tensor->tensor
      
      * Tensor->tensor
      
      * refine Tensor->tensor
      
      * fix
      
      * fix
      
      * fix
      
      * fix conflict
      
      * fix bug
      
      * fix ci bug
      
      * fix
      
      * delete diag op
      
      * fix conflict
      
      * Fix segment
      
      * fix
      
      * merge
      
      * merge
      
      * fix docstr bug
      Co-authored-by: Nhjchen2 <chenhoujiangcug@gmail.com>
      Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
      Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      2c81c907
  25. 04 9月, 2021 2 次提交
    • H
      Add tensor yaml, support export tensor functional api. (#6099) · bf2f7e9b
      Houjiang Chen 提交于
      * Add tensor yaml, support export tensor functional api.
      
      * refine
      
      * Remove packed functor signature
      
      * remove unused file
      
      * Refine
      
      * refine
      
      * Complete tensor functional apis.
      
      * Fix pybind call
      
      * Add target of_functional_tensor_obj
      
      * Disable throw visibility warnings
      
      * fix target link
      
      * fix
      
      * fix incorrect use of flow.Tensor.
      
      * Fix error merge
      
      * fix
      
      * fix add unittest
      
      * refine
      
      * refine
      
      * fix
      
      * fix
      
      * add tensor doc
      
      * auto format by CI
      
      * refine
      
      * Fix
      
      * refine
      
      * Fix
      
      * auto format by CI
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * Fix segment
      
      * fix
      
      * fix graph test
      Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
      Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      bf2f7e9b
    • G
      Add dtype bfloat16 (#5304) · 4aab561f
      guo ran 提交于
      * add data_type bfloat16
      
      * ifedf cu11
      
      * add cuda_pseudo_bfloat16
      
      * run unittes if cuda_version > 11000
      
      * refine
      
      * merge master
      
      * merge master
      
      * refine
      4aab561f
  26. 03 9月, 2021 2 次提交
    • L
      support flow.cuda.is_available (#6124) · ae7abaa3
      Luyang 提交于
      * support flow.cuda.is_available
      
      * export api docs
      
      * refine
      
      * fix docs
      
      * format
      
      * fix comments
      
      * export cuda.device_count
      
      * auto format by CI
      
      * Update python/oneflow/cuda/__init__.py
      Co-authored-by: Ndaquexian <daquexian566@gmail.com>
      
      * refine
      
      * refine
      Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
      Co-authored-by: Ndaquexian <daquexian566@gmail.com>
      Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      ae7abaa3
    • L
      Decompose nd sbp boxing (#5800) · 9c464a31
      Li Xinqi 提交于
      * GetBroadcastGroup
      
      * fix comment typo.
      
      * broadcast shape and dtype
      
      * 1) rm THREAD_LOCAL_CACHED; 2) fix bugs in ThreadLocal
      
      * fix wrong use of LocalRank
      
      * 1) a decorator for disabling recursive boxing call; 2) a decorator for checking consistent tensor meta.
      
      * don't set consistent_id when recursively calling eager consistent op interpreter.
      
      * decompose nd_sbp boxing
      
      * disable checking consistent tensor meta recursively.
      
      * GetDecomposableEquivalent
      
      * fix a unittest case bug
      
      * fix a bug in unittest
      
      * fix compiler complain
      
      * add unitests for CalcDecomposableEquivalentShapeAndNdSbpPair
      
      * InitNdSbpValidTransformationAxisSequence
      
      * DecomposeIntoNaiveTransformations
      
      * fix compiler complains
      
      * move several unitests in parallel_desc_test.cpp into placement_sbp_util_test.cpp
      
      * abstract_consistent_to_consistent_op_expr
      
      * fix compiler complaint
      
      * refactor consistent-to-consistent eager consisitent op interpreter
      
      * fix compiler complaint
      
      * refactor ConsistentToConsistentOpExpr
      
      * lazy interpreter (#5903)
      
      * fix bugs about consistent_id
      
      * refactor functional::ToConsistent
      
      * refactor GetNdSbp
      
      * fix compiler complaints
      
      * upgrade gtest and fix static check error
      
      * update head file index
      
      * fix bug
      
      * modify path of gtest lib
      
      * refactor NaiveNdSbpBoxingInterpreter to BoxingExpr(symmetric-nd-sbp-to-nd-sbp)
      
      * fix compiler complaints
      
      * Update gmock_headers.txt
      
      * Update gtest_headers.txt
      
      * fix bug about disable checking consistent meta in local to consistent functor
      
      * fix include bug
      Co-authored-by: qq_22305325's avatarclackhan <han_binbin@163.com>
      Co-authored-by: Nleaves-zwx <kunta0932@gmail.com>
      Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
      Co-authored-by: Nliufengwei <2472937968@qq.com>
      Co-authored-by: NTwice <i@twice.moe>
      Co-authored-by: NShenghang Tsai <jackalcooper@gmail.com>
      9c464a31
  27. 02 9月, 2021 1 次提交
  28. 01 9月, 2021 2 次提交
  29. 30 8月, 2021 1 次提交
  30. 29 8月, 2021 2 次提交
  31. 28 8月, 2021 2 次提交