- 16 8月, 2021 6 次提交
-
-
由 Xiaoyu Zhang 提交于
* restruct logsoftmax and abs test * add hardtanh test * refine batchnorm autotest * add meshgrid autotest * add pow autotest * add stack autotest * delete prelu useless code * change Stack Module Test Api * fix comments * fix softmax bug * fix sign bug * fix sign * auto format by CI Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 l702572275 提交于
* add fmod * add grad * fotmat and delete comment * delete comment * delete comment * delete line * rename and format * merge master * format * fix error , add data type * add int8 ,delete f16 * modified mod example * auto format by CI Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
-
由 tingkuanpei 提交于
* Support combined_margin_loss op in flow.nn.modules * Follow review comment to modify * Follow review comment to modify * auto format by CI Co-authored-by: NYao Chi <later@usopp.net> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 GehangZhang 提交于
* fix conv1d conv2d conv3d useless args * fix showing bugs * add args for unsample * fix torch.float Co-authored-by: NYao Chi <later@usopp.net> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Zailiang 提交于
* functional.onehot added * functional api yaml updated * api yaml updated * add one_hot functional API * add Module * amend one_hot testcase * amend docstring * delete Module * amend review question * amend review question * amend flow.nn.functional.one_hot * amend functional init.py * delete one_hot import code * amend valueerror * auto format by CI * amend numclasses is -1 * add testcase * update test_one_hot * auto format by CI * amend one_hot param * amend docsting * amend onehot.py docstring * amend on_value and off_value * auto format by CI * amend docsting error * remove onehot * auto format by CI Co-authored-by: Ntangnana925 <85614052+tangnana925@users.noreply.github.com> Co-authored-by: Ntangnana <tnn_personal@163.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: MARD1NO <359521840@qq.com>
-
由 Houjiang Chen 提交于
* abstract InferSourceOpParallelDistribution * support specified device, add consistent arange. * format * Fix arange python api and unittest * Broadcast parallel default * Fix wrong merge * Fix error merge Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
- 15 8月, 2021 10 次提交
-
-
由 Juncheng 提交于
* Add cmake option USE_SYSTEM_NCCL nccl target * rm mark_as_advanced
-
由 Twice 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Shijie 提交于
* fix gather kernel 0 shape * recover module test Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Li Xinqi 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 ZZK 提交于
* add logical scalar kernel * add logical scalar op register * add functional api yaml * modify math functor * fix * reuse functor * fix * modify equal * modify greater * modify greater equal * modify less equal * modify less than * add not equal * modify not equal * fix format * remove partial sum * add newline * reuse base class * fix bin_op to binary_op * modify to Scalar * first restruct and anotate cuda * modify to no grad user op * restruct code and add dtype * export to pybind * remove redundant logic in python * bind python as false * remove annotation * fix dtype * support scalar in input or output * fix * Add magic method * add docs * auto format by CI * fix randn test * modify back * small fix * fix 0d tensor * auto format by CI * fix 0d test * auto format by CI * fix ddp bug Signed-off-by: Ndaquexian <daquexian566@gmail.com> * fix to use is * remove [0] in ddp.py Signed-off-by: Ndaquexian <daquexian566@gmail.com> * fix unittest * fix format * fix wrong unittest * skip free eager test * fix to use is not none Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Ndaquexian <daquexian566@gmail.com> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 cheng cheng 提交于
* Fix BUG of LazyInterpret FreeEagerTensor memory shared with regst * remove note * remove debug
-
由 liufengwei0103 提交于
Co-authored-by: NZZK <42901638+MARD1NO@users.noreply.github.com>
-
由 Tianyu Zhao 提交于
* Rename `ParallelDistribution` to `NdSbp` * Rename `ParallelDistribution` to `NdSbp` * Rename `ParallelDistribution` to `NdSbp` * auto format by CI * Rename `ParallelDistribution` to `NdSbp` Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Juncheng 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Shenghang Tsai 提交于
-
- 14 8月, 2021 13 次提交
-
-
由 Bowen Chen 提交于
* add flow.rand * update docstr * update docstr * add consistent_rand, add more tests * update random op * refine * refine, add range and int type to uniform_kernel * refine * refine * update doc * update doc * Refactor UniformDistribution * fix Co-authored-by: Nhjchen2 <chenhoujiangcug@gmail.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Li Xinqi 提交于
* SyncAccessBlobByCallback * refactor capture-by-reference to capture-by-value * refactor InstructionsBuilder::SyncAccessBlobByCallback Co-authored-by: NHoujiang Chen <chenhoujiangcug@gmail.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 cheng cheng 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Shenghang Tsai 提交于
* cmake first class cuda support * refine * refien * refine * refein * refein * refeine * refine * refein * refine * refien * refgine * refien * refein * refein * rm useless * refien * refein * also link cuda libs if build static * refein * refien * add * Revert "add" This reverts commit d9e67ad1. * fix * refeine * retine Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Yinggang Wang 提交于
* feat(Tensor): support Tensor.__bool__() * test(Tensor): add tensor to bool test * docs(Tensor): refine is_nonzero document * format * fix(Tensor): fix Tensor.__bool___ bug * auto format by CI * fix(instancenorm): fix merge bug * fix(*): fix merge bugs Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 liufengwei0103 提交于
* refine code * refine code * optimize code * refine code * refine * back up * add tensor.to func * make of_format * remove to in pyTensor * sync gpu data * refine * refine * refine * refine * refine * refine * refine * refine * refine * backup * refine * rebase * check in gen py * merge master and fix bugs * address pr comments * eager boxing * address pr comments * fix b2p error * auto format by CI * remove boxing * export sbp * add tensor to_consistent * /minor fix * minor fix * refine * remove useless head file * Fix optional * remove to in tensor.cpp * update * Support symbol placement type in functional. * add sbp and sbp list arg * refine * use functional * refactor CastConsistentOpExpr * to_consistent(flow.B) backward * Cache op expr * add EagerNcclOpKernelState * refine * refine * refine * refine * refine * refine * minor fix * capture OpInterpContext * unimplemented apply * add GetNdSbp * add mutex * refine * merge EagerConsistentTensorImpl::NewWithPhyTensor and EagerConsistentTensorImpl::NewWithoutPhyTensor into EagerConsistentTensorImpl::New * rename functiona SyncData to SyncMetaAndData * fix function yml * refine * refine * refine collective boxing * make of_format * of_format * add to_local to pybind * refactor EagerBoxingInterpreter * minor fix * optimize CastParallelDistribution * add placement_sbp_util * minor fix * eager boxing backward * minor fix * sync shape and data when tensor_to_local * fix rpc_token bugs * fix p2s backward bug * refactor AsyncRpcCtx * set logical_shape correctly * simplify implementation of consistent_tensor.to_local * refine * initialize rpc_token with zero * refactor grad functions of to_consistent/to_local * refine * reformat and address pr comment * reformat * add check_meta_consistency in consistent2sonsistent * refactor eager_nccl_reduce lernel * refine * refine to_consistent api * ban_non_pod_data_in_eager_boxing * refine * refine * refine * backup code * THREAD_LOCAL_CACHED * Delete thread_local_cache.h * bugfix: DeviceId4ParallelId -> MachineId4ParallelId * optimize * support tensor str * Init code and can print consistent * refine format * remove useless to_consistent and format * refine code and print according data * attempt to support multi rank when fetch data * Revert "attempt to support multi rank when fetch data" This reverts commit ae56afad. * skip if tensor is consistent * delete useless * add comment * delete useless * traversal data to determine if int_mode * if consistent, return [...] * refine * add test and fix bug * add more assertTrue and delete useless * getitem using integer return scalar when tensor shape is [1] * add test cast * refine * fix spelling mistake * add op test and enhance in parse device * fix bug * fix docstr test bug and support to print meta * refine * auto format by CI * fix docstr in clip_grad.py * fix docstr * fix docstr and bug * the input shape parameter of reshape changed * add with flow.no_grad when operate tensor * fix docstr Co-authored-by: clackhan <han_binbin@163.com> Co-authored-by: Ntsai <jackalcooper@gmail.com> Co-authored-by: NXinqi Li <lixinqi0703106@163.com> Co-authored-by: NLi Xinqi <lixinqi2010@gmail.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Nhjchen2 <chenhoujiangcug@gmail.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Nwyg1997 <wyg19970408@gmail.com> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 leaves-zwx 提交于
* refine code * optimize code * refine code * refine * back up * add tensor.to func * make of_format * remove to in pyTensor * sync gpu data * refine * refine * refine * refine * refine * refine * refine * refine * refine * backup * refine * rebase * check in gen py * merge master and fix bugs * address pr comments * eager boxing * address pr comments * fix b2p error * auto format by CI * remove boxing * export sbp * add tensor to_consistent * /minor fix * minor fix * refine * remove useless head file * Fix optional * remove to in tensor.cpp * update * Support symbol placement type in functional. * add sbp and sbp list arg * refine * use functional * refactor CastConsistentOpExpr * to_consistent(flow.B) backward * Cache op expr * add EagerNcclOpKernelState * refine * refine * refine * refine * refine * refine * minor fix * capture OpInterpContext * unimplemented apply * add GetNdSbp * add mutex * refine * merge EagerConsistentTensorImpl::NewWithPhyTensor and EagerConsistentTensorImpl::NewWithoutPhyTensor into EagerConsistentTensorImpl::New * rename functiona SyncData to SyncMetaAndData * fix function yml * refine * refine * refine collective boxing * make of_format * of_format * add to_local to pybind * refactor EagerBoxingInterpreter * minor fix * optimize CastParallelDistribution * add placement_sbp_util * minor fix * eager boxing backward * minor fix * sync shape and data when tensor_to_local * fix rpc_token bugs * fix p2s backward bug * refactor AsyncRpcCtx * set logical_shape correctly * simplify implementation of consistent_tensor.to_local * refine * initialize rpc_token with zero * refactor grad functions of to_consistent/to_local * refine * reformat and address pr comment * reformat * add check_meta_consistency in consistent2sonsistent * refactor eager_nccl_reduce lernel * refine * refine to_consistent api * ban_non_pod_data_in_eager_boxing * refine * refine * refine * backup code * THREAD_LOCAL_CACHED * Delete thread_local_cache.h * bugfix: DeviceId4ParallelId -> MachineId4ParallelId * optimize * minor fix * LazyInterpreterApplyImplForParallelCastOpExpr * rm eager constraint * c2c interp ctx with parallel info * multi client collective boxing * test_to_consistent * support to_consistent grad_sbp * AsConsistentTensor * pass bwd test * add multi graph test * add ConsistentToConsistentOpExpr * LazyConsistentToConsistent * interpret ConsistentToConsistentOpExpr * update test * rm useless code * auto format by CI * fix conflict * mod comment * add message for local_tensor.to_consistent() check and consistent_tensor.to_local() check in lazy * address review * fix conflict * rm check which limit placement changing * auto format by CI * fix nd_sbp * auto format by CI * refactor to.py * ConsistentToConsistentOpExpr catch free tensor * fix copy op's sbp inferring * refactor empty infer sbp * refactor constant infer sbp * mod coco reader sbp inferring * fix GetSbpFn * fix consistent_to * fix (#5857) Co-authored-by: Nleaves-zwx <kunta0932@gmail.com> * modify comments * add test_to_placement case * clear code * unready test * refactor with InferNdSbp4SrcOp * rm out-dated comment * tidy code * SBP str -> cfg::SbpParallel Co-authored-by: clackhan <han_binbin@163.com> Co-authored-by: Ntsai <jackalcooper@gmail.com> Co-authored-by: NXinqi Li <lixinqi0703106@163.com> Co-authored-by: NLi Xinqi <lixinqi2010@gmail.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Nhjchen2 <chenhoujiangcug@gmail.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: NLiang Depeng <liangdepeng@gmail.com>
-
由 Li Xinqi 提交于
* GetBroadcastGroup * fix comment typo. * broadcast shape and dtype * 1) rm THREAD_LOCAL_CACHED; 2) fix bugs in ThreadLocal * fix wrong use of LocalRank * revert several code from master * fix compiler complain * merge master Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Yinggang Wang 提交于
* fix(Optim): fix optimizer list parameters input bug * refine Optimizer constructor * format * test(*): refine graph_rmsprop test Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 Li Xinqi 提交于
* wait vm empty before exiting * it's scheduler's duty to do Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 Houjiang Chen 提交于
Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Luyang 提交于
* add more datasets * add more transform funcs * export interface * export datasets interface * auto format by CI * fix docs * skip test * support DistributedSampler * refine * add more transform function * fix err import * fix comment * refine * add more transform test * refactor dataloader test * refine * add ddp test * refine * refine * add ddp test case * skil test * add ddp test case * add test case * refine * rm ddp test * remove ddp test * auto format by CI * format * update api docs * add utils.rst * auto format by CI * fix ddp grad size Signed-off-by: Ndaquexian <daquexian566@gmail.com> * remove print Signed-off-by: Ndaquexian <daquexian566@gmail.com> * refine as comments * refine * auto format by CI * auto format by CI * refine * add ddp test * auto format by CI * rm test case * fix reshape Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Ndaquexian <daquexian566@gmail.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Shenghang Tsai 提交于
* change file * check in changes * refine * refein * refien * add * fix git * refein * refein * refine * refine * refine Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
- 13 8月, 2021 11 次提交
-
-
由 Yao Chi 提交于
* forward finished * add gradients for l2_normalize * add test case and refine docstring * add blank line in file end * refine testcase * refine doctest * remove nn.l2_normlize, nn.functional.l2_normalize remains
-
由 Houjiang Chen 提交于
* Optimize maybe. * revert * refine code style * maybe: fix either_ptr and shared_or_scalar * maybe: clang format * maybe: fix error for nvcc * maybe: fix either_ptr and shared_or_scalar * maybe: clang format * either_ptr: fix dtor * maybe: fix Co-authored-by: NPragmaTwice <i@twice.moe> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 lichunyou 提交于
* first commit * for head * cuda * complete * rename * revise * space line * solve segment error * review1 * repeat * auto format by CI * rm si file * rm files * rm space * rm space1 * rm space1 * auto format by CI Co-authored-by: mu <702572275@qq.com> Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org>
-
由 Li Xinqi 提交于
* GetBroadcastGroup * fix comment typo. * broadcast shape and dtype * 1) rm THREAD_LOCAL_CACHED; 2) fix bugs in ThreadLocal * fix wrong use of LocalRank * 1) a decorator for disabling recursive boxing call; 2) a decorator for checking consistent tensor meta. * don't set consistent_id when recursively calling eager consistent op interpreter. * refactor tensor_rpc_util.h * add GlobalProcessCtx::NodeId and GetParallelId4CurrentProcessCtx * fix compiler complain * fix compiler complain * address pr comments Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: Ncheng cheng <472491134@qq.com>
-
由 cheng cheng 提交于
* Remove single_client api files * Remove Single-Client API in oneflow default python * revert oneflow.env * refine docs * fix watcher and delete useless python callback * Remove DestroyGlobalWatcher * Revert test_deconv for CI * fix test tol Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Luyang 提交于
* make setitem device match * consistent tensor support * refine * update * refine * auto format by CI * refine * refine * auto format by CI Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 qq_22305325 提交于
* add_eager_boxing_and_op_interpreter_dispatch_error_info * refine * refine * refine * use ErrorString4Inputs * auto format by CI Co-authored-by: Noneflow-ci-bot <ci-bot@oneflow.org> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Xiaoyu Zhang 提交于
* align reshape input param with pytorch * fix pixel_shuffle impl * fix ci error * fix reshape result bug * fix ci error Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Juncheng 提交于
* Kernel CUDA Graphs Support * fix CPU build * log * refine Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 Shijie 提交于
* add narrow user op * add narrow functor * add narrow module * add narrow gradient func * add split functor and module * testcase * fix test bug * testcase * add doc and doctest * fix docs Co-authored-by: NBBuf <1182563586@qq.com> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-
由 QiangX-man 提交于
* add index_select op * add CI testCase, update index_select op * solve some problems of code reviewing * update CI test * update index_select, add oundary conditions * create html, changed code format * delete module, change for using function call * add annotation * solve several problems of reviewing code * add annotation of param range * modified tensor index to a longer tensor Co-authored-by: NZhenhua <1209435+hengzi@users.noreply.github.com> Co-authored-by: NYao Chi <later@usopp.net> Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
-