- 04 9月, 2018 4 次提交
-
-
由 qq_22305325 提交于
* add matmul & dot & multiply * optimize dot kernel * fix multiply kernel code style * optimize matmul kernel
-
由 Li Xinqi 提交于
-
由 qq_22305325 提交于
* add embedding look up infer blob desc * optimize inifer blob desc
-
由 qq_22305325 提交于
* add hinge loss * add hinge loss test * hack hinge loss * optimize hinge loss * optimize hinge loss * optimize hinge loss * optimize hinge loss
-
- 03 9月, 2018 2 次提交
- 02 9月, 2018 3 次提交
-
-
由 Li Xinqi 提交于
-
由 Li Xinqi 提交于
-
由 Jinhui Yuan 提交于
-
- 01 9月, 2018 2 次提交
-
-
由 Li Xinqi 提交于
-
由 Jinhui Yuan 提交于
-
- 31 8月, 2018 1 次提交
-
-
由 Juncheng 提交于
-
- 30 8月, 2018 1 次提交
-
-
由 Jinhui Yuan 提交于
-
- 29 8月, 2018 1 次提交
-
-
由 Jinhui Yuan 提交于
* sketch of merge reduce project * add reduce_concat, reduce_split in logical graph (#1160) * add reduce_concat, reduce_split in logical graph * init ReduceTaskNodes in CollectReduceTaskNodes * add CompTaskNode for ReduceConcat & ReduceSplit * set ReduceConcat/Split color index * copy blob desc from ReduceConcat in to ReduceSplit out * refine CollectReduceTaskNodes * SetMemSharing for ReduceConcat, ReduceSplit regst * complete ReduceConcat & ReduceSplit op * fill ReduceConcat & ReduceSplit kernel * simplify ReduceConcatCompActor * make ReduceScatter & ReduceSplit as input-wise actor * reduce_scatter & reduce_split use is_inplace * use ByteSizeOfBlobBody for reduce related packed blob * Fix dev merge reduce (#1168) * check concat and split occur simultaneously * fix ReduceScatter & ReduceSplit as Inputwise actor * ReduceConcat & ReduceSplit works * fix single gpu issue * Refactor reduce (#1170) * backup, not complete yet * remove reduce_id * rm useless comment * add reduce_graph (#1169) * add reduce_graph * fix iter * add IsLogicalNodeMergeable and fix bug * remove needless constructor calls * node VisualStr may conflict, using node_id_str instead * reduce group works (#1171) * refine * sort nodes in topo (#1172) * add reduce_group_size in job_conf, fix 121 config of ReduceSplit and MdUpdt * resolve code review issues (variable names) * refine variable names * Dev merge reduce rename reduce group (#1174) * ReduceGraph=>ChainLogicalGraph * rename Group=>Chain * reformat * use pointer instead of reference for mutable argument * format change * worker node only pull sub_plan (#1176) * log compile time * use c++11 member initialization syntax * FixPackedBlobDescOfProducedRegst for ReduceSplit * Dev merge reduce refine chain logical graph (#1177) * remove IsMerageable * split TryMergeOneChain and rename to TryMergeTwoChains * reformat * resolve review issues
-
- 27 8月, 2018 1 次提交
-
-
由 Jinhui Yuan 提交于
-
- 25 8月, 2018 2 次提交
-
-
由 Jinhui Yuan 提交于
-
由 Jinhui Yuan 提交于
* refactor EraseEmptyRegst (no dependence on weak_ptr) * weak_ptr -> shared_ptr * refine
-
- 24 8月, 2018 3 次提交
-
-
由 Li Xinqi 提交于
* gdb copy blob * make BnInOp2Blob called by gdb easily
-
由 strickland12 提交于
-
由 strickland12 提交于
* use resize() * use .size to calc bitset_num
-
- 22 8月, 2018 3 次提交
-
-
由 Li Xinqi 提交于
-
由 strickland12 提交于
* if UseRelayPlacement * judge if there is only one gpu parallel_conf * refine * fix naive error
-
由 strickland12 提交于
* use Special judgment in InitNodeProducedRegstAct * abandon kMdUpdtArea ActEvents
-
- 21 8月, 2018 1 次提交
-
-
由 Jinhui Yuan 提交于
* clear act_event_logger act_event_bin_filename * cluster_thrd_ids_key * simplify ofrecord_decoder multi-thread * let decoder use AllocateCpuThrdIdEvenly * let ofrecord_decoder use local thread pool
-
- 20 8月, 2018 6 次提交
-
-
由 Li Xinqi 提交于
-
由 Jinhui Yuan 提交于
-
由 Jinhui Yuan 提交于
* caching the cudnn conv algorithm to eliminate duplicate calculation * refine cudnn conv algo ctx cache
-
由 strickland12 提交于
-
由 strickland12 提交于
* rm collect kMdUpdtArea Ancestor * refine AddOrderingCtrlEdgeInSameChain * mv ChainGraph to SetChainIdAndOrderInGraphForEachNode * rm task_node ancestors * rm emplace()
-
由 Jinhui Yuan 提交于
* fix typo * rm useless IsThisMachineMaster * refine the var name of naive_plan, mem_shared_plan, improved_plan * refactor PushPlan and PullPlan * let master node broadcast subplans instead the whole plan * remove useless code * rm useless code * use total_mbn_name_key
-
- 19 8月, 2018 6 次提交
-
-
由 Li Xinqi 提交于
* backpropogate model_diff only if is trainable * bugfix: consume bw task node only if trainable * bugfix: connect md_updt and bw_node when bw_node is not null * bugfix: md_updt enter HandlerNormal only if there is model to train * set all op trainable = false when predicting
-
由 strickland12 提交于
* rm MdUpdt chain merge * use area_id == kMdUpdtArea * rm judgement * refine IsSubset
-
由 Jinhui Yuan 提交于
-
由 Jinhui Yuan 提交于
* refine act_id order condition * strict act id check (excluding model regst) * add TODO: figure out the ActNumForEachOutput of model regsts to MdSave area
-
由 Jinhui Yuan 提交于
* remove blob_inited check * fix inplace feature of reduce add actor and kernel * rm useless code * add EnableInplace, support CPU allreduce
-
由 Jinhui Yuan 提交于
-
- 18 8月, 2018 4 次提交
-
-
由 Jinhui Yuan 提交于
-
由 strickland12 提交于
* refine task_graph construtor * use const qualifier * add ordered_chain_nodes_
-
由 strickland12 提交于
-
由 Jinhui Yuan 提交于
* enable in_diff blobs of bw_add share mem * use default operator= for BlobDesc * blob's mem_shared_id -> blob_mem_id * minor refine * move mem_case util to memory_allocator.h * refine blob mem sharing * refine compute packed blob * add operator= for blob_desc
-