- 10 1月, 2019 1 次提交
-
-
由 Wu Yi 提交于
* clip softmax for fp16 * updates * fuse xent support fp16 test=develop * wip * wip * add simple row reduce * wip fp16 accurate softmax * add accurate softmax kernel for fp16 test=develop * update test=develop * fix cpu build test=develop * update api.spec test=develop * follow comments test=develop * fix build test=develop * fix trt build test=develop * fix inference build test=develop * fix merge test=develop * update test=develop * try fix build test=develop * fix build test=develop * rename real_exp test=develop * fortest * remove hacky kernels test=develop * clean up test=develop
-
- 04 1月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
test=develop
-
- 26 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 24 12月, 2018 1 次提交
-
-
由 whs 提交于
* Init slim. * Remove distillation demo. * Fix import errors. test=develop * Fix some issues. test=develop * Fix configs. test=develop * Modify API.spec. test=develop * Fix format. test=develop * Fix format. test=develop * Add some comments.
-
- 20 12月, 2018 1 次提交
-
-
由 tangwei12 提交于
* fix utils config, test=develop * reweite tools in lookup table utils, test=develop * merge develop, test=develop * add copy in setup, test=develop * update api spec, test=develop * add doc, test=develop * prettifying doc, test=develop * Update API.spec update api.spec, test=develop * Update lookup_table_utils.py test=develop * Update lookup_table_utils.py test=develop * merge develop, test=develop * merge develop, test=develop * init fix, test=develop * fix in downloads, test=develop * fix in setup.in, test=develop
-
- 19 12月, 2018 3 次提交
- 17 12月, 2018 4 次提交
-
-
由 dongdaxiang 提交于
-
由 Yancey1989 提交于
-
由 dongdaxiang 提交于
test=develop
-
由 dongdaxiang 提交于
test=develop
-
- 14 12月, 2018 4 次提交
-
-
由 minqiyang 提交于
test=develop
-
由 Qiao Longfei 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 dengkaipeng 提交于
-
- 13 12月, 2018 3 次提交
-
-
由 SunGaofeng 提交于
* Add psroi_pool operator.
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
- 12 12月, 2018 2 次提交
-
-
由 sneaxiy 提交于
add API.spec test=develop
-
由 dengkaipeng 提交于
-
- 11 12月, 2018 1 次提交
-
-
由 dengkaipeng 提交于
-
- 10 12月, 2018 2 次提交
-
-
由 frankwhzhang 提交于
-
由 frankwhzhang 提交于
-
- 09 12月, 2018 1 次提交
-
-
由 frankwhzhang 提交于
-
- 07 12月, 2018 1 次提交
-
-
由 frankwhzhang 提交于
-
- 06 12月, 2018 1 次提交
-
-
由 frankwhzhang 提交于
-
- 05 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* expose square test=develop * fix activation test=develop * Add square API test=develop * add necessary op * code refine * fix API.spec test=develop * fix unit test test=develop * add unit test sparse_grad_clip test=develop * fix API.spec test=develop * remove mac test for test_gradient_clip test=develop * remove selectedrows_mul_tensor test=develop
-
- 01 12月, 2018 1 次提交
-
-
由 phlrain 提交于
-
- 30 11月, 2018 1 次提交
-
-
由 Wang Guibao 提交于
* AsyncExecutor: C++ side * Google naming conventions * Rename MultiExecutor to AsyncExecutor * pybind with async_executor * Naming convention * remove some flags and unused code * add refactored file of async_executor and data_feed * clear async executor interface and add data feed factory * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch * refine async_executor_refactor.cc * add some files about datafeed * Revert "add some files about datafeed" This reverts commit 8ee8133a. * Interface rework * add MultiSlotDataFeed * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side * update data_feed for add MultiSlotDataFeed * update datafeed and async_executor to run bow_net demo * fix bug that finish_set_filelist failed in multithread * delete finish_binding_memory_(flag), because it can not be marked under the current interface * Fix bug * update async_executor.py for support set_use_slots * update async_executor.py for support set_use_slots and set set_dense_slots * fix bug that when the number of files is less than the number of threads, it will fetch nan * remove redundant code, and make executor exit when set a illegal queue size * add batch_size check * add MultiSlotDesc * Revert "add MultiSlotDesc" This reverts commit 2e72ebfa. * add some checkpoint in DataFeedDesc * add CheckFile function in MultiSlotDataFeed * update something error info * fix deaded lock bug * Fix fetch variable * Merge error * fix code style in async_executor * using one lock blocking queue replace two lock blocking queue because of some bugs * update code style * add utest for data_feed * Fix fetch var * update utest for data_feed for multithread * update SetFileList info * fix bug in utest of data_feed * Add comments for python * Add comments for python code * Fix pybind.cc with new pybind11 version * add note for DataFeedDesc's set_use_slots function * Add save_model * update data_feed_test for multi-type * add comment for executor_thread_worker * Remove unused code * update data_feed_test for generate test data file * removed unnecessary interfaces and add comments * c++ style check * update data_feed.cc * AsyncExecutor: C++ side Google naming conventions Rename MultiExecutor to AsyncExecutor pybind with async_executor Naming convention remove some flags and unused code add refactored file of async_executor and data_feed clear async executor interface and add data feed factory split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch refine async_executor_refactor.cc add some files about datafeed Revert "add some files about datafeed" This reverts commit 8ee8133a. add MultiSlotDataFeed Interface rework Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side update datafeed and async_executor to run bow_net demo update async_executor.py for support set_use_slots Fix bug update async_executor.py for support set_use_slots and set set_dense_slots fix bug that when the number of files is less than the number of threads, it will fetch nan remove redundant code, and make executor exit when set a illegal queue size add MultiSlotDesc Revert "add MultiSlotDesc" This reverts commit 2e72ebfa. add some checkpoint in DataFeedDesc Fix fetch variable fix code style in async_executor Fix fetch var add utest for data_feed Add comments for python update utest for data_feed for multithread fix bug in utest of data_feed Add comments for python code Fix pybind.cc with new pybind11 version add note for DataFeedDesc's set_use_slots function update data_feed_test for multi-type Add save_model update data_feed_test for generate test data file removed unnecessary interfaces and add comments add comment for executor_thread_worker Remove unused code update data_feed.cc c++ style check * commit for code style * commit for code style * commit for code style * commit for code style * Comment away __init__ in async_executor.py * clang-format fix test=develop * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc * commit for update code style * commit for update code style * Add async_executor demo; Remove some methods test=develop * commit for update code style * commit for update code style * commit for update code style * update API.spec * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * Fix API.spec test=develop * Fix API.spec test=develop * Fix windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix code style test=develop * Fix code style test=develop * update datafeed * Fix code style test=develop * update data_feed_test for test Tensor test=develop * Fix code style test=develop * Fix windows build failure test=develop * Fix code style and windows build failure test=develop * Fix PYTHON3.5 build failure test=develop * AsyncExecutor API test=develop
-
- 29 11月, 2018 6 次提交
-
-
由 phlrain 提交于
-
由 Qiao Longfei 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
由 qingqing01 提交于
* Enable BatchNorm to use global mean and variane during training * Update doc and follow comments.
-
由 Xin Pan 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
- 27 11月, 2018 4 次提交
-
-
由 JiabinYang 提交于
-
由 Qiao Longfei 提交于
-
由 tangwei12 提交于
* Fix truncated normal. * Fix. * Make nce support more distribution. * Fix API.spec. * Fix python API. * Fix. test=develop * Fix API.spec test=develop * Fix sampler. * Fix order of arguments in python API. test=develop * NCE add selectedrows support * NCE update weighted sampling * fix bugs in nce_op, and assign_value_op optimized * fix bugs in nce_op, revert assign_value_op * nce_op optimize * nce_op optimize * nce_op optimize * add selectedRows test later test=develop * add selectedRows supported * add selectedRows supported test=develop * add selectedRows supported * add nce selectedRows supported, test=develop * add nce selectedRows supported * add nce selectedRows supported, test=develop * fix height in nce, test=develop * add ut * add ut, test=develop * make AutoGrownIndex inline test=develop * fix tinny error, test=develop
-
由 liuhongyu 提交于
-