提交 · 25a233e46d42f6cb76220d58d89f882723a7a4fc · 机器未来 / Paddle

05 4月, 2020 1 次提交

Add the matmul, elementwise_euqal, elementwise_sum ops to API2.0 (#23437) · 08e3d9c0

由 wawltor 提交于 4月 05, 2020

* Add the matmul, elementwise_euqal, elementwise_sum ops to API2.0
* Fix the import meesage in common_ops_import
* Update the test cast for mm

08e3d9c0

04 4月, 2020 1 次提交

Add allclose_op (#23335) · 56b50c97

由 Zhen Wang 提交于 4月 04, 2020

* Add allclose Op, and its function is analogous to numpy.allclose. It returns True if two tensors are elementwise equal within a tolerance.

56b50c97

03 4月, 2020 1 次提交

update linspace, equal operators to API 2.0 (#23274) · a2e10930

由 channings 提交于 4月 03, 2020

* update linspace, equal operators to API 2.0, test=develop

* equal support higher performance CUDA kernel, test=develop

* update comment of equal&linspace operator, test=develop

* update comment of equal&linspace operator, test=develop

a2e10930

23 3月, 2020 1 次提交
- X
  
  reorganize the paddle api test=develop (#23151) · 194a22c5
  由 XiaoguangHu 提交于 3月 23, 2020
  
  194a22c5
19 3月, 2020 1 次提交

Add Support for Break and Continue in Dygraph to Static (#23067) · fb7b008a

由 Huihuang Zheng 提交于 3月 19, 2020

1. Add support for Break and Continue in Dygraph to Static
2. Also add support for gast.Not in NodeTestTransformer
3. Also add support for logical op transformation in LoopTransformer

fb7b008a

27 6月, 2019 1 次提交

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

15 8月, 2018 1 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
24 2月, 2018 2 次提交
- L
  
  replace paddle.v2.fluid by paddle.fluid in tests · bde090a9
  由 Luo Tao 提交于 2月 24, 2018
  
  bde090a9
- L
  
  move Fluid API code out of V2 API code · b11956a0
  由 Luo Tao 提交于 2月 24, 2018
  
  b11956a0
13 2月, 2018 1 次提交

Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a

由 Xin Pan 提交于 2月 13, 2018

Currently, our tests run with 2 GPUs, the init time is absurdly long:
about 4s for each process.  Currently, we run each OP test on
different processes. This PR:

1. create cmake function py_test_modules which will generate the
Makefile that runs a list of Python unittest module in a single Python
process.

2. move all "python unittest compatible" (e.g., used the unittest
package, not just a regular python file). from fluid/tests to
fluid/tests/unittests.

3. cmake now will run all OP tests in fluid/tests/unittests in a
single process, except the time-consuming tests, they are separated
into different processes to utilize parallelism. Please make sure to
use the unittest package if you put the python test file in
fluid/tests/unittests

4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
to disable unittest, we can not do it when running all tests in a
single process since it will terminate the process without running the
other tests. Instead, the test is disabled in
fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
item. Please disable the unittest from
fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
Python file, for all Python file in fluid/tests/unittests/.

5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
tests in separate process so that they can be tested individually.

cde6241a

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
24 1月, 2018 1 次提交
- Y
  Rename is_compile_gpu to is_compiled_with_cuda · d0a04757
  由 Yang Yu 提交于 1月 24, 2018
```
The English of the previous API is bad.
```
  d0a04757
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

15 1月, 2018 1 次提交

Feature/hooks (#7513) · b9b75377

由 dzhwinter 提交于 1月 15, 2018

* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci

b9b75377

14 11月, 2017 1 次提交
- Q
  Change framework to fluid (#5637) · 4adc8a7a
  由 Qiao Longfei 提交于 11月 14, 2017
```
* init commit

* change some dir name
```
  4adc8a7a
21 10月, 2017 1 次提交
- Y
  
  Global function, op_support_gpu (#4980) · 86437a8d
  由 Yu Yang 提交于 10月 20, 2017
  
  86437a8d

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致