提交 · 29d8781240753bbe2c00a970288cc8fc18c8d422 · 机器未来 / Paddle

12 8月, 2019 2 次提交

G
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
由 gongweibao 提交于 8月 12, 2019
```
Polish fleet API to support cuda collective mode and nccl2 mode
```
29d87812

Refine embedding Api doc (#18820) · 744279fe

由 Kevin 提交于 8月 12, 2019

* fix overflow by int32 mul test=develop

* fix reference nullptr

* fix codestyle test=develop

* modify to point in ContextProjectFunctor test=develop

* modify to point in ContextProjectFunctor test=develop

* modify . to -> test=develop

* refine embedding padding_idx doc test=develop

* fix math:padding_idx preview bug test=develop

* modify API.spec test=develop

* fix spell error test=develop

* refine dtype parm desc test=develop

744279fe

11 8月, 2019 2 次提交

add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50

由 yaoxuefeng 提交于 8月 11, 2019

add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)

* add ctr related metric layer test=develop

* add save cache and slots shuffle test=develop

* add save cache and slots shuffle test=develop

* fix error

* fix error

* fix style for ci

* fix for comments

* change SlotsShuffle input to std::strinf for generality

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix stylr

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* change non-const reference to pointer

* fix style

* fix style

* fix style test=develop

* fix style  test=develop

* add return ins num in ctr metric op

* change dtype to float in metric_op.py

* fix error test=develop

* fix style test=develop

* fix API spec

* fix API spec

* fix API spec test=develop

* add UT test=develop

9150cf50

Z

remove book_memory_optimization directory, test=develop (#19117) · c51eb6bb
由 Zeng Jinle 提交于 8月 11, 2019

c51eb6bb

10 8月, 2019 2 次提交

Try to deprecate unstable python memory optimize (#18983) · c194b0c8

由 Zeng Jinle 提交于 8月 10, 2019

* deprecate python memory optimize, test=develop

* remove memory_optimize in unittests, test=develop

* add unittests to deprecated interfaces, test=develop

c194b0c8

Datafeed support reading to cuda place directly. (#19071) · 5a80cc84

由 hutuxian 提交于 8月 10, 2019

* add a place field in DataFeed to denote which place it will feed data to.
* abstract the copy process in CopyToFeedTensor function
* add UT for float32 type and for CUDAPlace

5a80cc84

09 8月, 2019 6 次提交

C
prune the feed op in compiler (#18997) · 3f4c088a
由 chengduo 提交于 8月 09, 2019
```
test=develop
```
3f4c088a
C
Remove compile from PE (#19080) · d2360332
由 chengduo 提交于 8月 09, 2019
```
* remove compile from PE
test=develop
```
d2360332

add eye op, kernel and unitest test=develop (#18980) · 4397cb31

由 ShenLiang 提交于 8月 09, 2019

* add eye op,test=document_preview test=develop

* fix the API.spec, test=develop

* fix the document, test=document_preview test=develop

* add unitest for CI coverage, test=develop

4397cb31

Add trilinear_interp OP (#18711) · f86fead6

由 Kaipeng Deng 提交于 8月 09, 2019

* add trilinear interp. test=develop

* fix unittest. test=develop

* add python api and test_layers. test=develop

* refine API.spec. test=develop

* fix format. test=develop

* add python API test. test=develop

* format code. test=develop

* refine code strcuture. test=develop

* fix format

* fix doc. test=develop

* fix converage. test=develop

* fix format. test=develop

f86fead6

C
Enhance fuse optimization op pass (#19010) · 17d62ab2
由 chengduo 提交于 8月 09, 2019
```
* Enhance fuse optimization op pass
test=develop
```
17d62ab2

Add call stack info during compile time (#19067) · 21440b4d

由 chengduo 提交于 8月 09, 2019

* Add call stack info during runtime and compile time
test=develop

* Rename operator_call_stack
test=develop

* Add unit test
test=develop

* follow comment
test=develop

21440b4d

08 8月, 2019 4 次提交

add fleet util, add some interface in hdfs util (#18752) · a99bc64c

由 jiaqi 提交于 8月 08, 2019

* add fleet util (fleet/utils/fleet_util.py): functions for users' convenience
* add some interface in hdfs util : hdfs is_file、hdfs cat

a99bc64c

[WIP] Add Imdb train demo (#18895) · 4ad7c9d5

由 mapingshuo 提交于 8月 08, 2019

* add train demo for imdb text classification task

* make inference library release data_feed dataset dataset_factory data_feed_factory

* add String Data Generator

* new feature of train demo: save model params

* New feature of train demo: set training config using gflags

* change code style for CI

* add readme and dataset for imdb demo trainer

4ad7c9d5

W
update roi doc in roi_pool and roi_align (#19036) · e50f527f
由 wangguanzhong 提交于 8月 08, 2019
```
* update roi doc in roi_pool and roi_align, test=develop
```
e50f527f

Fix memory overwriting of tensors returned by executor (#19030) · 8f537354

由 Leo Chen 提交于 8月 08, 2019

* fix memory overlapping of fetch var (return of executor.run), test=develop

* fix wrong usage of ParallelExecutor in op_test, test=develop

* remove useless parameter and simplify code

* avoid tensor destruct untimely, test=develop

* add testcase independent of OpTest, test=develop

8f537354

07 8月, 2019 1 次提交
- K
  
  fix natural exp decay doc. test=develop (#19025) · 1f46253d
  由 Kaipeng Deng 提交于 8月 07, 2019
  
  1f46253d
06 8月, 2019 4 次提交

L

Fix ExponentialMovingAverage api bug in python3, test=develop (#18775) · e5b9753a
由 LielinJiang 提交于 8月 06, 2019

e5b9753a

Add var_conv_2d op (#18518) · e681d655

由 Kevin 提交于 8月 06, 2019

* fix overflow by int32 mul test=develop

* fix reference nullptr

* fix codestyle test=develop

* modify to point in ContextProjectFunctor test=develop

* modify to point in ContextProjectFunctor test=develop

* modify . to -> test=develop

* add var_conv_2d op test=develop

* edit api.spec test=develop

* ignore unittest if with_mkl=off test=develop

* fix python3 division test=develop

* fix ignore unittest bug test=develop

* remove useless code test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

e681d655

C
Fix config description error in cuda_profiler function document (#18750) · 81fe02c3
由 Chen Weihang 提交于 8月 06, 2019
```
* fix profiler doc error, test=develop

* update API.spec, test=develop
```
81fe02c3
Z

reduce_unittest_time,test=develop (#19005) · 311f90f1
由 Zeng Jinle 提交于 8月 06, 2019

311f90f1

05 8月, 2019 6 次提交

L
fix dropout (#18965) · 5d9df8c8
由 lvmengsi 提交于 8月 05, 2019
```
Fix dropout in nn.py
```
5d9df8c8

fix g_param shape mismatch in WeightNormParamAttr (#18940) · 4da1c4f1

由 SunGaofeng 提交于 8月 05, 2019

* fix g_param shape mismatch in WeightNormParamAttr

* add comment to show why insert reshape in startup_program
test=develop

4da1c4f1

J

test=develop, fix memory leak in dygraph (#18998) · af63b118
由 Jiabin Yang 提交于 8月 05, 2019

af63b118

fix warpctc.dll not found issue (#18761) · a43a763b

由 liuwei1031 提交于 8月 05, 2019

* fix warpctc.dll not found issue, test=develop

* revert the linux platform change, test=develop

* delete warpctc_lib_path.h.in, test=develop

* add SetPySitePackagePath function

* fix warpctc.dylib not found issue on Mac, test=develop

* improve the paddle lib path setting logic, test=develop

* fix mac ci issue caused by test_warpctc_op unittest, test=develop

* tweak code, test=develop

a43a763b

C
Add checking for the fetch_list of Executor.run (#18957) · 01c7daad
由 chengduo 提交于 8月 05, 2019
```
* update exe.run
```
01c7daad
L
support tensor input for ctc align op (#18887) · faf6890b
由 Liufang Sang 提交于 8月 05, 2019
```
* test=develop support Tensor input for ctc_align_op

* test=develop add some comment
```
faf6890b

04 8月, 2019 1 次提交
- D
  make listen and server as exclusive run (#18990) · c97ea53c
  由 Dong Daxiang 提交于 8月 04, 2019
```
make listen and server as exclusive run 
```
  c97ea53c
02 8月, 2019 5 次提交

X
fix unalign of some examples (#18943) · 8ce90254
由 xsrobin 提交于 8月 02, 2019
```
* test=develop test=document_preview

* Update API.spec
```
8ce90254

Open gc by default (#18836) · 7ac748ad

由 Zeng Jinle 提交于 8月 02, 2019

* open gc by default, test=develop

* fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop

* fix conditional_block op eager deletion bug, test=develop

* add some comments to reviewers, test=develop

7ac748ad

H

fix expand op dtype build bugs; test=develop (#18932) · f745d6d9
由 hong 提交于 8月 02, 2019

f745d6d9

support filelist size < trainer num && fix pull dense (#18956) · 02c370c3

由 jiaqi 提交于 8月 02, 2019

* support filelist size < trainer num
* pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver
*  enable QueueDataset train same filelist for serveral times

02c370c3

石

Fusion: seqpool_cvm_concat (#18471) · ee2f296e

由石晓伟提交于 8月 02, 2019

* add fusion_seqpool_cvm_concat test=develop

* simplify pass, test=develop

* fix code style, test=develop

ee2f296e

01 8月, 2019 4 次提交

J
adjust ins weight according to nid slot (#18784) · 768059b3
由 jiaqi 提交于 8月 01, 2019
```
adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy
```
768059b3

Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c

由 wawltor 提交于 8月 01, 2019

* test=develop
Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.

* test=develop
Check the input and dtype in the op of unique_with_counts

* test=develop
test=document_preview
update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`

* test=develop
test=document_preview
Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.

* Fix some API problem in the op of `unique_with_counts`
test=develop
test=document_preview

* test=develop
test=document_preview
Fix the api sample of op `unique_with_counts`, and update api.spec

3ab1866c

L
Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
由 LielinJiang 提交于 8月 01, 2019
```
* fix depthwise conv gpu kernel bug, test=develop
* add more depthwise conv test, test=develop
```
22fa4c2d
W
Fix unitest of light nas. (#18931) · c92b78b0
由 whs 提交于 8月 01, 2019
```
test=develop
```
c92b78b0

31 7月, 2019 3 次提交

set fleet_send_batch_num a default value according to trainer num · 233746d8

由 jiaqi 提交于 7月 31, 2019

(1) set fleet_send_batch_num a default value according to trainer num， the previous 80000 is fixed，if trainer num is much less or larger than 100，global shuffle may have timeout error.

(2) fix load one table bug, add barrier

233746d8

C
[DyGraph] Make multi-card program faster (#18892) · 20859c08
由 chengduo 提交于 7月 31, 2019
```
* update parallel.py
test=develop
```
20859c08

Add center Loss Op Support (#18681) · 24f85431

由 HaoRen 提交于 7月 31, 2019

* support center loss
* change tensor copy  api to high level api tensorcopy

* test=develop rewrite the center_loss cuda_kernel to make it faster
and add document of the center loss api,also update test function

* test=document_preview test=develop
update document of center loss

* test=document_preview test=develop
modify API.spec modify test code remove nouse const_cast

24f85431

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致