提交 · 35b79ab86576bc9e224629d54ccf4a196c1e7c1d · 机器未来 / Paddle

27 11月, 2018 12 次提交
- Q
  
  fix infer compile test=develop · da387720
  由 Qiao Longfei 提交于 11月 27, 2018
  
  da387720
- C
  
  Add activation gelu (#14569) · 6c71c1f8
  由 Clementine 提交于 11月 27, 2018
  
  6c71c1f8
- C
  add ShareLoD for dropout_grad (#14616) · 6648f5ed
  由 chengduo 提交于 11月 27, 2018
```
test=develop
```
  6648f5ed
- Q
  
  fix compile problem test=develop · 92afbb92
  由 Qiao Longfei 提交于 11月 27, 2018
  
  92afbb92
- Q
  
  fix ci problem test=develop · 1edd435d
  由 Qiao Longfei 提交于 11月 27, 2018
  
  1edd435d
- T
  Make NCE_OP more efficient and support SelectedRows (#14469) · 56a4912b
  由 tangwei12 提交于 11月 27, 2018
```
* Fix truncated normal.

* Fix.

* Make nce support more distribution.

* Fix API.spec.

* Fix python API.

* Fix.
test=develop

* Fix API.spec
test=develop

* Fix sampler.

* Fix order of arguments in python API.
test=develop

* NCE add selectedrows support

* NCE update weighted sampling

* fix bugs in nce_op, and assign_value_op optimized

* fix bugs in nce_op, revert assign_value_op

* nce_op optimize

* nce_op optimize

* nce_op optimize

* add selectedRows test later

test=develop

* add selectedRows supported

* add selectedRows supported

test=develop

* add selectedRows supported

* add nce selectedRows supported, test=develop

* add nce selectedRows supported

* add nce selectedRows supported, test=develop

* fix height in nce, test=develop

* add ut

* add ut, test=develop

* make AutoGrownIndex inline
test=develop

* fix tinny error, test=develop
```
  56a4912b
- Q
  ctr reader can not be used in windows · f35f3fe7
  由 Qiao Longfei 提交于 11月 27, 2018
```
test=develop
```
  f35f3fe7
- P
  
  minor fix · 38715e6f
  由 peizhilin 提交于 11月 27, 2018
  
  38715e6f
- Q
  
  clean code test=develop · 6bef565d
  由 Qiao Longfei 提交于 11月 27, 2018
  
  6bef565d
- Q
  change log level · e7d1f524
  由 Qiao Longfei 提交于 11月 27, 2018
```
test=develop
```
  e7d1f524
- D
  
  add interp_method default bilinear. test=develop · bb489d4c
  由 dengkaipeng 提交于 11月 26, 2018
  
  bb489d4c
- D
  
  revert interpolate_op to bilinear_interp_op & nearest_interp_op. test=develop · 78f56391
  由 dengkaipeng 提交于 11月 26, 2018
  
  78f56391
26 11月, 2018 4 次提交

T
add comments and follow comments · 1f0291a5
由 tensor-tang 提交于 11月 26, 2018
```
test=develop
```
1f0291a5
Q
Transpose-Flatten-Concat fusion operator. (#14568) · 6224e61f
由 qingqing01 提交于 11月 26, 2018
```
* Transpose-Flatten-Concat fusion operator.
* Add unit testing and fix bug.
```
6224e61f

Fix save and load lookup table/optimizer vars (#14301) · 3639d99f

由 tangwei12 提交于 11月 26, 2018

*  fix mkdir conflict

*  fix load/save lookup tables

 test=develop

* add lookup_table_utils

* fix load optimize vars on pserver

* delete lookup table utils

* fix save and load lookup tables

* fix load optimizer var

* fix load optimizer var, test=develop

* fix python 3 style, test=develop

* move lookup_table_utils to contrib utils

3639d99f

Y
Use sub scope in tensor_array_to_tensor op. (#14524) · bf222f19
由 Yiqun Liu 提交于 11月 26, 2018
```
test=develop
```
bf222f19

25 11月, 2018 1 次提交
- G
  
  Add options to disable SO_REUSEPORT of grpc. (#14269) · c1bf9664
  由 gongweibao 提交于 11月 25, 2018
  
  c1bf9664
23 11月, 2018 5 次提交
- L
  
  add Set/GetCPUNumThreads api · e21edb26
  由 luotao1 提交于 11月 22, 2018
  
  e21edb26
- P
  add the bigobj option to NVCC compile · 445fff24
  由 peizhilin 提交于 11月 23, 2018
```
fix code style
```
  445fff24
- Q
  CUDA kernel for density_prior_box_op. (#14513) · 36f08eef
  由 qingqing01 提交于 11月 23, 2018
```
* CUDA kernel for density_prior_box_op.
* Support flatten to 2D.
```
  36f08eef
- T
  enable gru jitcode and refine act and lstm jitcode · 6a7f83d4
  由 tensor-tang 提交于 11月 23, 2018
```
test=develop
```
  6a7f83d4
- P
  
  rollback the format · 81bd7eef
  由 peizhilin 提交于 11月 23, 2018
  
  81bd7eef
22 11月, 2018 6 次提交

Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1

由 chengduo 提交于 11月 22, 2018

* refine cublase
test=develop

* code refine

* refine cublas

* add GEMME_EX

* add enable_cublas_tensor_op_math doc and add cublasCall
test=develop

* fix CublasCall for cuda version
test=develop

* fix error
test=develop

* fix GEMM_EX to be compatible with gcc 4.8
test=develop

* add GEMM_EX
test=develop

* to compatiable with gcc4.8
test=develop

00b9e9a1

P

fix unit test cases · 7c8c9dc9
由 peizhilin 提交于 11月 22, 2018

7c8c9dc9
T
enable peephole jitcode · 0c5ed5f6
由 tensor-tang 提交于 11月 22, 2018
```
test=develop
```
0c5ed5f6
T
init gru jitcode and fix lstm jitcode · e3b61cf5
由 tensor-tang 提交于 11月 22, 2018
```
test=develop
```
e3b61cf5
D
Group Norm (#13843) · ae7d2286
由 Dun 提交于 11月 22, 2018
```
Add group normalization operator.
```
ae7d2286

Windows/online (#14474) · d9a1f3e5

由 wopeizl 提交于 11月 22, 2018

* add recordio support

* disable the openblas multi-thread on windows since no support
adjust the python script

* code style

* code style
test=develop

* add create_recordio_file_reader back

* fix code style
test=develop

* fix the gtest.cmake on windows

* fix cc_test on windows

* fix the win build
test=develop

* remove fused compile support on windows
test=develop

* add the jit support
test=develop

* add the jit support, test=develop

* add the jit support, test=develop

* add the jit back
fix compile error on windows

* rollback test=develop

* test case fix

* disable DSO by default on windows

* exclude warpctc_op on windows

* exclude the dynload_warpctc out on windows
test=develop

* fix the scripts error
test=develop

* disable avx on windows by default
test=develop

* re-organize the cmake file

* disable mkl on windows by default

* add warp_ctc back

* fix the dependency

* fix the dependency

* fix the build issue on windows

* remove unsupported flag on windows

* code style

* code style
test=develop

* fix issue

* add profiler, parallel_executor back

* clean up the pre-definitions on windows

* fix build issue

* test=develop

d9a1f3e5

21 11月, 2018 4 次提交
- T
  add gru refer code and remove redundant avx code · 35620513
  由 tensor-tang 提交于 11月 21, 2018
```
test=develop
```
  35620513
- T
  jitkernel lstm refer support peephole · f9138608
  由 tensor-tang 提交于 11月 21, 2018
```
test=develop
```
  f9138608
- Y
  fix(Compile): fix depends error when compile op using cub · 3edd32d0
  由 Yu Yang 提交于 11月 21, 2018
```
some operators depend on cub and xxhash by header. The dependency should be declared explicitly rather than declared to pybind.

test=develop
```
  3edd32d0
- D
  Fix compling with cuDNN v5 · cda60311
  由 Dang Qingqing 提交于 11月 20, 2018
```
test=develop
```
  cda60311
20 11月, 2018 5 次提交
- T
  refine refer code and add lstm refer code · ce31deb7
  由 tensor-tang 提交于 11月 20, 2018
```
test=develop
```
  ce31deb7
- T
  
  add lstm jitcode · c2cfb03a
  由 tensor-tang 提交于 11月 20, 2018
  
  c2cfb03a
- Y
  
  Add the macro for NVCC (test=develop) · a906a361
  由 Yihua Xu 提交于 11月 20, 2018
  
  a906a361
- Y
  Revert "Remove the remnant code (test=develop)" · d91740ac
  由 Yihua Xu 提交于 11月 20, 2018
```
This reverts commit be506703.
```
  d91740ac
- Y
  
  Remove the remnant code (test=develop) · be506703
  由 Yihua Xu 提交于 11月 20, 2018
  
  be506703
19 11月, 2018 3 次提交

Q
Modify some infer-shape about detection operators in compile-time. (#14483) · 9eefd2c7
由 qingqing01 提交于 11月 19, 2018
```
* Modify some infer-shape in compile-time.
```
9eefd2c7

Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8

由 Yihua Xu 提交于 11月 19, 2018

* Optimize layer_norm operator with AVX intrinsic functions

* Revert the wrong modifications

* Implement the jit kernel for layer_norm operator

* Add math headfile to fix the compile issue (test=develop)

* Add math headfile to fix the compile issue (test=develop)

* Fixed the intrinsic headfile issue (test=develop)

* Fix the conflicts (test=develop)

* Revert for CUDA compiler (test=develop)

* Fixed the cuda depency (test=develop)

* Fix the marco issues (test=develop)

f4c869d8

P

add warp_ctc back · 8443961a
由 peizhilin 提交于 11月 19, 2018

8443961a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致