others_en.rst 2.2 KB
Newer Older
1 2 3 4 5 6

others
==================



Z
zq19 已提交
7
FLAGS_benchmark
8 9 10 11 12 13 14 15 16 17 18 19 20 21
**************************************
(since 0.12.0)

Used to do benchmark. If set, it will make scope delete synchronized, add some memory usage log, and synchronize all cuda kernel after kernel launches.

Values accepted
---------------
Bool. The default value is False.

Example
-------
FLAGS_benchmark=True will do some synchronizations to test benchmark.


Z
zq19 已提交
22
FLAGS_inner_op_parallelism
23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
*******************************************
(since 1.3.0)

Most operators are working in single thread mode, but for some operator, use multi thread is more suitable. For Example, optimization op that optimize sparse gradient will be much faster to use multi thread. This flag is used to set the thread number inside an operator.

Values accepted
---------------
Int32. The default value is 0 which means that operator will not run in multi thread mode.

Example
-------
FLAGS_inner_op_parallelism=5 will set the thread number inside an operator to 5.

Note
-------
currently only sparse adam op supports inner_op_parallelism.


Z
zq19 已提交
41
FLAGS_max_body_size
42 43 44 45 46 47 48 49 50 51 52 53 54 55
*******************************************
(Since 1.0.0)

It controls the max message size in BRPC.

Values accepted
---------------
Int32. The default value is 2147483647.

Example
-------
FLAGS_max_body_size=2147483647 will set the BRPC message size to 2147483647.


Z
zq19 已提交
56
FLAGS_sync_nccl_allreduce
57 58 59 60 61 62 63 64 65 66 67 68 69 70
*******************************************
(since 1.3)

If the FLAGS_sync_nccl_allreduce is true, there will call `cudaStreamSynchronize(nccl_stream)` in allreduce_op_handle, this mode can get better performance in some scenarios.

Values accepted
---------------
Bool. The default value is True.

Example
-------
FLAGS_sync_nccl_allreduce=True will call `cudaStreamSynchronize(nccl_stream)` in allreduce_op_handle.


Z
zq19 已提交
71
FLAGS_tracer_profile_fname
72 73 74 75 76 77 78 79 80 81 82 83 84
*******************************************
(since 1.4.0)

FLAGS_tracer_profile_fname indicates the profiler filename for imperative tracer, which generated by gperftools. Only valid when compiled `WITH_PROFILER=ON`. Empty if disabled.

Values accepted
---------------
String. The default value is ("gperf").

Example
-------
FLAGS_tracer_profile_fname="gperf_profile_file" will set the profiler filename for imperative tracer to "gperf_profile_file".