提交 · 16b856e7ec85c6f358bd6f2a63e7460ad3ded942 · qq_38905368 / tensorflow

12 4月, 2016 17 次提交

D
Applying editorial changes to the distributed how-to. · 16b856e7
由 Derek Murray 提交于 4月 11, 2016
```
Change: 119605636
```
16b856e7

Add SyncReplicasOptimizer test in dist_test · 7c86e9c9

由 A. Unique TensorFlower 提交于 4月 11, 2016

Usage example: ./remote_test.sh --num-workers 3 --sync-replicas

Also changed:
    1) In local and remote tests, let different workers contact separate GRPC
    sessions.
    2) In local and remote tests, adding the capacity to specify the number of
    workers. Before it was hard-coded at 2.
    Usage example:
    ./remote_test.sh --num-workers 2 --sync-replicas
    3) Using device setter in mnist_replica.py
Change: 119599547

7c86e9c9

A
Enable test_benchmark.* files for Android build. · ea6cdc0d
由 Andrew Harp 提交于 4月 11, 2016
```
Change: 119591021
```
ea6cdc0d
D
Updated the distributed runtime how-to to use the new API in `tf.train.*`. · 76d3808e
由 Derek Murray 提交于 4月 11, 2016
```
Change: 119589456
```
76d3808e

Redo the TensorBoard tutorial code (mnist_with_summaries.py). · a77499c8

由 Dan Mané 提交于 4月 11, 2016

Goals:
- Have enough of each summary type that tag grouping is useful.
(Wound up recording e.g. mean and stddev and min/max for each variable)
- Use every summary type (adds images)
- Write to multiple directories so there are several "runs"
Change: 119585022

a77499c8

D
Update released TensorBoard. · 3c59c1ed
由 Dan Mané 提交于 4月 11, 2016
```
Update bower dependencies.
Also force urls to lowercase.
Change: 119584968
```
3c59c1ed
A
Add stat_summarizer.h into tensorflow/core/BUILD · 3b724ecd
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Change: 119572994
```
3b724ecd
A
Adding link to readme in TensorBoard Toolbar. · e42fa006
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Moving favicon to datauri.
Change: 119569013
```
e42fa006
A
Update generated Python Op docs. · f4165768
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Change: 119565375
```
f4165768
A
Update ops-related pbtxt files. · beea3019
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Change: 119565115
```
beea3019

Restrict the use of OUT_OF_RANGE to what it was intended to. · cc7f05f6

由 A. Unique TensorFlower 提交于 4月 11, 2016

Clarify that OUT_OF_RANGE is raised only when reaching the end of
input for interable contents.

Change the few places where we incorrectly raised OUT_OF_RANGE to raise
ILLEGAL_ARGUMENT instead.

This will make code that catches the OUT_OF_RANGE exception more robust
as it won't get confused by spurious uses of the exception class.
Change: 119560848

cc7f05f6

Make the TF_CALL_*() macros drop the semicolon, instead putting that burden · 6e8e7cac

由 A. Unique TensorFlower 提交于 4月 11, 2016

on the caller. This allows us to use the macros for other purposes than
calling REGISTER_KERNEL, in particular in variadic template parameter lists.

Update REGISTER_KERNEL_BUILDER accordingly to add a semicolon, so that existing
code continues to compile.
Change: 119551677

6e8e7cac

E
Clean up benchmark.py after previous modifications and add unit test. · a128dee2
由 Eugene Brevdo 提交于 4月 11, 2016
```
Change: 119549296
```
a128dee2
A
Add additional space after "Examples" so ```python``` can render · 45b208d1
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
correctly.
Change: 119549145
```
45b208d1
A
Clarification of object ownership in ::tensorflow::Env · 3eaf185a
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Change: 119544956
```
3eaf185a
G
Add an option to docs.py collect_members to skip certain symbols · 63bc43f7
由 Geoffrey Irving 提交于 4月 11, 2016
```
This is used only within Google.
Change: 119543426
```
63bc43f7

Add Dimension.__str__ and improve TensorShape.merge_with errors · 7b8c1973

由 Geoffrey Irving 提交于 4月 11, 2016

Dimension.__str__ now spits out ? for unknown dimensions and an integer
otherwise.  Previously this logic was contained in TensorShape.__str__.

In addition, the exception produced TensorShape.merge_with now encodes all of
the two shapes, so something like

    Dimensions 2 and 9 are not compatible

becomes

    Shapes (?, 2) and (4, 9) are not compatible
Change: 119536897

7b8c1973

11 4月, 2016 5 次提交

A
Clarifies some documentation in server_lib.py · 6a4b2502
由 A. Unique TensorFlower 提交于 4月 11, 2016
```
Change: 119533248
```
6a4b2502

tensorflow: support usage of eigen thread pool · 017498bc

由 A. Unique TensorFlower 提交于 4月 11, 2016

Use eigen ThreadPool instead of tensorflow one if TENSORFLOW_USE_EIGEN_THREADPOOL is defined. This will allow to switch to the new non-blocking ThreadPool.
Change: 119512280

017498bc

Fix RNN performance bug. + Additions to rnn benchmarks & benchmarks.py. · eb161ecd

由 Eugene Brevdo 提交于 4月 10, 2016

The RNN performance bug:
* When passing sequence_length to rnn(), calculations were being performed past
  max_sequence_length.

This bug had one major side effect:
* It slowed down the calculation past max_sequence_length (it *should*
return zeros for outputs and copy state through)

The calculations themselves were still correct:  The state was still
copied through and the output was still all zeros.  But that calculation
was performed via a vector-conditional select() instead of a single
scalar cond().  As a result a lot of extra copying was happening both
in fw and backprop.

Thanks to Nat Roth (natusroth@gmail) for unearthing this bug.

**************
Also:
- updates to benchmarks.py (allow more specific benchmarks, added
  support for --benchmarks=all).
- cleaned up RNN benchmarks code a bit.

New and updated benchmarks:

Calculation: Static Unroll with Halved Sequence Length vs. Half Static Unroll
batch    full_t          units   gpu     dt(half_seq_len)        dt(unroll_half)         dt(half_seq_len)/dt(unroll_half)
128      50              256     False   0.164351                0.155019                1.060204
128      50              256     True    0.033295                0.028203                1.180550

Calculation: Static Unroll with Dynamic Flow LSTM vs. Dynamic Unroll LSTM
batch    max_t   units   gpu     dt(static)      dt(dynamic)     dt(dynamic)/dt(static)
256      50      512     False   1.759111        1.692570        0.962173
256      50      512     True    0.178953        0.190454        1.064269
256      50      256     False   0.533132        0.567228        1.063955
256      50      256     True    0.078298        0.085024        1.085905
256      50      128     False   0.220362        0.215350        0.977255
256      50      128     True    0.053379        0.059129        1.107723
Change: 119495675

eb161ecd

Y
Surface control_flow_ops.while_loop to public. · 918e9647
由 Yuan Yu 提交于 4月 10, 2016
```
Deprecated control_flow_ops.While. Use tf.while_loop.
Change: 119488170
```
918e9647

This is another step to make TensorFlow more interactive and flexible to... · 098f930d

由 Yuan Yu 提交于 4月 10, 2016

This is another step to make TensorFlow more interactive and flexible to users. It allows a tensor produced by a run call to stay "in-place" so that a future run call can use it in-place. To achieve this, a run call can now return a handle of a tensor to the client, which can then be fed to a subsequent run call. This feature is complimentary to partial run, though there are some overlaps.

Here are a few properties of the current implementation:

1. Tensors are stored in the state of a session. The tensors are garbage collected if the client doesn't have a reference to the tensor or the session is closed.

2. There is no change to the current session API. We introduced two ops to manage the conversions between tensors and its handles. (There is a third op to garbage collect a tensor.) See the example below.

3. It fits quite well into the current feed-fetch design/implementation. It tries to reuse the graph (and caches) as much as possible so to make things efficient.

Below is a simple example. More examples can be found in sessopn_ops_test.py.

# Return a handle.
a = tf.constant(10)
b = tf.constant(5)
c = tf.mul(a, b)
h = tf.get_session_handle(c).eval()

# Feed a tensor handle.
f, x = tf.get_session_tensor(dtypes.int32)
y = tf.mul(x, 10)
result = sess.run(y, feed_dict={f: h.handle})
# result == 500
Change: 119481352

098f930d

10 4月, 2016 1 次提交
- B
  Made isinf, isnan, isfinite, ceil and floor work with 16 bit floats. · cc9560e8
  由 Benoit Steiner 提交于 4月 09, 2016
```
Change: 119458778
```
  cc9560e8
09 4月, 2016 17 次提交
- A
  Return correct status with error when Graph creation fails. · 8f8b8d6d
  由 A. Unique TensorFlower 提交于 4月 09, 2016
```
Change: 119448828
```
  8f8b8d6d
- A
  Remove definition of macro TS_UNCHECKED_READ. · 01e08963
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
Change: 119434669
```
  01e08963
- V
  Fix GPU BUILD · f72976ca
  由 Vijay Vasudevan 提交于 4月 08, 2016
```
Change: 119431584
```
  f72976ca
- A
  Abstract the GCS/GFile code that event loading needs behind an io_wrapper module. · 1edb8c8e
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
io_wrapper provides some functions that check whether the path is a GCS path and
calls the relevant functions from either gfile or gcs. This is *not* intended to
be a general-purpose interface; it only implements the things that are necessary
for loading events from GCS/GFile storage.

We're doing this because having the entire event loader stack care about the
difference between GCS and GFile is bad from an encapsulation perspective; this
way, we can present one consistent interface.
Change: 119427191
```
  1edb8c8e
- A
  Fix broken build. · ff160e55
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
Change: 119427077
```
  ff160e55
- A
  Temporary rollback of half support in TF_CALL_REAL_NUMBER_TYPES() · e47ddeb3
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
due to internal issues.
Change: 119424490
```
  e47ddeb3
- D
  Fix segfault in grpc_server_lib when an invalid hostname-port pair is passed. · 95ae233e
  由 Derek Murray 提交于 4月 08, 2016
```
Previously, if the port was undefined, an out-of-bounds access would
be made. This change adds the appropriate checks.
Change: 119424297
```
  95ae233e
- A
  Use new adjoint attribute for solvers to make gradients more efficient. · 264bac93
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
Consolidate linalg shape inference functions.
Change: 119423897
```
  264bac93
- D
  More comprehensively enforcing Tensor MaxDimensions limit. · 4d9ec5ec
  由 David G. Andersen 提交于 4月 08, 2016
```
Change: 119423048
```
  4d9ec5ec
- M
  Remove unused mutex.h from file_system.h · f77c9fb7
  由 Manjunath Kudlur 提交于 4月 08, 2016
```
Change: 119420831
```
  f77c9fb7
- E
  Added ParseExample microbenchmark. · 72a1cdc9
  由 Eugene Brevdo 提交于 4月 08, 2016
```
Change: 119419160
```
  72a1cdc9
- D
  Fixing additional int64->32 implicit conversion warnings. · ffd88925
  由 David G. Andersen 提交于 4月 08, 2016
```
Change: 119416366
```
  ffd88925
- A
  Added reduce_join op. · 076cee78
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
Change: 119409291
```
  076cee78
- A
  Make TF_CALL_REAL_NUMBER_TYPES() and related macros include Eigen::half · 86dfcdc5
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
in addition to float. Explicitly exempt GPU effects that rely on atomics,
for which we have no good solution for half yet. Add some fixes in various
places (some in Eigen, some in kernels) to make it all compile.

Note that there are still ops that don't _declare_ half support (those
that use \u201callnumbertypes\u201d or similar do, those that use \u201cfloat, double\u201d
don't); these will be fixed in a forthcoming commit.
Change: 119409234
```
  86dfcdc5
- G
  Fix rnn_cell._get_sharded_variable to work for shapes of any rank. · 1cd35932
  由 Geoffrey Irving 提交于 4月 08, 2016
```
Change: 119407560
```
  1cd35932
- E
  Re-enable write-once, read-many semantics for TensorArray. · 2d691fe7
  由 Eugene Brevdo 提交于 4月 08, 2016
```
This implementation is a bit more efficient than the previous one because
the first write just performs a shallow copy.  Only on an aggregation is
any new memory allocated.

For read-many semantics, the operations read, pack, and concat must be called
with parameter  clear_after_read=False.  By default, the flag is set True; this
means a read will remove the reference to the underlying Tensor in
the TensorArray to reclaim memory in the runtime.
Change: 119404140
```
  2d691fe7
- A
  Support non-arm architectures for Android. · 55810fa8
  由 A. Unique TensorFlower 提交于 4月 08, 2016
```
Makes "-mfpu=neon" depend on CPU type.
Change: 119399482
```
  55810fa8

qq_38905368 / tensorflow 与 Fork 源项目一致

qq_38905368 / tensorflow
与 Fork 源项目一致