提交 · a614eed8f8d76aee17fcbc80c9b44a3538131845 · wux_labs / Tensorflow

10 8月, 2017 27 次提交
- A
  Changed formula for FTRL L1 normalization to be simpler and more efficient. · a614eed8
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164804532
```
  a614eed8
- A
  Further BUILD cleanup · ca612986
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164804406
```
  ca612986
- A
  Go: Update generated wrapper functions for TensorFlow ops. · 696c864c
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164803218
```
  696c864c
- A
  Update ops-related pbtxt files. · 5c40a917
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164802741
```
  5c40a917
- A
  Update Android Detect demo to use models exported using the Tensorflow Object... · 53aabd5c
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
Update Android Detect demo to use models exported using the Tensorflow Object Detection API.  Resolves #6738.

PiperOrigin-RevId: 164802542
```
  53aabd5c
- W
  Allow ffmpeg `samples_per_second` to be a Tensor · 22730fd4
  由 William Chargin 提交于 8月 09, 2017
```
This changes the `samples_per_second` parameter of the `encode_audio`
and `decode_audio` ops from an `Attr` to an `Input`, so that it can be
given arbitrary tensor values instead of only constants.

This change is important for use cases that want to use a single graph
to encode audio clips at arbitrary sample rates. (In particular, we want
to create a Python function that uses a long-running TensorFlow session
to encode audio; the sample rate cannot be known ahead of time, and we
don't want to have to reconstruct the graph on every call.)

PiperOrigin-RevId: 164799067
```
  22730fd4
- A
  [XLA] Preserve layouts in TryToSinkReshapeOrBroadcastAfterOpWithUniqueNonScalarOperand · 65692e13
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164797105
```
  65692e13
- A
  Added parallel version of DynamicStitchOp (named ParallelDynamicStitchOp) with · f06f18e5
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
slightly different semantics.

PiperOrigin-RevId: 164796436
```
  f06f18e5
- B
  Off by one in shuffle_dataset_op.cc in calculation of log msg interval. · b579d25c
  由 Brennan Saeta 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164794573
```
  b579d25c
- S
  Comment update for main-op in the SavedModel builder APIs. · 5aff5303
  由 Sukriti Ramesh 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164791375
```
  5aff5303
- F
  Add support for weight constraints in tf.layers API. · b5c59851
  由 Francois Chollet 提交于 8月 09, 2017
```
Refactor Keras layers to rely on the core constraint implementation.

PiperOrigin-RevId: 164788653
```
  b5c59851
- A
  Update ops-related pbtxt files. · d1a016b5
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164787644
```
  d1a016b5
- Y
  Add a flag to set whether to skip using cudnn for 1x1 filter. · b1d2480b
  由 Yangzihao Wang 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164786167
```
  b1d2480b
- A
  Disabling the csiszar_divergence_test. · 4cade361
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164782851
```
  4cade361
- A
  Go: Update generated wrapper functions for TensorFlow ops. · 257edb69
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164782742
```
  257edb69
- D
  [tf.contrib.data] Add `Dataset.prefetch()` transformation. · b55c0523
  由 Derek Murray 提交于 8月 09, 2017
```
This transformation is a simpler (and potentially more efficient)
replacement for `Dataset.map(lambda x: x, num_threads=1,
output_buffer_size=N)`, avoiding the overhead of function invocation
and simplifying the synchronization slightly.
PiperOrigin-RevId: 164781954
```
  b55c0523
- K
  [XLA] Fix Broadcast implementation in HloEvaluator to handle the special case... · 9ed2b669
  由 Kay Zhu 提交于 8月 09, 2017
```
[XLA] Fix Broadcast implementation in HloEvaluator to handle the special case of scalar broadcast to be consistent with other backends. Also add a test for scalar broadcast.

PiperOrigin-RevId: 164781786
```
  9ed2b669
- M
  Make HloDataFlowAnalysis updatable after transforming the HLO graph. · 56633fe0
  由 Mark Heffernan 提交于 8月 09, 2017
```
Updating is possible if operands/uses or computation roots change in
the graph. Updating is not possible if instructions are deleted or if
new instructions are added.

Specific changes:
* Add verification methods for asserting invariants and checking the
  analysis after updating.
* Always add phi values at while instructions. Previously these were
  added only if the phi had different inputs. The advantage of using
  phi's unconditionally is that the set of values is fixed for a
  module. Updates due to changing operands/uses in the graph do not
  create new values.
* Store values in a vector rather than a map. With unconditional phi
  values, the number of HloValues is fixed so the values can be held
  in a vector with stable references to elements.

PiperOrigin-RevId: 164778750
```
  56633fe0
- A
  Allow shared_embedding_columns function to work with _WeightedSparseColumn arguments · 83accbb3
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164777455
```
  83accbb3
- A
  FIx device colocation for kmeans. · ac34ad08
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164775849
```
  ac34ad08
- Y
  Add a test for checkpoint compatibility between fused and non-fused batch norm. · 594f3208
  由 Yao Zhang 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164771538
```
  594f3208
- A
  Add logit modifiers to allow boosting of previously trained models, ensembling, etc. · 5a6bc258
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164762982
```
  5a6bc258
- B
  Warn users when filling up large shuffle buffers · 8e472c69
  由 Brennan Saeta 提交于 8月 09, 2017
```
A common failure mode of the new datasets input pipeline is an
extremely long first sess.run call. It can sometimes appear to
users that things are simply hanging, when instead a large
shuffle buffer is being filled. When filling large shuffle
buffers, we should let users know what's going on.

PiperOrigin-RevId: 164760903
```
  8e472c69
- A
  Don't run contrib/timeseries/python/timeseries:state_management_test pip test... · 762c0e56
  由 A. Unique TensorFlower 提交于 8月 09, 2017
```
Don't run contrib/timeseries/python/timeseries:state_management_test pip test until crash has been resolved.

PiperOrigin-RevId: 164759761
```
  762c0e56
- B
  Deleted the code that infers shapes of restores since it crashes · 2b784072
  由 Benoit Steiner 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164739939
```
  2b784072
- D
  Export SavedModel atomically, and reduce directory timestamp race condition · ac69ae0d
  由 David Soergel 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164739283
```
  ac69ae0d
- E
  Fix invalid audience tags cloud_tpu_profiler pip_package. · 43007fb9
  由 Eric Liu 提交于 8月 09, 2017
```
Also make version name alpha instead of RC.

PiperOrigin-RevId: 164735457
```
  43007fb9
09 8月, 2017 13 次提交

A
Relax tolerance to fix OSS test failure on MacOS. · e3034efc
由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164728247
```
e3034efc

Consider the nested computations when checking if an instruction is · b4001ea6

由 HyoukJoong Lee 提交于 8月 09, 2017

removable from a computation. This is to prevent DCE from removing a
while instruction that includes a send/recv instruction.

PiperOrigin-RevId: 164722478

b4001ea6

A
Remove obsolete advice on BUILD flags · acceb27d
由 A. Unique TensorFlower 提交于 8月 09, 2017
```
PiperOrigin-RevId: 164718342
```
acceb27d
J
slot_creator: fix bugs handling dynamic-shaped var/tensor · e6e7ee49
由 James Qin 提交于 8月 08, 2017
```
PiperOrigin-RevId: 164686075
```
e6e7ee49

Remove newlines from tf.nn.dynamic_rnn args list. · 26e628b8

由 RJ Ryan 提交于 8月 08, 2017

Prevents bad formatting: https://www.tensorflow.org/versions/r1.2/api_docs/python/tf/nn/dynamic_rnn

PiperOrigin-RevId: 164675585

26e628b8

B
Use hand crafted filter instead of regexp since regexp don't always work · bb23f540
由 Benoit Steiner 提交于 8月 08, 2017
```
properly on some platforms.

PiperOrigin-RevId: 164665656
```
bb23f540
Y
Infer shapes for RestoreV2 and RestoreSlice ops when shape_and_slice input is present. · 8e6c372f
由 Yuefeng Zhou 提交于 8月 08, 2017
```
PiperOrigin-RevId: 164660701
```
8e6c372f
A
Unique names in the batch_function decorator. · 8d23b781
由 Alexandre Passos 提交于 8月 08, 2017
```
PiperOrigin-RevId: 164659904
```
8d23b781

Make plugin_data an optional field of SummaryMetadata · 4c60c962

由 A. Unique TensorFlower 提交于 8月 08, 2017

Every summary op writes data for a single plugin to process. Hence, each SummaryMetadata proto should have a single PluginData optional field (instead of a repeated one). This removes much complexity from TensorBoard logic that loops over the plugin data. It also simplifies the SQL schema - it can now enforce a one-to-one relationship between summary op and plugin.

PiperOrigin-RevId: 164659570

4c60c962

Make a change to the Cluster Resolver API: If no `credentials` are passed in... · de5034ae

由 Frank Chen 提交于 8月 08, 2017

Make a change to the Cluster Resolver API: If no `credentials` are passed in to the GCE and TPU Cluster Resolvers, then we will use the GoogleCredentials.get_application_default() credentials. If users want to pass in no credentials at all, then they will have to pass in "None" explicitly.

PiperOrigin-RevId: 164659129

de5034ae

[tf.contrib.data] Enable using step-local resources in Dataset.map()/filter(). · 865b92da

由 Derek Murray 提交于 8月 08, 2017

This change ensures that the mapper/predicate function used
respectively in these transformations has its own ScopedStepContainer,
thereby allowing the use of TensorArray resources (and operations that
use them, such as control-flow ops) inside these functions.

Fixes #11715.

PiperOrigin-RevId: 164648309

865b92da

Speed up tf.determinant by using LU factorization kernels from cuSolver for... · 389a7d43

由 A. Unique TensorFlower 提交于 8月 08, 2017

Speed up tf.determinant by using LU factorization kernels from cuSolver for large matrices instead of the batched LU factorization from cuBlas, which is only suitable for small matrices.

Speedup measured on Titan X (Maxwell):

Shape            Before    After    Speedup
------------------------------------------------------
(4, 4)          0.000159   0.000200 -26.35% (noise)
(16, 16)        0.000198   0.000190   3.59%
(64, 64)        0.000592   0.000538   9.10%
(128, 128)      0.001348   0.001376  -2.14%
(200, 200)      0.003201   0.002882   9.94%
(256, 256)      0.005096   0.003373  33.81%
(1024, 1024)    0.169690   0.012452  92.66%
(2, 512, 512)   0.023370   0.012243  47.61%
(2, 1024, 1024) 0.178757   0.025198  85.90%
(4, 4, 4)       0.000121   0.000128  -5.79%
(4, 16, 16)     0.000212   0.000190   9.95%
(4, 64, 64)     0.000499   0.000514  -3.01%
(4, 128, 128)   0.001276   0.001214   4.79%
(4, 256, 256)   0.004364   0.004314   1.14%
(4, 512, 512)   0.025031   0.024956   0.30%
(4, 1024, 1024) 0.184210   0.052858  71.31%
(8, 512, 512)   0.026542   0.026502   0.15%
(8, 1024, 1024) 0.186145   0.185988   0.08%
(65, 4, 4)      0.000152   0.000142   6.05%
(65, 16, 16)    0.000197   0.000194   1.52%
(65, 64, 64)    0.000559   0.000549   1.79%
(65, 128, 128)  0.001326   0.001308   1.29%
(65, 256, 256)  0.005495   0.005525  -0.55%
(65, 512, 512)  0.034147   0.034662  -1.51%
(513, 4, 4)     0.000144   0.000195 -35.42% (noise)
(513, 16, 16)   0.000207   0.000200   3.38%
(513, 64, 64)   0.001502   0.001490   0.79%
(513, 256, 256) 0.033428   0.032933   1.48%
(513, 512, 512) 0.234707   0.216858   7.60%

PiperOrigin-RevId: 164633730

389a7d43

Speed up GPU version of tf.matrix_inverse by using LU factorization kernels... · e57e11b7

由 A. Unique TensorFlower 提交于 8月 08, 2017

Speed up GPU version of tf.matrix_inverse by using LU factorization kernels from cuSolver and a hand-written matrix identity kernel, instead of the batched LU factorization from cuBlas, which is only suitable for small matrices.

Speedup measured on Titan X (Maxwell):

Shape           adjoint    Before    After    Speedup
------------------------------------------------------
(4, 4)          noadjoint  0.000204  0.000193   5.3%
(16, 16)        noadjoint  0.000360  0.000186  48.3%
(256, 256)      noadjoint  0.013830  0.003852  72.1%
(1024, 1024)    noadjoint  0.647639  0.015075  97.6%
(513, 4, 4)     noadjoint  0.000219  0.000192  12.3%
(513, 16, 16)   noadjoint  0.000293  0.000195  33.4%
(513, 256, 256) noadjoint  0.120573  0.120175   0.3%
(4, 4)          adjoint    0.000201  0.000193   3.9%
(16, 16)        adjoint    0.000282  0.000185  34.4%
(256, 256)      adjoint    0.013028  0.003391  73.9%
(1024, 1024)    adjoint    0.647752  0.014341  97.7%
(513, 4, 4)     adjoint    0.000221  0.000197  10.8%
(513, 16, 16)   adjoint    0.000384  0.000205  46.6%
(513, 256, 256) adjoint    0.131402  0.130616   0.6%

PiperOrigin-RevId: 164623298

e57e11b7