提交 · 815c7a760ec383d16fbba5fd08525c519a051a79 · xxadev / tensorflow

18 3月, 2017 27 次提交

Y
Use dynamic shape op to handle inputs of partially known shape. · 815c7a76
由 Yao Zhang 提交于 3月 17, 2017
```
Change: 150489919
```
815c7a76

Convert from raw pointers to std::unique_ptr · 39c03fab

由 Brennan Saeta 提交于 3月 17, 2017

In order to clarify ownership, this change moves the remote devices
from unmanaged raw pointers to std::unique_ptr. In the process, I have
removed some confusing comments regarding ownership that are now
unnecessary, as the types are both correct and enforced by the compiler.
Change: 150489013

39c03fab

A
Support deepcopy in _SparseColumn. · 29293fb6
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150488705
```
29293fb6
B
Integrated the grappler optimizers in TensorFlow. · 2cc1e156
由 Benoit Steiner 提交于 3月 17, 2017
```
Change: 150488108
```
2cc1e156
N
Add tf.op_scope -> tf.name_scope (including argument reorder) to the TF upgrade script · 0db3d5ab
由 Neal Wu 提交于 3月 17, 2017
```
Change: 150487597
```
0db3d5ab
A
Explicilty retry failures of individual deletes in DeleteRecursively. · 6a9236e0
由 Alexey Surkov 提交于 3月 17, 2017
```
Otherwise these failuers aren't currently covered by any retry logic.
Change: 150486764
```
6a9236e0
M
Test graph initialization logic in Estimator. · 2f46b74c
由 Mustafa Ispir 提交于 3月 17, 2017
```
Change: 150479545
```
2f46b74c

Update AdamOptimizer's documentation to reflect its behavior · 86fae6bd

由 A. Unique TensorFlower 提交于 3月 17, 2017

The docstring incorrectly claimed that momentum was ignored when a variable
slice wasn't used in the sparse version of the algorithm.
Change: 150477769

86fae6bd

A
Add None check for seq_len_mask before reshape. · fb1c4cd8
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150477638
```
fb1c4cd8
Y
Add ExportRunMetadata in queue runner and ExportCostGraph in coordinator. · 547a5402
由 Yuefeng Zhou 提交于 3月 17, 2017
```
Make the queue runner own the metadata and mutex.
Change: 150475730
```
547a5402
A
[Tensorflow] Add check fail when user passes a tensor with nullptr to lookup. · ca170f34
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150474503
```
ca170f34
A
Android: Added download models into build.gradle for android example · 360f449d
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150471440
```
360f449d

[XLA] Add mapping from HloInstruction to CallGraphNode. · d687f9bb

由 Mark Heffernan 提交于 3月 17, 2017

Make mapping from calling instruction to CallSite one-to-one by extending CallSite to handle more than one called computation. This enables instructions like kWhile which call two computations to be represented as a single CallSite. Also add a mapping from instruction to CallSite in CallGraphNode to enable fast call site lookup.

Also, include a few other opportunistic improvements:
* Change CallGraph::Build factor to return a std::unique_ptr. This enables,
  for example, more convenient use of CallGraph as a data member to a class.
* Change a few uses of unordered_set/map to FlatSet/Map.
Change: 150469958

d687f9bb

M
Fix link in `constant_op.py`. · c76bc807
由 Mark Daoust 提交于 3月 17, 2017
```
Change: 150466902
```
c76bc807
G
Add an option to omit contrib tests in ci build. · 64522550
由 Gunhan Gulsoy 提交于 3月 17, 2017
```
Change: 150463151
```
64522550
A
Adding the pip smoke test as the 8th test in sanity. · 9ffd0374
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150462131
```
9ffd0374
A
Improve fused instruction dump · 026936f5
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150460833
```
026936f5

Java: Constructor for creating Sessions with a ConfigProto · 2b4eca40

由 Asim Shankar 提交于 3月 17, 2017

And also convenience functions for feeding and fetching using handles to
outputs/operations instead of their names.
Change: 150460697

2b4eca40

J
Remove the unnecessary RTLD_GLOBAL dlopen flag in the remaining tests. · 490a2235
由 Jonathan Hseu 提交于 3月 17, 2017
```
Change: 150460215
```
490a2235
Y
Added data type info to conv autotune parameters. · f4b237f8
由 Yangzihao Wang 提交于 3月 17, 2017
```
Change: 150459431
```
f4b237f8

Improves performance of tf.matmul(a, b, ...) for dense tensors on NVIDIA GPUs... · 49f14738

由 A. Unique TensorFlower 提交于 3月 17, 2017

Improves performance of tf.matmul(a, b, ...) for dense tensors on NVIDIA GPUs in the following cases:

a) If the inner-most dimension of b is 1, i.e. the operation is (possibly a batch of) matrix*vector multiplication(s). This is accomplished by calling Cublas GEMV rather than GEMM. This speeds up large matrix-vector products by about 4x.

b) If one or more dimensions are unknown at graph construction time but the operation is in fact either a single matrix*matrix or matrix*vector multiplication.

The following benchmark numbers illustrating the improvements for matrix * vector products
were measured on a NVIDIA Titan X (Maxwell) card.

Benchmark                                    Base (ns)  New (ns) Improvement
----------------------------------------------------------------------------
BM_Matmul_50_50_1_false_false_DT_FLOAT_gpu       18102     17056     +5.8%
BM_Matmul_50_50_1_true_false_DT_FLOAT_gpu        18108     16374     +9.6%
BM_Matmul_50_50_1_false_true_DT_FLOAT_gpu        18153     17173     +5.4%
BM_Matmul_50_50_1_true_true_DT_FLOAT_gpu         18150     15950    +12.1%
BM_Matmul_500_500_1_false_false_DT_FLOAT_gpu     64605     16874    +73.9%
BM_Matmul_500_500_1_true_false_DT_FLOAT_gpu      62810     17298    +72.5%
BM_Matmul_500_500_1_false_true_DT_FLOAT_gpu      60447     17014    +71.9%
BM_Matmul_500_500_1_true_true_DT_FLOAT_gpu       58443     16934    +71.0%
BM_Matmul_2000_2000_1_false_false_DT_FLOAT_gpu  343298     81898    +76.1%
BM_Matmul_2000_2000_1_true_false_DT_FLOAT_gpu   294738     63723    +78.4%
BM_Matmul_2000_2000_1_false_true_DT_FLOAT_gpu   300671     83650    +72.2%
BM_Matmul_2000_2000_1_true_true_DT_FLOAT_gpu    284540     63742    +77.6%
Change: 150456725

49f14738

Z
Removes an unnecessary check that blocks using multihead with custom heads. · bb7cadbc
由 Zakaria Haque 提交于 3月 17, 2017
```
Change: 150452316
```
bb7cadbc
A
Set producer version in the graph used by shape refiner to run constant · 5a95c76c
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
folding.
Change: 150450219
```
5a95c76c
A
Fix mis-spelling. · 8e138e72
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150450082
```
8e138e72
A
[XLA] Add a test for the remainder of two scalar U32s. · 0151bae9
由 A. Unique TensorFlower 提交于 3月 17, 2017
```
Change: 150449788
```
0151bae9
M
Copied global step tests from contrib to core. · b26720ca
由 Mustafa Ispir 提交于 3月 17, 2017
```
Change: 150447439
```
b26720ca
M
Tested features, labels, and mode in Estimator.export · 55565d3d
由 Mustafa Ispir 提交于 3月 17, 2017
```
Change: 150443246
```
55565d3d

17 3月, 2017 13 次提交
- A
  Use e.errno instead of trying to except FileExistsError. · cf1a2324
  由 A. Unique TensorFlower 提交于 3月 17, 2017
```
FileExistsError doesn't exist in Python 2.7.
Change: 150436736
```
  cf1a2324
- A
  - Update XLA for removal of TargetOptions::LessPreciseFPMADOption (LLVM r298023) · f63e985e
  由 A. Unique TensorFlower 提交于 3月 17, 2017
```
- Removes getArgumentList() in favor of args() etc in XLA (LLVM r298010)
Change: 150427660
```
  f63e985e
- E
  Updates to RNNCells to allow easy storage of attention TensorArray in the state. · 03abac7f
  由 Eugene Brevdo 提交于 3月 16, 2017
```
The main change is that RNNCells that wrap other RNNCells now override self.zero_state to call the wrapped cell's zero_state and then (maybe) perform some post-processing... instead of relying on the state_size property to provide all information about the state.

Also made zero_state calls create ops inside their own name scope.
Change: 150413265
```
  03abac7f
- E
  Initial cut of documentation for tf.contrib.seq2seq · 9cc50983
  由 Eugene Brevdo 提交于 3月 16, 2017
```
Change: 150400474
```
  9cc50983
- B
  Added an option to disable the collection of detailed statistics in grappler · 5135546b
  由 Benoit Steiner 提交于 3月 16, 2017
```
Change: 150397471
```
  5135546b
- A
  tfdbg: Created a GRPC-based hook that streams debugger-related events. · 9c7e4964
  由 A. Unique TensorFlower 提交于 3月 16, 2017
```
Change: 150396376
```
  9c7e4964
- G
  A simple script to test TF contrib. · 89792d70
  由 Gunhan Gulsoy 提交于 3月 16, 2017
```
Change: 150389857
```
  89792d70
- B
  Sleep forever to trigger the timeout consistently · a93d70c5
  由 Benoit Steiner 提交于 3月 16, 2017
```
Change: 150388263
```
  a93d70c5
- A
  Fix separable convolution bias check · a0ca4bcb
  由 A. Unique TensorFlower 提交于 3月 16, 2017
```
Change: 150385615
```
  a0ca4bcb
- A
  [Tensorflow] Expose API to lookup TensorSlice. · c092f31c
  由 A. Unique TensorFlower 提交于 3月 16, 2017
```
Change: 150384503
```
  c092f31c
- S
  Replace OpRegistryInterface* with FunctionLibraryDefinition in Graph. · 433c8c89
  由 Skye Wanderman-Milne 提交于 3月 16, 2017
```
This is a first step towards supporting functions in C++ graph construction, e.g. being able to import GraphDefs with functions.
Change: 150382046
```
  433c8c89
- A
  Update opensource vulcanized HTML file for Tensorboard. · 57737d10
  由 A. Unique TensorFlower 提交于 3月 16, 2017
```
This update contains refinements to the charts in the scalars dashboard.
Change: 150380169
```
  57737d10
- A
  Let the user view health pills at any step. · 1316eeb6
  由 A. Unique TensorFlower 提交于 3月 16, 2017
```
This involves adding a toggle to the health pills info box in the graph visualizer. When that toggle is enabled, Tensorboard makes a request for health pills at step X when the user moves the slider.

This feature can be very slow because it requires reading from disk. Viewing health pills at say step 100,000 could take minutes to an hour. We must design ways to make this faster (for instance, have the debugger write events at a much greater frequency only after it encounters a bad value).
Change: 150379929
```
  1316eeb6

xxadev / tensorflow 与 Fork 源项目一致

xxadev / tensorflow
与 Fork 源项目一致