- 06 6月, 2019 5 次提交
-
-
由 Gaurav Jain 提交于
PiperOrigin-RevId: 251659257
-
由 Andy Ly 提交于
PiperOrigin-RevId: 251659253
-
由 Karmel Allison 提交于
PiperOrigin-RevId: 251656649
-
由 Bruce Fontaine 提交于
Add alternative in high level TPU embedding API to not use feature columns, but to use the mid level API FeatureConfig and TableConfig instead. PiperOrigin-RevId: 251656574
-
由 Justin Lebar 提交于
The AllReduce HLO has the ability to say, do an all-reduce on (say) GPUs [0,1] and separately on GPUs [2,3]. Previously this was not implemented (and in fact we incorrectly just ignored this and would do an all-reduce across all four GPUs). PiperOrigin-RevId: 251654069
-
- 05 6月, 2019 35 次提交
-
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251651006
-
由 Bixia Zheng 提交于
Extend the Philox bit generator and the Three Fry bit generator to support F64. Change UniformF32Distribution to UniformFloatingpointDistribution to add support for F64. Similarly, change NormalF32Distribution to NormalFloatingpointDistribution. Modify the tf2xla bridge to support F64 stateless random ops and stateful random ops. Modify the stateless random op tests and the stateful random op tests to test F64. PiperOrigin-RevId: 251649864
-
由 Justin Lebar 提交于
TupleThunk registers a callback to free the host buffer (i.e. the CPU memory that holds the pointers) after the H2D memcpy completes. Unfortunately these callbacks are slow. We add a new routine to StreamExecutor which runs a callback on the next BlockHostUntilDone, and we use this to free the buffer. This means that the buffer will stay live for a bit longer, but in practice these buffers are tiny, less than 32 bytes. We also use this in ConvolutionThunk and CudnnBatchnormThunk. Previously these were unsoundly relying on an optimization in the GPU driver (?) in which it seemed to eagerly copy the host memory. PiperOrigin-RevId: 251646672
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251646660
-
由 Justin Lebar 提交于
This makes it far easier to add a new parameter, which I will do in a later change. PiperOrigin-RevId: 251643760
-
由 Edward Loper 提交于
PiperOrigin-RevId: 251640087
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251637948
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251630341
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251613307
-
由 A. Unique TensorFlower 提交于
Previously more precision was required than is supported by 16 bit floats. PiperOrigin-RevId: 251612304
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251603109
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251596458
-
由 Peter Buchlovsky 提交于
PiperOrigin-RevId: 251596345
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251594608
-
由 A. Unique TensorFlower 提交于
Tests were passing when we don't actually support all the functionality. PiperOrigin-RevId: 251592402
-
由 Suharsh Sivakumar 提交于
Also fix incorrectly set version while i am here. PiperOrigin-RevId: 251583288
-
由 Anna R 提交于
PiperOrigin-RevId: 251581726
-
由 A. Unique TensorFlower 提交于
Add API in RecursiveCompilabilityChecker to access un-compilable node if node is inside a potentially recursive function body node. PiperOrigin-RevId: 251581339
-
由 Ayush Dubey 提交于
PiperOrigin-RevId: 251580137
-
由 Vojtech Bardiovsky 提交于
This was caused by wrong alignment of arguments and default values. Also add one more assert to improve debuggability. PiperOrigin-RevId: 251580104
-
由 Pavithra Vijay 提交于
PiperOrigin-RevId: 251579891
-
由 Smit Hinsu 提交于
PiperOrigin-RevId: 251578768
-
由 A. Unique TensorFlower 提交于
Save more temporary memory in remapper and memory optimizers. Get rid of some code duplication in constant_folding.cc. PiperOrigin-RevId: 251578727
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251577571
-
由 Anna R 提交于
PiperOrigin-RevId: 251572054
-
由 Thomas O'Malley 提交于
PiperOrigin-RevId: 251571946
-
由 Suharsh Sivakumar 提交于
PiperOrigin-RevId: 251571788
-
由 Thomas O'Malley 提交于
`tf.config.experimental_run_functions_eagerly` PiperOrigin-RevId: 251570860
-
由 Sourabh Bajaj 提交于
Support model.fit and evaluate in 2.0 with TPUStrategy using the experimental_run + train_on_batch API. PiperOrigin-RevId: 251570029
-
由 Zhenyu Tan 提交于
PiperOrigin-RevId: 251569439
-
由 Yu-Cheng Ling 提交于
PiperOrigin-RevId: 251568444
-
由 Rachel Lim 提交于
PiperOrigin-RevId: 251566961
-
由 A. Unique TensorFlower 提交于
Exposes the already-implemented variable `scatter_div`, `scatter_mul`, `scatter_min`, and `scatter_max` operations in the variables API. Also fixes some documentation, and adds tests that directly check the var.scatter_* methods. PiperOrigin-RevId: 251565839
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 251565800
-
由 Igor Ganichev 提交于
PiperOrigin-RevId: 251565676
-