- 19 12月, 2019 1 次提交
-
-
由 Pooya Davoodi 提交于
-
- 18 12月, 2019 39 次提交
-
-
由 Pooya Davoodi 提交于
This becomes only effective if stride>1.
-
由 William Chargin 提交于
Now that TensorBoard 2.1.0 has been released, our nightlies have moved up to 2.2.x alphas. PiperOrigin-RevId: 286091571 Change-Id: I33a2b679bc91ec9e46423a5eea298a01a5df7a10
-
由 Hye Soo Yang 提交于
PY3 migration - Remove PY2 test since there is PY3 equivalent: //third_party/tensorflow/python/distribute/cluster_resolver:tpu_cluster_resolver_py_test PiperOrigin-RevId: 286091355 Change-Id: I70930ca8ab0d0379370e9045b3823a84deb48d0b
-
由 Jeremy Lau 提交于
PiperOrigin-RevId: 286090321 Change-Id: I5a560ace9189881e77d03d817aec4e21cd6ff45d
-
由 Mihai Maruseac 提交于
Reverts previous debug changes PiperOrigin-RevId: 286085811 Change-Id: Ie9407e5356c012dbb8c39657aa601c7ea7c8a08c
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286085066 Change-Id: I3814e2575937909437d783f0cdc0642013be9c01
-
由 River Riddle 提交于
GCC is unable to properly implicitly capture 'this' in generic lambdas. This bug is not fixed until 7.1.0: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67274 PiperOrigin-RevId: 286083427 Change-Id: Id52437119a123a0b8f20d7af4b8f7067fdab0e86
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286082855 Change-Id: I1b48ab9ef7341de8fdb5cdf08a2ed2491ebfd2c3
-
由 A. Unique TensorFlower 提交于
Speedup with AVX+FMA (Skylake CPU, GTX 1080 GPU): CPU time: Benchmark Base (ns) New (ns) Improvement ------------------------------------------------------------------------------- BM_cpu_Round_DT_DOUBLE/4k 42894 39024 +9.0% BM_cpu_Round_DT_DOUBLE/32k 145162 61488 +57.6% BM_cpu_Round_DT_DOUBLE/256k 1385888 574402 +58.6% BM_cpu_Round_DT_DOUBLE/1M 6302318 4565979 +27.6% BM_gpu_Round_DT_DOUBLE/4k 45002 41022 +8.8% BM_gpu_Round_DT_DOUBLE/32k 40663 39132 +3.8% BM_gpu_Round_DT_DOUBLE/256k 50817 53454 -5.2% BM_gpu_Round_DT_DOUBLE/1M 106437 105017 +1.3% BM_cpu_Round_DT_FLOAT/4k 38844 33209 +14.5% BM_cpu_Round_DT_FLOAT/32k 61744 46989 +23.9% BM_cpu_Round_DT_FLOAT/256k 713001 248703 +65.1% BM_cpu_Round_DT_FLOAT/1M 3503077 1996549 +43.0% BM_gpu_Round_DT_FLOAT/4k 40797 38140 +6.5% BM_gpu_Round_DT_FLOAT/32k 45468 38946 +14.3% BM_gpu_Round_DT_FLOAT/256k 40970 38779 +5.3% BM_gpu_Round_DT_FLOAT/1M 67983 65995 +2.9% Wall time: Benchmark Base (ns) New (ns) Improvement ------------------------------------------------------------------------------- BM_cpu_Round_DT_DOUBLE/4k 14611 12279 +16.0% BM_cpu_Round_DT_DOUBLE/32k 41195 29524 +28.3% BM_cpu_Round_DT_DOUBLE/256k 72282 63039 +12.8% BM_cpu_Round_DT_DOUBLE/1M 154311 136727 +11.4% BM_gpu_Round_DT_DOUBLE/4k 16410 13868 +15.5% BM_gpu_Round_DT_DOUBLE/32k 13808 13105 +5.1% BM_gpu_Round_DT_DOUBLE/256k 20339 20115 +1.1% BM_gpu_Round_DT_DOUBLE/1M 75939 73861 +2.7% BM_cpu_Round_DT_FLOAT/4k 12103 9505 +21.5% BM_cpu_Round_DT_FLOAT/32k 28781 16516 +42.6% BM_cpu_Round_DT_FLOAT/256k 57667 44565 +22.7% BM_cpu_Round_DT_FLOAT/1M 108124 92402 +14.5% BM_gpu_Round_DT_FLOAT/4k 13893 12788 +8.0% BM_gpu_Round_DT_FLOAT/32k 17032 13086 +23.2% BM_gpu_Round_DT_FLOAT/256k 13996 12984 +7.2% BM_gpu_Round_DT_FLOAT/1M 37907 36085 +4.8% PiperOrigin-RevId: 286072830 Change-Id: Ifaa42e47b8893e34bba5618e19b8400e20f54e2c
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286067170 Change-Id: I0e0406a811a144f1ca3a77141478d8de86074d56
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286066745 Change-Id: I775209bd6c73aa87381a602f45ab33c7d1ec7ef6
-
由 River Riddle 提交于
PiperOrigin-RevId: 286066371 Change-Id: Ibb7f822b4e6847ceee6c1cc94d7764670fbb4161
-
由 A. Unique TensorFlower 提交于
https://gitlab.com/libeigen/eigen/commit/7252163335f56f23fcc7381c1efdea47161005fa PiperOrigin-RevId: 286065705 Change-Id: I4b59a2a6873e6188241ca607a4e7ca9981ba012c
-
由 Gaurav Jain 提交于
PiperOrigin-RevId: 286059397 Change-Id: Iaa771195df8c1f1cc4dc33a136c6eb75a9181f44
-
由 Yifei Feng 提交于
PiperOrigin-RevId: 286058486 Change-Id: Ice2c0d44120c6adf365389bec77f9c99c9281916
-
由 Dan Moldovan 提交于
Remove circular dependency between pyct and utils. This was undetected for a while, but an upcoming change detects it. PiperOrigin-RevId: 286057471 Change-Id: Ib8c8f5fe30af1c49d798515c0b3ddfb155a1e2f2
-
由 Zhenyu Tan 提交于
PiperOrigin-RevId: 286055835 Change-Id: If5253109acfbf43be21eb09b2f32599cabdbf2e9
-
由 Mihai Maruseac 提交于
Follow-up from previous change, will also be rolledback in ~1hr. PiperOrigin-RevId: 286055062 Change-Id: Ib0ea7293fc0b9ae99f8da9c98ae11ae8a9e35c35
-
由 Ken Franko 提交于
Adds coverage for multiple distribution strategies for partial datasets including a regression test for an control assert inside function. PiperOrigin-RevId: 286048436 Change-Id: I14e0c34a18730035bb5a506ef301e987823358c3
-
由 Anna R 提交于
imported at the top of the file and error is saved. Then, the error is printed inside create_local_cluster function. This creates extra references to upstream stack frames preventing them from getting garbage collected. Fixes #33376. PiperOrigin-RevId: 286048421 Change-Id: If48312d3f1236d6b53389642140b29dd05b65f2a
-
由 Hye Soo Yang 提交于
PY3 migration - Remove PY2 test since there is PY3 equivalent: //third_party/tensorflow/python/tpu/client:client_py2_test PiperOrigin-RevId: 286045890 Change-Id: Icae6371496d55e287163343bf61b27499eac04b4
-
由 Prakalp Srivastava 提交于
Custom support is required in the exporter because the shape expected by the XlaBuilder API for creating Infeed instruction is inferred from the first element of the result type tuple. PiperOrigin-RevId: 286045831 Change-Id: I945af265c5ad0d7bce163bc677061ed9a770240e
-
由 Feng Liu 提交于
The quant dim of fully connected is set to -1 to indicate that it doesn't require per-channel quantization. The quant spec generator binary should be able to handle it. PiperOrigin-RevId: 286042736 Change-Id: I0ef652f72c029ae1818f6728bcb5f72d163f77bf
-
由 A. Unique TensorFlower 提交于
Add pattern rewrite which splits a vector TransferWriteOp into slices according to the unrolling/slicing scheme of its InsertSlicesOp operand. PiperOrigin-RevId: 286042578 Change-Id: I67d39a3a696f7e4c69443f16c6ed7becbc902c20
-
由 Mahesh Ravishankar 提交于
The inline interface uses two methods to check legality of inling: 1) Can a region be inlined into another. 2) Can an operation be inlined into another. Setting the former to true, allows the inliner to use the second for legality checks. Add this method to the SPIR-V dialect inlining interface. PiperOrigin-RevId: 286041734 Change-Id: I17c0e50c2fc744e43663e2c2c5999e48597550ed
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286040953 Change-Id: Iecef32081cb5036f6af21d7f93b4372e377c3db2
-
由 Brian Atkinson 提交于
The flag was exported un-intentionally due to an internal refactoring. PiperOrigin-RevId: 286038710 Change-Id: I1c3e160e2274925a60a08ecd0aff8f8313160adb
-
由 Mihai Maruseac 提交于
This will get rolled back before new nightly pip get to be released PiperOrigin-RevId: 286037584 Change-Id: I1ecf8cf74353de72bdf969054ffffe0265ec1b80
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286034751 Change-Id: I11f7a8a4407163b8fe75f236a4e9f8cf8323376e
-
由 A. Unique TensorFlower 提交于
The lowering of MemRef types to the LLVM dialect is connected to the underlying runtime representation of structured memory buffers. It has changed several times in the past and reached the current state of a LLVM structured-typed descriptor containing two pointers and all sizes. In several reported use cases, a different, often simpler, lowering scheme is required. For example, lowering statically-shaped memrefs to bare LLVM pointers to simplify aliasing annotation. Split the pattern population functions into those include memref-related operations and the remaining ones. Users are expected to extend TypeConverter::convertType to handle the memref types differently. PiperOrigin-RevId: 286030610 Change-Id: Ibd7b38ca29f76d6eff9329f52d913a006d2a1358
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286030391 Change-Id: I9c3a6f6104b23929b36078c4eca36da79eb245ce
-
由 Robert David 提交于
PiperOrigin-RevId: 286027269 Change-Id: Ic37a825ec9b897e539a17946e3ed88f943f9efc5
-
由 A. Unique TensorFlower 提交于
The syntax for LLVM dialect types changed twice since this document was introduced. First, the quoted types are only prefixed with the dialect name `!llvm` rather than with `!llvm.type`. Second, for types that are simple enough (e.g., MLIR identifiers), the pretty form can be used instead of the quoted form. The relevant commits updated the dialect documentation, but not the conversion documentation. Use the valid type names in the conversion documentation. PiperOrigin-RevId: 286026153 Change-Id: Ib17798d4147922377b4d8866f9f8af47546cee31
-
由 Sanjoy Das 提交于
PiperOrigin-RevId: 286025069 Change-Id: I9d44df7209afcd95e0cfea92093fcd7a2ddf02fe
-
由 Ayush Dubey 提交于
This change fixes a subtle bug in `CompleteGroupLocal`, described below. After this change, `collective_ops_gpu_test_gpu` is no longer flaky. During group resolution, the first device to be resolved creates a `GroupRec` in `group_table_`, and initializes this `GroupRec` with the group's key, size, and device type. Subsequent devices access this group rec and check that the group attributes match. Because of the way mutexes are locked, before this change it was possible to skip the device type check as illustrated in the following example. Imagine a group with 2 devices, CPU:0 and GPU:0. CPU:0 creates the `gr` and initializes its type to CPU. It releases `group_mu_` but before it can lock `gr->mu` GPU:0 locks it first. GPU:0 then skips the device type check because `device_set` is empty. This is incorrect, and the desired behavior is to return an internal error for a collective group with different device types. This change removes the `device_set` is empty check because it is not needed. PiperOrigin-RevId: 286024716 Change-Id: Ib54defe5326426ab5cd1b692067b2b207f7a1d7e
-
由 A. Unique TensorFlower 提交于
This function has become redundant with MemRefDescriptor::getElementType and is no longer necessary. Use the MemRefDescriptor pervasively to concentrate descriptor-related logic in one place and drop the utility function. PiperOrigin-RevId: 286024168 Change-Id: I49880ed90488610887d9b3f5cd0bcbc7be235ca3
-
由 A. Unique TensorFlower 提交于
The conversion procedure has been updated to reflect the most recent MemRef descriptor proposal, but the documentation was only updated for the type conversion, omitting the address computation section. Make sure the two sections agree. PiperOrigin-RevId: 286022684 Change-Id: I4f6d8e0dcbee2ef5d856cacb4a90dd709d4721da
-
由 Juhyun Lee 提交于
PiperOrigin-RevId: 286021437 Change-Id: Ieb05e52c3ec2dc0f4df47a596e958acf57c54817
-
由 A. Unique TensorFlower 提交于
PiperOrigin-RevId: 286020854 Change-Id: I6a1eaf2fb3ea7ec1e769edc00f3310b9d04a8292
-