提交 · e2d43895f93bc5890b46efaf576c66182ee84131 · wux_labs / Tensorflow

14 10月, 2022 40 次提交

Update Eigen to commit:3bb6a48d8c171cf20b5f8e48bfb4e424fbd4f79e · e2d43895

由 A. Unique TensorFlower 提交于 10月 13, 2022

CHANGELOG
=========
3bb6a48d8 - Fix bug atan2
14c847dc0 - Refactor special values test for pow, and add a similar test for atan2
462758e8a - Don'\''t use generic sign function for sign(complex) unless it is vectorizable
c0d6a7261 - Use pnegate(pzero(x)) as a generic way to generate -0.0. Some compiler do not handle the literal -0.0 properly in fastmath mode.
7846c7387 - Eigen/Sparse: fix warnings -Wunused-but-set-variable
316754487 - Handle NaN inputs to atan2.
72db3f0fa - Remove references to M_PI_2 and M_PI_4.
d6bc06259 - Remove reference to EIGEN_HAS_CXX11_MATH.
5ceed0d57 - Guard GCC-specific pragmas with "#ifdef EIGEN_COMP_GNUC"
528b68674 - [clang-format] Add a few macros to AttributeMacros
e95c4a837 - Simpler range reduction strategy for atan<float>().
80efbfded - Unconditionally enable CXX11 math.
e5794873c - Replace assert with eigen_assert.
7d6a9925c - Fix 4x4 inverse when compiling with -Ofast.
1414a76fa - Only vectorize atan<double> for Altivec if VSX is available.
c475228b2 - Vectorize atan() for double.
1e1848fdb - Add a vectorized implementation of atan2 to Eigen.

* Switch TensorFlow to using the new fast atan2 in Eigen.
* Get rid of local implementations since Eigen is now guaranteed to support C++11 math.

PiperOrigin-RevId: 481044760

e2d43895

Update TFRT dependency to use revision · 65f0ce33

由 A. Unique TensorFlower 提交于 10月 13, 2022

http://github.com/tensorflow/runtime/commit/97d12a118984d820c14bdad0971704d95d99f411.

PiperOrigin-RevId: 481043847

65f0ce33

Integrate LLVM at llvm/llvm-project@1fda6f6859aa · 41785055

由 A. Unique TensorFlower 提交于 10月 13, 2022

Updates LLVM usage to match
[1fda6f6859aa](https://github.com/llvm/llvm-project/commit/1fda6f6859aa)

PiperOrigin-RevId: 481042584

41785055

R
xla/python/py_values.cc GIL-not-held fix: `DevicePutResult` can be copied only with the GIL held. · 8bab1b5a
由 Ralf W. Grosse-Kunstleve 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481042256
```
8bab1b5a

Allow NaN values in TFLite calibrator · 3d977755

由 Jaesung Chung 提交于 10月 13, 2022

For tf.DivNoNan operators, the calibrator should NaN values since the lowered
form of TFLite will encounter NaN value by design.

PiperOrigin-RevId: 481039479

3d977755

M
[XNNPACK] Document support for Slice operator · b076fa72
由 Marat Dukhan 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481035658
```
b076fa72
A
Refactor libtftpu.h from core/tpu to compiler/xla/stream_exxecutor · f7336e1f
由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481034607
```
f7336e1f
T
[XLA:SPMD] Extend Iota matching heuristics for gather/scatter index-parallel partitioning. · 2b3b5084
由 Tongfei Guo 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481030210
```
2b3b5084

Updating determinism check in svd op. Now the op works deterministically when... · 29f9aa33

由 A. Unique TensorFlower 提交于 10月 13, 2022

Updating determinism check in svd op. Now the op works deterministically when enabled, except in the case that the input matrix has column size 1.

PiperOrigin-RevId: 481018683

29f9aa33

A
Updates the security policy support for old releases. · e03e43dc
由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481016434
```
e03e43dc

Add support for TF_SigmoidGrad op as a MLIR legalization pass. · 05e7f51e

由 Mohammadreza Heydary 提交于 10月 13, 2022

This CL introduces a new legalization pattern that rewrites sigmoid_grad
operation as a `grad = dy * y * (1 - y)`, where `y = sigmoid(x)`.

PiperOrigin-RevId: 481015565

05e7f51e

[xla:runtime] Make XlaRuntimeCpuExecutable compatible with AOT · 8187a2a2

由 Anlun Xu 提交于 10月 13, 2022

JitExecutable is not required for AOT compilation, so XlaRuntimeCpuExecutable can own a runtime::Executable directly.

PiperOrigin-RevId: 481012743

8187a2a2

F
Move CaptureSnapshot to FunctionType · bbe022e9
由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481010554
```
bbe022e9
D
Move core/util/tensor_bundle/byte_swap_array.{cc,h} to TSL, update usage in XLA · 2d3eb407
由 David Dunleavy 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481008739
```
2d3eb407
A
[xla:runtime] Add CpuCompiler::Export · e20f8814
由 Anlun Xu 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481007346
```
e20f8814
D
Move compiler/xla/experimental to compiler/xla/hlo/experimental, update users · 53fa1d27
由 David Dunleavy 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481006537
```
53fa1d27
F
Use FunctionType in FunctionCacheKey · b5cc8c77
由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481006100
```
b5cc8c77
U
Removed call to function_eager's register method. · e4a52e12
由 Umer Javed 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481004699
```
e4a52e12
R
[NFC] Add missing std headers · 8b333007
由 Rahul Joshi 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481001377
```
8b333007
P
disable flaky test (port_test) · dcc677c2
由 Pankaj Kanwar 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480996400
```
dcc677c2
A
[xla:runtime] Add protobuf for CPU executable and CpuXlaRuntimeAotCompilationResult · 45b66a13
由 Anlun Xu 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480996180
```
45b66a13

Update TFRT dependency to use revision · bda96ae9

由 A. Unique TensorFlower 提交于 10月 13, 2022

http://github.com/tensorflow/runtime/commit/36b7795640a99e44a9ab246c5495be5f99736c99.

PiperOrigin-RevId: 480985467

bda96ae9

A
[lite] When constructing a subgraph, use control dependencies from the model's... · 61bd2c65
由 A. Unique TensorFlower 提交于 10月 13, 2022
```
[lite] When constructing a subgraph, use control dependencies from the model's metadata, if present.

PiperOrigin-RevId: 480984280
```
61bd2c65
S
Update to the Ops compatibility overview · 164d5975
由 Soo Sung 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480983235
```
164d5975
F
Add type hierarchy logic to FunctionType · 2b003310
由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480977676
```
2b003310
E
[xla:runtime] Always keep frame pointer when compiling xla executable · d1a41c23
由 Eugene Zhulenev 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480974257
```
d1a41c23

Changes to shrink generated code size substantially. · 0544e5ae

由 Jeffrey A. Dean 提交于 10月 13, 2022

Use std::function objects rather than creating thousands of separate copies of
very large routines in shape_util.h

Refactored code to have an internal helper ForEachState struct, and to have
separate code for the parallel vs. non-parallel versions of the core
ForEachInternal functionality (compiler wasn't smart enough to track the
parallel bit through the std::optional object with the ThreadPool, and so
was emitting parallel code even for calls with the non-parallel variant,
and due to templatization, there were thousands of copies of these routines.

Avoid inlining some large routines in literal.h

Avoid inlining some automatically generated constructors for ShapeIndex, ProgramShape,
etc. in shape.{h,cc}

Avoid inlining large routines on non-OK paths in status_macros.h

Changes drop generated text size for a large binary by about 3.0 MB (~1.4%).

PiperOrigin-RevId: 480973483

0544e5ae

Merge pull request #55780 from... · 3771f6e1

由 TensorFlower Gardener 提交于 10月 13, 2022

Merge pull request #55780 from ROCmSoftwarePlatform:google_upstream_remove_rocm_build_flag_nextafter_op

PiperOrigin-RevId: 480973150

3771f6e1

[XLA] Fix operand->tuple sharding prop to correctly handle empty tuples · f2f5fcd5

由 Rahul Joshi 提交于 10月 13, 2022

- Unify the code for handling operand->tuple sharding propagation when no
  sharding is present vs refining the existing sharding (and reuse the code
  to refine existing sharding).
- This also fixes an issue with handling empty tuple sub-elements, which are
  essentially not counted in the top-level tuple elements of the tuple sharding
  (since the code the refines existing sharding handles this correctly)

PiperOrigin-RevId: 480971521

f2f5fcd5

L
Small comment adjustment. · f5a5d487
由 Luke Boyer 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480968915
```
f5a5d487
T
Merge pull request #57979 from ROCmSoftwarePlatform:fixed_gpu_kernel_tiling_test_2 · df1d9f90
由 TensorFlower Gardener 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480958124
```
df1d9f90

[SPMD] Fix HloInstruction::HasSideEffects to be more aggressive with collectives · f7d6896e

由 Rahul Joshi 提交于 10月 13, 2022

 - consider collective communication operations with channel_id as side
   effecting only in non-spmd mode.
 - Also handle all collective operations in the function.
 - Change SPMD partitioning test to verify that any collective generated by
   partitioning does not have sharding.

PiperOrigin-RevId: 480953154

f7d6896e

B
Add support for simulated quantization to the TPUEmbedding API. · 6ae51c00
由 Bruce Fontaine 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480950049
```
6ae51c00
H
Save preemption notifier instance to context distributed manager. · 27cd2a2d
由 Haoyu Zhang 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480949877
```
27cd2a2d
A
Roll forward Move XPlane Proto to TSL. · 54e58314
由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480944931
```
54e58314

Cleanup tfl.resize lowering to tosa.resize · 4ff58c26

由 Robert Suderman 提交于 10月 13, 2022

Existing lowering missed cases where the width/height of
the input or output are 1. These cases were difficult to
address in the current implementation so they were cleaned
up. Then power-of-2 specific code was removed as it was
easier to just depend on GCD to do the right thing.

PiperOrigin-RevId: 480936509

4ff58c26

Parameterize some test cases to make use of sharding more effectively · 79584a00

由 Zhi An Ng 提交于 10月 13, 2022

These 2 tests cases are long running, because they test a cartesian product of dtypes * transpose * adjoint * shapes (2 * 4 * 4 * 3 = 96). These 2 test cases are the bottlenecks in the entire test suite finishing. By converting them into parameterized test cases, each of the case in the product becomes its own test case, and can run on different shards.

PiperOrigin-RevId: 480935083

79584a00

A
Cache free var detection result based on function qualname and its module name · f0cd16cd
由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480932683
```
f0cd16cd

[mlir][tosa] Adds MHLO -> TOSA legalizations for iota · 7d3e3c14

由 Jenni Kilduff 提交于 10月 13, 2022

This creates a const op filled with [0, 1, 2...iotaSize] values, then tiles it to the iota result shape

PiperOrigin-RevId: 480930548

7d3e3c14

J
Remove tensorflow.bzl from tsl/framework · 09165bd3
由 Jake Harmon 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480928553
```
09165bd3