提交 · e03e43dc1c8969d8ccfd2fa30c2d3075121c0392 · wux_labs / Tensorflow

14 10月, 2022 40 次提交
- A
  Updates the security policy support for old releases. · e03e43dc
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481016434
```
  e03e43dc
- M
  Add support for TF_SigmoidGrad op as a MLIR legalization pass. · 05e7f51e
  由 Mohammadreza Heydary 提交于 10月 13, 2022
```
This CL introduces a new legalization pattern that rewrites sigmoid_grad
operation as a `grad = dy * y * (1 - y)`, where `y = sigmoid(x)`.

PiperOrigin-RevId: 481015565
```
  05e7f51e
- A
  [xla:runtime] Make XlaRuntimeCpuExecutable compatible with AOT · 8187a2a2
  由 Anlun Xu 提交于 10月 13, 2022
```
JitExecutable is not required for AOT compilation, so XlaRuntimeCpuExecutable can own a runtime::Executable directly.

PiperOrigin-RevId: 481012743
```
  8187a2a2
- F
  Move CaptureSnapshot to FunctionType · bbe022e9
  由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481010554
```
  bbe022e9
- D
  Move core/util/tensor_bundle/byte_swap_array.{cc,h} to TSL, update usage in XLA · 2d3eb407
  由 David Dunleavy 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481008739
```
  2d3eb407
- A
  [xla:runtime] Add CpuCompiler::Export · e20f8814
  由 Anlun Xu 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481007346
```
  e20f8814
- D
  Move compiler/xla/experimental to compiler/xla/hlo/experimental, update users · 53fa1d27
  由 David Dunleavy 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481006537
```
  53fa1d27
- F
  Use FunctionType in FunctionCacheKey · b5cc8c77
  由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481006100
```
  b5cc8c77
- U
  Removed call to function_eager's register method. · e4a52e12
  由 Umer Javed 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481004699
```
  e4a52e12
- R
  [NFC] Add missing std headers · 8b333007
  由 Rahul Joshi 提交于 10月 13, 2022
```
PiperOrigin-RevId: 481001377
```
  8b333007
- P
  disable flaky test (port_test) · dcc677c2
  由 Pankaj Kanwar 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480996400
```
  dcc677c2
- A
  [xla:runtime] Add protobuf for CPU executable and CpuXlaRuntimeAotCompilationResult · 45b66a13
  由 Anlun Xu 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480996180
```
  45b66a13
- A
  Update TFRT dependency to use revision · bda96ae9
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
http://github.com/tensorflow/runtime/commit/36b7795640a99e44a9ab246c5495be5f99736c99.

PiperOrigin-RevId: 480985467
```
  bda96ae9
- A
  [lite] When constructing a subgraph, use control dependencies from the model's... · 61bd2c65
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
[lite] When constructing a subgraph, use control dependencies from the model's metadata, if present.

PiperOrigin-RevId: 480984280
```
  61bd2c65
- S
  Update to the Ops compatibility overview · 164d5975
  由 Soo Sung 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480983235
```
  164d5975
- F
  Add type hierarchy logic to FunctionType · 2b003310
  由 Faizan Muhammad 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480977676
```
  2b003310
- E
  [xla:runtime] Always keep frame pointer when compiling xla executable · d1a41c23
  由 Eugene Zhulenev 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480974257
```
  d1a41c23
- J
  Changes to shrink generated code size substantially. · 0544e5ae
  由 Jeffrey A. Dean 提交于 10月 13, 2022
```
Use std::function objects rather than creating thousands of separate copies of
very large routines in shape_util.h

Refactored code to have an internal helper ForEachState struct, and to have
separate code for the parallel vs. non-parallel versions of the core
ForEachInternal functionality (compiler wasn't smart enough to track the
parallel bit through the std::optional object with the ThreadPool, and so
was emitting parallel code even for calls with the non-parallel variant,
and due to templatization, there were thousands of copies of these routines.

Avoid inlining some large routines in literal.h

Avoid inlining some automatically generated constructors for ShapeIndex, ProgramShape,
etc. in shape.{h,cc}

Avoid inlining large routines on non-OK paths in status_macros.h

Changes drop generated text size for a large binary by about 3.0 MB (~1.4%).

PiperOrigin-RevId: 480973483
```
  0544e5ae
- T
  Merge pull request #55780 from... · 3771f6e1
  由 TensorFlower Gardener 提交于 10月 13, 2022
```
Merge pull request #55780 from ROCmSoftwarePlatform:google_upstream_remove_rocm_build_flag_nextafter_op

PiperOrigin-RevId: 480973150
```
  3771f6e1
- R
  [XLA] Fix operand->tuple sharding prop to correctly handle empty tuples · f2f5fcd5
  由 Rahul Joshi 提交于 10月 13, 2022
```
- Unify the code for handling operand->tuple sharding propagation when no
  sharding is present vs refining the existing sharding (and reuse the code
  to refine existing sharding).
- This also fixes an issue with handling empty tuple sub-elements, which are
  essentially not counted in the top-level tuple elements of the tuple sharding
  (since the code the refines existing sharding handles this correctly)

PiperOrigin-RevId: 480971521
```
  f2f5fcd5
- L
  Small comment adjustment. · f5a5d487
  由 Luke Boyer 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480968915
```
  f5a5d487
- T
  Merge pull request #57979 from ROCmSoftwarePlatform:fixed_gpu_kernel_tiling_test_2 · df1d9f90
  由 TensorFlower Gardener 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480958124
```
  df1d9f90
- R
  [SPMD] Fix HloInstruction::HasSideEffects to be more aggressive with collectives · f7d6896e
  由 Rahul Joshi 提交于 10月 13, 2022
```
 - consider collective communication operations with channel_id as side
   effecting only in non-spmd mode.
 - Also handle all collective operations in the function.
 - Change SPMD partitioning test to verify that any collective generated by
   partitioning does not have sharding.

PiperOrigin-RevId: 480953154
```
  f7d6896e
- B
  Add support for simulated quantization to the TPUEmbedding API. · 6ae51c00
  由 Bruce Fontaine 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480950049
```
  6ae51c00
- H
  Save preemption notifier instance to context distributed manager. · 27cd2a2d
  由 Haoyu Zhang 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480949877
```
  27cd2a2d
- A
  Roll forward Move XPlane Proto to TSL. · 54e58314
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480944931
```
  54e58314
- R
  Cleanup tfl.resize lowering to tosa.resize · 4ff58c26
  由 Robert Suderman 提交于 10月 13, 2022
```
Existing lowering missed cases where the width/height of
the input or output are 1. These cases were difficult to
address in the current implementation so they were cleaned
up. Then power-of-2 specific code was removed as it was
easier to just depend on GCD to do the right thing.

PiperOrigin-RevId: 480936509
```
  4ff58c26
- Z
  Parameterize some test cases to make use of sharding more effectively · 79584a00
  由 Zhi An Ng 提交于 10月 13, 2022
```
These 2 tests cases are long running, because they test a cartesian product of dtypes * transpose * adjoint * shapes (2 * 4 * 4 * 3 = 96). These 2 test cases are the bottlenecks in the entire test suite finishing. By converting them into parameterized test cases, each of the case in the product becomes its own test case, and can run on different shards.

PiperOrigin-RevId: 480935083
```
  79584a00
- A
  Cache free var detection result based on function qualname and its module name · f0cd16cd
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480932683
```
  f0cd16cd
- J
  [mlir][tosa] Adds MHLO -> TOSA legalizations for iota · 7d3e3c14
  由 Jenni Kilduff 提交于 10月 13, 2022
```
This creates a const op filled with [0, 1, 2...iotaSize] values, then tiles it to the iota result shape

PiperOrigin-RevId: 480930548
```
  7d3e3c14
- J
  Remove tensorflow.bzl from tsl/framework · 09165bd3
  由 Jake Harmon 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480928553
```
  09165bd3
- A
  Refactor out some code in xla_compile_on_demand as util methods. · d54b4d21
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480927625
```
  d54b4d21
- A
  [pjrt:cpu] Add SerializeExecutable and DeserializeExecutable to TfrtCpuClient · 2259ae05
  由 Anlun Xu 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480927276
```
  2259ae05
- T
  Merge pull request #58066 from tensorflow:rthadur-patch-1 · c8a8d954
  由 TensorFlower Gardener 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480926841
```
  c8a8d954
- Z
  [XNNPACK] Support Slice node in XNNPACK delegate · 582e2037
  由 Zhi An Ng 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480926009
```
  582e2037
- C
  Roll forward Move XPlane Proto to TSL. · 86596848
  由 Clive Verghese 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480922000
```
  86596848
- A
  [tfrt:jit] Allow for tiling matmul in all possible dimensions · c4f3a3ef
  由 A. Unique TensorFlower 提交于 10月 13, 2022
```
Tiling `linalg.matmul` with any (< 3) number of tile sizes.

PiperOrigin-RevId: 480921681
```
  c4f3a3ef
- T
  Merge pull request #57519 from tatwaichong:conv3d · a06ca422
  由 TensorFlower Gardener 提交于 10月 13, 2022
```
PiperOrigin-RevId: 480919007
```
  a06ca422
- E
  [xla:cpu-next] Extract XlaFrameworkMapping into its own file · 9ebdeae3
  由 Emilio Cota 提交于 10月 13, 2022
```
So that it can be included from header files without bringing
in cpu_executable as a dependence.

PiperOrigin-RevId: 480917177
```
  9ebdeae3
- B
  Include gpu_delegate_native_jni.cc in a filegroup alongside gpu_delegate_jni.cc · 7f798a56
  由 Bernhard Bauer 提交于 10月 13, 2022
```
The two files typically need to be consumed jointly, as gpu_delegate_native_jni.cc contains the native code needed for (successfully) initializing the native part of GpuDelegate.

PiperOrigin-RevId: 480914567
```
  7f798a56