| `spark.yarn.shuffle.stopOnFailure` | `false` | 是否在 Spark Shuffle Service 初始化出现故障时停止 NodeManager。This prevents application failures caused by running containers on NodeManagers where the Spark Shuffle Service is not running. |
| `spark.acls.enable` | false | 是否开启 Spark acls。如果开启了,它检查用户是否有权限去查看或修改 job。Note this requires the user to be known, so if the user comes across as null no checks are done. UI 利用使用过滤器验证和设置用户。 |
| `spark.network.crypto.enabled` | false | Enable encryption using the commons-crypto library for RPC and block transfer service. Requires `spark.authenticate` to be enabled. |
使用 multiple input streams(多个输入流)/ receivers(接收器)接收数据的替代方法是明确 repartition(重新分配) input data stream(输入数据流)(使用 `inputStream.repartition(<number of partitions>)` )。这会在 further processing(进一步处理)之前将 received batches of data(收到的批次数据) distributes(分发)到集群中指定数量的计算机.
使用 multiple input streams(多个输入流)/ receivers(接收器)接收数据的替代方法是明确 repartition(重新分配)input data stream(输入数据流)(使用 `inputStream.repartition(<number of partitions>)` )。这会在 further processing(进一步处理)之前将 received batches of data(收到的批次数据)distributes(分发)到集群中指定数量的计算机.
### Level of Parallelism in Data Processing(数据处理中的并行度水平)
| _Spark 1.1 或更早版本,_ 或者 <br> _Spark 1.2 或者没有 write ahead logs 的更高的版本_ | Buffered data lost with unreliable receivers(unreliable receivers 的缓冲数据丢失)<br> Zero data loss with reliable receivers(reliable receivers 的零数据丢失) <br> At-least once semantics(至少一次性语义) | Buffered data lost with unreliable receivers(unreliable receivers 的缓冲数据丢失) <br> Past data lost with all receivers(所有的 receivers 的过去的数据丢失) <br> Undefined semantics(未定义语义) |
| _Spark 1.2 或者带有 write ahead logs 的更高版本_ | Zero data loss with reliable receivers(reliable receivers 的零数据丢失)<br> At-least once semantics(至少一次性语义) | Zero data loss with reliable receivers and files(reliable receivers 和 files 的零数据丢失) <br> At-least once semantics(至少一次性语义) |
| _Spark 1.1 或更早版本,_ 或者 <br> _Spark 1.2 或者没有 write ahead logs 的更高的版本_ | Buffered data lost with unreliable receivers(unreliable receivers 的缓冲数据丢失)<br> Zero data loss with reliable receivers(reliable receivers 的零数据丢失)<br> At-least once semantics(至少一次性语义)| Buffered data lost with unreliable receivers(unreliable receivers 的缓冲数据丢失)<br> Past data lost with all receivers(所有的 receivers 的过去的数据丢失)<br> Undefined semantics(未定义语义)|
| _Spark 1.2 或者带有 write ahead logs 的更高版本_ | Zero data loss with reliable receivers(reliable receivers 的零数据丢失)<br> At-least once semantics(至少一次性语义)| Zero data loss with reliable receivers and files(reliable receivers 和 files 的零数据丢失)<br> At-least once semantics(至少一次性语义)|