1. 21 10月, 2015 14 次提交
  2. 20 10月, 2015 12 次提交
    • M
    • M
      [FLINK-2865] remove upper direct memory size bound · 6c44d93d
      Maximilian Michels 提交于
      - set the upper bound to Long.MAX_VALUE
      
      For YARN, we set it to the calculated maximum container size (no need to fix).
      6c44d93d
    • T
      [FLINK-2354] [runtime] Replace old StateHandleProvider by StateStorageHelper... · a6890b28
      Till Rohrmann 提交于
      [FLINK-2354] [runtime] Replace old StateHandleProvider by StateStorageHelper in ZooKeeperStateHandleStore
      
      The old StateHandleProvider used in ZooKeeperStateHandleStore had to be replaced because the state backend implementation has changed. Since the new state backend could not be used anymore, a new StateStorageHelper interface has been created. The default implementation FileSystemStateStorageHelper stores the given state onto the specified file system and returns a FileSerializableStateHandle.
      
      Various fixes due to rebasing.
      a6890b28
    • T
      [FLINK-2852] [test-stability] Fix ScalaShellITSuite and ScalaShellLocalStartupITCase · 630798d3
      Till Rohrmann 提交于
      Changes test program to use an int accumulator which is checked at the end of the program.
      This avoids to look for the "Job status changed to FINISHED." string in the stdout output,
      which can sometimes not be printed if the JobExecutionResult arrives earlier than the
      JobStatusChanged message at the JobClientActor.
      630798d3
    • T
      [FLINK-2793] [runtime-web] Rework JobManagerRetriever to avoid race conditions · e3ad9621
      Till Rohrmann 提交于
      The JobManagerRetriever sets the new leaderGatewayPortFuture directly in the notifyLeaderAddress
      method instead of in one of the futures. This avoids race conditions between multiple futures
      which finish in a different order than they were started. Furthermore, this replaces promises
      by futures where a promise is not needed.
      
      Add logging statement
      
      Fix WebRuntimeMonitorITCase to use random port and proper state backend
      
      Add ChannelHandler.Sharable to RuntimeMonitorHandler
      
      Remove sanity check from WebInfoServer to let it work on Yarn
      e3ad9621
    • U
      [FLINK-2793] [runtime-web] Redirect to leader in non-standalone mode · 77fc0cc4
      Ufuk Celebi 提交于
      Squashes:
      5a88d5e [tests] Add HttpTestClient for testing HTTP responses
      656d6d6 Split WebMonitor and LeaderRetrievalService start up
      a7e8da8 Move generated /web files to src/main/resources
      
      Add comment to webMonitorPort attribute and make line breaks more Scalaesque
      
      Don't block on leader retrieval and only resolve associated job manager once
      
      Make JobManagerRetriever independent of redirecting logic
      
      This closes #1202.
      77fc0cc4
    • T
      [FLINK-2804] [runtime] Add blocking job submission support for HA · d18f5809
      Till Rohrmann 提交于
      The JobClientActor is now repsonsible for receiving the JobStatus updates from
      a newly elected leader. It uses the LeaderRetrievalService to be notified about
      new leaders. The actor can only be used to submit a single job to the JM. Once
      it received a job from the Client it tries to send it to the current leader.
      If no leader is available, a connection timeout is triggered. If the job could
      be sent to the JM, a submission timeout is triggered if the JobClientActor does
      not receive a JobSubmitSuccess message within the timeout interval. If the
      connection to the leader is lost after having submitted a job, a connection
      timeout is triggered if the JobClientActor cannot reconnect to another JM within
      the timeout interval. The JobClient simply awaits on the completion of the
      returned future to the SubmitJobAndWait message.
      
      Added test cases for JobClientActor exceptions
      
      This closes #1249.
      d18f5809
    • U
      [FLINK-2805] [blobmanager] Write JARs to file state backend for recovery · c3a4d1d9
      Ufuk Celebi 提交于
      Move StateBackend enum to top level and org.apache.flink.runtime.state
      
      Abstract blob store in blob server for recovery
      
      This closes #1227.
      c3a4d1d9
    • T
      [FLINK-2354] [runtime] Remove state changing futures in JobManager · c2989f2b
      Till Rohrmann 提交于
      Internal actor states must only be modified within the actor thread.
      This avoids all the well-known issues coming with concurrency.
      
      Fix RemoveCachedJob by introducing RemoveJob
      
      Fix JobManagerITCase
      
      Add removeJob which maintains the job in the SubmittedJobGraphStore
      
      Make revokeLeadership not remove the jobs from the state backend
      
      Fix shading problem with curator by hiding CuratorFramework in ChaosMonkeyITCase
      c2989f2b
    • U
      [FLINK-2354] [runtime] Add job graph and checkpoint recovery · 73c73e92
      Ufuk Celebi 提交于
      This closes #1153.
      73c73e92
    • U
    • U
  3. 19 10月, 2015 9 次提交
  4. 18 10月, 2015 5 次提交