1. 20 5月, 2014 11 次提交
    • A
      CLUSTER RESET implemented. · 5efa5501
      antirez 提交于
      The new command is able to reset a cluster node so that it starts again
      as a fresh node. By default the command performs a soft reset (the same
      as calling it as CLUSTER RESET SOFT), and the following steps are
      performed:
      
      1) All slots are set as unassigned.
      2) The list of known nodes is flushed.
      3) Node is set as master if it is a slave.
      
      When an hard reset is performed with CLUSTER RESET HARD the following
      additional operations are performed:
      
      4) A new Node ID is created at random.
      5) Epochs are set to 0.
      
      CLUSTER RESET is useful both when the sysadmin wants to reconfigure a
      node with a different role (for example turning a slave into a master)
      and for testing purposes.
      
      It also may play a role in automatically provisioned Redis Clusters,
      since it allows to reset a node back to the initial state in order to be
      reconfigured.
      5efa5501
    • A
      Remove trailing spaces from cluster.c file. · 687f84a3
      antirez 提交于
      687f84a3
    • A
      Cluster test: added function assert_cluster_state. · d2790240
      antirez 提交于
      d2790240
    • A
      be159490
    • A
      Cluster: better handling of stolen slots. · b8a71e5a
      antirez 提交于
      The previous code handling a lost slot (by another master with an higher
      configuration for the slot) was defensive, considering it an error and
      putting the cluster in an odd state requiring redis-cli fix.
      
      This was changed, because actually this only happens either in a
      legitimate way, with failovers, or when the admin messed with the config
      in order to reconfigure the cluster. So the new code instead will try to
      make sure that the keys stored match the new slots map, by removing all
      the keys in the slots we lost ownership from.
      
      The function that deletes the keys from the lost slots is called only
      if the node does not lose all its slots (resulting in a reconfiguration
      as a slave of the node that got ownership). This is an optimization
      since the replication code will anyway flush all the instance data in
      a faster way.
      b8a71e5a
    • A
      cluster.tcl: fix redis links leak in refresh_nodes_map. · f727000f
      antirez 提交于
      f727000f
    • A
      cluster.tcl: saner error handling. · 4db4a5d5
      antirez 提交于
      Better handling of connection errors in order to update the table and
      recovery, populate the startup nodes table after fetching the list of
      nodes.
      
      More work to do about it, it is still not as reliable as
      redis-rb-cluster implementation which is the minimal reference
      implementation for Redis Cluster clients.
      4db4a5d5
    • A
      d9227938
    • A
      e10ee072
    • A
      Cluster: forced failover implemented. · e84dcabf
      antirez 提交于
      Using CLUSTER FAILOVER FORCE it is now possible to failover a master in
      a forced way, which means:
      
      1) No check to understand if the master is up is performed.
      2) No data age of the slave is checked. Evan a slave with very old data
         can manually failover a master in this way.
      3) No chat with the master is attempted to reach its replication offset:
         the master can just be down.
      e84dcabf
    • A
      Cluster: bypass data_age check for manual failovers. · b5cdd42b
      antirez 提交于
      Automatic failovers only happen in Redis Cluster if the slave trying to
      be elected was disconnected from its master for no more than 10 times
      the node-timeout value. However there should be no such a check for
      manual failovers, since these are initiated by the sysadmin that, in
      theory, knows what she is doing when a slave is selected to be promoted.
      b5cdd42b
  2. 19 5月, 2014 2 次提交
  3. 12 5月, 2014 10 次提交
  4. 09 5月, 2014 3 次提交
  5. 08 5月, 2014 3 次提交
    • A
      Sentinel: log when a failover will be attempted again. · 6eadeb72
      antirez 提交于
      When a Sentinel performs a failover (successful or not), or when a
      Sentinel votes for a different Sentinel trying to start a failover, it
      sets a min delay before it will try to get elected for a failover.
      
      While not strictly needed, because if multiple Sentinels will try
      to failover the same master at the same time, only one configuration
      will eventually win, this serialization is practically very useful.
      Normal failovers are cleaner: one Sentinel starts to failover, the
      others update their config when the Sentinel performing the failover
      is able to get the selected slave to move from the role of slave to the
      one of master.
      
      However currently this timeout was implicit, so users could see
      Sentinels not reacting, after a failed failover, for some time, without
      giving any feedback in the logs to the poor sysadmin waiting for clues.
      
      This commit makes Sentinels more verbose about the delay: when a master
      is down and a failover attempt is not performed because the delay has
      still not elaped, something like that will be logged:
      
          Next failover delay: I will not start a failover
          before Thu May  8 16:48:59 2014
      6eadeb72
    • A
      Sentinel: generate +config-update-from event when a new config is received. · c738f0f4
      antirez 提交于
      This event makes clear, before the switch-master event is generated,
      that a Sentinel received a configuration update from another Sentinel.
      c738f0f4
    • A
      REDIS_ENCODING_EMBSTR_SIZE_LIMIT set to 39. · d8ffd2b2
      antirez 提交于
      The new value is the limit for the robj + SDS header + string +
      null-term to stay inside the 64 bytes Jemalloc arena in 64 bits
      systems.
      d8ffd2b2
  6. 07 5月, 2014 11 次提交