提交 · 237188bf1b0e90911b96ecffa948e0e2d6402514 · 别团等shy哥发育 / redis

26 6月, 2014 2 次提交

Cluster: Add CLUSTER SLOTS command · 237188bf

由 Matt Stancliff 提交于 6月 16, 2014

CLUSTER SLOTS returns a Redis-formatted mapping from
slot ranges to IP/Port pairs serving that slot range.

The outer return elements group return values by slot ranges.

The first two entires in each result are the min and max slots for the range.

The third entry in each result is guaranteed to be either
an IP/Port of the master for that slot range - OR - null
if that slot range, for some reason, has no master

The 4th and higher entries in each result are replica instances
for the slot range.

Output comparison:
127.0.0.1:7001> cluster nodes
f853501ec8ae1618df0e0f0e86fd7abcfca36207 127.0.0.1:7001 myself,master - 0 0 2 connected 4096-8191
5a2caa782042187277647661ffc5da739b3e0805 127.0.0.1:7005 slave f853501ec8ae1618df0e0f0e86fd7abcfca36207 0 1402622415859 6 connected
6c70b49813e2ffc9dd4b8ec1e108276566fcf59f 127.0.0.1:7007 slave 26f4729ca0a5a992822667fc16b5220b13368f32 0 1402622415357 8 connected
2bd5a0e3bb7afb2b56a2120d3fef2f2e4333de1d 127.0.0.1:7006 slave 32adf4b8474fdc938189dba00dc8ed60ce635b0f 0 1402622419373 7 connected
5a9450e8279df36ff8e6bb1c139ce4d5268d1390 127.0.0.1:7000 master - 0 1402622418872 1 connected 0-4095
32adf4b8474fdc938189dba00dc8ed60ce635b0f 127.0.0.1:7002 master - 0 1402622419874 3 connected 8192-12287
5db7d05c245267afdfe48c83e7de899348d2bdb6 127.0.0.1:7004 slave 5a9450e8279df36ff8e6bb1c139ce4d5268d1390 0 1402622417867 5 connected
26f4729ca0a5a992822667fc16b5220b13368f32 127.0.0.1:7003 master - 0 1402622420877 4 connected 12288-16383

127.0.0.1:7001> cluster slots
1) 1) (integer) 0
   2) (integer) 4095
   3) 1) "127.0.0.1"
      2) (integer) 7000
   4) 1) "127.0.0.1"
      2) (integer) 7004
2) 1) (integer) 12288
   2) (integer) 16383
   3) 1) "127.0.0.1"
      2) (integer) 7003
   4) 1) "127.0.0.1"
      2) (integer) 7007
3) 1) (integer) 4096
   2) (integer) 8191
   3) 1) "127.0.0.1"
      2) (integer) 7001
   4) 1) "127.0.0.1"
      2) (integer) 7005
4) 1) (integer) 8192
   2) (integer) 12287
   3) 1) "127.0.0.1"
      2) (integer) 7002
   4) 1) "127.0.0.1"
      2) (integer) 7006

237188bf

Cluster: myself->ip autodiscovery. · 8fdc857a

由 antirez 提交于 6月 25, 2014

Instead of having an hardcoded IP address in the node configuration, we
autodiscover it via MEET messages for automatic update when the node is
restarted with a different IP address.

This mechanism was discussed in the context of PR #1782.

8fdc857a

23 6月, 2014 1 次提交

Add REDIS_BIND_ADDR access macro · a2632f26

由 Matt Stancliff 提交于 4月 24, 2014

We need to access (bindaddr[0] || NULL) in a few places, so centralize
access with a nice macro.

a2632f26

21 6月, 2014 7 次提交
- A
  
  Cluster: clear NOADDR flag when updating node address. · 9eb02c6a
  由 antirez 提交于 6月 20, 2014
  
  9eb02c6a
- A
  
  Cluster: fix an error message when logging failover auth denied. · 6b566ab7
  由 antirez 提交于 6月 10, 2014
  
  6b566ab7
- A
  
  Cluster: better comment for clusterSendFailoverAuthIfNeeded() epoch test. · 326145b9
  由 antirez 提交于 6月 10, 2014
  
  326145b9
- A
  
  Cluster: log granted failover authorizations. · 78a687ec
  由 antirez 提交于 6月 10, 2014
  
  78a687ec
- A
  
  Cluster: log configEpoch updates to myself. · 61910a67
  由 antirez 提交于 6月 10, 2014
  
  61910a67
- A
  
  Cluster: log when a master denies a failover auth. · 111a5556
  由 antirez 提交于 6月 10, 2014
  
  111a5556
- A
  
  Cluster: cluster_my_epoch added to CLUSTER INFO output. · 5cdefb7c
  由 antirez 提交于 6月 10, 2014
  
  5cdefb7c
07 6月, 2014 2 次提交

Cluster: check that configEpoch never goes back. · 8b059f06

由 antirez 提交于 6月 07, 2014

Since there are ways to alter the configEpoch outside of the failover
procedure (for exampel CLUSTER SET-CONFIG-EPOCH and via the configEpoch
collision resolution algorithm), make always sure, before replacing our
configEpoch with a new one, that it is greater than the current one.

8b059f06

Cluster: SET-CONFIG-EPOCH should update currentEpoch. · 67029323

由 antirez 提交于 6月 07, 2014

SET-CONFIG-EPOCH, used by redis-trib at cluster creation time, failed to
update the currentEpoch, making it possible after a failover for a
server to set its configEpoch to a value smaller than the current one
(since configEpochs are obtained using currentEpoch).

The bug totally break the Redis Cluster algorithms and protocols
allowing for permanent split brain conditions about the slots
configuration as shown in issue #1799.

67029323

26 5月, 2014 1 次提交

Cluster: always allow ok -> fail switch in clusterUpdateState(). · eea0d41f

由 antirez 提交于 5月 26, 2014

There is a time defined by REDIS_CLUSTER_WRITABLE_DELAY where fail -> ok
switch is not possible after startup as a master for some time, however
the contrary (ok -> fail) should always be possible.

eea0d41f

23 5月, 2014 1 次提交
- A
  Cluster: slave validity factor is now user configurable. · 33f63ff9
  由 antirez 提交于 5月 22, 2014
```
Check the commit changes in the example redis.conf for more information.
```
  33f63ff9
20 5月, 2014 10 次提交

A
Cluster: use clusterSetNodeAsMaster() during slave failover. · 8c6e8680
由 antirez 提交于 5月 15, 2014
```
clusterHandleSlaveFailover() was reimplementing what
clusterSetNodeAsMaster() without any good reason.
```
8c6e8680

Cluster: clear todo_before_sleep flags when executing actions. · 41a72416

由 antirez 提交于 5月 15, 2014

Thanks to this change, when there is some code like:

    clusterDoBeforeSleep(CLUSTER_TODO_UPDATE_STATE|...);
    ... and later before returning to the event loop ...
    clusterUpdateState();

The clusterUpdateState() function will clar the flag and will not be
repeated in the clusterBeforeSleep() function. This especially important
for config save/fsync flags which are slow to execute and not a good
idea to repeat without a good reason.

This is implemented for all the CLUSTER_TODO flags.

41a72416

A

Fixed typo in CLUSTER RESET implementation. · 5685d15a
由 antirez 提交于 5月 15, 2014

5685d15a

CLUSTER RESET implemented. · 5efa5501

由 antirez 提交于 5月 15, 2014

The new command is able to reset a cluster node so that it starts again
as a fresh node. By default the command performs a soft reset (the same
as calling it as CLUSTER RESET SOFT), and the following steps are
performed:

1) All slots are set as unassigned.
2) The list of known nodes is flushed.
3) Node is set as master if it is a slave.

When an hard reset is performed with CLUSTER RESET HARD the following
additional operations are performed:

4) A new Node ID is created at random.
5) Epochs are set to 0.

CLUSTER RESET is useful both when the sysadmin wants to reconfigure a
node with a different role (for example turning a slave into a master)
and for testing purposes.

It also may play a role in automatically provisioned Redis Clusters,
since it allows to reset a node back to the initial state in order to be
reconfigured.

5efa5501

A

Remove trailing spaces from cluster.c file. · 687f84a3
由 antirez 提交于 5月 15, 2014

687f84a3
A

Cluster: don't accept cluster bus connections during startup. · be159490
由 antirez 提交于 5月 14, 2014

be159490

Cluster: better handling of stolen slots. · b8a71e5a

由 antirez 提交于 5月 14, 2014

The previous code handling a lost slot (by another master with an higher
configuration for the slot) was defensive, considering it an error and
putting the cluster in an odd state requiring redis-cli fix.

This was changed, because actually this only happens either in a
legitimate way, with failovers, or when the admin messed with the config
in order to reconfigure the cluster. So the new code instead will try to
make sure that the keys stored match the new slots map, by removing all
the keys in the slots we lost ownership from.

The function that deletes the keys from the lost slots is called only
if the node does not lose all its slots (resulting in a reconfiguration
as a slave of the node that got ownership). This is an optimization
since the replication code will anyway flush all the instance data in
a faster way.

b8a71e5a

A

Cluster: fixed data_age computation / check integer overflow. · e10ee072
由 antirez 提交于 5月 12, 2014

e10ee072

Cluster: forced failover implemented. · e84dcabf

由 antirez 提交于 5月 12, 2014

Using CLUSTER FAILOVER FORCE it is now possible to failover a master in
a forced way, which means:

1) No check to understand if the master is up is performed.
2) No data age of the slave is checked. Evan a slave with very old data
   can manually failover a master in this way.
3) No chat with the master is attempted to reach its replication offset:
   the master can just be down.

e84dcabf

Cluster: bypass data_age check for manual failovers. · b5cdd42b

由 antirez 提交于 5月 12, 2014

Automatic failovers only happen in Redis Cluster if the slave trying to
be elected was disconnected from its master for no more than 10 times
the node-timeout value. However there should be no such a check for
manual failovers, since these are initiated by the sysadmin that, in
theory, knows what she is doing when a slave is selected to be promoted.

b5cdd42b

12 5月, 2014 2 次提交
- A
  RESTORE: reply with -BUSYKEY special error code. · 0a707bab
  由 antirez 提交于 5月 12, 2014
```
The error when the target key is busy was a generic one, while it makes
sense to be able to distinguish between the target key busy error and
the others easily.
```
  0a707bab
- A
  CLUSTER MEET: better error messages when address is invalid. · fafe29cd
  由 antirez 提交于 5月 09, 2014
```
Fixes issue #1734.
```
  fafe29cd
09 5月, 2014 2 次提交

Cluster: bulk-accept new nodes connections. · 83c92d50

由 antirez 提交于 5月 09, 2014

The same change was operated for normal client connections. This is
important for Cluster as well, since when a node rejoins the cluster,
when a partition heals or after a restart, it gets flooded with new
connection attempts by all the other nodes trying to form a full
mesh again.

83c92d50

A

Cluster: clusterAcceptHandler() comments updated to match the code. · ed18587f
由 antirez 提交于 5月 09, 2014

ed18587f

05 5月, 2014 1 次提交

CLUSTER SET-CONFIG-EPOCH implemented. · c4c7389f

由 antirez 提交于 4月 29, 2014

Initially Redis Cluster accepted that after cluster creation all the
nodes were at configEpoch 0, evolving from zero as failovers happen.

However later the semantic was made more strict in order to make sure a
cluster has always all the master nodes with a different configEpoch,
which is more robust in some corner case (especially resulting from
errors by the system administrator).

To assign different configEpochs to different nodes at startup was a
task performed naturally by the config conflicts resolution algorithm
(see the Cluster specification). However this works well only for small
clusters or when there are actually just a few collisions, since it is
designed for exceptional cases.

When a large cluster is created hundred of nodes can be at epoch 0, so
the conflict resolution code is slow to provide an unique config to each
node. For this reason this new command was introduced. It can be called
only when a node is totally fresh: no other nodes known, and configEpoch
set to zero, so it is safe even against misuses.

redis-trib will use the new command in order to start the cluster
already setting an incremental unique config to every node.

c4c7389f

29 4月, 2014 2 次提交

clusterLoadConfig() REDIS_ERR retval semantics refined. · b008863e

由 antirez 提交于 4月 24, 2014

We should return REDIS_ERR to signal we can't read the configuration
because there is no config file only after checking errno, othewise
we risk to rewrite an existing file that was not accessible for some
other reason.

b008863e

Lock nodes.conf to avoid multiple processes using the same file. · 71d71814

由 antirez 提交于 4月 24, 2014

This was a common source of problems among users.
The solution adopted is not bullet-proof as if the user deletes the
nodes.conf file manually, and starts a new instance with the same
nodes.conf file path, two instances will use the same file. However
following this reasoning the user may drop a nuclear bomb into the
datacenter as well.

71d71814

23 4月, 2014 1 次提交
- K
  
  fix cluster node description showing wrong slot allocation · 19ac9af0
  由 kingsumos 提交于 4月 22, 2014
  
  19ac9af0
16 4月, 2014 8 次提交

Add casting to match printf format. · 437cddee

由 antirez 提交于 4月 07, 2014

adjustOpenFilesLimit() and clusterUpdateSlotsWithConfig() that were
assuming uint64_t is the same as unsigned long long, which is true
probably for all the systems out there that we target, but still GCC
emitted a warning since technically they are two different types.

437cddee

A
Cluster: last_vote_epoch -> lastVoteEpoch. · 8c6ce3dd
由 antirez 提交于 3月 27, 2014
```
Use cammel case for epochs that are persisted on disk.
```
8c6ce3dd
A
Cluster: save/restore vars that must persist after recovery. · 1080e272
由 antirez 提交于 3月 27, 2014
```
This fixes issue #1479.
```
1080e272

Cluster: handshake "already known" error logged to VERBOSE. · 47fbbd9f

由 antirez 提交于 3月 26, 2014

This is not really an error but something that always happens for
example when creating a new cluster, or if the sysadmin rejoins manually
a node that is already known.

Since useless logs don't help, moved to VERBOSE level.

47fbbd9f

Cluster: clusterHandleConfigEpochCollision() fixed. · 6875a158

由 antirez 提交于 3月 26, 2014

New config epochs must always be obtained incrementing the currentEpoch,
that is itself guaranteed to be >= the max configEpoch currently known
to the node.

6875a158

A

Cluster: better logging for clusterUpdateSlotsConfigWith(). · 74fd89b3
由 antirez 提交于 3月 26, 2014

74fd89b3
A
Cluster: CLUSTER SETSLOT implementation comment updated. · cf8f72c1
由 antirez 提交于 3月 25, 2014
```
Update the comment since the implementation details changed.
```
cf8f72c1

Cluster: configEpoch collisions resolution. · 431573ae

由 antirez 提交于 3月 25, 2014

The slave election in Redis Cluster guarantees that slaves promoted to
masters always end with unique config epochs, however failures during
manual reshardings, software bugs and operational errors may in theory
cause two nodes to have the same configEpoch.

This commit introduces a mechanism to eventually always end with different
configEpochs if a collision ever happens.

As a (wanted) side effect, this also ensures that after a new cluster
is created, all nodes will end with a different configEpoch automatically.

431573ae

别团等shy哥发育 / redis 与 Fork 源项目一致

别团等shy哥发育 / redis
与 Fork 源项目一致