提交 · 258d377d8ee428698e9b8fa323bd32612786954a · Turbo码先生 / redis

18 3月, 2014 6 次提交

A

Sentinel test: 02 unit better coverage + refactoring. · 258d377d
由 antirez 提交于 3月 18, 2014

258d377d
A

Sentinel test: foreach_instance_id implements 'break'. · 58f104e2
由 antirez 提交于 3月 18, 2014

58f104e2
A

Sentinel: instance_is_killed proc added to sentinel.tcl. · 2586ea76
由 antirez 提交于 3月 18, 2014

2586ea76
A

Sentinel: propagate down-after-ms changes to slaves and sentinels. · 218cc5fc
由 antirez 提交于 3月 18, 2014

218cc5fc

Sentinel: down-after-milliseconds is not master-specific. · bb6d8501

由 antirez 提交于 3月 18, 2014

addReplySentinelRedisInstance() modified so that this field is displayed
for all the kind of instances: Sentinels, Masters, Slaves.

bb6d8501

Sentinel failure detection implementation improved. · ae0b7680

由 antirez 提交于 3月 17, 2014

Failure detection in Sentinel is ping-pong based. It used to work by
remembering the last time a valid PONG reply was received, and checking
if the reception time was too old compared to the current current time.

PINGs were sent at a fixed interval of 1 second.

This works in a decent way, but does not scale well when we want to set
very small values of "down-after-milliseconds" (this is the node
timeout basically).

This commit reiplements the failure detection making a number of
changes. Some changes are inspired to Redis Cluster failure detection
code:

* A new last_ping_time field is added in representation of instances.
  If non zero, we have an active ping that was sent at the specified
  time. When a valid reply to ping is received, the field is zeroed
  again.
* last_ping_time is not reset when we reconnect the link or send a new
  ping, so from our point of view it represents the time we started
  waiting for the instance to reply to our pings without receiving a
  reply.
* last_ping_time is now used in order to check if the instance is
  timed out. This means that we can have a node timeout of 100
  milliseconds and yet the system will work well since the new check is
  not bound to the period used to send pings.
* Pings are now sent every second, or often if the value of
  down-after-milliseconds is less than one second. With a lower limit of
  10 HZ ping frequency.
* Link reconnection code was improved. This is used in order to try to
  reconnect the link when we are at 50% of the node timeout without a
  valid reply received yet. However the old code triggered unnecessary
  reconnections when the node timeout was very small. Now that should be
  ok.

The new code passes the tests but more testing is needed and more unit
tests stressing the failure detector, so currently this is merged only
in the unstable branch.

ae0b7680

15 3月, 2014 3 次提交
- A
  Sentinel: use CLIENT SETNAME when connecting to Redis. · 3a2ff556
  由 antirez 提交于 3月 15, 2014
```
This makes debugging / monitoring of Sentinels simpler since you can
identify sentinels in CLIENT LIST output of Redis instances.
```
  3a2ff556
- S
  Merge pull request #1608 from mattsta/fix-sentinel-current-epoch-segfault · c65b75e7
  由 Salvatore Sanfilippo 提交于 3月 14, 2014
```
Fix segfault from accessing array out of bounds
```
  c65b75e7
- M
  Fix segfault from accessing array out of bounds · 584052ee
  由 Matt Stancliff 提交于 3月 14, 2014
```
argc == 2; argv[2] == crash
```
  584052ee
14 3月, 2014 3 次提交

Sentinel: be safe under crash-recovery assumptions. · ed813863

由 antirez 提交于 3月 14, 2014

Sentinel's main safety argument is that there are no two configurations
for the same master with the same version (configuration epoch).

For this to be true Sentinels require to be authorized by a majority.
Additionally Sentinels require to do two important things:

* Never vote again for the same epoch.
* Never exchange an old vote for a fresh one.

The first prerequisite, in a crash-recovery system model, requires to
persist the master->leader_epoch on durable storage before to reply to
messages. This was not the case.

We also make sure to persist the current epoch in order to never reply
to stale votes requests from other Sentinels, after a recovery.

The configuration is persisted by making use of fsync(), this is
considered in the context of this code a good enough guarantee that
after a restart our durable state is restored, however this may not
always be the case depending on the kind of hardware and operating
system used.

ed813863

Sentinel: fake PUBLISH command to receive HELLO messages. · 36509402

由 antirez 提交于 3月 14, 2014

Now the way HELLO messages are received is unified.
Now it is no longer needed for Sentinels to converge to the higher
configuration for a master to be able to chat via some Redis instance,
the are able to directly exchanges configurations.

Note that this commit does not include the (trivial) change needed to
send HELLO messages to Sentinel instances as well, since for an error I
committed the change in the previous commit that refactored hello
messages processing into a separated function.

36509402

A

Sentinel: HELLO processing refactored into sentinelProcessHelloMessage(). · 9dfe426f
由 antirez 提交于 3月 14, 2014

9dfe426f

13 3月, 2014 2 次提交
- A
  
  Cluster: flag the transaction as dirty for the new redirections. · 133fccb0
  由 antirez 提交于 3月 11, 2014
  
  133fccb0
- A
  
  Linenoise updated, multiline mode enabled in redis-cli. · 429aff4e
  由 antirez 提交于 3月 13, 2014
  
  429aff4e
11 3月, 2014 9 次提交

A
redis-trib: call MIGRATE via r.client.call as fix for redis-rb API changes. · cc11d103
由 antirez 提交于 3月 11, 2014
```
See issue #1593.

Thanks to @badboy for suggesting the direct client.call fix.
```
cc11d103
A
redis-trib: new subcommand 'call'. Exec command in all nodes. · df32eb68
由 antirez 提交于 3月 11, 2014
```
Example:

./redis-trib.rb call 192.168.1.11:7000 config get cluster-node-timeout
```
df32eb68

redis-trib: create subcommand is now able to assign spare slaves. · 2e5c394f

由 antirez 提交于 3月 11, 2014

Example: if the user will try to configure a cluster with 9 nodes,
asking for 1 slave for master, redis-trib will configure a 4 masters
cluster with 1 slave each as usually, but this time will assign the
spare node as a slave of one of the masters.

2e5c394f

Cluster: update node configEpoch on UPDATE messages. · e26f4486

由 antirez 提交于 3月 11, 2014

The UPDATE message contains the configEpoch of the node configuration
advertised in the packet. Update it if needed.

e26f4486

Cluster: set slot error if we receive an update for a busy slot. · a2ff9091

由 antirez 提交于 3月 11, 2014

By manually modifying nodes configurations in random ways, it is possible
to create the following scenario:

A is serving keys for slot 10
B is manually configured to serve keys for slot 10

A receives an update from B (or another node) where it is informed that
the slot 10 is now claimed by B with a greater configuration epoch,
however A still has keys from slot 10.

With this commit A will put the slot in error setting it in IMPORTING
state, so that redis-trib can detect the issue.

a2ff9091

A

Cluster: clarified a comment in clusterUpdateSlotsConfigWith(). · 1ed0ad77
由 antirez 提交于 3月 11, 2014

1ed0ad77
A

Cluster: flush importing/migrating state when master is turned into slave. · 8287945f
由 antirez 提交于 3月 11, 2014

8287945f
A

Cluster: clusterCloseAllSlots() added. · 2e8e0ad4
由 antirez 提交于 3月 11, 2014

2e8e0ad4

DEBUG ERROR implemented. · 8eae54aa

由 antirez 提交于 3月 10, 2014

The new "error" subcommand of the DEBUG command can reply with an user
selected error, specified as its sole argument:

    DEBUG ERROR "LOADING please wait..."

The error is generated just prefixing the command argument with a "-"
character, and replacing newlines with spaces (since error replies can't
include newlines).

The goal of the command is to help in Client libraries unit tests by
making simple to simulate a command call triggering a given error.

8eae54aa

10 3月, 2014 17 次提交

DEBUG CMDKEYS: provide some guarantee to getKeysFromCommand(). · 2705306b

由 antirez 提交于 3月 10, 2014

getKeysFromCommand() is designed to be called with the command arguments
passing the basic arity checks described in the command table.

DEBUG CMDKEYS must provide the same guarantees for calling
getKeysFromCommand() to be safe.

2705306b

A
Cluster: make sortGetKeys() able to handle multiple STORE options. · 5b864617
由 antirez 提交于 3月 10, 2014
```
It does not make sense to pass multiple store options, so, better to
handle it ;-)
```
5b864617

DEBUG CMDKEYS added for getKeysFromCommand() testing. · c4ef1d64

由 antirez 提交于 3月 10, 2014

Examples:

    redis 127.0.0.1:6379> debug cmdkeys set foo bar
    1) "foo"
    redis 127.0.0.1:6379> debug cmdkeys mget a b c
    1) "a"
    2) "b"
    3) "c"
    redis 127.0.0.1:6379> debug cmdkeys zunionstore foo 2 a b
    1) "a"
    2) "b"
    3) "foo"
    redis 127.0.0.1:6379> debug cmdkeys ping
    (empty list or set)

c4ef1d64

Cluster: don't allow BY option of SORT as well. · 3e1d7726

由 antirez 提交于 3月 10, 2014

There is the exception of a "constant" BY pattern that is used in order
to signal to don't sort at all. In this case no lookup is needed so it
is possible to support this case in Cluster mode.

3e1d7726

A

Cluster: SORT get keys helper implemented. · 04cf02e8
由 antirez 提交于 3月 10, 2014

04cf02e8
A

Cluster: evalGetKeys() fixed: was not setting keys count. · 21765c85
由 antirez 提交于 3月 10, 2014

21765c85
A
Cluster: don't allow GET option in cluster mode. · 03344196
由 antirez 提交于 3月 10, 2014
```
The commit also refactors a bit the error handling during SORT option
parsing.
```
03344196
A

Fixed memory leak in SORT LIMIT option argument parsing on error. · 8caecc9a
由 antirez 提交于 3月 10, 2014

8caecc9a
A

Cluster: getKeysFromCommand() top comment improved. · ef5e7fba
由 antirez 提交于 3月 10, 2014

ef5e7fba

Cluster: evalGetKey() added for EVAL/EVALSHA. · c0e818ab

由 antirez 提交于 3月 10, 2014

Previously we used zunionInterGetKeys(), however after this function was
fixed to account for the destination key (not needed when the API was
designed for "diskstore") the two set of commands can no longer be served
by an unique keys-extraction function.

c0e818ab

A

Cluster: getKeysFromCommand() and related: top-comments added. · caf7b9b4
由 antirez 提交于 3月 10, 2014

caf7b9b4

Cluster: getKeysFromCommand() API cleaned up. · 787b2970

由 antirez 提交于 3月 10, 2014

This API originated from the "diskstore" experiment, not for Redis
Cluster itself, so there were legacy/useless things trying to
differentiate between keys that are going to be overwritten and keys
that need to be fetched from disk (preloaded).

All useless with Cluster, so removed with the result of code
simplification.

787b2970

A
Cluster: some zunionInterGetKeys() comment trimmed. · 55b88e00
由 antirez 提交于 3月 10, 2014
```
Everything was pretty clear again from the initial statements.
```
55b88e00
S
Merge pull request #1586 from mattsta/fix-zunioninterstorekeys · aca6cb52
由 Salvatore Sanfilippo 提交于 3月 10, 2014
```
Fix key extraction for z{union,inter}store
```
aca6cb52

Cluster: abort on port too high error. · c1a7d3e6

由 antirez 提交于 3月 10, 2014

It also fixes multi-line comment style to be consistent with the rest of
the code base.

Related to #1555.

c1a7d3e6

S
Merge pull request #1555 from mattsta/cluster-port-error-out · 442b06db
由 Salvatore Sanfilippo 提交于 3月 10, 2014
```
Cluster port error out
```
442b06db

Cluster: be explicit about passing NULL as bind addr for connect. · ed8c5523

由 antirez 提交于 3月 10, 2014

The code was already correct but it was using that bindaddr[0] is set to
NULL as a side effect of current implementation if no bind address is
configured. This is not guarnteed to hold true in the future.

ed8c5523

Turbo码先生 / redis 与 Fork 源项目一致

Turbo码先生 / redis
与 Fork 源项目一致