提交 · 3c1672da7dcd96fefaa75d48e6a76e7788dd17ce · 别团等shy哥发育 / redis

13 2月, 2014 4 次提交

Update cached time in rdbLoad() callback. · 3c1672da

由 antirez 提交于 2月 13, 2014

server.unixtime and server.mstime are cached less precise timestamps
that we use every time we don't need an accurate time representation and
a syscall would be too slow for the number of calls we require.

Such an example is the initialization and update process of the last
interaction time with the client, that is used for timeouts.

However rdbLoad() can take some time to load the DB, but at the same
time it did not updated the time during DB loading. This resulted in the
bug described in issue #1535, where in the replication process the slave
loads the DB, creates the redisClient representation of its master, but
the timestamp is so old that the master, under certain conditions, is
sensed as already "timed out".

Thanks to @yoav-steinberg and Redis Labs Inc for the bug report and
analysis.

3c1672da

A

Log when CONFIG REWRITE goes bad. · 116617c5
由 antirez 提交于 2月 13, 2014

116617c5

Test: regression for issue #1549. · 767846dc

由 antirez 提交于 2月 13, 2014

It was verified that reverting the commit that fixes the bug, the test
no longer passes.

767846dc

Fix script cache bug in the scripting engine. · 14143fbe

由 antirez 提交于 2月 13, 2014

This commit fixes a serious Lua scripting replication issue, described
by Github issue #1549. The root cause of the problem is that scripts
were put inside the script cache, assuming that slaves and AOF already
contained it, even if the scripts sometimes produced no changes in the
data set, and were not actaully propagated to AOF/slaves.

Example:

    eval "if tonumber(KEYS[1]) > 0 then redis.call('incr', 'x') end" 1 0

Then:

    evalsha <sha1 step 1 script> 1 0

At this step sha1 of the script is added to the replication script cache
(the script is marked as known to the slaves) and EVALSHA command is
transformed to EVAL. However it is not dirty (there is no changes to db),
so it is not propagated to the slaves. Then the script is called again:

    evalsha <sha1 step 1 script> 1 1

At this step master checks that the script already exists in the
replication script cache and doesn't transform it to EVAL command. It is
dirty and propagated to the slaves, but they fail to evaluate the script
as they don't have it in the script cache.

The fix is trivial and just uses the new API to force the propagation of
the executed command regardless of the dirty state of the data set.

Thank you to @minus-infinity on Github for finding the issue,
understanding the root cause, and fixing it.

14143fbe

12 2月, 2014 2 次提交

A

AOF write error: retry with a frequency of 1 hz. · 0296aab6
由 antirez 提交于 2月 12, 2014

0296aab6

AOF: don't abort on write errors unless fsync is 'always'. · dd73a7bf

由 antirez 提交于 2月 12, 2014

A system similar to the RDB write error handling is used, in which when
we can't write to the AOF file, writes are no longer accepted until we
are able to write again.

For fsync == always we still abort on errors since there is currently no
easy way to avoid replying with success to the user otherwise, and this
would violate the contract with the user of only acknowledging data
already secured on disk.

dd73a7bf

11 2月, 2014 20 次提交

A

Redis 2.9.50 (Redis 3.0.0 beta-1) · 910b6d34
由 antirez 提交于 2月 11, 2014

910b6d34
A

Cluster: clusterDelNode(): remove node from master's slaves. · 0725988a
由 antirez 提交于 2月 11, 2014

0725988a
A
Cluster: UPDATE messages are the norm and verbose. · 4513d8fc
由 antirez 提交于 2月 11, 2014
```
Logging them at WARNING level was of little utility and of sure disturb.
```
4513d8fc
A

Cluster: redis-trib fix: handling of another trivial case. · 97212551
由 antirez 提交于 2月 11, 2014

97212551

Cluster: configEpoch assignment in SETNODE improved. · 6d550f2d

由 antirez 提交于 2月 11, 2014

Avoid to trash a configEpoch for every slot migrated if this node has
already the max configEpoch across the cluster.

Still work to do in this area but this avoids both ending with a very
high configEpoch without any reason and to flood the system with fsyncs.

6d550f2d

Cluster: clusterSetStartupEpoch() made more generally useful. · 585e9fb8

由 antirez 提交于 2月 11, 2014

The actual goal of the function was to get the max configEpoch found in
the cluster, so make it general by removing the assignment of the max
epoch to currentEpoch that is useful only at startup.

585e9fb8

Cluster: always increment the configEpoch in SETNODE after import. · 8b5196ad

由 antirez 提交于 2月 11, 2014

Removed a stale conditional preventing the configEpoch from incrementing
after the import in certain conditions. Since the master got a new slot
it should always claim a new configuration.

8b5196ad

Cluster: on resharding upgrade version of receiving node. · 2e3f6b0f

由 antirez 提交于 2月 11, 2014

The node receiving the hash slot needs to have a version that wins over
the other versions in order to force the ownership of the slot.

However the current code is far from perfect since a failover can happen
during the manual resharding. The fix is a work in progress but the
bottom line is that the new version must either be voted as usually,
set by redis-trib manually after it makes sure can't be used by other
nodes, or reserved configEpochs could be used for manual operations (for
example odd versions could be never used by slaves and are always used
by CLUSTER SETSLOT NODE).

2e3f6b0f

Cluster: fsync at every SETSLOT command puts too pressure on disks. · a221ae5c

由 antirez 提交于 2月 10, 2014

During slots migration redis-trib can send a number of SETSLOT commands.
Fsyncing every time is a bit too much in production as verified
empirically.

To make sure configs are fsynced on all nodes after a resharding
redis-trib may send something like CLUSTER CONFSYNC.

In this case fsyncs were not providing too much value since anyway
processes can crash in the middle of the resharding of an hash slot, and
redis-trib should be able to recover from this condition anyway.

a221ae5c

Cluster: conditions to clear "migrating" on slot for SETSLOT ... NODE changed. · 77c6fa65

由 antirez 提交于 2月 10, 2014

If the slot is manually assigned to another node, clear the migrating
status regardless of the fact it was previously assigned to us or not,
as long as we no longer have keys for this slot.

This avoid a race during slots migration that may leave the slot in
migrating status in the source node, since it received an update message
from the destination node that is already claiming the slot.

This way we are sure that redis-trib at the end of the slot migration is
always able to close the slot correctly.

77c6fa65

A

Cluster: remove debugging xputs from redis-trib. · f406a094
由 antirez 提交于 2月 10, 2014

f406a094

Cluster: redis-trib fix: cover new case of open slot. · 01de2468

由 antirez 提交于 2月 10, 2014

The case is the trivial one a single node claiming the slot as
migrating, without nodes claiming it as importing.

01de2468

A

redis-trib: log event after we have reference to 'master'. · 8f254287
由 antirez 提交于 2月 10, 2014

8f254287

Cluster: don't update slave's master if we don't know it. · cc97305e

由 antirez 提交于 2月 10, 2014

There is no way we can update the slave's node->slaveof pointer if we
don't know the master (no node with such an ID in our tables).

cc97305e

A

Cluster: ignore slot config changes if we are importing it. · fa6f4f21
由 antirez 提交于 2月 10, 2014

fa6f4f21
A

Cluster: update configEpoch after manually messing with slots. · 30214fff
由 antirez 提交于 2月 10, 2014

30214fff
A

Cluster: redis-trib, more info about open slots error. · cfcb09bf
由 antirez 提交于 2月 10, 2014

cfcb09bf
A

Cluster: fixed inverted arguments in logging function call. · 8e12fae0
由 antirez 提交于 2月 10, 2014

8e12fae0

Cluster: clear the FAIL status for masters without slots. · 6a015457

由 antirez 提交于 2月 10, 2014

Masters without slots don't participate to the cluster but just do
redirections, no need to take them in FAIL state if they are back
reachable.

6a015457

A

Cluster: replica migration should only work for masters serving slots. · 969a4f1d
由 antirez 提交于 2月 10, 2014

969a4f1d

10 2月, 2014 9 次提交
- A
  
  Cluster: redis-trib del-node variable typo fixed. · cb92a1ef
  由 antirez 提交于 2月 10, 2014
  
  cb92a1ef
- A
  
  Cluster: clusterReadHandler() fixed to work with new message header. · 6987a959
  由 antirez 提交于 2月 10, 2014
  
  6987a959
- A
  Cluster: don't propagate PUBLISH two times. · ad85f520
  由 antirez 提交于 2月 10, 2014
```
PUBLISH both published messages via Cluster bus and replication when
cluster was enabled, resulting in duplicated message in the slave.
```
  ad85f520
- A
  Cluster: signature changed to "RCmb" (Redis Cluster message bus). · b82b66b5
  由 antirez 提交于 2月 10, 2014
```
Sounds better after all.
```
  b82b66b5
- A
  
  Cluster: discard bus messages with version != 0. · b6e04f55
  由 antirez 提交于 2月 10, 2014
  
  b6e04f55
- A
  
  Cluster: added signature + version in bus packets. · 0ee1a78c
  由 antirez 提交于 2月 10, 2014
  
  0ee1a78c
- A
  
  3.0 release notes added. · ff6a75a0
  由 antirez 提交于 2月 10, 2014
  
  ff6a75a0
- A
  
  Old Changelog file removed from unstable branch. · b1c3c6e5
  由 antirez 提交于 2月 10, 2014
  
  b1c3c6e5
- A
  
  Cluster: redis-trib: options table entry for add-node fixed. · dca95f24
  由 antirez 提交于 2月 10, 2014
  
  dca95f24
08 2月, 2014 2 次提交

A

Don't count time to feed MONITORs in SLOWLOG. · 6df4ffe6
由 antirez 提交于 2月 07, 2014

6df4ffe6

Cluster: keys slot computation now supports hash tags. · 142281dc

由 antirez 提交于 2月 07, 2014

Currently this is marginally useful, only to make sure two keys are in
the same hash slot when the cluster is stable (no rehashing in
progress).

In the future it is possible that support will be added to run
mutli-keys operations with keys in the same hash slot.

142281dc

07 2月, 2014 1 次提交
- A
  
  Sentinel: allow SHUTDOWN command in Sentinel mode. · 2d6eb689
  由 antirez 提交于 2月 07, 2014
  
  2d6eb689
05 2月, 2014 2 次提交

Check for EAGAIN in sendBulkToSlave(). · 970de3e9

由 antirez 提交于 2月 05, 2014

Sometime an osx master with a Linux server over a slow link caused
a strange error where osx called the writable function for
the socket but actually apparently there was no room in the socket
buffer to accept the write: write(2) call returned an EAGAIN error,
that was not checked, so we considered write(2) == 0 always as a connection
reset, which was unfortunate since the bulk transfer has to start again.

Also more errors are logged with the WARNING level in the same code path
now.

970de3e9

Cluster: fixed MF condition in clusterHandleSlaveFailover(). · 04fe000b

由 antirez 提交于 2月 05, 2014

For manual failover we need a manual failover in progress, and that
mf_can_start is true (master offset received and matched).

04fe000b

别团等shy哥发育 / redis 与 Fork 源项目一致

别团等shy哥发育 / redis
与 Fork 源项目一致