提交 · b6cab88c1dfe74a745c2ca3fbf51c4a77a6c3481 · xindoo / redis

06 7月, 2017 3 次提交
- A
  Modules: no MULTI/EXEC for commands replicated from async contexts. · b6cab88c
  由 antirez 提交于 7月 05, 2017
```
They are technically like commands executed from external clients one
after the other, and do not constitute a single atomic entity.
```
  b6cab88c
- D
  
  fixed #4100 · c63a97f8
  由 Dvir Volk 提交于 7月 03, 2017
  
  c63a97f8
- I
  
  Sets up fake client to select current db in RM_Call() · 3f3dc3b8
  由 itamar 提交于 3月 06, 2017
  
  3f3dc3b8
27 6月, 2017 1 次提交

RDB modules values serialization format version 2. · 5af0fc0c

由 antirez 提交于 6月 27, 2017

The original RDB serialization format was not parsable without the
module loaded, becuase the structure was managed only by the module
itself. Moreover RDB is a streaming protocol in the sense that it is
both produce di an append-only fashion, and is also sometimes directly
sent to the socket (in the case of diskless replication).

The fact that modules values cannot be parsed without the relevant
module loaded is a problem in many ways: RDB checking tools must have
loaded modules even for doing things not involving the value at all,
like splitting an RDB into N RDBs by key or alike, or just checking the
RDB for sanity.

In theory module values could be just a blob of data with a prefixed
length in order for us to be able to skip it. However prefixing the values
with a length would mean one of the following:

1. To be able to write some data at a previous offset. This breaks
stremaing.
2. To bufferize values before outputting them. This breaks performances.
3. To have some chunked RDB output format. This breaks simplicity.

Moreover, the above solution, still makes module values a totally opaque
matter, with the fowllowing problems:

1. The RDB check tool can just skip the value without being able to at
least check the general structure. For datasets composed mostly of
modules values this means to just check the outer level of the RDB not
actually doing any checko on most of the data itself.
2. It is not possible to do any recovering or processing of data for which a
module no longer exists in the future, or is unknown.

So this commit implements a different solution. The modules RDB
serialization API is composed if well defined calls to store integers,
floats, doubles or strings. After this commit, the parts generated by
the module API have a one-byte prefix for each of the above emitted
parts, and there is a final EOF byte as well. So even if we don't know
exactly how to interpret a module value, we can always parse it at an
high level, check the overall structure, understand the types used to
store the information, and easily skip the whole value.

The change is backward compatible: older RDB files can be still loaded
since the new encoding has a new RDB type: MODULE_2 (of value 7).
The commit also implements the ability to check RDB files for sanity
taking advantage of the new feature.

5af0fc0c

11 5月, 2017 7 次提交
- A
  
  Modules TSC: put the client in the pending write list. · 12fd298f
  由 antirez 提交于 5月 03, 2017
  
  12fd298f
- A
  
  Module: fix RedisModule_Call() "l" specifier to create a raw string. · fb8734fe
  由 antirez 提交于 5月 03, 2017
  
  fb8734fe
- A
  Modules TSC: Release the GIL for all the time we are blocked. · c4b88495
  由 antirez 提交于 5月 03, 2017
```
Instead of giving the module background operations just a small time to
run in the beforeSleep() function, we can have the lock released for all
the time we are blocked in the multiplexing syscall.
```
  c4b88495
- A
  
  Modules TSC: Export symbols of the new API. · fcd9a07d
  由 antirez 提交于 5月 02, 2017
  
  fcd9a07d
- A
  
  Modules TSC: Handling of RM_Reply* functions. · 8affa3e7
  由 antirez 提交于 5月 02, 2017
  
  8affa3e7
- A
  
  Modules TSC: Basic TS context creeation and handling. · 31b1f3c1
  由 antirez 提交于 5月 02, 2017
  
  31b1f3c1
- A
  
  Modules TSC: GIL and cooperative multi tasking setup. · 74f3a843
  由 antirez 提交于 4月 28, 2017
  
  74f3a843
18 4月, 2017 3 次提交

A

Make more obvious why there was issue #3843. · a5b66da8
由 antirez 提交于 4月 10, 2017

a5b66da8

Fix modules blocking commands awake delay. · f60d6f09

由 antirez 提交于 4月 10, 2017

If a thread unblocks a client blocked in a module command, by using the
RedisMdoule_UnblockClient() API, the event loop may not be awaken until
the next timeout of the multiplexing API or the next unrelated I/O
operation on other clients. We actually want the client to be served
ASAP, so a mechanism is needed in order for the unblocking API to inform
Redis that there is a client to serve ASAP.

This commit fixes the issue using the old trick of the pipe: when a
client needs to be unblocked, a byte is written in a pipe. When we run
the list of clients blocked in modules, we consume all the bytes
written in the pipe. Writes and reads are performed inside the context
of the mutex, so no race is possible in which we consume the bytes that
are actually related to an awake request for a client that should still
be put into the list of clients to unblock.

It was verified that after the fix the server handles the blocked
clients with the expected short delay.

Thanks to @dvirsky for understanding there was such a problem and
reporting it.

f60d6f09

D

fixed free of blocked client before refering to it · 17250409
由 Dvir Volk 提交于 3月 01, 2017

17250409

22 2月, 2017 1 次提交

Use SipHash hash function to mitigate HashDos attempts. · ba647598

由 antirez 提交于 2月 20, 2017

This change attempts to switch to an hash function which mitigates
the effects of the HashDoS attack (denial of service attack trying
to force data structures to worst case behavior) while at the same time
providing Redis with an hash function that does not expect the input
data to be word aligned, a condition no longer true now that sds.c
strings have a varialbe length header.

Note that it is possible sometimes that even using an hash function
for which collisions cannot be generated without knowing the seed,
special implementation details or the exposure of the seed in an
indirect way (for example the ability to add elements to a Set and
check the return in which Redis returns them with SMEMBERS) may
make the attacker's life simpler in the process of trying to guess
the correct seed, however the next step would be to switch to a
log(N) data structure when too many items in a single bucket are
detected: this seems like an overkill in the case of Redis.

SPEED REGRESION TESTS:

In order to verify that switching from MurmurHash to SipHash had
no impact on speed, a set of benchmarks involving fast insertion
of 5 million of keys were performed.

The result shows Redis with SipHash in high pipelining conditions
to be about 4% slower compared to using the previous hash function.
However this could partially be related to the fact that the current
implementation does not attempt to hash whole words at a time but
reads single bytes, in order to have an output which is endian-netural
and at the same time working on systems where unaligned memory accesses
are a problem.

Further X86 specific optimizations should be tested, the function
may easily get at the same level of MurMurHash2 if a few optimizations
are performed.

ba647598

12 1月, 2017 1 次提交

MEMORY USAGE: support for modules data types. · e36d5222

由 antirez 提交于 1月 12, 2017

As a side effect of supporting it, we no longer crash when MEMORY USAGE
is called against a module data type.

Close #3637.

e36d5222

16 12月, 2016 1 次提交
- D
  
  fixed stop condition in RM_ZsetRangeNext and RM_ZsetRangePrev · 90d918bd
  由 Dvir Volk 提交于 12月 15, 2016
  
  90d918bd
14 12月, 2016 1 次提交

Replication: fix the infamous key leakage of writable slaves + EXPIRE. · c65dfb43

由 antirez 提交于 12月 13, 2016

BACKGROUND AND USE CASEj

Redis slaves are normally write only, however the supprot a "writable"
mode which is very handy when scaling reads on slaves, that actually
need write operations in order to access data. For instance imagine
having slaves replicating certain Sets keys from the master. When
accessing the data on the slave, we want to peform intersections between
such Sets values. However we don't want to intersect each time: to cache
the intersection for some time often is a good idea.

To do so, it is possible to setup a slave as a writable slave, and
perform the intersection on the slave side, perhaps setting a TTL on the
resulting key so that it will expire after some time.

THE BUG

Problem: in order to have a consistent replication, expiring of keys in
Redis replication is up to the master, that synthesize DEL operations to
send in the replication stream. However slaves logically expire keys
by hiding them from read attempts from clients so that if the master did
not promptly sent a DEL, the client still see logically expired keys
as non existing.

Because slaves don't actively expire keys by actually evicting them but
just masking from the POV of read operations, if a key is created in a
writable slave, and an expire is set, the key will be leaked forever:

1. No DEL will be received from the master, which does not know about
such a key at all.

2. No eviction will be performed by the slave, since it needs to disable
eviction because it's up to masters, otherwise consistency of data is
lost.

THE FIX

In order to fix the problem, the slave should be able to tag keys that
were created in the slave side and have an expire set in some way.

My solution involved using an unique additional dictionary created by
the writable slave only if needed. The dictionary is obviously keyed by
the key name that we need to track: all the keys that are set with an
expire directly by a client writing to the slave are tracked.

The value in the dictionary is a bitmap of all the DBs where such a key
name need to be tracked, so that we can use a single dictionary to track
keys in all the DBs used by the slave (actually this limits the solution
to the first 64 DBs, but the default with Redis is to use 16 DBs).

This solution allows to pay both a small complexity and CPU penalty,
which is zero when the feature is not used, actually. The slave-side
eviction is encapsulated in code which is not coupled with the rest of
the Redis core, if not for the hook to track the keys.

TODO

I'm doing the first smoke tests to see if the feature works as expected:
so far so good. Unit tests should be added before merging into the
4.0 branch.

c65dfb43

05 12月, 2016 1 次提交
- D
  
  fix memory corruption on RM_FreeCallReply · 0fb9f341
  由 Dvir Volk 提交于 11月 30, 2016
  
  0fb9f341
30 11月, 2016 1 次提交
- A
  
  Modules: change type registration API to use a struct of methods. · 71e8d15e
  由 antirez 提交于 11月 30, 2016
  
  71e8d15e
24 11月, 2016 1 次提交
- A
  Modules: fix client blocking calls access to invalid struct field. · 1f55170b
  由 antirez 提交于 11月 24, 2016
```
We already have reference to the client pointer, no need to access the
already freed structure.

Close #3634.
```
  1f55170b
01 11月, 2016 1 次提交
- D
  
  fixed sizeof in allocating io RedisModuleCtx* · ec8fd6e5
  由 Dvir Volk 提交于 10月 31, 2016
  
  ec8fd6e5
13 10月, 2016 2 次提交
- A
  
  Modules: AbortBlock() API implemented. · 95c17c0c
  由 antirez 提交于 10月 13, 2016
  
  95c17c0c
- A
  
  module.c: trim comment to 80 cols. · 553aa0e2
  由 antirez 提交于 10月 13, 2016
  
  553aa0e2
07 10月, 2016 4 次提交
- A
  
  Modules: fixes to the blocking commands API: examples now works. · 34599691
  由 antirez 提交于 10月 07, 2016
  
  34599691
- A
  
  Modules: RM_Milliseconds() API added. · f156038d
  由 antirez 提交于 10月 07, 2016
  
  f156038d
- A
  
  Modules: blocking commands WIP: API exported, a first example. · ffb00fbc
  由 antirez 提交于 10月 07, 2016
  
  ffb00fbc
- A
  Module: API to block clients with threading support. · 8fadfe52
  由 antirez 提交于 10月 07, 2016
```
Just a draft to align the main ideas, never executed code. Compiles.
```
  8fadfe52
06 10月, 2016 2 次提交

Module: Ability to get context from IO context. · 152c1b68

由 antirez 提交于 10月 06, 2016

It was noted by @dvirsky that it is not possible to use string functions
when writing the AOF file. This sometimes is critical since the command
rewriting may need to be built in the context of the AOF callback, and
without access to the context, and the limited types that the AOF
production functions will accept, this can be an issue.

Moreover there are other needs that we can't anticipate regarding the
ability to use Redis Modules APIs using the context in order to build
representations to emit AOF / RDB.

Because of this a new API was added that allows the user to get a
temporary context from the IO context. The context is auto released
if obtained when the RDB / AOF callback returns.

Calling multiple time the function to get the context, always returns
the same one, since it is invalid to have more than a single context.

152c1b68

A

Copyright notice added to module.c. · 72279e3e
由 antirez 提交于 10月 06, 2016

72279e3e

03 10月, 2016 1 次提交
- A
  Modules: API to save/load single precision floating point numbers. · 3dc84c53
  由 antirez 提交于 10月 03, 2016
```
When double precision is not needed, to take 2x space in the
serialization is not good.
```
  3dc84c53
02 10月, 2016 1 次提交
- A
  
  Modules: API to log from module I/O callbacks. · a1b1fd4f
  由 antirez 提交于 10月 02, 2016
  
  a1b1fd4f
21 9月, 2016 1 次提交
- D
  
  added RM_CreateStringPrintf · a91650fc
  由 Dvir Volk 提交于 9月 21, 2016
  
  a91650fc
14 9月, 2016 1 次提交

dict.c: introduce dictUnlink(). · afcbcc0e

由 oranagra 提交于 5月 09, 2016

Notes by @antirez:

This patch was picked from a larger commit by Oran and adapted to change
the API a bit. The basic idea is to avoid double lookups when there is
to use the value of the deleted entry.

BEFORE:

    entry = dictFind( ... ); /* 1st lookup. */
    /* Do somethjing with the entry. */
    dictDelete(...);         /* 2nd lookup. */

AFTER:

    entry = dictUnlink( ... ); /* 1st lookup. */
    /* Do somethjing with the entry. */
    dictFreeUnlinkedEntry(entry); /* No lookups!. */

afcbcc0e

09 9月, 2016 1 次提交
- W
  
  fix memory error on module unload · f9c9b4bf
  由 wyx 提交于 9月 09, 2016
  
  f9c9b4bf
04 8月, 2016 1 次提交
- A
  Modules: handle NULL replies more gracefully. · 13f18d2b
  由 antirez 提交于 8月 03, 2016
```
After all crashing at every API misuse makes everybody's life more
complex.
```
  13f18d2b
03 8月, 2016 1 次提交
- A
  
  Modules: initial draft for a testing module. · 04340e1f
  由 antirez 提交于 8月 03, 2016
  
  04340e1f
02 8月, 2016 1 次提交

Modules: StringAppendBuffer() and ability to retain strings. · 7829e4ed

由 antirez 提交于 8月 02, 2016

RedisModule_StringRetain() allows, when automatic memory management is
on, to keep string objects living after the callback returns. Can also
be used in order to use Redis reference counting of objects inside
modules.

The reason why this is useful is that sometimes when implementing new
data types we want to reference RedisModuleString objects inside the
module private data structures, so those string objects must be valid
after the callback returns even if not referenced inside the Redis key
space.

7829e4ed

23 6月, 2016 2 次提交
- A
  Actually remove static from #3331. · 0f484d83
  由 antirez 提交于 6月 23, 2016
```
I forgot -a when amending in the previous commit.
```
  0f484d83
- A
  Minor change to conform PR #3331 to Redis code base style. · c0ca87dc
  由 antirez 提交于 6月 23, 2016
```
Also avoid "static" in order to have symbols during crashes.
```
  c0ca87dc