README.md 20.1 KB
Newer Older
1
This README is just a fast *quick start* document. You can find more detailed documentation at [redis.io](https://redis.io).
2

A
antirez 已提交
3 4 5
What is Redis?
--------------

6
Redis is often referred as a *data structures* server. What this means is that Redis provides access to mutable data structures via a set of commands, which are sent using a *server-client* model with TCP sockets and a simple protocol. So different processes can query and modify the same data structures in a shared way.
A
antirez 已提交
7 8 9 10 11

Data structures implemented into Redis have a few special properties:

* Redis cares to store them on disk, even if they are always served and modified into the server memory. This means that Redis is fast, but that is also non-volatile.
* Implementation of data structures stress on memory efficiency, so data structures inside Redis will likely use less memory compared to the same data structure modeled using an high level programming language.
12
* Redis offers a number of features that are natural to find in a database, like replication, tunable levels of durability, cluster, high availability.
A
antirez 已提交
13

J
Jan-Erik Rediger 已提交
14
Another good example is to think of Redis as a more complex version of memcached, where the operations are not just SETs and GETs, but operations to work with complex data types like Lists, Sets, ordered data structures, and so forth.
A
antirez 已提交
15 16 17

If you want to know more, this is a list of selected starting points:

A
antirez 已提交
18 19 20 21
* Introduction to Redis data types. http://redis.io/topics/data-types-intro
* Try Redis directly inside your browser. http://try.redis.io
* The full list of Redis commands. http://redis.io/commands
* There is much more inside the Redis official documentation. http://redis.io/documentation
A
antirez 已提交
22

23 24
Building Redis
--------------
25 26

Redis can be compiled and used on Linux, OSX, OpenBSD, NetBSD, FreeBSD.
A
antirez 已提交
27 28
We support big endian and little endian architectures, and both 32 bit
and 64 bit systems.
29 30

It may compile on Solaris derived systems (for instance SmartOS) but our
31
support for this platform is *best effort* and Redis is not guaranteed to
32
work as well as in Linux, OSX, and \*BSD there.
33 34 35 36 37 38 39 40 41

It is as simple as:

    % make

You can run a 32 bit Redis binary using:

    % make 32bit

S
Saurabh Jha 已提交
42
After building Redis, it is a good idea to test it using:
43 44 45

    % make test

46
Fixing build problems with dependencies or cached build options
47 48
---------

A
antirez 已提交
49
Redis has some dependencies which are included into the `deps` directory.
S
Fixup  
Saurabh Jha 已提交
50 51
`make` does not automatically rebuild dependencies even if something in
the source code of dependencies changes.
52 53 54 55 56 57 58 59 60 61 62

When you update the source code with `git pull` or when code inside the
dependencies tree is modified in any other way, make sure to use the following
command in order to really clean everything and rebuild from scratch:

    make distclean

This will clean: jemalloc, lua, hiredis, linenoise.

Also if you force certain build options like 32bit target, no C compiler
optimizations (for debugging purposes), and other similar build time options,
63
those options are cached indefinitely until you issue a `make distclean`
64
command.
65

66 67 68 69 70
Fixing problems building 32 bit binaries
---------

If after building Redis with a 32 bit target you need to rebuild it
with a 64 bit target, or the other way around, you need to perform a
71
`make distclean` in the root directory of the Redis distribution.
72 73 74 75 76

In case of build errors when trying to build a 32 bit binary of Redis, try
the following steps:

* Install the packages libc6-dev-i386 (also try g++-multilib).
77 78
* Try using the following command line instead of `make 32bit`:
  `make CFLAGS="-m32 -march=native" LDFLAGS="-m32"`
79 80 81 82

Allocator
---------

P
Pieter Noordhuis 已提交
83 84 85 86 87
Selecting a non-default memory allocator when building Redis is done by setting
the `MALLOC` environment variable. Redis is compiled and linked against libc
malloc by default, with the exception of jemalloc being the default on Linux
systems. This default was picked because jemalloc has proven to have fewer
fragmentation problems than libc malloc.
88

P
Pieter Noordhuis 已提交
89
To force compiling against libc malloc, use:
90

P
Pieter Noordhuis 已提交
91
    % make MALLOC=libc
92

P
Pieter Noordhuis 已提交
93
To compile against jemalloc on Mac OS X systems, use:
94

P
Pieter Noordhuis 已提交
95
    % make MALLOC=jemalloc
96 97 98 99 100 101 102

Verbose build
-------------

Redis will build with a user friendly colorized output by default.
If you want to see a more verbose output use the following:

103
    % make V=1
A
antirez 已提交
104

105 106 107 108 109 110 111
Running Redis
-------------

To run Redis with the default configuration just type:

    % cd src
    % ./redis-server
S
Saurabh Jha 已提交
112

113 114 115 116 117 118
If you want to provide your redis.conf, you have to run it using an additional
parameter (the path of the configuration file):

    % cd src
    % ./redis-server /path/to/redis.conf

S
Saurabh Jha 已提交
119
It is possible to alter the Redis configuration by passing parameters directly
120 121
as options using the command line. Examples:

122
    % ./redis-server --port 9999 --replicaof 127.0.0.1 6379
123 124 125 126 127
    % ./redis-server /etc/redis/6379.conf --loglevel debug

All the options in redis.conf are also supported as options using the command
line, with exactly the same name.

128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145
Playing with Redis
------------------

You can use redis-cli to play with Redis. Start a redis-server instance,
then in another terminal try the following:

    % cd src
    % ./redis-cli
    redis> ping
    PONG
    redis> set foo bar
    OK
    redis> get foo
    "bar"
    redis> incr mycounter
    (integer) 1
    redis> incr mycounter
    (integer) 2
146
    redis>
147

A
antirez 已提交
148
You can find the list of all the available commands at http://redis.io/commands.
149

150 151 152 153 154 155 156
Installing Redis
-----------------

In order to install Redis binaries into /usr/local/bin just use:

    % make install

A
antirez 已提交
157
You can use `make PREFIX=/some/other/directory install` if you wish to use a
158 159 160 161 162 163 164 165 166
different destination.

Make install will just install binaries in your system, but will not configure
init scripts and configuration files in the appropriate place. This is not
needed if you want just to play a bit with Redis, but if you are installing
it the proper way for a production system, we have a script doing this
for Ubuntu and Debian systems:

    % cd utils
167
    % ./install_server.sh
168 169 170 171 172 173

The script will ask you a few questions and will setup everything you need
to run Redis properly as a background daemon that will start again on
system reboots.

You'll be able to stop and start Redis using the script named
A
antirez 已提交
174
`/etc/init.d/redis_<portnumber>`, for instance `/etc/init.d/redis_6379`.
175

176
Code contributions
S
Saurabh Jha 已提交
177
-----------------
178 179 180 181

Note: by contributing code to the Redis project in any form, including sending
a pull request via Github, a code fragment or patch via private email or
public discussion groups, you agree to release your code under the terms
J
Jan-Erik Rediger 已提交
182
of the BSD license that you can find in the [COPYING][1] file included in the Redis
183 184
source distribution.

J
Jan-Erik Rediger 已提交
185
Please see the [CONTRIBUTING][2] file in this source distribution for more
186 187
information.

J
Jan-Erik Rediger 已提交
188 189
[1]: https://github.com/antirez/redis/blob/unstable/COPYING
[2]: https://github.com/antirez/redis/blob/unstable/CONTRIBUTING
190 191 192 193 194 195 196 197

Redis internals
===

If you are reading this README you are likely in front of a Github page
or you just untarred the Redis distribution tar ball. In both the cases
you are basically one step away from the source code, so here we explain
the Redis source code layout, what is in each file as a general idea, the
A
antirez 已提交
198
most important functions and structures inside the Redis server and so forth.
S
Saurabh Jha 已提交
199 200
We keep all the discussion at a high level without digging into the details
since this document would be huge otherwise and our code base changes
S
Fixup  
Saurabh Jha 已提交
201
continuously, but a general idea should be a good starting point to
202 203 204 205 206 207 208
understand more. Moreover most of the code is heavily commented and easy
to follow.

Source code layout
---

The Redis root directory just contains this README, the Makefile which
S
Fixup  
Saurabh Jha 已提交
209 210
calls the real Makefile inside the `src` directory and an example
configuration for Redis and Sentinel. You can find a few shell
211 212 213 214
scripts that are used in order to execute the Redis, Redis Cluster and
Redis Sentinel unit tests, which are implemented inside the `tests`
directory.

S
Fixup  
Saurabh Jha 已提交
215
Inside the root are the following important directories:
216 217 218

* `src`: contains the Redis implementation, written in C.
* `tests`: contains the unit tests, implemented in Tcl.
S
Saurabh Jha 已提交
219
* `deps`: contains libraries Redis uses. Everything needed to compile Redis is inside this directory; your system just needs to provide `libc`, a POSIX compatible interface and a C compiler. Notably `deps` contains a copy of `jemalloc`, which is the default allocator of Redis under Linux. Note that under `deps` there are also things which started with the Redis project, but for which the main repository is not `anitrez/redis`. An exception to this rule is `deps/geohash-int` which is the low level geocoding library used by Redis: it originated from a different project, but at this point it diverged so much that it is developed as a separated entity directly inside the Redis repository.
220 221 222 223

There are a few more directories but they are not very important for our goals
here. We'll focus mostly on `src`, where the Redis implementation is contained,
exploring what there is inside each file. The order in which files are
A
antirez 已提交
224
exposed is the logical one to follow in order to disclose different layers
225 226 227
of complexity incrementally.

Note: lately Redis was refactored quite a bit. Function names and file
S
Saurabh Jha 已提交
228
names have been changed, so you may find that this documentation reflects the
229
`unstable` branch more closely. For instance in Redis 3.0 the `server.c`
S
Fixup  
Saurabh Jha 已提交
230
and `server.h` files were named to `redis.c` and `redis.h`. However the overall
A
antirez 已提交
231
structure is the same. Keep in mind that all the new developments and pull
232 233
requests should be performed against the `unstable` branch.

J
Joe Pettersson 已提交
234
server.h
235 236
---

S
Saurabh Jha 已提交
237
The simplest way to understand how a program works is to understand the
238 239 240 241 242
data structures it uses. So we'll start from the main header file of
Redis, which is `server.h`.

All the server configuration and in general all the shared state is
defined in a global structure called `server`, of type `struct redisServer`.
S
Saurabh Jha 已提交
243
A few important fields in this structure are:
244 245 246 247

* `server.db` is an array of Redis databases, where data is stored.
* `server.commands` is the command table.
* `server.clients` is a linked list of clients connected to the server.
248
* `server.master` is a special client, the master, if the instance is a replica.
249

S
Saurabh Jha 已提交
250
There are tons of other fields. Most fields are commented directly inside
251 252 253 254
the structure definition.

Another important Redis data structure is the one defining a client.
In the past it was called `redisClient`, now just `client`. The structure
S
Saurabh Jha 已提交
255
has many fields, here we'll just show the main ones:
256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272

    struct client {
        int fd;
        sds querybuf;
        int argc;
        robj **argv;
        redisDb *db;
        int flags;
        list *reply;
        char buf[PROTO_REPLY_CHUNK_BYTES];
        ... many other fields ...
    }

The client structure defines a *connected client*:

* The `fd` field is the client socket file descriptor.
* `argc` and `argv` are populated with the command the client is executing, so that functions implementing a given Redis command can read the arguments.
S
Saurabh Jha 已提交
273
* `querybuf` accumulates the requests from the client, which are parsed by the Redis server according to the Redis protocol and executed by calling the implementations of the commands the client is executing.
274 275 276 277
* `reply` and `buf` are dynamic and static buffers that accumulate the replies the server sends to the client. These buffers are incrementally written to the socket as soon as the file descriptor is writable.

As you can see in the client structure above, arguments in a command
are described as `robj` structures. The following is the full `robj`
A
antirez 已提交
278
structure, which defines a *Redis object*:
279 280 281 282 283 284 285 286 287 288 289 290

    typedef struct redisObject {
        unsigned type:4;
        unsigned encoding:4;
        unsigned lru:LRU_BITS; /* lru time (relative to server.lruclock) */
        int refcount;
        void *ptr;
    } robj;

Basically this structure can represent all the basic Redis data types like
strings, lists, sets, sorted sets and so forth. The interesting thing is that
it has a `type` field, so that it is possible to know what type a given
S
Saurabh Jha 已提交
291
object has, and a `refcount`, so that the same object can be referenced
292
in multiple places without allocating it multiple times. Finally the `ptr`
S
Fixup  
Saurabh Jha 已提交
293
field points to the actual representation of the object, which might vary
294 295 296 297 298 299
even for the same type, depending on the `encoding` used.

Redis objects are used extensively in the Redis internals, however in order
to avoid the overhead of indirect accesses, recently in many places
we just use plain dynamic strings not wrapped inside a Redis object.

S
Saurabh Jha 已提交
300
server.c
301 302 303 304 305 306 307 308
---

This is the entry point of the Redis server, where the `main()` function
is defined. The following are the most important steps in order to startup
the Redis server.

* `initServerConfig()` setups the default values of the `server` structure.
* `initServer()` allocates the data structures needed to operate, setup the listening socket, and so forth.
S
Saurabh Jha 已提交
309
* `aeMain()` starts the event loop which listens for new connections.
310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325

There are two special functions called periodically by the event loop:

1. `serverCron()` is called periodically (according to `server.hz` frequency), and performs tasks that must be performed from time to time, like checking for timedout clients.
2. `beforeSleep()` is called every time the event loop fired, Redis served a few requests, and is returning back into the event loop.

Inside server.c you can find code that handles other vital things of the Redis server:

* `call()` is used in order to call a given command in the context of a given client.
* `activeExpireCycle()` handles eviciton of keys with a time to live set via the `EXPIRE` command.
* `freeMemoryIfNeeded()` is called when a new write command should be performed but Redis is out of memory according to the `maxmemory` directive.
* The global variable `redisCommandTable` defines all the Redis commands, specifying the name of the command, the function implementing the command, the number of arguments required, and other properties of each command.

networking.c
---

326
This file defines all the I/O functions with clients, masters and replicas
327 328 329 330
(which in Redis are just special clients):

* `createClient()` allocates and initializes a new client.
* the `addReply*()` family of functions are used by commands implementations in order to append data to the client structure, that will be transmitted to the client as a reply for a given command executed.
S
Saurabh Jha 已提交
331
* `writeToClient()` transmits the data pending in the output buffers to the client and is called by the *writable event handler* `sendReplyToClient()`.
332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356
* `readQueryFromClient()` is the *readable event handler* and accumulates data from read from the client into the query buffer.
* `processInputBuffer()` is the entry point in order to parse the client query buffer according to the Redis protocol. Once commands are ready to be processed, it calls `processCommand()` which is defined inside `server.c` in order to actually execute the command.
* `freeClient()` deallocates, disconnects and removes a client.

aof.c and rdb.c
---

As you can guess from the names these files implement the RDB and AOF
persistence for Redis. Redis uses a persistence model based on the `fork()`
system call in order to create a thread with the same (shared) memory
content of the main Redis thread. This secondary thread dumps the content
of the memory on disk. This is used by `rdb.c` to create the snapshots
on disk and by `aof.c` in order to perform the AOF rewrite when the
append only file gets too big.

The implementation inside `aof.c` has additional functions in order to
implement an API that allows commands to append new commands into the AOF
file as clients execute them.

The `call()` function defined inside `server.c` is responsible to call
the functions that in turn will write the commands into the AOF.

db.c
---

357
Certain Redis commands operate on specific data types, others are general.
358 359
Examples of generic commands are `DEL` and `EXPIRE`. They operate on keys
and not on their values specifically. All those generic commands are
360 361 362 363 364 365 366 367 368 369
defined inside `db.c`.

Moreover `db.c` implements an API in order to perform certain operations
on the Redis dataset without directly accessing the internal data structures.

The most important functions inside `db.c` which are used in many commands
implementations are the following:

* `lookupKeyRead()` and `lookupKeyWrite()` are used in order to get a pointer to the value associated to a given key, or `NULL` if the key does not exist.
* `dbAdd()` and its higher level counterpart `setKey()` create a new key in a Redis database.
A
antirez 已提交
370
* `dbDelete()` removes a key and its associated value.
371 372 373 374 375 376 377
* `emptyDb()` removes an entire single database or all the databases defined.

The rest of the file implements the generic commands exposed to the client.

object.c
---

378
The `robj` structure defining Redis objects was already described. Inside
379 380 381 382 383 384 385 386 387 388 389 390 391 392
`object.c` there are all the functions that operate with Redis objects at
a basic level, like functions to allocate new objects, handle the reference
counting and so forth. Notable functions inside this file:

* `incrRefcount()` and `decrRefCount()` are used in order to increment or decrement an object reference count. When it drops to 0 the object is finally freed.
* `createObject()` allocates a new object. There are also specialized functions to allocate string objects having a specific content, like `createStringObjectFromLongLong()` and similar functions.

This file also implements the `OBJECT` command.

replication.c
---

This is one of the most complex files inside Redis, it is recommended to
approach it only after getting a bit familiar with the rest of the code base.
393
In this file there is the implementation of both the master and replica role
394 395
of Redis.

396 397
One of the most important functions inside this file is `replicationFeedSlaves()` that writes commands to the clients representing replica instances connected
to our master, so that the replicas can get the writes performed by the clients:
398 399 400 401
this way their data set will remain synchronized with the one in the master.

This file also implements both the `SYNC` and `PSYNC` commands that are
used in order to perform the first synchronization between masters and
402
replicas, or to continue the replication after a disconnection.
403 404 405 406

Other C files
---

407
* `t_hash.c`, `t_list.c`, `t_set.c`, `t_string.c` and `t_zset.c` contains the implementation of the Redis data types. They implement both an API to access a given data type, and the client commands implementations for these data types.
408 409 410 411
* `ae.c` implements the Redis event loop, it's a self contained library which is simple to read and understand.
* `sds.c` is the Redis string library, check http://github.com/antirez/sds for more information.
* `anet.c` is a library to use POSIX networking in a simpler way compared to the raw interface exposed by the kernel.
* `dict.c` is an implementation of a non-blocking hash table which rehashes incrementally.
412 413 414 415
* `scripting.c` implements Lua scripting. It is completely self contained from the rest of the Redis implementation and is simple enough to understand if you are familar with the Lua API.
* `cluster.c` implements the Redis Cluster. Probably a good read only after being very familiar with the rest of the Redis code base. If you want to read `cluster.c` make sure to read the [Redis Cluster specification][3].

[3]: http://redis.io/topics/cluster-spec
416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434

Anatomy of a Redis command
---

All the Redis commands are defined in the following way:

    void foobarCommand(client *c) {
        printf("%s",c->argv[1]->ptr); /* Do something with the argument. */
        addReply(c,shared.ok); /* Reply something to the client. */
    }

The command is then referenced inside `server.c` in the command table:

    {"foobar",foobarCommand,2,"rtF",0,NULL,0,0,0,0,0},

In the above example `2` is the number of arguments the command takes,
while `"rtF"` are the command flags, as documented in the command table
top comment inside `server.c`.

435 436
After the command operates in some way, it returns a reply to the client,
usually using `addReply()` or a similar function defined inside `networking.c`.
437

B
Benjamin Holst 已提交
438
There are tons of commands implementations inside the Redis source code
439 440 441
that can serve as examples of actual commands implementations. To write
a few toy commands can be a good exercise to familiarize with the code base.

S
Saurabh Jha 已提交
442 443 444
There are also many other files not described here, but it is useless to
cover everything. We want to just help you with the first steps.
Eventually you'll find your way inside the Redis code base :-)
445 446

Enjoy!