1. 25 3月, 2014 4 次提交
    • M
      Cluster: Fix trib create when masters==replicas · ee18135a
      Matt Stancliff 提交于
      This bug was introduced in 2e5c394f during a refactor.
      
      It took me a while to understand what was going on with
      the code, so I've refactored it further by:
        - Replacing boolean values with meaningful symbols
        - Replacing 'i' with a meaningful variable name
        - Adding the proper abort check
        - Factoring out now duplicated conditionals
        - Adding optional verbose logging (we're inside *four*
          different looping constructs, so it takes a while to
          figure out where all the moving parts are)
        - Updating comment for the section
      
      This fixes a problem when the number of master instances
      equaled the number of replica instances.  Before, when
      there were equal numbers of both, nodes_count would go to
      zero, but the while loop would spin in i < @replicas because
      i would never be updated (because the nodes_list of each ip
      was length == 0, which triggered an endless loop of
      next -> i = 0 -> 0 < 1? -> true -> next -> i = 0 ...)
      
      Thanks to carlo who found this problem at:
      https://groups.google.com/forum/#!topic/redis-db/_WVVqDw5B7c
      ee18135a
    • J
      Fixed a few typos. · 9fa96697
      Jan-Erik Rediger 提交于
      9fa96697
    • M
      Cluster: remove variable causing warning · a7478079
      Matt Stancliff 提交于
      GCC-4.9 warned about this, but clang didn't.
      
      This commit fixes warning:
      sentinel.c: In function 'sentinelReceiveHelloMessages':
      sentinel.c:2156:43: warning: variable 'master' set but not used [-Wunused-but-set-variable]
           sentinelRedisInstance *ri = c->data, *master;
      a7478079
    • J
      Finally fix the `install_server.sh` script. · 46904ae2
      Jan-Erik Rediger 提交于
      Includes changes from a dozen bug reports and pull requests.
      Was tested on Ubuntu, Debian and CentOS.
      46904ae2
  2. 24 3月, 2014 4 次提交
    • A
      Sample and cache RSS in serverCron(). · 2dd8c462
      antirez 提交于
      Obtaining the RSS (Resident Set Size) info is slow in Linux and OSX.
      This slowed down the generation of the INFO 'memory' section.
      
      Since the RSS does not require to be a real-time measurement, we
      now sample it with server.hz frequency (10 times per second by default)
      and use this value both to show the INFO rss field and to compute the
      fragmentation ratio.
      
      Practically this does not make any difference for memory profiling of
      Redis but speeds up the INFO call significantly.
      2dd8c462
    • A
      sdscatvprintf(): Try to use a static buffer. · ae211be9
      antirez 提交于
      For small content the function now tries to use a static buffer to avoid
      a malloc/free cycle that is too costly when the function is used in the
      context of performance critical code path such as INFO output generation.
      
      This change was verified to have positive effects in the execution speed
      of the INFO command.
      ae211be9
    • A
      Cache uname() output across INFO calls. · 571d6b01
      antirez 提交于
      Uname was profiled to be a slow syscall. It produces always the same
      output in the context of a single execution of Redis, so calling it at
      every INFO output generation does not make too much sense.
      
      The uname utsname structure was modified as a static variable. At the
      same time a static integer was added to check if we need to call uname
      the first time.
      571d6b01
    • A
      sdscatvprintf(): guess buflen using format length. · 06b2a389
      antirez 提交于
      sdscatvprintf() uses a loop where it tries to output the formatted
      string in a buffer of the initial length, if there was not enough room,
      a buffer of doubled size is tried and so forth.
      
      The initial guess for the buffer length was very poor, an hardcoded
      "16". This caused the printf to be processed multiple times without a
      good reason. Given that printf functions are already not fast, the
      overhead was significant.
      
      The new heuristic is to use a buffer 4 times the length of the format
      buffer, and 32 as minimal size. This appears to be a good balance for
      typical uses of the function inside the Redis code base.
      
      This change improved INFO command performances 3 times.
      06b2a389
  3. 21 3月, 2014 23 次提交
    • A
      Add test-lru.rb to utils. · fd70c68c
      antirez 提交于
      This is a program useful to evaluate the Redis LRU algorithm behavior.
      fd70c68c
    • A
      Use getLRUClock() instead of server.lruclock to create objects. · 24c35024
      antirez 提交于
      Thanks to Matt Stancliff for noticing this error. It was in the original
      code but somehow I managed to remove the change from the commit...
      24c35024
    • A
      The default maxmemory policy is now noeviction. · 7f274e3b
      antirez 提交于
      This is safer as by default maxmemory should just set a memory limit
      without any key to be deleted, unless the policy is set to something
      more relaxed.
      7f274e3b
    • A
      Use 24 bits for the lru object field and improve resolution. · a219339e
      antirez 提交于
      There were 2 spare bits inside the Redis object structure that are now
      used in order to enlarge 4x the range of the LRU field.
      
      At the same time the resolution was improved from 10 to 1 second: this
      still provides 194 days before the LRU counter overflows (restarting from
      zero).
      
      This is not a problem since it only causes lack of eviction precision for
      objects not touched for a very long time, and the lack of precision is
      only temporary.
      a219339e
    • A
      Default LRU samples is now 5. · 9928b531
      antirez 提交于
      9928b531
    • A
      Use new dictGetRandomKeys() API to get samples for eviction. · fd5e8c01
      antirez 提交于
      The eviction quality degradates a bit in my tests, but since the API is
      faster, it allows to raise the number of samples, and overall is a win.
      fd5e8c01
    • A
      struct dictEntry -> dictEntry. · 10c8d862
      antirez 提交于
      10c8d862
    • A
      Added dictGetRandomKeys() to dict.c: mass get random entries. · 26292670
      antirez 提交于
      This new function is useful to get a number of random entries from an
      hash table when we just need to do some sampling without particularly
      good distribution.
      
      It just jumps at a random place of the hash table and returns the first
      N items encountered by scanning linearly.
      
      The main usefulness of this function is to speedup Redis internal
      sampling of the key space, for example for key eviction or expiry.
      26292670
    • A
      LRU eviction pool implementation. · c641074a
      antirez 提交于
      This is an improvement over the previous eviction algorithm where we use
      an eviction pool that is persistent across evictions of keys, and gets
      populated with the best candidates for evictions found so far.
      
      It allows to approximate LRU eviction at a given number of samples
      better than the previous algorithm used.
      c641074a
    • A
      Fix OBJECT IDLETIME return value converting to seconds. · c9ac817c
      antirez 提交于
      estimateObjectIdleTime() returns a value in milliseconds now, so we need
      to scale the output of OBJECT IDLETIME to seconds.
      c9ac817c
    • A
      Obtain LRU clock in a resolution dependent way. · 205c2ccc
      antirez 提交于
      For testing purposes it is handy to have a very high resolution of the
      LRU clock, so that it is possible to experiment with scripts running in
      just a few seconds how the eviction algorithms works.
      
      This commit allows Redis to use the cached LRU clock, or a value
      computed on demand, depending on the resolution. So normally we have the
      good performance of a precomputed value, and a clock that wraps in many
      days using the normal resolution, but if needed, changing a define will
      switch behavior to an high resolution LRU clock.
      205c2ccc
    • A
      Specify lruclock in redisServer structure via REDIS_LRU_BITS. · 561e7934
      antirez 提交于
      The padding field was totally useless: removed.
      561e7934
    • A
      Specify LRU resolution in milliseconds. · 8f0b7491
      antirez 提交于
      8f0b7491
    • A
      Set LRU parameters via REDIS_LRU_BITS define. · 63aacbe8
      antirez 提交于
      63aacbe8
    • A
      Unify stats reset for CONFIG RESETSTAT / initServer(). · 8b6a674a
      antirez 提交于
      Now CONFIG RESETSTAT makes sure to reset all the fields, and in the
      future it will be simpler to avoid missing new fields.
      8b6a674a
    • A
      Sentinel: sentinelRefreshInstanceInfo() minor refactoring. · 128dcee4
      antirez 提交于
      Test sentinel.tilt condition on top and return if it is true.
      This allows to remove the check for the tilt condition in the remaining
      code paths of the function.
      128dcee4
    • A
      Sentinel test: 02 unit better coverage + refactoring. · b104015f
      antirez 提交于
      b104015f
    • A
      Sentinel test: foreach_instance_id implements 'break'. · f5f281f9
      antirez 提交于
      f5f281f9
    • A
      Sentinel: instance_is_killed proc added to sentinel.tcl. · f308f677
      antirez 提交于
      f308f677
    • A
      0774d492
    • A
      Sentinel: down-after-milliseconds is not master-specific. · a86e24de
      antirez 提交于
      addReplySentinelRedisInstance() modified so that this field is displayed
      for all the kind of instances: Sentinels, Masters, Slaves.
      a86e24de
    • A
      Sentinel failure detection implementation improved. · 9997b51f
      antirez 提交于
      Failure detection in Sentinel is ping-pong based. It used to work by
      remembering the last time a valid PONG reply was received, and checking
      if the reception time was too old compared to the current current time.
      
      PINGs were sent at a fixed interval of 1 second.
      
      This works in a decent way, but does not scale well when we want to set
      very small values of "down-after-milliseconds" (this is the node
      timeout basically).
      
      This commit reiplements the failure detection making a number of
      changes. Some changes are inspired to Redis Cluster failure detection
      code:
      
      * A new last_ping_time field is added in representation of instances.
        If non zero, we have an active ping that was sent at the specified
        time. When a valid reply to ping is received, the field is zeroed
        again.
      * last_ping_time is not reset when we reconnect the link or send a new
        ping, so from our point of view it represents the time we started
        waiting for the instance to reply to our pings without receiving a
        reply.
      * last_ping_time is now used in order to check if the instance is
        timed out. This means that we can have a node timeout of 100
        milliseconds and yet the system will work well since the new check is
        not bound to the period used to send pings.
      * Pings are now sent every second, or often if the value of
        down-after-milliseconds is less than one second. With a lower limit of
        10 HZ ping frequency.
      * Link reconnection code was improved. This is used in order to try to
        reconnect the link when we are at 50% of the node timeout without a
        valid reply received yet. However the old code triggered unnecessary
        reconnections when the node timeout was very small. Now that should be
        ok.
      
      The new code passes the tests but more testing is needed and more unit
      tests stressing the failure detector, so currently this is merged only
      in the unstable branch.
      9997b51f
    • A
      Sentinel: use CLIENT SETNAME when connecting to Redis. · 04cd7f77
      antirez 提交于
      This makes debugging / monitoring of Sentinels simpler since you can
      identify sentinels in CLIENT LIST output of Redis instances.
      04cd7f77
  4. 15 3月, 2014 3 次提交
    • A
      redis-trib: call MIGRATE via r.client.call as fix for redis-rb API changes. · 32f188cd
      antirez 提交于
      See issue #1593.
      
      Thanks to @badboy for suggesting the direct client.call fix.
      32f188cd
    • M
      Fix segfault from accessing array out of bounds · 0c8f0079
      Matt Stancliff 提交于
      argc == 2; argv[2] == crash
      0c8f0079
    • A
      Sentinel: be safe under crash-recovery assumptions. · 668a95a5
      antirez 提交于
      Sentinel's main safety argument is that there are no two configurations
      for the same master with the same version (configuration epoch).
      
      For this to be true Sentinels require to be authorized by a majority.
      Additionally Sentinels require to do two important things:
      
      * Never vote again for the same epoch.
      * Never exchange an old vote for a fresh one.
      
      The first prerequisite, in a crash-recovery system model, requires to
      persist the master->leader_epoch on durable storage before to reply to
      messages. This was not the case.
      
      We also make sure to persist the current epoch in order to never reply
      to stale votes requests from other Sentinels, after a recovery.
      
      The configuration is persisted by making use of fsync(), this is
      considered in the context of this code a good enough guarantee that
      after a restart our durable state is restored, however this may not
      always be the case depending on the kind of hardware and operating
      system used.
      668a95a5
  5. 14 3月, 2014 2 次提交
    • A
      Sentinel: fake PUBLISH command to receive HELLO messages. · ca2a3b32
      antirez 提交于
      Now the way HELLO messages are received is unified.
      Now it is no longer needed for Sentinels to converge to the higher
      configuration for a master to be able to chat via some Redis instance,
      the are able to directly exchanges configurations.
      
      Note that this commit does not include the (trivial) change needed to
      send HELLO messages to Sentinel instances as well, since for an error I
      committed the change in the previous commit that refactored hello
      messages processing into a separated function.
      ca2a3b32
    • A
  6. 13 3月, 2014 1 次提交
  7. 11 3月, 2014 3 次提交