caching_with_rails.textile 19.1 KB
Newer Older
1 2
h2. Caching with Rails: An overview

3
This guide will teach you what you need to know about avoiding that expensive round-trip to your database and returning what you need to return to the web clients in the shortest time possible.
4

P
Pratik Naik 已提交
5 6 7 8 9 10 11
After reading this guide, you should be able to use and configure:

* Page, action, and fragment caching
* Sweepers
* Alternative cache stores
* Conditional GET support

12 13 14 15
endprologue.

h3. Basic Caching

16
This is an introduction to the three types of caching techniques that Rails provides by default without the use of any third party plugins.
17

18
To start playing with testing you'll want to ensure that +config.action_controller.perform_caching+ is set to +true+ if you're running in development mode. This flag is normally set in the corresponding +config/environments/*.rb+ and caching is disabled by default for development and test, and enabled for production.
19 20 21 22 23 24 25

<ruby>
config.action_controller.perform_caching = true
</ruby>

h4. Page Caching

26
Page caching is a Rails mechanism which allows the request for a generated page to be fulfilled by the webserver (i.e. apache or nginx), without ever having to go through the Rails stack at all. Obviously, this is super-fast. Unfortunately, it can't be applied to every situation (such as pages that need authentication) and since the webserver is literally just serving a file from the filesystem, cache expiration is an issue that needs to be dealt with.
27

28
So, how do you enable this super-fast cache behavior?  Simple, let's say you have a controller called +ProductsController+ and an +index+ action that lists all the products
29 30 31 32

<ruby>
class ProductsController < ActionController

33
  caches_page :index
34

35
  def index
P
Pratik Naik 已提交
36 37
    @products = Products.all
  end
38 39 40 41

end
</ruby>

42
The first time anyone requests +/products+, Rails will generate a file called +products.html+ and the webserver will then look for that file before it passes the next request for +/products+ to your Rails application.
43

44
By default, the page cache directory is set to +Rails.public_path+ (which is usually set to the +public+ folder) and this can be configured by changing the configuration setting +config.action_controller.page_cache_directory+. Changing the default from +public+ helps avoid naming conflicts, since you may want to put other static html in +public+, but changing this will require web server reconfiguration to let the web server know where to serve the cached files from.
45

46
The Page Caching mechanism will automatically add a +.html+ extension to requests for pages that do not have an extension to make it easy for the webserver to find those pages and this can be configured by changing the configuration setting +config.action_controller.page_cache_extension+.
47

48
In order to expire this page when a new product is added we could extend our example controller like this:
49 50 51 52

<ruby>
class ProductsController < ActionController

53
  caches_page :index
54

55
  def index
P
Pratik Naik 已提交
56 57
    @products = Products.all
  end
58 59

  def create
A
Aditya Chadha 已提交
60
    expire_page :action => :list
61 62 63 64 65
  end

end
</ruby>

66
If you want a more complicated expiration scheme, you can use cache sweepers to expire cached objects when things change. This is covered in the section on Sweepers.
67

P
Pratik Naik 已提交
68
Note: Page caching ignores all parameters. For example +/products?page=1+ will be written out to the filesystem as +products.html+ with no reference to the +page+ parameter. Thus, if someone requests +/products?page=2+ later, they will get the cached first page. Be careful when page caching GET parameters in the URL!
69 70 71

h4. Action Caching

72
One of the issues with Page Caching is that you cannot use it for pages that require to restrict access somehow. This is where Action Caching comes in. Action Caching works like Page Caching except for the fact that the incoming web request does go from the webserver to the Rails stack and Action Pack so that before filters can be run on it before the cache is served. This allows authentication and other restriction to be run while still serving the result of the output from a cached copy.
73

P
Pratik Naik 已提交
74
Clearing the cache works in the exact same way as with Page Caching.
75

P
Pratik Naik 已提交
76
Let's say you only wanted authenticated users to call actions on +ProductsController+.
77 78 79 80

<ruby>
class ProductsController < ActionController

P
Pratik Naik 已提交
81 82
  before_filter :authenticate
  caches_action :index
83

P
Pratik Naik 已提交
84 85 86
  def index
    @products = Product.all
  end
87 88

  def create
P
Pratik Naik 已提交
89
    expire_action :action => :index
90 91 92 93 94
  end

end
</ruby>

95
You can also use +:if+ (or +:unless+) to pass a Proc that specifies when the action should be cached. Also, you can use +:layout => false+ to cache without layout so that dynamic information in the layout such as logged in user info or the number of items in the cart can be left uncached. This feature is available as of Rails 2.2.
96

97
You can modify the default action cache path by passing a +:cache_path+ option. This will be passed directly to +ActionCachePath.path_for+. This is handy for actions with multiple possible routes that should be cached differently. If a block is given, it is called with the current controller instance.
98

99
Finally, if you are using memcached, you can also pass +:expires_in+. In fact, all parameters not used by +caches_action+ are sent to the underlying cache store. 
100 101 102

h4. Fragment Caching

103
Life would be perfect if we could get away with caching the entire contents of a page or action and serving it out to the world. Unfortunately, dynamic web applications usually build pages with a variety of components not all of which have the same caching characteristics. In order to address such a dynamically created page where different parts of the page need to be cached and expired differently Rails provides a mechanism called Fragment Caching.
104

105
Fragment Caching allows a fragment of view logic to be wrapped in a cache block and served out of the cache store when the next request comes in.
106

107
As an example, if you wanted to show all the orders placed on your website in real time and didn't want to cache that part of the page, but did want to cache the part of the page which lists all products available, you could use this piece of code:
108 109 110 111 112 113 114 115

<ruby>
<% Order.find_recent.each do |o| %>
  <%= o.buyer.name %> bought <% o.product.name %>
<% end %>

<% cache do %>
  All available products:
P
Pratik Naik 已提交
116
  <% Product.all.each do |p| %>
117 118 119 120 121
    <%= link_to p.name, product_url(p) %>
  <% end %>
<% end %>
</ruby>

122
The cache block in our example will bind to the action that called it and is written out to the same place as the Action Cache, which means that if you want to cache multiple fragments per action, you should provide an +action_suffix+ to the cache call:
123 124

<ruby>
P
Pratik Naik 已提交
125
<% cache(:action => 'recent', :action_suffix => 'all_products') do %>
126 127 128
  All available products:
</ruby>

P
Pratik Naik 已提交
129
and you can expire it using the +expire_fragment+ method, like so:
130 131

<ruby>
P
Pratik Naik 已提交
132
expire_fragment(:controller => 'products', :action => 'recent', :action_suffix => 'all_products')
133 134
</ruby>

135
If you don't want the cache block to bind to the action that called it, You can also use globally keyed fragments by calling the +cache+ method with a key, like so:
136 137

<ruby>
P
Pratik Naik 已提交
138
<% cache('all_available_products') do %>
139 140 141 142
  All available products:
<% end %>
</ruby>

143
This fragment is then available to all actions in the +ProductsController+ using the key and can be expired the same way:
144 145

<ruby>
P
Pratik Naik 已提交
146
expire_fragment('all_available_products')
147 148 149 150
</ruby>

h4. Sweepers

151
Cache sweeping is a mechanism which allows you to get around having a ton of +expire_{page,action,fragment}+ calls in your code.  It does this by moving all the work required to expire cached content into a +ActionController::Caching::Sweeper+ class.  This class is an Observer and looks for changes to an object via callbacks, and when a change occurs it expires the caches associated with that object in an around or after filter.
152

153
Continuing with our Product controller example, we could rewrite it with a sweeper like this:
154 155

<ruby>
P
Pratik Naik 已提交
156 157
class ProductSweeper < ActionController::Caching::Sweeper
  observe Product # This sweeper is going to keep an eye on the Product model
158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174

  # If our sweeper detects that a Product was created call this
  def after_create(product)
          expire_cache_for(product)
  end

  # If our sweeper detects that a Product was updated call this
  def after_update(product)
          expire_cache_for(product)
  end

  # If our sweeper detects that a Product was deleted call this
  def after_destroy(product)
          expire_cache_for(product)
  end

  private
P
Pratik Naik 已提交
175 176 177
  def expire_cache_for(product)
    # Expire the index page now that we added a new product
    expire_page(:controller => 'products', :action => 'index')
178 179

    # Expire a fragment
P
Pratik Naik 已提交
180
    expire_fragment('all_available_products')
181 182 183 184
  end
end
</ruby>

185
You may notice that the actual product gets passed to the sweeper, so if we were caching the edit action for each product, we could add a expire method which specifies the page we want to expire:
P
Pratik Naik 已提交
186 187 188 189 190

<ruby>
  expire_action(:controller => 'products', :action => 'edit', :id => product)
</ruby>

191
Then we add it to our controller to tell it to call the sweeper when certain actions are called. So, if we wanted to expire the cached content for the list and edit actions when the create action was called, we could do the following:
192 193 194 195

<ruby>
class ProductsController < ActionController

P
Pratik Naik 已提交
196 197 198
  before_filter :authenticate
  caches_action :index
  cache_sweeper :product_sweeper
199

P
Pratik Naik 已提交
200 201
  def index
    @products = Product.all
202 203 204 205 206 207 208
  end

end
</ruby>

h4. SQL Caching

209
Query caching is a Rails feature that caches the result set returned by each query so that if Rails encounters the same query again for that request, it will use the cached result set as opposed to running the query against the database again.
210 211 212 213 214 215

For example:

<ruby>
class ProductsController < ActionController

P
Pratik Naik 已提交
216
  def index
217
    # Run a find query
P
Pratik Naik 已提交
218
    @products = Product.all
219 220 221 222

    ...

    # Run the same query again
P
Pratik Naik 已提交
223
    @products = Product.all
224 225 226 227 228
  end

end
</ruby>

229
The second time the same query is run against the database, it's not actually going to hit the database.  The first time the result is returned from the query it is stored in the query cache (in memory) and the second time it's pulled from memory.
230

231
However, it's important to note that query caches are created at the start of an action and destroyed at the end of that action and thus persist only for the duration of the action.  If you'd like to store query results in a more persistent fashion, you can in Rails by using low level caching.
232

233
h3. Cache Stores
234

235
Rails provides different stores for the cached data created by action and fragment caches. Page caches are always stored on disk.
P
Pratik Naik 已提交
236

237
Rails 2.1 and above provide +ActiveSupport::Cache::Store+ which can be used to cache strings. Some cache store implementations, like +MemoryStore+, are able to cache arbitrary Ruby objects, but don't count on every cache store to be able to do that.
P
Pratik Naik 已提交
238

P
Pratik Naik 已提交
239
The default cache stores provided with Rails include:
P
Pratik Naik 已提交
240

241
1) +ActiveSupport::Cache::MemoryStore+: A cache store implementation which stores everything into memory in the same process. If you're running multiple Ruby on Rails server processes (which is the case if you're using mongrel_cluster or Phusion Passenger), then this means that your Rails server process instances won't be able to share cache data with each other. If your application never performs manual cache item expiry (e.g. when you‘re using generational cache keys), then using +MemoryStore+ is ok. Otherwise, consider carefully whether you should be using this cache store.  
242

A
Aditya Chadha 已提交
243
+MemoryStore+  is not only able to store strings, but also arbitrary Ruby objects.
244

A
Aditya Chadha 已提交
245
+MemoryStore+  is not thread-safe. Use +SynchronizedMemoryStore+ instead if you need thread-safety.
P
Pratik Naik 已提交
246
                                      
247 248 249 250
<ruby>
ActionController::Base.cache_store = :memory_store
</ruby>

251
2) +ActiveSupport::Cache::FileStore+: Cached data is stored on the disk, this is the default store and the default path for this store is +tmp/cache+. Works well for all types of environments and allows all processes running from the same application directory to access the cached content. If +tmp/cache+ does not exist, the default store becomes +MemoryStore+.
P
Pratik Naik 已提交
252

253 254 255 256
<ruby>
ActionController::Base.cache_store = :file_store, "/path/to/cache/directory"
</ruby>

257
3) +ActiveSupport::Cache::DRbStore+: Cached data is stored in a separate shared DRb process that all servers communicate with. This works for all environments and only keeps one cache around for all processes, but requires that you run and manage a separate DRb process.
P
Pratik Naik 已提交
258

259 260 261 262
<ruby>
ActionController::Base.cache_store = :drb_store, "druby://localhost:9192"
</ruby>

263
4) +ActiveSupport::Cache::MemCacheStore+: Works like +DRbStore+, but uses Danga's +memcached+ instead. Rails uses the bundled +memcached-client+ gem by default. This is currently the most popular cache store for production websites.
P
Pratik Naik 已提交
264 265

Special features:
A
Aditya Chadha 已提交
266

267
* Clustering and load balancing. One can specify multiple memcached servers, and +MemCacheStore+ will load balance between all available servers. If a server goes down, then +MemCacheStore+ will ignore it until it goes back online.
A
Aditya Chadha 已提交
268
* Time-based expiry support. See +write+ and the +:expires_in+ option.
269
* Per-request in memory cache for all communication with the +memcached+ server(s).
P
Pratik Naik 已提交
270 271 272

It also accepts a hash of additional options:

273 274 275
* +:namespace+: specifies a string that will automatically be prepended to keys when accessing the memcached store.
* +:readonly+: a boolean value that when set to true will make the store read-only, with an error raised on any attempt to write.
* +:multithread+: a boolean value that adds thread safety to read/write operations - it is unlikely you'll need to use this option as the Rails threadsafe! method offers the same functionality.
P
Pratik Naik 已提交
276

277
The read and write methods of the +MemCacheStore+ accept an options hash too. When reading you can specify +:raw => true+ to prevent the object being marshaled (by default this is false which means the raw value in the cache is passed to +Marshal.load+ before being returned to you.)
P
Pratik Naik 已提交
278

279
When writing to the cache it is also possible to specify +:raw => true+ means the value is not passed to +Marshal.dump+ before being stored in the cache (by default this is false). 
P
Pratik Naik 已提交
280

281
The write method also accepts an +:unless_exist+ flag which determines whether the memcached add (when true) or set (when false) method is used to store the item in the cache and an +:expires_in+ option that specifies the time-to-live for the cached item in seconds.
282 283 284 285 286

<ruby>
ActionController::Base.cache_store = :mem_cache_store, "localhost"
</ruby>

287
5) +ActiveSupport::Cache::SynchronizedMemoryStore+: Like +MemoryStore+ but thread-safe.
P
Pratik Naik 已提交
288 289 290 291 292

<ruby>
ActionController::Base.cache_store = :synchronized_memory_store
</ruby>

293
6) +ActiveSupport::Cache::CompressedMemCacheStore+: Works just like the regular +MemCacheStore+ but uses GZip to decompress/compress on read/write.
P
Pratik Naik 已提交
294 295 296 297 298

<ruby>
ActionController::Base.cache_store = :compressed_mem_cache_store, "localhost"
</ruby>

299
7) Custom store: You can define your own cache store (new in Rails 2.1).
P
Pratik Naik 已提交
300

301 302 303 304
<ruby>
ActionController::Base.cache_store = MyOwnStore.new("parameter")
</ruby>

305
NOTE: +config.cache_store+ can be used in place of +ActionController::Base.cache_store+ in your +Rails::Initializer.run+ block in +environment.rb+
306

307
In addition to all of this, Rails also adds the +ActiveRecord::Base#cache_key+ method that generates a key using the class name, +id+ and +updated_at+ timestamp (if available).
P
Pratik Naik 已提交
308

P
Pratik Naik 已提交
309
You can access these cache stores at a low level for storing queries and other objects. Here's an example:
P
Pratik Naik 已提交
310 311 312 313 314 315 316

<ruby>
Rails.cache.read("city")   # => nil
Rails.cache.write("city", "Duckburgh")
Rails.cache.read("city")   # => "Duckburgh"
</ruby>

P
Pratik Naik 已提交
317
h3. Conditional GET support
318

319
Conditional GETs are a feature of the HTTP specification that provide a way for web servers to tell browsers that the response to a GET request hasn't changed since the last request and can be safely pulled from the browser cache.
320

321
They work by using the +HTTP_IF_NONE_MATCH+ and +HTTP_IF_MODIFIED_SINCE+ headers to pass back and forth both a unique content identifier and the timestamp of when the content was last changed. If the browser makes a request where the content identifier (etag) or last modified since timestamp matches the server’s version then the server only needs to send back an empty response with a not modified status.
322

323
It is the server's (i.e. our) responsibility to look for a last modified timestamp and the if-none-match header and determine whether or not to send back the full response. With conditional-get support in rails this is a pretty easy task:
324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345

<ruby>
class ProductsController < ApplicationController

  def show
    @product = Product.find(params[:id])

    # If the request is stale according to the given timestamp and etag value
    # (i.e. it needs to be processed again) then execute this block
    if stale?(:last_modified => @product.updated_at.utc, :etag => @product)
      respond_to do |wants|
        # ... normal response processing
      end
    end

    # If the request is fresh (i.e. it's not modified) then you don't need to do
    # anything. The default render checks for this using the parameters
    # used in the previous call to stale? and will automatically send a
    # :not_modified.  So that's it, you're done.
end
</ruby>

346
If you don't have any special response processing and are using the default rendering mechanism (i.e. you're not using respond_to or calling render yourself) then you’ve got an easy helper in fresh_when:
347 348 349 350 351 352 353 354 355

<ruby>
class ProductsController < ApplicationController

  # This will automatically send back a :not_modified if the request is fresh,
  # and will render the default template (product.*) if it's stale.

  def show
    @product = Product.find(params[:id])
A
Aditya Chadha 已提交
356
    fresh_when :last_modified => @product.published_at.utc, :etag => @product
357 358 359 360 361 362
  end
end
</ruby>

h3. Advanced Caching

363
Along with the built-in mechanisms outlined above, a number of excellent plugins exist to help with finer grained control over caching. These include Chris Wanstrath's excellent cache_fu plugin (more info "here": http://errtheblog.com/posts/57-kickin-ass-w-cachefu) and Evan Weaver's interlock plugin (more info "here": http://blog.evanweaver.com/articles/2007/12/13/better-rails-caching/). Both of these plugins play nice with memcached and are a must-see for anyone
364
seriously considering optimizing their caching needs.
P
Pratik Naik 已提交
365 366 367 368 369

Also the new "Cache money":http://github.com/nkallen/cache-money/tree/master plugin is supposed to be mad cool. 

h3. References

P
Pratik Naik 已提交
370
* "Scaling Rails Screencasts":http://railslab.newrelic.com/scaling-rails
371
* "RailsEnvy, Rails Caching Tutorial, Part 1":http://www.railsenvy.com/2007/2/28/rails-caching-tutorial
A
Aditya Chadha 已提交
372
* "RailsEnvy, Rails Caching Tutorial, Part 2":http://www.railsenvy.com/2007/3/20/ruby-on-rails-caching-tutorial-part-2
373 374
* "ActiveSupport::Cache documentation":http://api.rubyonrails.org/classes/ActiveSupport/Cache.html
* "Rails 2.1 integrated caching tutorial":http://thewebfellas.com/blog/2008/6/9/rails-2-1-now-with-better-integrated-caching
P
Pratik Naik 已提交
375

376

P
Pratik Naik 已提交
377
h3. Changelog
P
Pratik Naik 已提交
378 379
"Lighthouse ticket":http://rails.lighthouseapp.com/projects/16213-rails-guides/tickets/10-guide-to-caching

A
Aditya Chadha 已提交
380 381 382 383 384 385 386
* May       02, 2009: Formatting cleanups
* April     26, 2009: Clean up typos in submitted patch
* April      1, 2009: Made a bunch of small fixes
* February  22, 2009: Beefed up the section on cache_stores
* December  27, 2008: Typo fixes
* November  23, 2008: Incremental updates with various suggested changes and formatting cleanup
* September 15, 2008: Initial version by Aditya Chadha