提交 · 7b75ca100de247a40c78da89e28f4d71e0635b95 · 张重言 / rails

01 6月, 2016 1 次提交

Make Active Record emit significantly smaller YAML · c4cb6862

由 Sean Griffin 提交于 5月 31, 2016

This reduces the size of a YAML encoded Active Record object by ~80%
depending on the number of columns. There were a number of wasteful
things that occurred when we encoded the objects before that have
resulted in numerous wins

- We were emitting the result of `attributes_before_type_cast` as a hack
  to work around some laziness issues
- The name of an attribute was emitted multiple times, since the
  attribute objects were in a hash keyed by the name. We now store them
  in an array instead, and reconstruct the hash using the name
- The types were included for every attribute. This would use backrefs
  if multiple objects were encoded, but really we don't need to include
  it at all unless it differs from the type at the class level. (The
  only time that will occur is if the field is the result of a custom
  select clause)
- `original_attribute:` was included over and over and over again since
  the ivar is almost always `nil`. We've added a custom implementation
  of `encode_with` on the attribute objects to ensure we don't write the
  key when the field is `nil`.

This isn't without a cost though. Since we're no longer including the
types, an object can find itself in an invalid state if the type changes
on the class after serialization. This is the same as 4.1 and earlier,
but I think it's worth noting.

I was worried that I'd introduce some new state bugs as a result of
doing this, so I've added an additional test that asserts mutation not
being lost as the result of YAML round tripping.

Fixes #25145

c4cb6862

12 5月, 2016 1 次提交

Define ActiveRecord::Attribute::Null#type_cast · 556e530d

由 Matthew Erhard 提交于 5月 11, 2016

Using ActiveRecord::Base.attribute to declare an attribute with a default value on a model where the attribute is not backed by the database would raise a NotImplementedError when model.save is called.

The error originates from https://github.com/rails/rails/blob/59d252196b36f6afaafd231756d69ea21537cf5d/activerecord/lib/active_record/attribute.rb#L84.
This is called from https://github.com/rails/rails/blob/59d252196b36f6afaafd231756d69ea21537cf5d/activerecord/lib/active_record/attribute.rb#L46 on an ActiveRecord::Attribute::Null object.

This commit corrects the behavior by implementing ActiveRecord::Attribute::Null#type_cast.

With ActiveRecord::Attribute::Null#type_cast defined, ActiveRecord::Attribute::Null#value (https://github.com/rails/rails/blob/59d252196b36f6afaafd231756d69ea21537cf5d/activerecord/lib/active_record/attribute.rb#L173..L175) can be replaced with its super method (https://github.com/rails/rails/blob/59d252196b36f6afaafd231756d69ea21537cf5d/activerecord/lib/active_record/attribute.rb#L36..L40).

fixes #24979

556e530d

02 10月, 2015 1 次提交

Further encapsulate dirty checking on `Attribute` · 07723c23

由 Sean Griffin 提交于 9月 28, 2015

We can skip the allocation of a full `AttributeSet` by changing the
semantics of how we structure things. Instead of comparing two separate
`AttributeSet` objects, and `Attribute` is now a singly linked list of
every change that has happened to it. Since the attribute objects are
immutable, to apply the changes we simply need to copy the head of the
list.

It's worth noting that this causes one subtle change in the behavior of
AR. When a record is saved successfully, the `before_type_cast` version
of everything will be what was sent to the database. I honestly think
these semantics make more sense, as we could have just as easily had the
DB do `RETURNING *` and updated the record with those if we had things
like timestamps implemented at the DB layer.

This brings our performance closer to 4.2, but we're still not quite
there.

07723c23

29 9月, 2015 1 次提交

Inline `Attribute#original_value` · e950c4b4

由 Sean Griffin 提交于 9月 28, 2015

The external uses of this method have been removed, and I'd like to
internally re-use that name, as I'm planning to encapsulate `changed?`
into the attribute object itself.

e950c4b4

25 9月, 2015 1 次提交

Clean up the implementation of AR::Dirty · 8e633e50

由 Sean Griffin 提交于 9月 24, 2015

This moves a bit more of the logic required for dirty checking into the
attribute objects. I had hoped to remove the `with_value_from_database`
stuff, but unfortunately just calling `dup` on the attribute objects
isn't enough, since the values might contain deeply nested data
structures. I think this can be cleaned up further.

This makes most dirty checking become lazy, and reduces the number of
object allocations and amount of CPU time when assigning a value. This
opens the door (but doesn't quite finish) to improving the performance
of writes to a place comparable to 4.1

8e633e50

18 2月, 2015 3 次提交
- S
  
  `type_cast_from_user` -> `cast` · 9ca6948f
  由 Sean Griffin 提交于 2月 17, 2015
  
  9ca6948f
- S
  
  `type_cast_for_database` -> `serialize` · 1455c4c2
  由 Sean Griffin 提交于 2月 17, 2015
  
  1455c4c2
- S
  
  `Type#type_cast_from_database` -> `Type#deserialize` · 4a3cb840
  由 Sean Griffin 提交于 2月 17, 2015
  
  4a3cb840
01 2月, 2015 1 次提交

Attribute assignment and type casting has nothing to do with columns · 70ac0729

由 Sean Griffin 提交于 1月 30, 2015

It's finally finished!!!!!!! The reason the Attributes API was kept
private in 4.2 was due to some publicly visible implementation details.
It was previously implemented by overloading `columns` and
`columns_hash`, to make them return column objects which were modified
with the attribute information.

This meant that those methods LIED! We didn't change the database
schema. We changed the attribute information on the class. That is
wrong! It should be the other way around, where schema loading just
calls the attributes API for you. And now it does!

Yes, this means that there is nothing that happens in automatic schema
loading that you couldn't manually do yourself. (There's still some
funky cases where we hit the connection adapter that I need to handle,
before we can turn off automatic schema detection entirely.)

There were a few weird test failures caused by this that had to be
fixed. The main source came from the fact that the attribute methods are
now defined in terms of `attribute_names`, which has a clause like
`return [] unless table_exists?`. I don't *think* this is an issue,
since the only place this caused failures were in a fake adapter which
didn't override `table_exists?`.

Additionally, there were a few cases where tests were failing because a
migration was run, but the model was not reloaded. I'm not sure why
these started failing from this change, I might need to clear an
additional cache in `reload_schema_from_cache`. Again, since this is not
normal usage, and it's expected that `reset_column_information` will be
called after the table is modified, I don't think it's a problem.

Still, test failures that were unrelated to the change are worrying, and
I need to dig into them further.

Finally, I spent a lot of time debugging issues with the mutex used in
`define_attribute_methods`. I think we can just remove that method
entirely, and define the attribute methods *manually* in the call to
`define_attribute`, which would simplify the code *tremendously*.

Ok. now to make this damn thing public, and work on moving it up to
Active Model.

70ac0729

28 1月, 2015 2 次提交

Remove Relation#bind_params · b06f64c3

由 Sean Griffin 提交于 1月 27, 2015

`bound_attributes` is now used universally across the board, removing
the need for the conversion layer. These changes are mostly mechanical,
with the exception of the log subscriber. Additional, we had to
implement `hash` on the attribute objects, so they could be used as a
key for query caching.

b06f64c3

S

All subclasses of `Attribute` should be private constants · 3a551b97
由 Sean Griffin 提交于 1月 27, 2015

3a551b97

21 1月, 2015 1 次提交

Introduce `ActiveRecord::Base#accessed_fields` · be9b6803

由 Sean Griffin 提交于 1月 20, 2015

This method can be used to see all of the fields on a model which have
been read. This can be useful during development mode to quickly find
out which fields need to be selected. For performance critical pages, if
you are not using all of the fields of a database, an easy performance
win is only selecting the fields which you need. By calling this method
at the end of a controller action, it's easy to determine which fields
need to be selected.

While writing this, I also noticed a place for an easy performance win
internally which I had been wanting to introduce. You cannot mutate a
field which you have not read. Therefore, we can skip the calculation of
in place changes if we have never read from the field. This can
significantly speed up methods like `#changed?` if any of the fields
have an expensive mutable type (like `serialize`)

```
Calculating -------------------------------------
 #changed? with serialized column (before)
                       391.000  i/100ms
 #changed? with serialized column (after)
                         1.514k i/100ms
-------------------------------------------------
 #changed? with serialized column (before)
                          4.243k (± 3.7%) i/s -     21.505k
 #changed? with serialized column (after)
                         16.789k (± 3.2%) i/s -     84.784k
```

be9b6803

15 1月, 2015 1 次提交

Only use the `_before_type_cast` in the form when from user input · d8e71041

由 Sean Griffin 提交于 1月 14, 2015

While we don't want to change the form input when validations fail,
blindly using `_before_type_cast` will cause the input to display the
wrong data for any type which does additional work on database values.

d8e71041

17 12月, 2014 1 次提交

`update_column` take ruby-land input, not database-land input · dd8b5fb9

由 Sean Griffin 提交于 12月 16, 2014

In the case of serialized columns, we would expect the unserialized
value as input, not the serialized value. The original issue which made
this distinction, #14163, introduced a bug. If you passed serialized
input to the method, it would double serialize when it was sent to the
database. You would see the wrong input upon reloading, or get an error
if you had a specific type on the serialized column.

To put it another way, `update_column` is a special case of
`update_all`, which would take `['a']` and not `['a'].to_yaml`, but you
would not pass data from `params` to it.

Fixes #18037

dd8b5fb9

17 8月, 2014 1 次提交
- S
  
  Implement `_was` and `changes` for in-place mutations of AR attributes · 877ea784
  由 Sean Griffin 提交于 7月 12, 2014
  
  877ea784
16 8月, 2014 1 次提交

Implement `==` on `Type::Value` and `Attribute` · bc153cff

由 Sean Griffin 提交于 8月 15, 2014

This was a small self contained piece of the refactoring that I am
working on, which required these objects to be comparable.

bc153cff

26 6月, 2014 3 次提交

Move writing unknown column exception to null attribute · 3bc314e6

由 Sean Griffin 提交于 6月 26, 2014

Making this change revealed several subtle bugs related to models with
no primary key, and anonymous classes. These have been fixed as well,
with regression tests added.

3bc314e6

S
`Attribute` should know about its name · bb7bc499
由 Sean Griffin 提交于 6月 22, 2014
```
This allows using polymorphism for the uninitialized attributes raising
an exception behavior.
```
bb7bc499

Encapsulate the creation of `Attribute` objects · 14b1208d

由 Sean Griffin 提交于 6月 22, 2014

This will make it less painful to add additional properties, which
should persist across writes, such as `name`.

Conflicts:
	activerecord/lib/active_record/attribute_set.rb

14b1208d

25 6月, 2014 1 次提交

Move behavior of `read_attribute` to `AttributeSet` · a89f8a92

由 Sean Griffin 提交于 6月 22, 2014

Moved `Builder` to its own file, as it started looking very weird once I
added private methods to the `AttributeSet` class and the `Builder`
class started to grow.

Would like to refactor `fetch_value` to change to

```ruby
self[name].value(&block)
```

But that requires the attributes to know about their name, which they
currently do not.

a89f8a92

24 6月, 2014 1 次提交

add missing `:nodoc:` for recent refactorings. [ci skip] · b27e856d

由 Yves Senn 提交于 6月 24, 2014

Adding `# :nodoc:` to the parent `class` / `module` is not going
to ignore nested classes or modules.

There is a modifier `# :nodoc: all` but sadly the containing class
or module will continue to be in the docs.

/cc @sgrif

b27e856d

17 6月, 2014 1 次提交
- S
  
  Refactor in-place dirty checking to use the attribute object · 218105f5
  由 Sean Griffin 提交于 6月 16, 2014
  
  218105f5
14 6月, 2014 1 次提交

Introduce an Attribute object to handle the type casting dance · 6f08db05

由 Sean Griffin 提交于 6月 07, 2014

There's a lot more that can be moved to these, but this felt like a good
place to introduce the object. Plans are:

- Remove all knowledge of type casting from the columns, beyond a
  reference to the cast_type
- Move type_cast_for_database to these objects
- Potentially make them mutable, introduce a state machine, and have
  dirty checking handled here as well
- Move `attribute`, `decorate_attribute`, and anything else that
  modifies types to mess with this object, not the columns hash
- Introduce a collection object to manage these, reduce allocations, and
  not require serializing the types

6f08db05