提交 · d72fd1e45b192ab507f8170ceec1328c2aae7bb1 · 张重言 / ruby

15 4月, 2020 11 次提交
- N
  Added rb_syserr_new_path · d72fd1e4
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
```
Similar to rb_syserr_fail_path, but just returns the created
exception instance instead of raising it.
```
  d72fd1e4
- N
  Create succ_index_table as a part of `iseq_setup` · 69b3e0ac
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
```
With compiling `CPDEBUG >= 2`, `rb_iseq_disasm` segfaults if this
table has not been created.  Also `ibf_load_iseq_each` calls
`rb_iseq_insns_info_encode_positions`.
```
  69b3e0ac
- N
  
  Added test for `debug_level:` option of `RubyVM::InstructionSequence.compile` · a9567cc2
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
  
  a9567cc2
- T
  
  This test is not testing attr_writer · 1dad9fa5
  由 Takashi Kokubun 提交于 4月 14, 2020
  
  1dad9fa5
- T
  Invalidate fastpath when calling attr_reader by super · 79f3403b
  由 Takashi Kokubun 提交于 4月 14, 2020
```
The same bug as 8355a998 existed in attr_reader too.
```
  79f3403b
- T
  Invalidate fastpath when calling attr_writer by super · 8355a998
  由 Takashi Kokubun 提交于 4月 14, 2020
```
We started to use fastpath on invokesuper when a method is not refinements
since 5c276818, but we shouldn't have used fastpath for attr_writer either.

`cc->aux_.attr_index` is for an actual receiver class, while we store
its superclass in `cc->klass` and therefore there's no way to properly
invalidate attr_writer's inline cache when it's called by super.

[Bug #16785]

I suspect the same bug also exists in attr_reader. I'll address that in
another commit.
```
  8355a998
- N
  
  Disassemble nop-inserted list · 2c12d111
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
  
  2c12d111
- N
  
  Show heading for update_catch_except_flags · de2afcbf
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
  
  de2afcbf
- N
  
  Shrink diassembled result string · f9822d17
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
  
  f9822d17
- G
  
  * 2020-04-15 [ci skip] · 0d6737cb
  由 git 提交于 4月 15, 2020
  
  0d6737cb
- N
  
  Disallow line-continuation before R-assign · 478135f4
  由 Nobuyoshi Nakada 提交于 4月 15, 2020
  
  478135f4
14 4月, 2020 12 次提交

N
Removed duplicate value_expr checks · a520ee47
由 Nobuyoshi Nakada 提交于 4月 14, 2020
```
`arg_rhs` has the same check and is always a non-void value
expression.
```
a520ee47

由 Takashi Kokubun 提交于 4月 13, 2020

MJIT_CC seems not defined
https://ci.appveyor.com/project/ruby/ruby/builds/32161572/job/u5sw8yn4in87heki

f8886265

N
[ruby/date] Suppress -Wchar-subscripts warnings by Cygwin gcc 9.3.0 · 7a85d31c
由 Nobuyoshi Nakada 提交于 4月 14, 2020
```
https://github.com/ruby/date/commit/9968eb69f0
```
7a85d31c
T

Add missing call in 70b7304f · e6677503
由 Takashi Kokubun 提交于 4月 13, 2020

e6677503

Ignore AppVeyor vs120's pdb corruption · 70b7304f

由 Takashi Kokubun 提交于 4月 13, 2020

We tried to fix this like https://github.com/ruby/ruby/pull/3029, but it
didn't work. The failure has never been helpful for developing MJIT, and
currently it's not prioritized to be fixed. Until we try to figure out
the root cause on AppVeyor vs120, let's optionally disable testing when
the random corruption happens.

70b7304f

Unify vm benchmark prefixes to vm_ (#3028) · f883d621

由 Takashi Kokubun 提交于 4月 13, 2020

The vm1_ prefix and vm2_ had had special meaning until
820ad9cb and
12068aa4. AFAIK there's no special
meaning in vm3_ prefix.

As they have confused people (like "In `benchmark` what is difference
between `vm1_`, `vm2_` and `vm3_`"), I'd like to remove the obsoleted
prefix as we obviated that two years ago.

f883d621

N
Make y.output in ripper unlocalized [ci skip] · 9fa24018
由 Nobuyoshi Nakada 提交于 4月 14, 2020
```
Often it is easy to search, grep, etc from command line, for
debugging purpose.
```
9fa24018
K

Add {Regexp,String}#match with block to call-seq [ci skip] · c79e3a59
由 Kazuhiro NISHIYAMA 提交于 4月 14, 2020

c79e3a59

Make vm_call_cfunc_with_frame a fastpath (#3027) · 310ef9f4

由 Takashi Kokubun 提交于 4月 13, 2020

when there's no need to call CALLER_SETUP_ARG and CALLER_REMOVE_EMPTY_KW_SPLAT
(i.e. !rb_splat_or_kwargs_p(ci) && !calling->kw_splat).

Micro benchmark:
```
$ benchmark-driver -v --rbenv 'before;after' benchmark/vm_send_cfunc.yml --repeat-count=4
before: ruby 2.8.0dev (2020-04-13T23:45:05Z master b9d3ceee) [x86_64-linux]
after: ruby 2.8.0dev (2020-04-14T00:48:52Z no-splat-fastpath 418d363722) [x86_64-linux]
Calculating -------------------------------------
                         before       after
       vm_send_cfunc    69.585M     88.724M i/s -    100.000M times in 1.437097s 1.127096s

Comparison:
                    vm_send_cfunc
               after:  88723605.2 i/s
              before:  69584737.1 i/s - 1.28x  slower
```

Optcarrot:
```
$ benchmark-driver -v --rbenv 'before;after' benchmark.yml --repeat-count=12 --output=all
before: ruby 2.8.0dev (2020-04-13T23:45:05Z master b9d3ceee) [x86_64-linux]
after: ruby 2.8.0dev (2020-04-14T00:48:52Z no-splat-fastpath 418d363722) [x86_64-linux]
Calculating -------------------------------------
                                       before                 after
Optcarrot Lan_Master.nes    50.76119601545175     42.73858236484051 fps
                            50.76388649761503     51.04211379912850
                            50.80930672252514     51.39455790755538
                            50.90236000778749     51.75656936556145
                            51.01744746340430     51.86875277356489
                            51.06495279015112     51.88692482485558
                            51.07785337168974     51.93429603190578
                            51.20163525187862     51.95768145071314
                            51.34671771913112     52.45577266040274
                            51.35918340835583     52.53163888762858
                            51.46641337418146     52.62172484121034
                            51.50835463462257     52.85064021113239
```

310ef9f4

Unwrap vm_call_cfunc indirection on JIT · b9d3ceee

由 Takashi Kokubun 提交于 4月 13, 2020

for VM_METHOD_TYPE_CFUNC.

This has been known to decrease optcarrot fps:

```
$ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark.yml --repeat-count=24 --output=all
before --jit: ruby 2.8.0dev (2020-04-13T16:25:13Z master fb40495c) +JIT [x86_64-linux]
after --jit: ruby 2.8.0dev (2020-04-13T23:23:11Z mjit-inline-c bdcd06d159) +JIT [x86_64-linux]
Calculating -------------------------------------
                                 before --jit           after --jit
Optcarrot Lan_Master.nes    66.38132676191719     67.41369177299630 fps
                            69.42728743772243     68.90327567263054
                            72.16028300263211     69.62605130880686
                            72.46631319102777     70.48818243767207
                            73.37078877002490     70.79522887347566
                            73.69422431217367     70.99021920193194
                            74.01471487018695     74.69931965402584
                            75.48685183295630     74.86714575949016
                            75.54445264507932     75.97864419721677
                            77.28089738169756     76.48908637569581
                            78.04183397891302     76.54320932488021
                            78.36807984096562     76.59407262898067
                            78.92898762543574     77.31316743361343
                            78.93576483233765     77.97153484180480
                            79.13754917503078     77.98478782102325
                            79.62648945850653     78.02263322726446
                            79.86334213878064     78.26333724045934
                            80.05100635898518     78.60056756355614
                            80.26186843769584     78.91082645644468
                            80.34205717020330     79.01226659142263
                            80.62286066044338     79.32733939423721
                            80.95883033058557     79.63793060542024
                            80.97376819251613     79.73108936622778
                            81.23050939202896     80.18280109433088
```

and I deleted this capability in an early stage of YARV-MJIT development:
https://github.com/k0kubun/yarv-mjit/commit/0ab130feeefc2b9078a1077e4fec93b3f5e45d07

I suspect either of the following things could be the cause:

* Directly calling vm_call_cfunc requires more optimization effort in GCC,
  resulting in 30ms-ish compilation time increase for such methods and
  decreasing the number of methods compiled in a benchmarked period.

* Code size increase => icache miss hit

These hypotheses could be verified by some methodologies. However, I'd
like to introduce this regardless of the result because this blocks
inlining C method's definition.

I may revert this commit when I give up to implement inlining C method
definition, which requires this change.

Microbenchmark-wise, this gives slight performance improvement:

```
$ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark/mjit_send_cfunc.yml --repeat-count=4
before --jit: ruby 2.8.0dev (2020-04-13T16:25:13Z master fb40495c) +JIT [x86_64-linux]
after --jit: ruby 2.8.0dev (2020-04-13T23:23:11Z mjit-inline-c bdcd06d159) +JIT [x86_64-linux]
Calculating -------------------------------------
                     before --jit  after --jit
     mjit_send_cfunc      41.961M      56.489M i/s -    100.000M times in 2.383143s 1.770244s

Comparison:
                  mjit_send_cfunc
         after --jit:  56489372.5 i/s
        before --jit:  41961388.1 i/s - 1.35x  slower
```

b9d3ceee

G

* 2020-04-14 [ci skip] · fb40495c
由 git 提交于 4月 14, 2020

fb40495c
B
Add a a list of cases for which clock_getres() has been observed to be inaccurate · a6f7458e
由 Benoit Daloze 提交于 4月 13, 2020
```
* See [Bug #16740]
```
a6f7458e

13 4月, 2020 7 次提交

B

Improve Hash documentation. · c28e230a
由 Burdette Lamar 提交于 4月 13, 2020

c28e230a
N

Allow simple R-assign in endless def · 67bcac87
由 Nobuyoshi Nakada 提交于 4月 13, 2020

67bcac87

卜

delete CACHELINE · 5dc6080c

由卜部昌平提交于 4月 13, 2020

Since https://github.com/ruby/ruby/pull/2888 this macro is no longer
used in any place.

5dc6080c

卜

include what you use. · c37a357c

由卜部昌平提交于 4月 12, 2020

This reverts commit 443389ef.
This reverts commit d94960f2.

Inclusion of header files must be explicit.  Every file shall directly
include what is necessary.

https://github.com/include-what-you-use/include-what-you-use says:

> When every file includes what it uses, then it is possible to edit any
> file and remove unused headers, without fear of accidentally breaking
> the upwards dependencies of that file. It also becomes easy to
> automatically track and update dependencies in the source code.

Though we don't use iwyu itself, the principle quoted above is a good
thing that we can agree.

Now that include guards were added to every and all of the headers
inside of our project this changeset does not increase compile time, at
least on my machine.

c37a357c

卜

add #include guard hack · 4ff3f205

由卜部昌平提交于 4月 10, 2020

According to MSVC manual (*1), cl.exe can skip including a header file
when that:

- contains #pragma once, or
- starts with #ifndef, or
- starts with #if ! defined.

GCC has a similar trick (*2), but it acts more stricter (e. g. there
must be _no tokens_ outside of #ifndef...#endif).

Sun C lacked #pragma once for a looong time.  Oracle Developer Studio
12.5 finally implemented it, but we cannot assume such recent version.

This changeset modifies header files so that each of them include
strictly one #ifndef...#endif.  I believe this is the most portable way
to trigger compiler optimizations. [Bug #16770]

*1: https://docs.microsoft.com/en-us/cpp/preprocessor/once
*2: https://gcc.gnu.org/onlinedocs/cppinternals/Guard-Macros.html

4ff3f205

T

Add MJIT_COUNTER macro to dump total_calls · a3f6f679
由 Takashi Kokubun 提交于 4月 12, 2020

a3f6f679

Avoid UB with flexible array member · 82fdffc5

由 Alan Wu 提交于 4月 12, 2020

Accessing past the end of an array is technically UB. Use C99 flexible
array member instead to avoid the UB and simplify allocation size
calculation.

See also: DCL38-C in the SEI CERT C Coding Standard

82fdffc5

12 4月, 2020 8 次提交

G

* 2020-04-13 [ci skip] · f2c3848a
由 git 提交于 4月 13, 2020

f2c3848a
N
Make rb_scan_args implementations same · 4c8e3f12
由 Nobuyoshi Nakada 提交于 4月 12, 2020
```
between rb_scan_args_set and rb_scan_args_assign +
rb_scan_args_result.
```
4c8e3f12
N

Honor COLUMNS [Feature #16754] · a07cbacd
由 Nobuyoshi Nakada 提交于 4月 12, 2020

a07cbacd
N

Hightlight usage [Feature #16754] · cc68d2fb
由 Nobuyoshi Nakada 提交于 3月 27, 2020

cc68d2fb
N

Set up environment variable for pager [Feature #16754] · 3825662d
由 Nobuyoshi Nakada 提交于 4月 12, 2020

3825662d
N

PAGER without fork&exec too [Feature #16754] · e6551d83
由 Nobuyoshi Nakada 提交于 2月 04, 2020

e6551d83

View the help message with PAGER [Feature #16754] · f22c4ff3

由 Nobuyoshi Nakada 提交于 2月 04, 2020

View the help message wth pager designed by RUBY_PAGER or PAGER
environment variable, unless that value is empty.

f22c4ff3

Enable fastpath on invokesuper (#3021) · 5c276818

由 Takashi Kokubun 提交于 4月 11, 2020

Fastpath has not been used for invokesuper since it has set vm_call_super_method on every invocation.
Because it seems to be blocked only by refinements, try enabling fastpath on invokesuper when cme is not for refinements.

While this patch itself should be helpful for VM performance, a part of this patch's motivation is to unblock inlining invokesuper on JIT. 

$ benchmark-driver -v --rbenv 'before;after' benchmark/vm2_super.yml --repeat-count=4
before: ruby 2.8.0dev (2020-04-11T15:19:58Z master a01bda59) [x86_64-linux]
after: ruby 2.8.0dev (2020-04-12T02:00:08Z invokesuper-fastpath c171984ee3) [x86_64-linux]
Calculating -------------------------------------
                         before       after
           vm2_super    20.031M     32.860M i/s -      6.000M times in 0.299534s 0.182593s

Comparison:
                        vm2_super
               after:  32859885.2 i/s
              before:  20031097.3 i/s - 1.64x  slower

5c276818

11 4月, 2020 2 次提交
- G
  
  * 2020-04-12 [ci skip] · a01bda59
  由 git 提交于 4月 12, 2020
  
  a01bda59
- N
  
  Relaxed of R-assign value to arg · 022c7bbe
  由 Nobuyoshi Nakada 提交于 4月 12, 2020
  
  022c7bbe