提交 · 11b3c588cea43cec164ff31e0d3c72952f515d5a · OpenXiangShan / XiangShan

26 3月, 2021 2 次提交
- A
  Pass enablePerf to BlockInclusiveCache. · 11b3c588
  由 Allen 提交于 3月 26, 2021
```
L2 and L3 Only enablePerf when XSCore enables perf.
```
  11b3c588
- A
  
  Add performance counters for L2. · c5c804af
  由 Allen 提交于 3月 26, 2021
  
  c5c804af
25 3月, 2021 4 次提交

Refactor XSPerf, now we have three XSPerf Functions. · 408a32b7

由 Allen 提交于 3月 25, 2021

XSPerfAccumulate: sum up performance values.
XSPerfHistogram: count the occurrence of performance values, split them
into bins, so that we can estimate their distribution.
XSPerfMax: get max of performance values.

408a32b7

Added several performance counters to L1DCache. · e0a152a4

由 Allen 提交于 3月 25, 2021

Not tested yet.

Added:
* L1 MSHR occupation
* L1 MSHR latency
* L1 Load Miss latency
* L1 Store latency
* L1 Store occupation
* L1 Load req count

e0a152a4

A

Add a TransactionLatencyCounter to utils. · 125034f7
由 Allen 提交于 3月 25, 2021

125034f7

Add a new apply function to XSPerf. · cb4c13a1

由 Allen 提交于 3月 25, 2021

Now we can put a performance value into several bins and count them.
In this way, we can get a distribution of this performance value.

cb4c13a1

24 3月, 2021 3 次提交
- L
  
  RS: every rs has its own iqSize now (#710) · 61704268
  由 Lemover 提交于 3月 24, 2021
  
  61704268
- L
  
  ReservationStation: fixed incorrect use of 'pc' (#709) · f432c814
  由 ljw 提交于 3月 24, 2021
  
  f432c814
- Y
  
  TLTimer: change default freq to 1000000 (#708) · 298aa395
  由 Yinan Xu 提交于 3月 24, 2021
  
  298aa395
23 3月, 2021 1 次提交
- Y
  
  sbuffer: init flush counter to avoid X state (#707) · a1b789cf
  由 Yinan Xu 提交于 3月 23, 2021
  
  a1b789cf
22 3月, 2021 8 次提交
- Y
  
  jump: use lower 39bits of target pc to generate isMisPred (#706) · 5b914e39
  由 Yinan Xu 提交于 3月 22, 2021
  
  5b914e39
- L
  
  Beu: separate l1plus and icache (#705) · 4e3ce935
  由 ljw 提交于 3月 22, 2021
  
  4e3ce935
- Y
  Merge pull request #704 from RISCVERS/update-soc · 6d78a15a
  由 Yinan Xu 提交于 3月 22, 2021
```
Update SoC and emu configurations
```
  6d78a15a
- Y
  
  github,ci: reduce used cores · 7e587639
  由 Yinan Xu 提交于 3月 22, 2021
  
  7e587639
- Y
  
  makefile: use larger --output-split to reduce cpp files · ffd5ea39
  由 Yinan Xu 提交于 3月 22, 2021
  
  ffd5ea39
- Y
  Merge pull request #699 from RISCVERS/add-beu · eb021a4b
  由 Yinan Xu 提交于 3月 22, 2021
```
Add bus error unit and connect ecc errors to beu
```
  eb021a4b
- Z
  MissQueue: add perf cnt for inflight entries in maximum (#700) · 83d6150b
  由 zhanglinjuan 提交于 3月 22, 2021
```
* MissQueue: add perf cnt for inflight entries in maximum

* MissQueue: max_inflight ignores cycles when missQueue is empty
```
  83d6150b
- L
  
  RS: add some signals' init value (#703) · fb9ab422
  由 Lemover 提交于 3月 22, 2021
  
  fb9ab422
21 3月, 2021 1 次提交
- Y
  
  top: add TLXbar below L3 · 329e267d
  由 Yinan Xu 提交于 3月 21, 2021
  
  329e267d
20 3月, 2021 1 次提交
- Y
  PMA: change the reserved off-chip address space to RW · 3111281e
  由 Yinan Xu 提交于 3月 20, 2021
```
This allows the software to determine whether an address
can be read or written.
```
  3111281e
19 3月, 2021 8 次提交
- J
  
  L1plusCache: add error io. · bc72443c
  由 jinyue110 提交于 3月 19, 2021
  
  bc72443c
- J
  
  ICache: add error IO · ab219f87
  由 jinyue110 提交于 3月 19, 2021
  
  ab219f87
- L
  
  Top: add beu · 2e3a956e
  由 LinJiawei 提交于 3月 19, 2021
  
  2e3a956e
- L
  
  Soc: insert a buffer between L3 and dram · 953a0310
  由 LinJiawei 提交于 3月 19, 2021
  
  953a0310
- L
  
  Dcache: connect ecc to beu(not tested) · 312f3607
  由 LinJiawei 提交于 3月 19, 2021
  
  312f3607
- L
  
  Merge remote-tracking branch 'origin/master' into add-beu · 99c2c3fa
  由 LinJiawei 提交于 3月 19, 2021
  
  99c2c3fa
- L
  
  Dcache: optimize way selection (#697) · 97301f30
  由 ljw 提交于 3月 19, 2021
  
  97301f30
- Y
  
  Add XSCoreWithL2 to wrap XSCore,L2 into a module (#696) · 6c4d7a40
  由 Yinan Xu 提交于 3月 19, 2021
  
  6c4d7a40
18 3月, 2021 2 次提交
- L
  
  Soc: connect beu and cores · 9637c0c6
  由 LinJiawei 提交于 3月 18, 2021
  
  9637c0c6
- L
  
  Soc: add bus error unit · 0584d3a8
  由 LinJiawei 提交于 3月 18, 2021
  
  0584d3a8
14 3月, 2021 1 次提交
- S
  btb: use single port sram to meet timing constraints (#692) · 8f6a1237
  由 Steve Gou 提交于 3月 14, 2021
```
* add perf counters for btb and ubtb
* update btb only on not hit or jalr mispredicts to reduce write stalls
```
  8f6a1237
13 3月, 2021 3 次提交
- Y
  
  emu: add --stat-cycles to dump statistics periodically (#690) · e834a6fe
  由 Yinan Xu 提交于 3月 13, 2021
  
  e834a6fe
- Y
  
  Update github ci scripts (#691) · a9d16859
  由 Yinan Xu 提交于 3月 13, 2021
  
  a9d16859
- L
  RS & DTLB: fix bug of dtlb's hit perf counter (#689) · ee46cd6e
  由 Lemover 提交于 3月 13, 2021
```
just record the tlb result(access and miss) of first issue by add
signal isFirstIssue (isFirstIssue = cntCountQueue(i) === 0.U)
```
  ee46cd6e
12 3月, 2021 3 次提交
- L
  
  RS: set tailPtr to 0 when flush (#686) · 9db43ee7
  由 Lemover 提交于 3月 12, 2021
  
  9db43ee7
- Z
  DCache: optimize situations when ldu and mainPipe contend for read port (#688) · a7817148
  由 zhanglinjuan 提交于 3月 12, 2021
```
* DCacheWrapper: MainPipe use read port 1 to ease congestion

* MainPipe: do not consider congestion with ldu0 read when disabling fast wakeup
```
  a7817148
- L
  
  RS: fix bug of wrong enq and deq perf counter (#683) · 7d0fb725
  由 Lemover 提交于 3月 12, 2021
  
  7d0fb725
11 3月, 2021 3 次提交

ci-runner: only specify a numa node for performance stability (#685) · ac54e310

由 Yinan Xu 提交于 3月 11, 2021

Previously we use numactl to specify both nodes and cpus for emu.
However, when other processes are using the same cpu, verilated emu
suffers from huge performance degradation. To avoid these scenarios,
we only specify the numa node to achieve a more stable performance.

ac54e310

Add support for a simple version of move elimination (#682) · aac4464e

由 Yinan Xu 提交于 3月 11, 2021

In this commit, we add support for a simpler version of move elimination.

The original instruction sequences are:
move r1, r0
add r2, r1, r3

The optimized sequnces are:
move pr1, pr0
add pr2, pr0, pr3 # instead of add pr2, pr1, pr3

In this way, add can be issued once r0 is ready and move seems to be eliminated.

aac4464e

Y

WaitTable: use 2-bit counter and optimize XORFold logic (#681) · e6e4a58d
由 Yinan Xu 提交于 3月 11, 2021

e6e4a58d

OpenXiangShan / XiangShan 10 个月 前同步成功

OpenXiangShan / XiangShan
10 个月前同步成功