提交 · e0374b1cef8835167a76d86691da910632f16c45 · OpenXiangShan / XiangShan

05 2月, 2023 1 次提交
- H
  
  MMU: Add Fake L1 TLB (#1888) · e0374b1c
  由 Haoyuan Feng 提交于 2月 05, 2023
  
  e0374b1c
04 2月, 2023 1 次提交
- S
  
  Merge pull request #1875 from OpenXiangShan/ftq_c_flush · f5ecdd4e
  由 Steve Gou 提交于 2月 04, 2023
  
  f5ecdd4e
19 1月, 2023 1 次提交
- H
  
  L2TLB: fix page cache assert when pte_ppn access fault (#1882) · dd7fe201
  由 Haoyuan Feng 提交于 1月 19, 2023
  
  dd7fe201
18 1月, 2023 1 次提交
- H
  
  PTW: raise access fault when ppn high 20 bits is not zero (#1881) · 0d94d540
  由 Haoyuan Feng 提交于 1月 18, 2023
  
  0d94d540
16 1月, 2023 1 次提交

MMU: Add L1TLB and L2TLB Resp difftest (#1879) · 5ab1b84d

由 Haoyuan Feng 提交于 1月 16, 2023

* L2TLB: Add L2TLB Resp Check in difftest

* L1TLB: Add L1TLB Resp Check in difftest

* L2TLB: Do not Check Resp with difftest when access fault

* Update difftest

5ab1b84d

12 1月, 2023 1 次提交
- G
  
  break ifuwbptr dependency · 2448f137
  由 Guokai Chen 提交于 12月 04, 2022
  
  2448f137
11 1月, 2023 2 次提交
- G
  
  fix cfiVec (#1842) · 3f88c020
  由 Guokai Chen 提交于 1月 11, 2023
  
  3f88c020
- H
  
  PTW: Add PTW refill check in difftest (#1872) · 9c26bab7
  由 Haoyuan Feng 提交于 1月 11, 2023
  
  9c26bab7
04 1月, 2023 1 次提交

dcache: setup way predictor framework (#1857) · 144422dc

由 Maxpicca 提交于 1月 04, 2023

This commit sets up a basic dcache way predictor framework and a dummy predictor.
A Way Predictor Unit (WPU) module has been added to dcache. Dcache data SRAMs
have been reorganized for that. 

The dummy predictor is disabled by default. 

Besides, dcache bank conflict check has been optimized. It may cause timing problems,
to be fixed in the future.

* ideal wpu

* BankedDataArray: change architecture to reduce bank_conflict

* BankedDataArray: add db analysis

* Merge: the rest

* BankedDataArray: change the logic of rrl_bank_conflict, but let the number of rw_bank_conflict up

* Load Logic: changed to be as expected

reading data will be delayed by one cycle to make selection
writing data will be also delayed by one cycle to do write operation

* fix: ecc check error

* update the gitignore

* WPU: add regular wpu and change the replay mechanism

* WPU: fix refill fail bug, but a new addiw fail bug appears

* WPU: temporarily turn off to PR

* WPU: tfix all bug

* loadqueue: fix the initialization of replayCarry

* bankeddataarray: fix the bug

* DCacheWrapper: fix bug

* ready-to-run: correct the version

* WayPredictor: comments clean

* BankedDataArray: fix ecc_bank bug

* Parameter: set the enable signal of wpu

144422dc

03 1月, 2023 1 次提交
- H
  
  PTW: Fix bug when resp valid but not fire (#1871) · 2a906a65
  由 Haoyuan Feng 提交于 1月 03, 2023
  
  2a906a65
02 1月, 2023 3 次提交
- Y
  Switch to asynchronous reset for all modules (#1867) · 67ba96b4
  由 Yinan Xu 提交于 1月 02, 2023
```
This commit changes the reset of all modules to asynchronous style,
including changes on the initialization values of some registers.
For async registers, they must have constant reset values.
```
  67ba96b4
- Y
  
  Bump difftest to fix resource leak problem (#1866) · 01a51437
  由 Yinan Xu 提交于 1月 02, 2023
  
  01a51437
- H
  PTW: Fix mem_addr_update when sfence (#1868) · d826bce1
  由 Haoyuan Feng 提交于 1月 02, 2023
```
* PTW: Fix a bug when sfence

* PTW: Fix mem_addr_update when sfence
```
  d826bce1
28 12月, 2022 1 次提交

lq: Remove LQ data (#1862) · 683c1411

由 happy-lx 提交于 12月 28, 2022

This PR remove data in lq.

All cache miss load instructions will be replayed by lq, and the forward path to the D channel
and mshr is added to the pipeline.
Special treatment is made for uncache load. The data is no longer stored in the datamodule
but stored in a separate register. ldout is only used as uncache writeback, and only ldout0
will be used. Adjust the priority so that the replayed instruction has the highest priority in S0.

Future work:
1. fix `milc` perf loss
2. remove data from MSHRs

* difftest: monitor cache miss latency

* lq, ldu, dcache: remove lq's data

* lq's data is no longer used
* replay cache miss load from lq (use counter to delay)
* if dcache's mshr gets refill data, wake up lq's missed load
* uncache load will writeback to ldu using ldout_0
* ldout_1 is no longer used

* lq, ldu: add forward port

* forward D and mshr in load S1, get result in S2
* remove useless code logic in loadQueueData

* misc: revert monitor

683c1411

25 12月, 2022 1 次提交

Separate Utility submodule from XiangShan (#1861) · 3c02ee8f

由 wakafa 提交于 12月 25, 2022

* misc: add utility submodule

* misc: adjust to new utility framework

* bump utility: revert resetgen

* bump huancun

3c02ee8f

21 12月, 2022 2 次提交
- H
  MMU: Add ChiselDB and Fake PTW (#1858) · 5afdf73c
  由 Haoyuan Feng 提交于 12月 21, 2022
```
* L2TLB: Fix a bug of Prefetcher

* MMU: Add ChiselDB

* MMU: Add Fake PTW

* MMU: Fix ChiselDB for dual core
```
  5afdf73c
- B
  
  l2tlb: fix bug that sfence fail to flush global sp entries (#1859) · 42a7f20f
  由 bugGenerator 提交于 12月 21, 2022
  
  42a7f20f
15 12月, 2022 1 次提交

modified ptw and keep performance from dropping (#1835) · 44b79566

由 Xiaokun-Pei 提交于 12月 15, 2022

* modified ptw and keep performance from dropping

* fixed a bug in ptw

* fixed the bug in ptw

* fixed ptw:the bug that eemu go wrong at the third cycle and the bug that sfence cause in MC test

44b79566

11 12月, 2022 1 次提交
- W
  
  vlsu: define vlsu io (#1853) · cea88ff8
  由 William Wang 提交于 12月 11, 2022
  
  cea88ff8
08 12月, 2022 1 次提交

ldu: add st-ld violation re-execute (#1849) · 16c3b0b7

由 sfencevma 提交于 12月 08, 2022

* lsu: add st-ld violation re-execute

* misc: update vio check comments in LQ
Co-authored-by: NLyn <lyn@Lyns-MacBook-Pro.local>
Co-authored-by: NWilliam Wang <zeweiwang@outlook.com>

16c3b0b7

07 12月, 2022 1 次提交

Uncache: optimize write operation (#1844) · 37225120

由 sfencevma 提交于 12月 07, 2022

This commit adds an uncache write buffer to accelerate uncache write

For uncacheable address range, now we use atomic bit in PMA to indicate
uncache write in this range should not use uncache write buffer.

Note that XiangShan does not support atomic insts in uncacheable address range.

* uncache: optimize write operation

* pma: add atomic config

* uncache: assign hartId

* remove some pma atomic

* extend peripheral id width
Co-authored-by: NLyn <lyn@Lyns-MacBook-Pro.local>

37225120

05 12月, 2022 1 次提交
- H
  ROB, difftest: add robidx support (#1845) · b211808b
  由 happy-lx 提交于 12月 05, 2022
```
* bump difftest and wire extra signals (robidx, lqidx, sqidx etc)
from ROB to difftest
```
  b211808b
02 12月, 2022 2 次提交

Replay all load instructions from LQ (#1838) · a760aeb0

由 happy-lx 提交于 12月 02, 2022

This intermediate architecture replays all load instructions from LQ.
An independent load replay queue will be added later.

Performance loss caused by changing of load replay sequences will be
analyzed in the future.

* memblock: load queue based replay

* replay load from load queue rather than RS
* use counters to delay replay logic

* memblock: refactor priority

* lsq-replay has higher priority than try pointchasing

* RS: remove load store rs's feedback port

* ld-replay: a new path for fast replay

* when fast replay needed, wire it to loadqueue and it will be selected
this cycle and replay to load pipline s0 in next cycle

* memblock: refactor load S0

* move all the select logic from lsq to load S0
* split a tlbReplayDelayCycleCtrl out of loadqueue to speed up
generating emu

* loadqueue: parameterize replay

a760aeb0

H

mmu: increase mmu timeout to 10000 (#1839) · 914b8455
由 Haoyuan Feng 提交于 12月 02, 2022

914b8455

30 11月, 2022 1 次提交
- H
  rob, mmu: fix bug of not specifying signal width (#1840) · f3034303
  由 Haoyuan Feng 提交于 11月 30, 2022
```
Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>
```
  f3034303
22 11月, 2022 1 次提交
- W
  Merge pull request #1831 from OpenXiangShan/nanhu-lsu-timing-to-master · 5da19fb3
  由 William Wang 提交于 11月 22, 2022
```
Rebase nanhu lsu timing opt to master
```
  5da19fb3
21 11月, 2022 1 次提交
- W
  
  ci: bump ready-to-run nemu · 688bb537
  由 William Wang 提交于 11月 21, 2022
  
  688bb537
19 11月, 2022 13 次提交

W

lsu: fix nanhu cherry-pick conflict · 34ffc2fb
由 William Wang 提交于 11月 05, 2022

34ffc2fb
W

atom: lr should raise load misalign exception · 8c343485
由 William Wang 提交于 10月 31, 2022

8c343485
W

ci: add extra pmp test · b4edc553
由 William Wang 提交于 10月 31, 2022

b4edc553

csr: medeleg write should have 0xb3ff mask · 5e4ec482

由 William Wang 提交于 10月 29, 2022

According to the RISC-V manual, exception code 14 is reserved.

See https://github.com/OpenXiangShan/NEMU/commit/9800da6a5e660dae5411c9b303833bc84bc04db4

5e4ec482

Fix atom inst pmp inplementation (#1813) · 0fedb24c

由 William Wang 提交于 11月 19, 2022

* atom: fix atom inst storeAccessFault gen logic

* atom, pmp: atom access !r addr should raise SAF

* atom: lr should raise load access fault

0fedb24c

dcache: fix replace & probeAck TtoB perm problem (#1791) · b8f6ff86

由 William Wang 提交于 9月 26, 2022

* chore: fix WBQEntryReleaseUpdate bundle naming

There is no real hardware change

* dcache: fix replace & probeAck TtoB perm problem

When dcache replaces a cacheline, it will move that cacheline data to
writeback queue, and wait until refill data come. When refill data
comes, it writes dcache data array and update meta for that cacheline,
then wakes up cacheline release req and write data to l2 cache.

In previous design, if a probe request comes before real l1 to l2 release
req, it can be merged in the same writeback queue entry. Probe req will
update dcache meta in mainpipe s3, then be merged in writeback queue.
However, for a probe TtoB req, the following problem may happen:

1) a replace req waits for refill in writeback queue entry X
2) probe TtoB req enters mainpipe s3, set cacheline coh to B
3) probe TtoB req is merged to writeback queue entry X
4) writeback queue entry X is waken up, do probeack immediately (TtoN)
5) refill data for replace req comes from l2, a refill req enters mainpipe
and update dcache meta (set cacheline being replaced coh to N)

Between 4) and 5), l2 thinks that l1 coh is N, but l1 coh is actually B,
here comes the problem.

Temp patch for nanhu:

Now we let all probe req do extra check. If it is a TtoB probe req and the
coresponding cacheline release req is already in writeback queue, we set
dcache meta coh to N. As we do set block in dcache mainpipe, we can do
that check safely when probe req is in mainpipe.

b8f6ff86

W

dcache: optimize data sram read fanout (#1784) · a19ae480
由 William Wang 提交于 9月 22, 2022

a19ae480

ldu: fix replay from fetch signal for missed load (#1780) · 4b7b4cc9

由 William Wang 提交于 9月 12, 2022

When write back missed load, io.ldout.bits.uop.ctrl.replayInst
should not be overwriteen by load pipeline replay check result
`s3_need_replay_from_fetch`

4b7b4cc9

W
dcache: do not use mp s2_ready to gen data_read.valid (#1756) · 774f100a
由 William Wang 提交于 9月 03, 2022
```
* dcache: remove data read resp data_dup_0

* dcache: do not use mp s2_ready to gen data_read.valid
```
774f100a
Z

MemBlock: add pipeline for reqs between lsq and uncache (#1760) · a86e4de7
由 zhanglinjuan 提交于 9月 01, 2022

a86e4de7
Y
ld,rs: optimize load-load forward timing (#1762) · 74fe3640
由 Yinan Xu 提交于 9月 01, 2022
```
Move imm addition to stage 0.
```
74fe3640

ldu: remove dcache sram data from forwardData (#1754) · cc24c304

由 William Wang 提交于 8月 31, 2022

forwardData for load queue does not need data from dcache sram.
In this way, we remove load queue data wdata fanin from all dcache
data srams

cc24c304

Optimize buffers between L1 and L2 · 2fd089ae

由 Yinan Xu 提交于 8月 30, 2022

* remove 2 buffers from l1i to l2
* add 1 buffer between l2 and xbar

Latency changes:
* L1D to L2: +1
* L1I to L2: -1
* PTW to L2: +1

2fd089ae

OpenXiangShan / XiangShan 10 个月 前同步成功

OpenXiangShan / XiangShan
10 个月前同步成功