提交 · a0db5a4b97ef9e25ea3e3ae7c5e68a06ff2b9f66 · OpenXiangShan / XiangShan

20 6月, 2022 2 次提交
- Y
  
  decode: parallel fusion decoder and rat read (#1588) · a0db5a4b
  由 Yinan Xu 提交于 6月 20, 2022
  
  a0db5a4b
- W
  
  ssit: pipeline update logic to reduce fanin (#1583) · 2f0b133c
  由 William Wang 提交于 6月 20, 2022
  
  2f0b133c
18 6月, 2022 2 次提交

decode: do not set lsrc of LUI for better timing (#1586) · a19215dd

由 Yinan Xu 提交于 6月 18, 2022

This commit changes the lsrc/psrc of LUI in dispatch instead of
decode to optimize the timing of lsrc in DecodeStage, which is
critical for rename table.

lsrc/ldest should be directly get from instr for the timing. Fused
instructions change lsrc/ldest now, which will be optimized later.

a19215dd

perfcnt: keep strict regularity of perf counter name (#1585) · d18dc7e6

由 wakafa 提交于 6月 18, 2022

* buspmu: avoid inner space in perf-cnt name

* perfcnt: judge regularity of perfname

* perfcnt: fix some irregular perfname

* bump huancun

d18dc7e6

17 6月, 2022 1 次提交
- Z
  
  l2tlb: fix ecc width (#1584) · 5197bac8
  由 Ziyue-Zhang 提交于 6月 17, 2022
  
  5197bac8
11 6月, 2022 1 次提交
- G
  
  ICache: fix DataArray non-ecc width (#1579) · e5f1252b
  由 Guokai Chen 提交于 6月 11, 2022
  
  e5f1252b
09 6月, 2022 1 次提交
- S
  ftq: should use jmpOffset instead of cfiIndex when assigning (#1561) · ae409b75
  由 Steve Gou 提交于 6月 09, 2022
```
last_may_be_rvi_call in case that a call comes after a taken branch
```
  ae409b75
06 6月, 2022 3 次提交

J

discard iprefetch req when resource busy · e8747464
由 Jenius 提交于 6月 06, 2022

e8747464

delete 500 cycle wait · 19d62fa1

由 Jenius 提交于 6月 06, 2022

* add SRAM ready (resetfinish) condition for *Array (metaArray/dataArray)
req.ready

19d62fa1

fix bugs in IFU and delete 500-cycle ready · 625ecd17

由 Jenius 提交于 6月 06, 2022

* fix mmio_resend_af wrong assignment
* fix wb_half_flush missOffset(using wb_lastIdx instead of PredictWidth
-1)
* change pipeline ready condition (this_ready =  this_stage_fire || this_stage_empty)
* delete 500-cycle ready condition (toICache(*).ready means the SRAM has
been reset and ready for read)

625ecd17

02 6月, 2022 1 次提交

ittage: we should write new target when alloc · 3b7c55f8

由 Lingrui98 提交于 6月 02, 2022

Previous logic checked the value of old_ctr to select between old target and
new target when updating ittage table. However, when we need to alloc a new
entry, the value of old_ctr is X because we do not reset ittage table. So we
would definitely write an X to the target field, which is the output of the
mux, as the selector is X.

3b7c55f8

31 5月, 2022 1 次提交

fix for chipsalliance/rocket-chip#2967 (#1562) · 361e6d51

由 Jiuyang Liu 提交于 5月 31, 2022

* fix for chipsalliance/rocket-chip#2967

* decode: fix width of BitPat(?) in decode logic
Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>

361e6d51

29 5月, 2022 1 次提交
- J
  
  <bug-fix>: fix f3 mmio write back override bug (#1567) · bccc5520
  由 Jenius 提交于 5月 29, 2022
  
  bccc5520
26 5月, 2022 1 次提交
- J
  
  fix for chipsalliance/chisel3#2496 (#1563) · 005e809b
  由 Jiuyang Liu 提交于 5月 26, 2022
  
  005e809b
25 5月, 2022 1 次提交
- L
  
  ubtb: fix write waymask of fallThruPred · 9f956ac4
  由 Lingrui98 提交于 5月 25, 2022
  
  9f956ac4
11 5月, 2022 2 次提交

Fix vcs simulation support, support manually set ram_size (#1551) · 25ac26c6

由 William Wang 提交于 5月 11, 2022

* difftest: disable runahead to make vcs happy

* difftest: bump huancun to make vcs happy

* difftest: bump difftest and ready-to-run

* difftest support ramsize and paddr base config
* 8GB/16GB nemu so are provided by ready-to-run

* ci: update nightly ci, manually set ram_size

* difftest: bump huancun to make vcs happy

* difftest,nemu: support run-time assign mem size

* ci: polish nightly ci script

25ac26c6

rob: don't set hasWFI if there're exceptions (#1550) · d2df63c3

由 Yinan Xu 提交于 5月 11, 2022

An instruction with exceptions may have arbitrary instr values and
may be decoded into WFI instructions, which cause errors.

d2df63c3

09 5月, 2022 3 次提交
- L
  
  CSR: Fix WFI to support debug interrupts (#1547) · 4ede3fe2
  由 Li Qianruo 提交于 5月 09, 2022
  
  4ede3fe2
- J
  
  ICache: add difftest-Refill test (#1548) · 41cb8b61
  由 Jenius 提交于 5月 09, 2022
  
  41cb8b61
- S
  fix bugs of tage-sc (#1533) · e82f7653
  由 Steve Gou 提交于 5月 09, 2022
```
* sc: fix a performance bug

* tage: fix number of use-alt-on-na counters

* tage: update provider u-bit according to provider results
```
  e82f7653
07 5月, 2022 1 次提交
- G
  
  pass reset vector from SimTop (#1545) · c4b44470
  由 Guokai Chen 提交于 5月 07, 2022
  
  c4b44470
06 5月, 2022 2 次提交

feat: parameterize load store (#1527) · 46f74b57

由 Haojin Tang 提交于 5月 06, 2022

* feat: parameterize load/store pipeline, etc.

* fix: use LoadPipelineWidth rather than LoadQueueSize

* fix: parameterize `rdataPtrExtNext`

* SBuffer: fix idx update logic

* atomic: parameterize atomic logic in `MemBlock`

* StoreQueue: update allow enque requirement

* feat: support one load/store pipeline

* feat: parameterize `EnsbufferWidth`

* chore: resharp codes for better generated name

46f74b57

W
chore: remove sc too many fail assertion (#1514) · 5d6ad649
由 William Wang 提交于 5月 06, 2022
```
* chore: remove sc too many fail assertion

* chore: use XSWarn()
```
5d6ad649

05 5月, 2022 2 次提交

W
assert: fix dcache mp s1_way_en assertion (#1530) · 7459e344
由 William Wang 提交于 5月 05, 2022
```
s1_tag_match_way is vaild iff tag_read.valid and meta_read.valid in s0
for the same req
```
7459e344

csr: init status.fs to 01 · 80dd83d8

由 Yinan Xu 提交于 3月 27, 2022

XiangShan does not support fs=0 because when fs=0, all floating-point
states are not accessible. Spike supports fs=0. To diff with Spike,
we temporarily set fs to 1 when initialized.

80dd83d8

04 5月, 2022 2 次提交

Y

csr: check WFI and other illegal instructions · 5d669833
由 Yinan Xu 提交于 5月 04, 2022

5d669833

rob: WFI depends on mip&mie only · 5c95ea2e

由 Yinan Xu 提交于 5月 04, 2022

This commit fixes the implementation of WFI. The WFI instruction
waits in the ROB until an interrupt might need servicing.

According to the RISC-V manual, the WFI must be unaffected by the
global interrupt bits in `mstatus` and the delegation register
`mideleg`.

5c95ea2e

29 4月, 2022 1 次提交
- Y
  
  difftest: add support for the WFI instruction · f37600a6
  由 Yinan Xu 提交于 4月 29, 2022
  
  f37600a6
28 4月, 2022 1 次提交

core,rob: support the WFI instruction · b6900d94

由 Yinan Xu 提交于 4月 28, 2022

The RISC-V WFI instruction is previously decoded as NOP. This commit
adds support for the real wait-for-interrupt (WFI).

We add a state_wfi FSM in the ROB. After WFI leaves the ROB, the next
instruction will wait in the ROB until an interrupt.

b6900d94

25 4月, 2022 2 次提交

Fix a bug in dual-core difftest (#1538) · 4d5d2702

由 wakafa 提交于 4月 25, 2022

* difftest: fix false-positive difftest intRF writeback, adapt to new difftest API

* csr: skip mip difftest

* bump difftest

* bump difftest

4d5d2702

C
fix some typos (#1537) · 1c746d3a
由 cui fliter 提交于 4月 25, 2022
```
* fix some typos
Signed-off-by: Ncuishuang <imcusg@gmail.com>
```
1c746d3a

14 4月, 2022 1 次提交

mmu.l2tlb: divide missqueue into 'missqueue' and llptw (#1522) · 92e3bfef

由 Lemover 提交于 4月 14, 2022

old missqueue: cache req miss slot and mem access-er
Problem: these two func are totally different, make mq hard to handle in a single select policy.
Solution: divide these two funciton into two module.
  new MissQueue: only hold reqs that page cache miss and need re-req cache， a simple flushable queue
  llptw: Last level ptw, only access ptes, priorityMux queue

* mmu: rename PTW.scala to L2TLB.scala

* mmu: rename PTW to L2TLB

* mmu: rename PtwFsm to PTW

* mmu.l2tlb: divide missqueue into 'missqueue' and llptw

old missqueue: cache req miss slot and mem access-er
Problem: these two func are totally different, make mq hard to handle
  in single select policy.
Solution: divide these two funciton into two module.
  new MissQueue: only hold reqs that page cache miss and new re-req
  cache
  llptw: Last level ptw, only access ptes

* mmu.l2tlb: syntax bug that misses io assign

* mmu.l2tlb: fix bug that mistakes ptw's block signal

92e3bfef

02 4月, 2022 1 次提交

mem: reduce refill to use latency (#1401) · 09203307

由 William Wang 提交于 4月 02, 2022

* mem: optimize missq reject to lq timing

DCache replay request is quite slow to generate, as it need to compare
load address with address in all valid miss queue entries.

Now we delay the usage of replay request from data cache.
Now replay request will not influence normal execuation flow until
load_s3 (1 cycle after load_s2, load result writeback to RS).

It is worth mentioning that "select refilling inst for load
writeback" will be disabled if dcacheRequireReplay in the
last cycle.

* dcache: compare probe block addr instead of full addr

* mem: do not replay from RS when ldld vio or fwd failed

ld-ld violation or forward failure will let an normal load inst replay
from fetch. If TLB hit and ld-ld violation / forward failure happens,
we write back that inst immediately. Meanwhile, such insts will not be
replayed from rs.

It should fix "mem: optimize missq reject to lq timing"

* mem: fix replay from rs condition

* mem: reduce refill to use latency

This commit update lq entry flag carefully in load_s3 to avoid extra
refill delay. It will remove the extra refill delay introduced by #1375
without harming memblock timing.

In #1375, we delayed load refill when dcache miss queue entry fails
to accept a miss. #1375 exchanges performance for better timing.

* mem: fix rs feedback priority

When dataInvalid && mshrFull, a succeed refill should not cancel
rs replay.

09203307

01 4月, 2022 1 次提交

l2tlb.cache: store invalid entries(only super entries) into sp to avoid mem access waste (#1518) · 8d8ac704

由 Lemover 提交于 4月 01, 2022

Corner Case that makes l2tlb's performance decrease sharply:
core may have mis-speculative memory access, which may cause tlb-miss and ptw req to l2tlb.
In l2tlb, the reqs may still miss and even have invalid pte that won't be stored in l2tlb.cache.
If the relative ptes are invalid, these reqs will be held by miss queue and wait for page walker performing
page table walk one by one. It's too slow and will raise time out assert in l2tlb.missqueue.

Solution:
store invalid entries(only super entries) into sp.
Bad news is that sp only has16 entries, so invaid entries will pollute sp as well.
Good news is that the invalid reqs are always in same super page, so only one entries is mostly enough.

* l2tlb.cache: sp entries now handles invalid entries

* l2tlb.cache: fix syntax error, forgot assgin some signals

8d8ac704

31 3月, 2022 1 次提交
- L
  
  Bump chisel to 3.5.0 · 9658ce50
  由 LinJiawei 提交于 3月 25, 2022
  
  9658ce50
30 3月, 2022 1 次提交
- L
  sram-tlb: change SRAMTemplate & when tlb refill, just resp a miss/fast_miss (#1504) · 70083794
  由 Lemover 提交于 3月 30, 2022
```
* bump huancun

* sram: fix sram, keep rdata when w.valid

* tlb: when refill, just return miss at next cycle, rm unused assert
```
  70083794
28 3月, 2022 1 次提交
- J
  IPrefetch: fix address align width of p0_vaddr (#1508) · d6b06a99
  由 Jay 提交于 3月 28, 2022
```
iprefetch uses vaddr instead of paddr.
```
  d6b06a99
27 3月, 2022 1 次提交
- H
  
  sq: fix use of OHToUInt (#1505) · e41db104
  由 happy-lx 提交于 3月 27, 2022
  
  e41db104
23 3月, 2022 2 次提交
- J
  IFU <bug-fix>: deal with itlb miss for resend (#1488) · c3b2d83a
  由 Jay 提交于 3月 23, 2022
```
* IFU <bug-fix>: deal with itlb miss for resend

* IFU <bug fix>: enable crossPageFault for resend-pf
Co-authored-by: NDeltaZero <lacrosseelis@gmail.com>
```
  c3b2d83a
- L
  
  Fix typo (#1480) · 91e3488a
  由 Leway Colin 提交于 3月 23, 2022
  
  91e3488a

OpenXiangShan / XiangShan 9 个月 前同步成功

OpenXiangShan / XiangShan
9 个月前同步成功