提交 · a0db5a4b97ef9e25ea3e3ae7c5e68a06ff2b9f66 · OpenXiangShan / XiangShan

20 6月, 2022 1 次提交
- Y
  
  decode: parallel fusion decoder and rat read (#1588) · a0db5a4b
  由 Yinan Xu 提交于 6月 20, 2022
  
  a0db5a4b
18 6月, 2022 1 次提交

decode: do not set lsrc of LUI for better timing (#1586) · a19215dd

由 Yinan Xu 提交于 6月 18, 2022

This commit changes the lsrc/psrc of LUI in dispatch instead of
decode to optimize the timing of lsrc in DecodeStage, which is
critical for rename table.

lsrc/ldest should be directly get from instr for the timing. Fused
instructions change lsrc/ldest now, which will be optimized later.

a19215dd

31 5月, 2022 1 次提交

fix for chipsalliance/rocket-chip#2967 (#1562) · 361e6d51

由 Jiuyang Liu 提交于 5月 31, 2022

* fix for chipsalliance/rocket-chip#2967

* decode: fix width of BitPat(?) in decode logic
Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>

361e6d51

26 5月, 2022 1 次提交
- J
  
  fix for chipsalliance/chisel3#2496 (#1563) · 005e809b
  由 Jiuyang Liu 提交于 5月 26, 2022
  
  005e809b
11 5月, 2022 2 次提交

Fix vcs simulation support, support manually set ram_size (#1551) · 25ac26c6

由 William Wang 提交于 5月 11, 2022

* difftest: disable runahead to make vcs happy

* difftest: bump huancun to make vcs happy

* difftest: bump difftest and ready-to-run

* difftest support ramsize and paddr base config
* 8GB/16GB nemu so are provided by ready-to-run

* ci: update nightly ci, manually set ram_size

* difftest: bump huancun to make vcs happy

* difftest,nemu: support run-time assign mem size

* ci: polish nightly ci script

25ac26c6

rob: don't set hasWFI if there're exceptions (#1550) · d2df63c3

由 Yinan Xu 提交于 5月 11, 2022

An instruction with exceptions may have arbitrary instr values and
may be decoded into WFI instructions, which cause errors.

d2df63c3

09 5月, 2022 1 次提交
- L
  
  CSR: Fix WFI to support debug interrupts (#1547) · 4ede3fe2
  由 Li Qianruo 提交于 5月 09, 2022
  
  4ede3fe2
06 5月, 2022 1 次提交

feat: parameterize load store (#1527) · 46f74b57

由 Haojin Tang 提交于 5月 06, 2022

* feat: parameterize load/store pipeline, etc.

* fix: use LoadPipelineWidth rather than LoadQueueSize

* fix: parameterize `rdataPtrExtNext`

* SBuffer: fix idx update logic

* atomic: parameterize atomic logic in `MemBlock`

* StoreQueue: update allow enque requirement

* feat: support one load/store pipeline

* feat: parameterize `EnsbufferWidth`

* chore: resharp codes for better generated name

46f74b57

05 5月, 2022 1 次提交

csr: init status.fs to 01 · 80dd83d8

由 Yinan Xu 提交于 3月 27, 2022

XiangShan does not support fs=0 because when fs=0, all floating-point
states are not accessible. Spike supports fs=0. To diff with Spike,
we temporarily set fs to 1 when initialized.

80dd83d8

04 5月, 2022 2 次提交

Y

csr: check WFI and other illegal instructions · 5d669833
由 Yinan Xu 提交于 5月 04, 2022

5d669833

rob: WFI depends on mip&mie only · 5c95ea2e

由 Yinan Xu 提交于 5月 04, 2022

This commit fixes the implementation of WFI. The WFI instruction
waits in the ROB until an interrupt might need servicing.

According to the RISC-V manual, the WFI must be unaffected by the
global interrupt bits in `mstatus` and the delegation register
`mideleg`.

5c95ea2e

29 4月, 2022 1 次提交
- Y
  
  difftest: add support for the WFI instruction · f37600a6
  由 Yinan Xu 提交于 4月 29, 2022
  
  f37600a6
28 4月, 2022 1 次提交

core,rob: support the WFI instruction · b6900d94

由 Yinan Xu 提交于 4月 28, 2022

The RISC-V WFI instruction is previously decoded as NOP. This commit
adds support for the real wait-for-interrupt (WFI).

We add a state_wfi FSM in the ROB. After WFI leaves the ROB, the next
instruction will wait in the ROB until an interrupt.

b6900d94

25 4月, 2022 2 次提交

Fix a bug in dual-core difftest (#1538) · 4d5d2702

由 wakafa 提交于 4月 25, 2022

* difftest: fix false-positive difftest intRF writeback, adapt to new difftest API

* csr: skip mip difftest

* bump difftest

* bump difftest

4d5d2702

C
fix some typos (#1537) · 1c746d3a
由 cui fliter 提交于 4月 25, 2022
```
* fix some typos
Signed-off-by: Ncuishuang <imcusg@gmail.com>
```
1c746d3a

02 4月, 2022 1 次提交

mem: reduce refill to use latency (#1401) · 09203307

由 William Wang 提交于 4月 02, 2022

* mem: optimize missq reject to lq timing

DCache replay request is quite slow to generate, as it need to compare
load address with address in all valid miss queue entries.

Now we delay the usage of replay request from data cache.
Now replay request will not influence normal execuation flow until
load_s3 (1 cycle after load_s2, load result writeback to RS).

It is worth mentioning that "select refilling inst for load
writeback" will be disabled if dcacheRequireReplay in the
last cycle.

* dcache: compare probe block addr instead of full addr

* mem: do not replay from RS when ldld vio or fwd failed

ld-ld violation or forward failure will let an normal load inst replay
from fetch. If TLB hit and ld-ld violation / forward failure happens,
we write back that inst immediately. Meanwhile, such insts will not be
replayed from rs.

It should fix "mem: optimize missq reject to lq timing"

* mem: fix replay from rs condition

* mem: reduce refill to use latency

This commit update lq entry flag carefully in load_s3 to avoid extra
refill delay. It will remove the extra refill delay introduced by #1375
without harming memblock timing.

In #1375, we delayed load refill when dcache miss queue entry fails
to accept a miss. #1375 exchanges performance for better timing.

* mem: fix rs feedback priority

When dataInvalid && mshrFull, a succeed refill should not cancel
rs replay.

09203307

31 3月, 2022 1 次提交
- L
  
  Bump chisel to 3.5.0 · 9658ce50
  由 LinJiawei 提交于 3月 25, 2022
  
  9658ce50
24 2月, 2022 2 次提交
- Y
  
  std: delay fp regfile read for one cycle (#1473) · 783011be
  由 Yinan Xu 提交于 2月 24, 2022
  
  783011be
- Y
  
  busyTable: make a copy for store fp data (#1474) · 4d51b769
  由 Yinan Xu 提交于 2月 24, 2022
  
  4d51b769
14 2月, 2022 1 次提交
- S
  
  ctrl,ftq: move pc and target calculation in redirect generator to ftq (#1463) · 2e1be6e1
  由 Steve Gou 提交于 2月 14, 2022
  
  2e1be6e1
23 1月, 2022 2 次提交
- W
  
  csr: fix xret mode check (#1440) · cb8f1780
  由 William Wang 提交于 1月 23, 2022
  
  cb8f1780
- L
  
  pmp: fix bug of l locks cfg's modification (#1438) · ff1b5dbb
  由 Lemover 提交于 1月 23, 2022
  
  ff1b5dbb
14 1月, 2022 1 次提交
- W
  
  difftest: latch difftestloadevent signal (#1423) · 75c2f5ae
  由 wakafa 提交于 1月 14, 2022
  
  75c2f5ae
09 1月, 2022 1 次提交

rob: block commit when exceptions are valid (#1419) · 983f3e23

由 Yinan Xu 提交于 1月 09, 2022

This commit fixes the block_commit condition when an instruction has
exception but labeled flushPipe. Previously such an instruction will
commit normally.

983f3e23

07 1月, 2022 4 次提交
- W
  
  trigger: add addr trigger for atom insts · bbd4b852
  由 William Wang 提交于 1月 06, 2022
  
  bbd4b852
- L
  
  Fix ROB enq and writeback logic not considering trigger hits · 0e5209d0
  由 Li Qianruo 提交于 1月 06, 2022
  
  0e5209d0
- L
  Fix stepie · 052ee9a1
  由 Li Qianruo 提交于 1月 05, 2022
```
Previously the stepie bit won't take effect
```
  052ee9a1
- Y
  difftest: delay commit and regfile for two cycles (#1417) · bde9b502
  由 Yinan Xu 提交于 1月 07, 2022
```
CSRs are updated later after instructions commit from ROB. Thus, we
need to delay difftest commit for several cycles.
```
  bde9b502
05 1月, 2022 1 次提交

Debug mode: various bug fixes (#1412) · d7dd1af1

由 Li Qianruo 提交于 1月 05, 2022

* Reduce trigger hit wires that goes into exceptiongen
* Fix frontend triggers rewriting hit wire
* Retrieved some accidentally dropped changes in branch dm-debug (mainly fixes to debug mode)
* Fix dmode in tdata1
* Fix ebreaks not causing exception in debug mode
* Fix dcsr field bugs
* Fix faulty distributed tEnable
* Fix store triggers not using vaddr
* Fix store trigger rewriting hit vector
* Initialize distributed tdata registers in MemBlock and Frontend to zero
* Fix load trigger select bit in mcontrol
* Fix singlestep bit valid in debug mode
* Mask all interrupts in debug mode

d7dd1af1

01 1月, 2022 2 次提交

mem: split L1CacheErrorInfo and L1BusErrorUnitInfo, fix ecc error (#1409) · 0f59c834

由 William Wang 提交于 1月 01, 2022

* mem: fix error csr update

* dcache: l2 error will now trigger atom error

* chore: fix cache error debug decoder

* mem: split L1CacheErrorInfo and L1BusErrorUnitInfo

0f59c834

Fix marchid value for hart CSR configuration (#1411) · e1b773ea

由 Luo Jia 提交于 1月 01, 2022

XiangShan has registered an marchid of 25: https://github.com/riscv/riscv-isa-manual/blob/master/marchid.md .
This value should be returned from CSR `marchid`.

e1b773ea

30 12月, 2021 1 次提交
- R
  
  add reset value of distribute trigger csrs at memory and frontend block. · 27802204
  由 rvcoresjw 提交于 12月 30, 2021
  
  27802204
29 12月, 2021 3 次提交

J
ICache: add parity check enable and prefetch enable control registers (#1406) · ecccf78f
由 Jay 提交于 12月 29, 2021
```
* Add Prefetch and Parity enable register for ICache

* Add ICache parity enable control for pipe
```
ecccf78f
L

csr: add one/two cycle for signals customCtrl/tlb/csrUpdate (#1405) · c7f0997b
由 Lemover 提交于 12月 29, 2021

c7f0997b

dispatch: block enq when previous instructions have exception (#1400) · 3a6db8a3

由 Yinan Xu 提交于 12月 29, 2021

This commit adds blocking logic for instructions when they enter
dispatch queues. If previous instructions have exceptions, any
following instructions should be enter dispatch queue.

Consider the following case. If uop(0) has an exception and is a load.
If uop(1) does not have an exception and is a load as well. Then the
allocation logic in dispatch queue will allocate an entry for both
uop(0) and uop(1). However, uop(0) will not set enq.valid and leave
the entry in dispatch queue empty. uop(1) will be allocated in dpq.
In dispatch queue, pointers are updated according to the real number
of instruction enqueue, which is one. While the second is actually
allocated. This causes errors.

3a6db8a3

28 12月, 2021 1 次提交

mem: refactor l1 error implementation (#1391) · 9ef181f4

由 William Wang 提交于 12月 28, 2021

* dcache: add source info in L1CacheErrorInfo

* ICache: fix valid signal and add source/opType

* dcache: fix bug in ecc error

* mem,csr: send full L1CacheErrorInfo to CSR

* icache: provide cache error info for CSR

* dcache: force resp hit if tag ecc error happens

* mem: reorg l1 cache error report path

Now dcache tag error will force trigger a hit

* dcache: fix readline ecc check error

* dcache: mainpipe will not be influenced by tag error

* dcache: fix data ecc check error

* dcache: if coh state is Nothing, do not raise error
Co-authored-by: Nzhanglinjuan <zhanglinjuan20s@ict.ac.cn>
Co-authored-by: NJinYue <jinyue20s@ict.ac.cn>

9ef181f4

26 12月, 2021 1 次提交
- Y
  atomic: fix exception valid after #1392 (#1395) · 207ef628
  由 Yinan Xu 提交于 12月 26, 2021
```
Valid should be set to true after atomic.exception.valid and cleared
after redirect is valid.
```
  207ef628
24 12月, 2021 1 次提交

atomics: delay exception.valid for more cycles (#1392) · 231d3399

由 Yinan Xu 提交于 12月 24, 2021

Exception address is used serveral cycles after flush. We delay it
by more cycles to ensure its flush safety.

231d3399

22 12月, 2021 1 次提交

mem: optimize missq reject to lq timing (#1375) · 6b6d88e6

由 William Wang 提交于 12月 22, 2021

* mem: optimize missq reject to lq timing

DCache replay request is quite slow to generate, as it need to compare
load address with address in all valid miss queue entries.

Now we delay the usage of replay request from data cache.
Now replay request will not influence normal execution flow until
load_s3 (1 cycle after load_s2, load result writeback to RS).

Note1: It is worth mentioning that "select refilling inst for load
writeback" will be disabled if dcacheRequireReplay in the
last cycle.

Note2: ld-ld violation or forward failure will let an normal load inst replay
from fetch. If TLB hit and ld-ld violation / forward failure happens,
we write back that inst immediately. Meanwhile, such insts will not be
replayed from rs.

* dcache: compare probe block addr instead of full addr

6b6d88e6

21 12月, 2021 1 次提交

lsq: add LsqEnqCtrl to optimize enqueue timing (#1380) · 10551d4e

由 Yinan Xu 提交于 12月 21, 2021

This commit adds an LsqEnqCtrl module to add one more clock cycle
between dispatch and load/store queue.

LsqEnqCtrl maintains the lqEnqPtr/sqEnqPtr and lqCounter/sqCounter.
They are used to determine whether load/store queue can accept new
instructions. After that, instructions are sent to load/store queue.
This module decouples queue allocation and real enqueue.

Besides, uop storage in load/store queue are optimized. In dispatch,
only robIdx is required. Other information is naturally conveyed in
the pipeline and can be stored later in load/store queue if needed.
For example, exception vector, trigger, ftqIdx, pdest, etc are
unnecessary before the instruction leaves the load/store pipeline.

10551d4e

OpenXiangShan / XiangShan 9 个月 前同步成功

OpenXiangShan / XiangShan
9 个月前同步成功