提交 · df5b4b8e4c8c57d66a4b3cfef025a03fdc4199d0 · OpenXiangShan / XiangShan

18 12月, 2021 1 次提交
- Y
  
  csr: optimize exception and trapTarget timing (#1372) · df5b4b8e
  由 Yinan Xu 提交于 12月 18, 2021
  
  df5b4b8e
17 12月, 2021 3 次提交

pmp: add static pmp check that stored in tlb entries (#1366) · 5b7ef044

由 Lemover 提交于 12月 17, 2021

* memblock: regnext ptw's resp

* pmp: timing optimization from tlb.sram.ppn to pmp, add static pmp check

long latency: tlb's sram may be slow to gen ppn, ppn to pmp may be
long latency.
Solution: add static pmp check.

Fatal problem: pmp grain is smalled than TLB pages(4KB, 2MB, 1GB)
Solution: increase pmp'grain to 4K, for 4K entries, pre-check pmp and
store the result into tlb storage. For super pages, still dynamic check
that translation and check.

* pmp: change pmp grain to 4KB, change pma relative init config

* bump ready-to-run, update nemu so for pmp grain

* bump ready-to-run, update nemu so for pmp grain again

  update pmp unit test. The old test assumes that pmp grain is less than 512bit.

5b7ef044

Y

csr: use zext pc when vm is disabled (#1361) · bd1f1bf3
由 Yinan Xu 提交于 12月 17, 2021

bd1f1bf3
J
Change default L3 size to 6MB (#1365) · 0fbed464
由 Jiawei Lin 提交于 12月 17, 2021
```
* Change L3 to 6MB

* Bump huancun
```
0fbed464

16 12月, 2021 3 次提交

Y

rename: check valid condition for lui (#1368) · 89c0fb0a
由 Yinan Xu 提交于 12月 16, 2021

89c0fb0a

dcache: remove redundant ecc array (#1358) · 77decb47

由 zhanglinjuan 提交于 12月 16, 2021

* dcache: fix bug in ecc check

* dcache: remove redundant ecc array

* CacheInstruction: fix typo

* dcache: fix bugs in cache instruction on ecc

* MetaArray: wrap ecc array as a single module

77decb47

Fix false hit bug after IFU timing optimization (#1367) · a1351e5d

由 Jay 提交于 12月 16, 2021

* fix invalidTakenFault use wrong seqTarget

* IFU: fix oversize bug

* ctrl: mark all flushes as level.flush for frontend

This commit changes how flushes behave for frontend.

When ROB commits an instruction with a flush, we notify the frontend
of the flush without the commit.

Flushes to frontend may be delayed by some cycles and commit before
flush causes errors. Thus, we make all flush reasons to behave the
same as exceptions for frontend, that is, RedirectLevel.flush.

* IFU: exclude lastTaken situation when judging beyond fetch
Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>

a1351e5d

15 12月, 2021 4 次提交

Debug Mode: support difftest with spike (#1363) · f1c56d6c

由 Li Qianruo 提交于 12月 15, 2021

* Debug Mode: support basic difftest with spike

* Debug Mode: fix some bugs

Bugs fixed are:
1. All interrupts and exceptions cause debug mode to enter park loop
2. Debug interrupt ignored due to flushPipe

f1c56d6c

W

mem: writeback atom exception from store wb port 0 (#1353) · 858c53d7
由 William Wang 提交于 12月 15, 2021

858c53d7
L
mmpma: fix mmpma's read/write decoupled logic (#1354) · cef5c4b4
由 Lemover 提交于 12月 15, 2021
```
* mmpma: fix read/write io decoupled logic

* pma: fix init pma config
```
cef5c4b4

rename: add fused lui and load (#1356) · fd7603d9

由 Yinan Xu 提交于 12月 15, 2021

This commit adds fused load support by bypassing LUI results to load.

For better timing, detection is done at the rename stage. Imm is stored
in psrc(1), psrc(0) and imm.

fd7603d9

14 12月, 2021 6 次提交

H

README: fix a typo (#1357) · 6a326a79
由 Haojin Tang 提交于 12月 14, 2021

6a326a79
Y

difftest: move sc_valid to AtomicsUnit (#1350) · e13d224a
由 Yinan Xu 提交于 12月 14, 2021

e13d224a

dp2: out.bits does not depend on lsq.canAccept (#1352) · 74ca315b

由 Yinan Xu 提交于 12月 14, 2021

This commit optimizes Dispatch2Rs timing by ignoring lsq.canAccept
when sending bits to reservation stations.

74ca315b

Optimize IFU and PreDecode timing (#1347) · 2a3050c2

由 Jay 提交于 12月 14, 2021

* ICache: add ReplacePipe for Probe & Release

* remove ProbeUnit

* Probe & Release enter ReplacePipe

* fix bugs when running Linux on MinimalConfig

* TODO: set conflict for ReplacePipe

* ICache: fix ReplacePipe invalid write bug

* chores: code clean up

* IFU: optimize timing

* PreDecode: separate into 2 module for timing optimization

* IBuffer: add enqEnable to replace valid for timing

* IFU/ITLB: optimize timing

* IFU: calculate cut_ptr in f1

* TLB: send req in f1 and wait resp in f2

* ICacheMainPipe: add tlb miss logic in s0

* Optimize IFU timing

* IFU: fix lastHalfRVI bug

* IFU: fix performance bug

* IFU: optimize MMIO commit timing

* IFU: optmize trigger timing and add frontendTrigger

* fix compile error

* IFU: fix mmio stuck bug

2a3050c2

Z

dcache: fix bug in ecc check (#1349) · dd95524e
由 zhanglinjuan 提交于 12月 14, 2021

dd95524e

csr: update mtval/stval according to the trap mode (#1344) · 7c071650

由 Yinan Xu 提交于 12月 14, 2021

This commit changes the condition to update mtval and stval.

According to the RISC-V spec, when a trap is taken into M/S-mode,
mtval/stval is either set to zero or written wrih exception-specific
information to assist software in handling the trap.

Previously in XiangShan, mtval/stval is updated depending on the
current priviledge mode, which is incorrect.

7c071650

13 12月, 2021 3 次提交

Optimize dcache timing (#1332) · 69790076

由 zhanglinjuan 提交于 12月 13, 2021

* MissQueue: loose merging condition to ease timing stress

* MissQueue: remove grant_beats

* MissQueue: compare block addr, not the whole addr bits

* dcache: optimize timing for generating ready to sbuffer
Co-authored-by: NWilliam Wang <zeweiwang@outlook.com>

69790076

Y
Merge pull request #1345 from OpenXiangShan/fix-soft-prefetch · 979fa9bc
由 Yinan Xu 提交于 12月 13, 2021
```
mem: fix soft prefetch
```
979fa9bc

SoC: insert more buffers into mmio path (#1329) · be340b14

由 Jiawei Lin 提交于 12月 13, 2021

* SoC: add axi4spliter

* pmp: add apply method to reduce loc

* pma: add PMA used in axi4's spliter

* Fix package import

* pma: re-write tl-pma, put tl-pma into AXI4Spliter

* pma: add memory mapped pma

* soc: rm dma port, rm axi4spliter, mv mmpma out of spliter

* csr: clear mstatus.mprv when mstatus.mpp != ModeM at xret

* csr: fix write mask for mstatus, mepc and sepc

This commit fixes the write mask for mstatus, mepc and sepc.

According to the RISC-V instruction manual, for RV64 systems,
the SXL and UXL fields are WARL fields that control the value of
XLEN for S-mode and U-mode, respectively. For RV64 systems, if
S-mode is not supported, then SXL is hardwired to zero. For RV64
systems, if U-mode is not supported, then UXL is hardwired to zero.

Besides, mepc[0] and sepc[0] should be hardwired to zero.

* wb,load: delay load fp for one cycle

* csr: add mconfigptr, but hardwire to 0 now

* bump huancun

* csr: add *BE to mstatusStruct which are hardwired to 0

* Remove unused files

* csr: fix bug of xret clear mprv

* bump difftest

* ci: add unit test, xret clear mstatus.mprv when xpp is not M

* bump ready-to-run

* mem,atomics: delay exception info for one cycle

* SoC: insert more buffers into mmio path

* SoC: insert buffer between l3_xbar and l3_banked_xbar

* Optimze l3->ddr path

* Bump huancun
Co-authored-by: NZhangZifei <zhangzifei20z@ict.ac.cn>
Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>
Co-authored-by: Nwangkaifan <wangkaifan@ict.ac.cn>

be340b14

12 12月, 2021 5 次提交

W

mem: replay soft prefetch if tlb miss · c707f0c8
由 William Wang 提交于 12月 12, 2021

c707f0c8

L2/L3: fix prefetch train address (#1339) · 459ad1b2

由 Jiawei Lin 提交于 12月 12, 2021

* L2/L3: fix prefetch train address

* HuanCun: update SRAMTemplate

* Config: Keep the client dir capacity of L3 twice the L2

* Bump huancun

459ad1b2

W

csr: add soft_prefetch_enable to smblockctl · d10a581e
由 William Wang 提交于 12月 12, 2021

d10a581e
W
mem: soft prefetch will not be replayed · 690158b0
由 William Wang 提交于 12月 12, 2021
```
Soft prefetch will be always marked as "load hit"
```
690158b0

csr: add vectored trap mode (#1343) · 68b89fcb

由 Yinan Xu 提交于 12月 12, 2021

All bits for stvec and mtvec are writable in XiangShan.

According to the RISC-V spec, {m,s}tvec[1:0] are MODE bits. When
MODE=Vectored, all synchronous exceptions into M/S mode cause the pc
to be set to the address in the BASE field, whereas interrupts cause
the pc to be set to the address in the BASE field plus four times
the interrupt cause number.

If XiangShan decides to not support vectored mode, {m,s}tvec[1:0]
should be hardwired to zero.

68b89fcb

11 12月, 2021 4 次提交

jump: set the LSB of the target to zero (#1342) · 1a389dfd

由 Yinan Xu 提交于 12月 11, 2021

According to RISC-V spec, for the JALR instruction, its target address
is obtained by adding the sign-extended 12-bit I-immediate to the
register rs1, then setting the least-significant bit of the result
to zero.

1a389dfd

Y

csr: delay fflags and dirty_fs for better timing (#1341) · 7181c0c1
由 Yinan Xu 提交于 12月 11, 2021

7181c0c1

mmu: timing optimization of ptwfilter's recv and issue & storeunit's mmio (#1326) · 2c2c1588

由 Lemover 提交于 12月 11, 2021

* TLB: when miss, regnext the req sent to ptw

* PTWFilter: timing optimzation of do_iss that ignore ptwResp's filter

* StoreUnit: logic optimization of from s2_mmio to s2_out_valid

* ptwfilter: when issue but filtered, clear the v bit

special case that
ptw.resp clear all the duplicate req when arrive to filter
ptw_resp is the RegNext of ptw.resp and it filters ptw.req
when ptw_resp filter the req but ptw.resp not filter the tlb_req to
stop do_enq, then the v bit of the req will not be cleared ever.

It will be more correct to fliter the entries and tlb_req with ptw_resp,
but the timing restriction says no. So just use the confusing trick
to slove the complicate corner case.

2c2c1588

core: delay csrCtrl for two cycles (#1336) · 6f688dac

由 Yinan Xu 提交于 12月 11, 2021

This commit adds DelayN(2) to some CSR-related signals, including
control bits to ITLB, DTLB, PTW, etc.

To avoid accessing the ITLB before control bits change, we also need
to delay the flush for two cycles. We assume branch misprediction or
memory violation does not cause csrCtrl to change.

6f688dac

10 12月, 2021 4 次提交

icache: support data/tag r/w op (#1337) · 70899835

由 William Wang 提交于 12月 10, 2021

* mem,cacheop: fix read data writeback

* mem,cacheop: rename cacheop state bits

These bits are different from w_*, s_* bits in cache

* mem: enable icache op feedback

* icache: update cache op implementation

* chore: remove cache op logic from XSCore.scala

70899835

W

dcache: fix lrsc_locked_block check (#1334) · 8b538b51
由 William Wang 提交于 12月 10, 2021

8b538b51

core: refactor hardware performance counters (#1335) · 1ca0e4f3

由 Yinan Xu 提交于 12月 10, 2021

This commit optimizes the coding style and timing for hardware
performance counters.

By default, performance counters are RegNext(RegNext(_)).

1ca0e4f3

bump huancun (#1322) · 1dc3a3a0

由 wakafa 提交于 12月 10, 2021

* bump huancun

* bump huancun

* Bump huancun
Co-authored-by: NLinJiawei <linjiawei20s@ict.ac.cn>

1dc3a3a0

09 12月, 2021 2 次提交

J

ICache: send ProbeAck when Probe NToN (#1331) · 1d4a76ae
由 Jay 提交于 12月 09, 2021

1d4a76ae

core: refactor writeback parameters (#1327) · 6ab6918f

由 Yinan Xu 提交于 12月 09, 2021

This commit adds WritebackSink and WritebackSource parameters for
multiple modules. These traits hide implementation details from
other modules by defining IO-related functions in modules.

By using WritebackSink, ROB is able to choose the writeback sources.
Now fflags and exceptions are connected from exe units to reduce write
ports and optimize timing.

Further optimizations on write-back to RS and better coding style to
be added later.

6ab6918f

08 12月, 2021 4 次提交

csr: add write mask to satp.ppn & xstatus.xs (#1323) · 705cbec3

由 Lemover 提交于 12月 08, 2021

* csr.satp: add r/w mask of ppn part

* ci: add unit test, satp should concern PADDRBITS

* csr.xstatus: XS field is ready-only

* bump ready-to-run

* bump ready-to-run, update nemu so

* fix typo

705cbec3

dcache: optimize refill block timing (#1320) · b36dd5fd

由 William Wang 提交于 12月 08, 2021

Now we RegNext(refill_req) for 1 cycle. It will provide more
time for refillShouldBeBlocked calcuation

b36dd5fd

Fix dcache probe (#1324) · 53e88463

由 William Wang 提交于 12月 08, 2021

* dcache: give probe the highest priority

* dcache: fix block probe logic

* dcache: give replace_req higher priority

53e88463

R

update f2_mmio update logic (#1325) · c0b2b8e9
由 rvcoresjw 提交于 12月 08, 2021

c0b2b8e9

07 12月, 2021 1 次提交

dcache: fix read data cache op (#1319) · b6358f8f

由 William Wang 提交于 12月 07, 2021

* mem,cacheop: fix read data writeback

* mem,cacheop: rename cacheop state bits

These bits are different from w_*, s_* bits in cache

b6358f8f

OpenXiangShan / XiangShan 12 个月 前同步成功

OpenXiangShan / XiangShan
12 个月前同步成功