1. 14 12月, 2021 5 次提交
    • Y
      difftest: move sc_valid to AtomicsUnit (#1350) · e13d224a
      Yinan Xu 提交于
      e13d224a
    • Y
      dp2: out.bits does not depend on lsq.canAccept (#1352) · 74ca315b
      Yinan Xu 提交于
      This commit optimizes Dispatch2Rs timing by ignoring lsq.canAccept
      when sending bits to reservation stations.
      74ca315b
    • J
      Optimize IFU and PreDecode timing (#1347) · 2a3050c2
      Jay 提交于
      * ICache: add ReplacePipe for Probe & Release
      
      * remove ProbeUnit
      
      * Probe & Release enter ReplacePipe
      
      * fix bugs when running Linux on MinimalConfig
      
      * TODO: set conflict for ReplacePipe
      
      * ICache: fix ReplacePipe invalid write bug
      
      * chores: code clean up
      
      * IFU: optimize timing
      
      * PreDecode: separate into 2 module for timing optimization
      
      * IBuffer: add enqEnable to replace valid for timing
      
      * IFU/ITLB: optimize timing
      
      * IFU: calculate cut_ptr in f1
      
      * TLB: send req in f1 and wait resp in f2
      
      * ICacheMainPipe: add tlb miss logic in s0
      
      * Optimize IFU timing
      
      * IFU: fix lastHalfRVI bug
      
      * IFU: fix performance bug
      
      * IFU: optimize MMIO commit timing
      
      * IFU: optmize trigger timing and add frontendTrigger
      
      * fix compile error
      
      * IFU: fix mmio stuck bug
      2a3050c2
    • Z
      dcache: fix bug in ecc check (#1349) · dd95524e
      zhanglinjuan 提交于
      dd95524e
    • Y
      csr: update mtval/stval according to the trap mode (#1344) · 7c071650
      Yinan Xu 提交于
      This commit changes the condition to update mtval and stval.
      
      According to the RISC-V spec, when a trap is taken into M/S-mode,
      mtval/stval is either set to zero or written wrih exception-specific
      information to assist software in handling the trap.
      
      Previously in XiangShan, mtval/stval is updated depending on the
      current priviledge mode, which is incorrect.
      7c071650
  2. 13 12月, 2021 3 次提交
    • Z
      Optimize dcache timing (#1332) · 69790076
      zhanglinjuan 提交于
      * MissQueue: loose merging condition to ease timing stress
      
      * MissQueue: remove grant_beats
      
      * MissQueue: compare block addr, not the whole addr bits
      
      * dcache: optimize timing for generating ready to sbuffer
      Co-authored-by: NWilliam Wang <zeweiwang@outlook.com>
      69790076
    • Y
      Merge pull request #1345 from OpenXiangShan/fix-soft-prefetch · 979fa9bc
      Yinan Xu 提交于
      mem: fix soft prefetch
      979fa9bc
    • J
      SoC: insert more buffers into mmio path (#1329) · be340b14
      Jiawei Lin 提交于
      * SoC: add axi4spliter
      
      * pmp: add apply method to reduce loc
      
      * pma: add PMA used in axi4's spliter
      
      * Fix package import
      
      * pma: re-write tl-pma, put tl-pma into AXI4Spliter
      
      * pma: add memory mapped pma
      
      * soc: rm dma port, rm axi4spliter, mv mmpma out of spliter
      
      * csr: clear mstatus.mprv when mstatus.mpp != ModeM at xret
      
      * csr: fix write mask for mstatus, mepc and sepc
      
      This commit fixes the write mask for mstatus, mepc and sepc.
      
      According to the RISC-V instruction manual, for RV64 systems,
      the SXL and UXL fields are WARL fields that control the value of
      XLEN for S-mode and U-mode, respectively. For RV64 systems, if
      S-mode is not supported, then SXL is hardwired to zero. For RV64
      systems, if U-mode is not supported, then UXL is hardwired to zero.
      
      Besides, mepc[0] and sepc[0] should be hardwired to zero.
      
      * wb,load: delay load fp for one cycle
      
      * csr: add mconfigptr, but hardwire to 0 now
      
      * bump huancun
      
      * csr: add *BE to mstatusStruct which are hardwired to 0
      
      * Remove unused files
      
      * csr: fix bug of xret clear mprv
      
      * bump difftest
      
      * ci: add unit test, xret clear mstatus.mprv when xpp is not M
      
      * bump ready-to-run
      
      * mem,atomics: delay exception info for one cycle
      
      * SoC: insert more buffers into mmio path
      
      * SoC: insert buffer between l3_xbar and l3_banked_xbar
      
      * Optimze l3->ddr path
      
      * Bump huancun
      Co-authored-by: NZhangZifei <zhangzifei20z@ict.ac.cn>
      Co-authored-by: NYinan Xu <xuyinan@ict.ac.cn>
      Co-authored-by: Nwangkaifan <wangkaifan@ict.ac.cn>
      be340b14
  3. 12 12月, 2021 5 次提交
  4. 11 12月, 2021 4 次提交
    • Y
      jump: set the LSB of the target to zero (#1342) · 1a389dfd
      Yinan Xu 提交于
      According to RISC-V spec, for the JALR instruction, its target address
      is obtained by adding the sign-extended 12-bit I-immediate to the
      register rs1, then setting the least-significant bit of the result
      to zero.
      1a389dfd
    • Y
      csr: delay fflags and dirty_fs for better timing (#1341) · 7181c0c1
      Yinan Xu 提交于
      7181c0c1
    • L
      mmu: timing optimization of ptwfilter's recv and issue & storeunit's mmio (#1326) · 2c2c1588
      Lemover 提交于
      * TLB: when miss, regnext the req sent to ptw
      
      * PTWFilter: timing optimzation of do_iss that ignore ptwResp's filter
      
      * StoreUnit: logic optimization of from s2_mmio to s2_out_valid
      
      * ptwfilter: when issue but filtered, clear the v bit
      
      special case that
      ptw.resp clear all the duplicate req when arrive to filter
      ptw_resp is the RegNext of ptw.resp and it filters ptw.req
      when ptw_resp filter the req but ptw.resp not filter the tlb_req to
      stop do_enq, then the v bit of the req will not be cleared ever.
      
      It will be more correct to fliter the entries and tlb_req with ptw_resp,
      but the timing restriction says no. So just use the confusing trick
      to slove the complicate corner case.
      2c2c1588
    • Y
      core: delay csrCtrl for two cycles (#1336) · 6f688dac
      Yinan Xu 提交于
      This commit adds DelayN(2) to some CSR-related signals, including
      control bits to ITLB, DTLB, PTW, etc.
      
      To avoid accessing the ITLB before control bits change, we also need
      to delay the flush for two cycles. We assume branch misprediction or
      memory violation does not cause csrCtrl to change.
      6f688dac
  5. 10 12月, 2021 4 次提交
  6. 09 12月, 2021 2 次提交
    • J
      ICache: send ProbeAck when Probe NToN (#1331) · 1d4a76ae
      Jay 提交于
      1d4a76ae
    • Y
      core: refactor writeback parameters (#1327) · 6ab6918f
      Yinan Xu 提交于
      This commit adds WritebackSink and WritebackSource parameters for
      multiple modules. These traits hide implementation details from
      other modules by defining IO-related functions in modules.
      
      By using WritebackSink, ROB is able to choose the writeback sources.
      Now fflags and exceptions are connected from exe units to reduce write
      ports and optimize timing.
      
      Further optimizations on write-back to RS and better coding style to
      be added later.
      6ab6918f
  7. 08 12月, 2021 4 次提交
  8. 07 12月, 2021 3 次提交
  9. 06 12月, 2021 9 次提交
  10. 05 12月, 2021 1 次提交