github-mirrors/cva6 - Blizzard Finnegan's Codeforge

mirror of https://github.com/openhwgroup/cva6.git synced 2025-04-22 21:27:10 -04:00

Author	SHA1	Message	Date
Guillaume Chauvon	3ce44b1b4e	Spyglass clean up: multiple change to remove Spyglass warnings (#2727 ) Some checks are pending bender-up-to-date / bender-up-to-date (push) Waiting to run Details ci / build-riscv-tests (push) Waiting to run Details ci / execute-riscv64-tests (push) Blocked by required conditions Details ci / execute-riscv32-tests (push) Blocked by required conditions Details Multiple changes to clean up code and remove Spyglass warnings. Co-authored-by: JeanRochCoulon <jean-roch.coulon@thalesgroup.com>	2025-01-23 08:32:31 +01:00
AngelaGonzalezMarino	c389382c89	Altera opt 2 (#2602 ) The second optimization for Altera FPGA is to move the BHT to LUTRAM. Same as before, the reason why the optimization previously done for Xilinx is not working, is that in that case asynchronous RAM primitives are used, and Altera does not support asynchronous RAM. Therefore, this optimization consists in using synchronous RAM for the BHT. The main changes to the existing code are: New RAM module to infer synchronous RAM in altera with 2 independent read ports and one write port (SyncThreePortRam.sv) Changes in the frontend.sv file: modify input to vpc_i port of BHT, by advancing the address to read, in order to compensate for the delay of synchronous RAM. Changes in the bht.sv file: This case is more complex because of the logic operations that need to be performed inside the BHT. First, the pc pointed by bht_update_i is read from the memory, modified according to the saturation counter and valid bit, and finally written again in the memory. The prediction output is given based on the vpc_i. With asynchronous memory, the new data written via update_i is available one clock cycle after writing it. So, if vpc_i tries to read the address that was previously written by update_i, everything is fine. However, in the case of synchronous memory there are three clock cycles of latency (one for reading the pc content (read port 1), another one for writing it, and another one for reading in the other port (read port 0)). For this reason, there is the need to adapt the design to these new latency constraints: First, there is the need for a delay on the address write of the synchronous RAM, to wait for the previous pc read and store the right modified data. Once this is solved, similarly to the FIFO case, there is the need for an auxiliary buffer that will store the data written in the FIFO, allowing to have it available 2 clock cycles after the update_i was valid. This is because after having the correct data, the RAM takes 2 clock cycles until data can be available in the output (one clock cycle for writing and one for reading). Finally, there is a multiplexer in the output that permits to deliver the correct prediction providing the data from the update logic (1 cycle of delay), the auxiliary register (2 cycles of delay), or the RAM (3 or more cycles of delay), depending on the delay since the update_i was valid (i.e. written to the memory).	2024-11-21 23:36:18 +01:00
AngelaGonzalezMarino	33c5d77bd8	Altera opt 1 (#2592 ) Some checks failed bender-up-to-date / bender-up-to-date (push) Has been cancelled Details ci / build-riscv-tests (push) Has been cancelled Details ci / execute-riscv64-tests (push) Has been cancelled Details ci / execute-riscv32-tests (push) Has been cancelled Details The first optimization for Altera FPGA is to move the instruction queue to LUTRAM. The reason why the optimization previously done for Xilinx is not working, is that in that case asynchronous RAM primitives are used, and Altera does not support asynchronous RAM. Therefore, this optimization consists in using synchronous RAM for the instruction queue and FIFOs inside wt axi adapter. The main changes to the existing code are: New RAM module to infer synchronous RAM in altera with independent read and write ports (SyncDpRam_ind_r_w.sv) Changes inside cva6_fifo_v3 to adapt to the use of synchronous RAM instead of asynchronous: When the FIFO is not empty, next data is always read and available at the output hiding the reading latency introduced by synchronous RAM (similar to fall-through approach). This is a simplification that is possible because in a FIFO we always know what is the next address to be read. When data is read right after write, we can’t use the previous method because there is a latency to first write the data in the FIFO, and then to read it. For this reason, in the new design there is an auxiliary register used to hide this latency. This is used only if the FIFO is empty, so we detect when the word written is first word, and keep it in this register. If the next cycle comes a read, the data out is taken from the aux register. Afterwards the data is already available in the RAM and can be read continuously as in the first case. All this is only used inf FpgaAlteraEn parameter is enabled, otherwise the previous implementation with asynchronous RAM applies (when FpgaEn is set), or the register based implementation (when FpgaEn is not set).	2024-11-15 14:34:15 +01:00
CoralieAllioux	205872acc6	Fix initialization of memory array in simulation (#2259 )	2024-06-14 14:34:08 +02:00
Florian Zaruba	38e8c059b2	Parameterization and other fixes for downstream project (#1950 ) * Bender fixes and switch to `cva6_fifo_v3` * cfg: Fix verilator warnings * Bender: Fix yml * acc_dispatcher: Add `csr_addr_i` * parameterization: Fox AXI_USER_EN warning * wb_cache: Fix Verilator Lint warnings * cva6_fifo_v3: Add to Flist * parameterization: Address review concerns * Switch to `cva6_fifo_v3` * tracer: Remove tracer interface The interface made a bunch of problems with the typedefs so I've removed it. --------- Co-authored-by: JeanRochCoulon <jean-roch.coulon@thalesgroup.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2024-04-05 13:02:18 +02:00
Côme	987c645bb7	Parametrization step 3 (#1935 ) This is the third step for #1451. Many values are moved but not all values are moved yet * move NR_SB_ENTRIES & TRANS_ID_BITS * remove default rvfi_instr_t from spike.sv * fifo_v3: ariane_pkg::FPGA_EN becomes a param * move FPGA_EN * inline wt_cache_pkg::L15_SET_ASSOC * move wt_cache_pkg::L15_WAY_WIDTH * inline wt_cache_pkg::L1I_SET_ASSOC * inline wt_cache_pkg::L1D_SET_ASSOC * move wt_cache_pkg::DCACHE_CL_IDX_WIDTH * move ICACHE_TAG_WIDTH * move DCACHE_TAG_WIDTH * move ICACHE_INDEX_WIDTH * move ICACHE_SET_ASSOC * use ICACHE_SET_ASSOC_WIDTH instead of $clog2(ICACHE_SET_ASSOC) * move DCACHE_NUM_WORDS * move DCACHE_INDEX_WIDTH * move DCACHE_OFFSET_WIDTH * move DCACHE_BYTE_OFFSET * move DCACHE_DIRTY_WIDTH * move DCACHE_SET_ASSOC_WIDTH * move DCACHE_SET_ASSOC * move CONFIG_L1I_SIZE * move CONFIG_L1D_SIZE * move DCACHE_LINE_WIDTH * move ICACHE_LINE_WIDTH * move ICACHE_USER_LINE_WIDTH * move DCACHE_USER_LINE_WIDTH * DATA_USER_WIDTH = DCACHE_USER_WIDTH * move DCACHE_USER_WIDTH * move FETCH_USER_WIDTH * move FETCH_USER_EN * move LOG2_INSTR_PER_FETCH * move INSTR_PER_FETCH * move FETCH_WIDTH * transform SSTATUS_SD and SMODE_STATUS_READ_MASK into functions * move [SM]_{SW,TIMER,EXT}_INTERRUPT into a structure * move SV * move vm_mode_t to config_pkg * move MODE_SV * move VPN2 * move PPNW * move ASIDW * move ModeW * move XLEN_ALIGN_BYTES * move DATA_USER_EN * format: apply verible	2024-03-15 17:21:34 +00:00
Nils Wistoff	29eae1ca22	tech_cells_generic: Upgrade to v0.2.13 (#1676 )	2023-12-06 11:16:58 +01:00
Nils Wistoff	3833439fb7	fpu: ⬆️ Update FPU version (#1102 ) Co-authored-by: Matteo Perotti <mperotti@iis.ee.ethz.ch> Co-authored-by: Matheus Cavalcante <matheusd@iis.ee.ethz.ch> Co-authored-by: JeanRochCoulon <jean-roch.coulon@thalesgroup.com>	2023-04-14 23:53:45 +02:00
sébastien jacq	3b55657552	BHT optimized for fpga (#1039 ) It counts and saves the saturation bits in D flip-flops for the ASIC version and in a three-port asynchronous read memory for the FPGA version. FPGA flushing is not supported because the frontend module flushing signal is not connected.	2023-02-01 16:37:44 +01:00
sébastien jacq	32abc1ccda	Add Fifo v3 to optimize fpga implementation in resource size (#1032 )	2023-01-23 12:33:44 +01:00
Zbigniew Chamski	b44a696bbe	Vendorize fpga-support submodule. (#1030 )	2023-01-16 11:40:04 +01:00
Zbigniew Chamski	17ccfc42f4	Vendorize corev_apu submodules referenced by CVA6 core. (#1015 )	2022-12-13 12:20:36 +01:00
Zbigniew Chamski	8a5898dce4	Vendorize CVA6 core submodules (common_cells, FPU with related sub-modules) (#1007 )	2022-12-09 11:07:12 +01:00

13 commits