github-mirrors/ibex - Blizzard Finnegan's Codeforge

mirror of https://github.com/lowRISC/ibex.git synced 2025-06-28 17:24:11 -04:00

Author	SHA1	Message	Date
Yuichi Sugiyama	46bef34dee	[rtl] Fix hazard detection issues for Pointer Authentication With the writeback stage enabled we execute the PAC/AUT instruction before the required data is written to the register file. For example, when the load instruction precedes AUT instruction, AUT instruction is started before the loaded data is written to the register file. It is a problem that the hazard detection (stall_ld_hz) using rf_ren_a/b_o was not active for PAC/AUT instruction. Also, I change codes not to activate pa_pac_en or pa_aut_en when load hazard occurs.	2020-07-15 17:02:42 +02:00
Yuichi Sugiyama	f13ac7b8b9	[rtl] Add Pointer Authentication	2020-07-15 17:02:42 +02:00
ganoam	1aa4d5a32b	[bitmanip] Optimizations and Parametrization This commit contains some final optimizations regarding the bit manipulation extension as well as the parametrization into a balanced version and a full performance version. Balanced Version: * Supports ZBB, ZBS, ZBF and ZBT extensions * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i] * Everything else completes in a single cycle. Full Version: * Supports all 32b sub extensions. * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i], crc32[c], bext, bdep * Everything else completes in a single cycle. Notable Changes: * bext/bdep are now multi-cycle: Sharing additional register with multiplier module * grev/gorc instructions are implemented in separate structures rather than sharing the shifter or butterfly network. * Speed up decision on using rs1 or rs3 for alu_operand_a by introducing single-bit register, to identify ternary instructions in their first cycle. * Introduce enumerated parameter to chose bit manipulation implementation Signed-off-by: ganoam <gnoam@live.com>	2020-06-26 14:43:24 +02:00
Bert Pieters	4eece98875	[rtl] Remove use of `define in decoder Fixes #30 Signed-off-by: Bert Pieters <bert.pieters@gmail.com>	2020-06-18 15:38:05 +01:00
ganoam	66687e927c	[bitmanip] Add ZBR instruction group This commit implements the Bit Manipulation Extension ZBR instruction group: crc32[c].[bhw]. CRC-32 (CRC-32/ISO-HDLC) and CRC-32C (CRC-32/ISCSI) are directly implemented. The CRC operation solves the following equation using binary polynomial arithmetic: rev(rd)(x) = rev(rs1)(x) * xn mod {1, P}(x), where {1,P}(x) denotes the crc polynomial. Using barret reduction one can write this as rd = (rs1 >> n) ^ rev(rev( (rs1 << (32-1)) cx rev(mu)) cx P) ^-- cycle 0--------------------^ ^-- cycle 1 ------------------------------------------^ Where cx denotes carry-less multiplication and mu = polydiv(x64, {1,P}), omitting the MSB (bit 32). The implementation increases area consumption by ~0.6kGE for synthesis with relaxed timing constraints. With tight timing constraints that is ~1.6kGE. There is no significant impact on frequency. Signed-off-by: ganoam <gnoam@live.com>	2020-05-22 17:21:03 +02:00
ganoam	f173e2baba	[bitmanip] Add ZBC instruction group This commit implements the Bit Manipulation Extension ZBC instruction group: clmul[rh] (carry-less multiply [reverse][high]) Carry-less multiplication can be understood as multiplication based on the addition interpreted as the bit-wise xor operation. Example: 1101 X 1011 = 1111111: 1011 X 1101 ----------- 1101 xor 1101 --------- 10111 xor 0000 ---------- 010111 xor 1101 ----------- 1111111 Architectural details: A 32 x 32-bit array [ operand_b[i] ? (operand_a << i) : '0 for i in 0 ... 31 ] is generated. The entries of the array are pairwise 'xor-ed' together in a 5-stage binary tree. The area increase when synthesized with relaxed timing constraints is 1.6-1.7kGE. Timing figures are improve by 0.1 ns for the 3-stage configuration and worsen by 0.04ns for the 2-stage implementation. This suggests fluctuations due to the heuristic nature of the synthesis tools. Signed-off-by: ganoam <gnoam@live.com>	2020-05-19 10:38:38 +02:00
ganoam	9bd3350bb3	[bitmanip] Add sext.b/h instructions This commit implements the Bit Manipulation Extension sign-extend instructions: sext.b (sign-extend byte) and sext.h (sign-extend half word). The implementation is basically a one-liner, duplicating the msb of the byte / half-word into the msb of the output register. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 22:03:45 +02:00
ganoam	fac404a6f3	[bitmanip] Add ZBF instruction group This commit implements the Bit Manipulation Extension ZBF instruction group, which consists only of the one instruction bfp (bit-field place). This instruction places a field of length len < 16 from rs2 in rs1 at offset off. Architectureal details: The implementation works exactly the same as proposed by Claire Wolf in her reference implementation. 1. bfp_mask = slo(o, len) 2. bfp_result = (rs1 & ~(bfp_mask << off)) \| (rs2 & bfp_mask) << off ^------ shifter-^ The existing shifter structure is shared for the indicated operation. Impact on area: * When synthesizing without the B-extension, the 2 stage design seems to move the timing bottleneck, leading to optimizations which result in an area increase by 1 kGE, when synthesized with tight timing constraints. For the 3 stage configuration there is no change. When synthesized with relaxed timing constraints there is no significant change in either configuration. * With the B-extension enabled, the area increase for tight timing constraints is 1.1-1.2 kGE. For relaxed timing constraints that is ~0.4kGE Impact on timing: No significant impact. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 21:34:49 +02:00
ganoam	0afd000a09	[bitmanip] Add ZBE Instruction Group This commit implements the Bit Manipulation Extension ZBE instruction group: bext (bit extract) and bdep (bit deposit). Architectural details: * bext/bdep: A new butterfly and inverse butterfly network is implemented. The generation of its controlbits depend on a parallel prefix bitcount of the deposit / extract mask. * bitcounter: The path for bext / bdep instructions traverses the bit counter and the butterfly network, resulting in both a larger delay and area. To mitigate the bitcounter has been changed from a serial bit counter to a radix-2 tree structure. * grev/gorc: Zbp instructions general reverse and general or-combine have as of yet shared the shifters reversal structure. It has proven benefitial to area and timing to reuse the novel butterfly network instead The butterfly network itself consumes ~3.5kGE and ~1.1kGE for synthesis with tight and relaxed timing constraints respectively. Including the optimizations of the bitcounter and grev/gorc, the overall change in area consumption is +4.6kGE (+1.2kGE) and +3.3kGE (+1.1kGE) for synthesis with tight (relaxed) timing constraints for 2- and 3-stage configurations respectively. For tight timing constraints that is a growth by around ~10%, for relaxed ~5%. The impact on the maximum frequency is negligable. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 16:43:19 +02:00
Pirmin Vogel	3922b2582f	[rtl] Rework generation and use of mult/div_sel/en Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2020-05-01 17:29:59 +02:00
ganoam	a68923a404	[bitmanip] Add ZBP Instruction Group This commit implements the Bit Manipulation Extension ZBP instruction group: grev[i] (generalized reverse), gorc[i] (generalized or-combine) and [un]shfl[i] (generalized shuffle) and all of their pseudo-instructions. Architectural details: * grev / gorc: The shifter structure features only a right shift structure. In order to perform a left shift therefore the operand needs to be reversed, shifted and reversed again. The architecture of the back-reversal is implemented in stages which are activated using the general reverse / orcombine operand, or a signal marking left-shifts. * shfl / unshfl: Also known as zip / unzip or interlace / uninterlace operation. These instructions are implemented in their own structure using a permutation networ of 6 stages. 4 stages thereof implement the shuffle permutations. the first and last stage is the flip stage, which effectively reverse s the order of the inner stages, for unshuffle operations. Signed-off-by: ganoam <gnoam@live.com>	2020-04-29 11:10:44 +02:00
ganoam	133fef2c2f	[bitmanip] Add ZBS Instruction Group This commit implements the Bit Manipulation Extension SBS instruction group: sbset[i], sbclr[i], sbinv[i] and sbext[i]. These instructions set, clear, invert or extract bit rs1[rs2] or rs1[imm] for reg-reg and reg-imm instructions respectively. Archtectural details: * A multiplexer is added to the shifter structure in order to chose between 32'h1, used for the single-bit instructions as summarized below, and regular operand_b input. * Dedicated bitwise-logic blocks are introduced for multicycle shifts and cmix instructions (fsr, fsl, ror, rol), single-bit instructions (sbset, sbclr, sbinv, sbext), and stanard-ALU and zbb instructions (or, and xor, orn, andn, xnor). Instruction details: All of the zbs instructions rely on sharing the existing shifter structure. The instructions are carried out in one cycle. * sbset, sbclr, sbinv: shift_result = 32'h1 << rs2[4:0]; singlebit_result = rs1 [\|, ^ , &~] shift_result; * sbext: shift_result = rs1 >> rs2[4:0]; singlebit_result = {31'0,shift_resutl[0]}; Signed-off-by: ganoam <gnoam@live.com>	2020-04-24 08:32:30 +02:00
ganoam	06f50ddeac	Bugfix: Generate Erroneous Illegal Insn This commit fixes three possible cases for erroneous generation of illegal instruction signals. Also, the bit-slices considered for decoding ALU instructions are corrected to better reflect their encoding specifications. * Fix decoding of orc_b in illegal_insn generation. * Insn[31] is no longer checked for generation of illegal instructions: This bit is part of the rs3 register adress for ternary bitmanipulation instructions (zbt). * Correct bit-slicing for ALU reg-immediate instructions according to specification: immediates are encoded in the range insn[26:20] in all cases. Where a shift-amount is encoded, bits [26:25] will have no effect, but will no longer generate illegal instructions. Signed-off-by: ganoam <gnoam@live.com>	2020-04-17 13:39:38 +02:00
ganoam	4cb77b8121	[bitmanip] Add ZBT Instruction Group This commits implements the Bit Manipulateion Extension ZBT instruction group: cmix, cmov, fsr[i] and fsl. Those are instructions depend on three ALU operands. Completeion of these instructions takes 2 clock cycles. Additionally, the rotation shifts rol and ror are made multicycle instructions. All multicycle instructions take exactly two cycles to complete. Architectural additions: * Multicycle Stage Register in ID stage. multicycle_op_stage_reg * Decoder generates alu_multicycle signal, to stall pipeline * For all ternary instructions: 1. cycle: connect alu operands a and b to rs1 and rs2 respectively 2. cycle: connect operands a and be to rs3 and rs2 respectively * Reduce the physical size of the shifter from 64 bit to 63 bit: 32-bit operand + 1 bit for arithmetic / one-shift * Make rotation shifts multicycle instructions. Instruction Details: * cmov: 1. store operand a (rs1) in stage reg. 2. return stage reg output (rs2) or rs3. if rs2 != 0 the output (rs1) is already known in the first cycle. -> variable latency implementation is possible. * cmix: 1. store rs1 & rs2 in stage reg 2. return stage_reg_q \| (rs2 & ~rs3) reusing bwlogic from zbb * rol/ror: (here: ror) shift_amt = rs2 & 31; shift_amt_compl = (32 - shift_amt) & 31 1. store (rs1 >> shift_amt) in stage reg 2. return (rs1 << shift_amt_compl) \| stage_reg_q * fsl/fsr: For funnel shifts, the order of applying the shift amount or its complement is determined by bit [5] of shift_amt. Pseudocode for fsr: shift_amt = rs2 & 63 shift_amt_compl = (32 - shift_amt[4:0]) 1. if (shift_amt >= 33): store (rs1 >> shift_amt_compl[4:0]) in stage reg else if (shift_amt <0 && shift_amt <= 31): store (rs1 << shift_amt[4:0]) in stage reg else if (shift_amt == 32 \|\| shift_amt == 0): store rs1 in stage reg 2. if (shift_amt >= 33): return stage_reg_q \| (rs3 << shift_amt[4:0]) else if (shift_amt <0 && shift_amt <= 31): return stage_reg_q \| (rs3 >> shift_amt_compl[4:0]) else if (shift_amt == 32): return rs3 else if (shift_amt == 0): return rs1 Signed-off-by: ganoam <gnoam@live.com>	2020-04-16 14:03:35 +02:00
Tom Roberts	97a50d7f12	[rtl] Add fixed time execution of branches - A new parameter and a run-time control bit (DataIndTiming and data_ind_timing) enabling different behaviour for running security critical code sections. - In the new mode, all branches act as if taken, with not-taken branches executing as a branch to the next instruction. - This should give similar execution time/power characteristics regardless of the branch condition. - Note that with the BranchTargetALU, branches stall an extra cycle in secure mode to avoid factoring the branch-taken decision into the branch target address mux. Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-04-13 14:27:40 +01:00
ganoam	8a26111f40	[bitmanip] Add ZBB Instruction Group This commit implements the Bit Manipulation Extension ZBB instruction group: clz, ctz, pcnt, slo, sro, rol, ror, rev, rev8, orcb, pack packu, packh, min, max, andn, orn, and xnor. * Bit counting instructions clz, ctz and pcnt can be implemented to share much of the architecture: clz: Count Leading Zeros. Counts the number of 0 bits at the MSB end of the argument. ctz: Count Trailing Zeros. Counts the number of 0 bits at the LSB end of the argument. pcnt: Counts the number of set bits of the argument. The implementation uses: - 32 one bit adders, counting the set bits of a signal bitcnt_bits, starting from the LSB end. - For pcnt the argument is fed directly into bitcnt_bits. - For clz, the operand is reversed such that leading zeros are located at the LSB end of bitcnt_bits. - For ctz and clz: counter enable signal for 1-bit counter i is high, if the previous enable signal, and its corresponting bitcnt_bit was high. * Instructions sll[i], srl[i],slo[i], sro[i], rol, ror[i], rev, rev8 and orc.b are summarized as shifting instructions and related: The following instructions are slight variations of the existing base spec's sll, srl and sra instructions. - slo[i] and sro[i]: shift left/right ones: similar to shift-logical operations from base spec, but shifting in ones instead of zeros. - rol and ror[i]: rotate left/right ones: circular shift operations. shifting in values from the oposite end of the operand instead of zeros. Those instructions can be implemented, sharing the base spec's shifting structure. In order to support rotate operations, a 64-bit shifting structure is needed. In the existing ALU, hardware is described only for right shifts. For left shifts the operand is initially reversed, right shifted and the result is reversed back. This gives rise to an additional resource sharing oportunity for some more zbb operations: - rev: bitwise reversal. - rev8: byte-order swap. - orc.b: byte-wise reverse and or-combine. * Instructions min, max: For the B-extension's min/max instructions, we can share the existing comparison operations. The result is obtained by activating the comparison structure accordingly and multiplexing the operands using the comparison result. * Logic-with-negate instructions andn, orn, xnor: For the B-extension's logic-with-negate instructions we can share the structures of the base spec's logic structures already present for 'xnor', 'or' and 'and' instructions as well as the conditionally negated b operand generated for subtraction operations. * Instructions pack, packu, packh: For the pack, packh and packu instructions I don't see any opportunities for resource sharing. However, the architecture is quite simple. - pack: pack the lower halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packu: pack the upper halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packh: pack the LSB bytes of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. Signed-off-by: ganoam <gnoam@live.com>	2020-03-27 17:13:26 +01:00
Tom Roberts	624ef41462	[rtl] Extend BT ALU to be used for all jumps - Create separate operand muxes for the branch/jump target ALU - Complete jump instructions in one cycle when BT ALU configured Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-03-25 15:25:22 +00:00
Tom Roberts	c054a63c3d	[rtl] Instantiate instruction cache - Add parameters and actual instantiation of icache - Add a custom CSR in the M-mode custom RW range to enable the cache - Wire up the cache invalidation signal to trigger on fence.i Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-03-23 12:57:31 +00:00
Greg Chadwick	89e5fc11ed	[RTL] Add configurable third pipeline stage The third pipeline stage is a new writeback stage. Ibex can now be configured as the original two stage design or the new three stage design using the `WritebackStage` parameter in ibex_core. This defaults to 0 (giving the original two stage design). The three stage design is EXPERIMENTAL In the three stage design all register write back occurs in the third, final stage. This allows a cycle for responses to loads and stores so when the memory system can respond in a single cycle there will be no stall. This offers significant performance benefits. Documentation of the three stage design is still to be written so existing documentation applies to the two stage design only as various aspects of Ibex behaviour will change in the three stage design. Signed-off-by: Greg Chadwick <gac@lowrisc.org>	2020-03-06 15:29:14 +00:00
Greg Chadwick	5d16a5b104	[rtl] Introduce default clk/reset to prim_assert This mirrors the change made in OpenTitan (https://github.com/lowRISC/opentitan/pull/1485)	2020-02-10 09:42:52 +00:00
Greg Chadwick	3fcede8a13	[rtl] Comment and naming tweaks	2020-01-31 09:32:20 +00:00
Greg Chadwick	b52aacf91b	[rtl] Add multdiv_sel signal to decode multdiv_sel signals the mult/div operand should be selected for the ALU inputs. Previously the mult_en/div_en signals were used but these factor in whether the instruction is actually happening which is not relevant for the mux select. The dedicated select signal gives better timing.	2020-01-31 09:32:20 +00:00
Greg Chadwick	486bf45711	[rtl] Replicate instruction flops to reduce fanout Adds a second set of instruction flops that are used to determine ALU operation and operand selection. This reduces fanout from the instruction flops and so helps timing.	2020-01-31 09:32:20 +00:00
Greg Chadwick	639964514c	[RTL] Added seperate ALU for branch target On branches now compute target same cycle as the condition. This removes a stall cycle from all taken conditional branches.	2020-01-31 09:32:20 +00:00
Daniel Mlynek	9d232d1cde	Include assert macros when they are used prim_assert.sv is a file containing assertion macros (defines). Previously, prim_assert.sv was compiled as normal SystemVerilog file. This made the defines available for the whole compilation unit as soon as they were defined. Since all cores using prim_assert depended (in fusesoc) on the lowrisc:prim:assert core, prim_assert was always compiled first, and the defines were visible in subsequent files. All of that is only true if all files end up in one comilation unit. The SV standard states that what makes up a compilation unit is tool-defined, but also states that typically, passing multiple files (or a file list/.f file) to a single tool invocation means that all files end up in one compilation unit; if the tool is called multiple times, then the files end up in separate compilation units. Edalize (the fusesoc backend) doesn't guarantee either behavior, and so it happens that for Vivado, Verilator, Cadence and Synopsys simulators, all files are compiled into a single compilation unit. But for Riviera, each file is a separate compilation unit. To avoid relying on the definition of compilation units, and to do the generally right thing (TM), this commit changes the code to always include the prim_assert.sv file when it is used in a source file. Include guards are introduced in the prim_assert.sv file to avoid defining things twice.	2020-01-28 14:46:48 +00:00
Pirmin Vogel	0778008f33	[rtl] Remove X assignments, add SVAs for selector signals This commit replaces all X assignments in the RTL with defined values. In addition, SystemVerilog Assertions are added to catch invalid signal values in simulation. A new file containing the corresponding assertion macros is added as well. Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2019-12-20 10:09:09 +01:00
Greg Chadwick	11749c7e4d	[rtl] Implement FENCE.I Fixes #391	2019-11-27 08:47:26 +00:00
Philipp Wagner	7eee24c094	Mention CREDITS.md in license header	2019-08-27 18:10:02 +01:00
Philipp Wagner	14b8f88957	Replace author credits in files with CREDITS.md We currently have a documentation block at the beginning of each file, containing author credits and module-level documentation. The module-level documentation is retained for historic reasons and duplicated with the newer comments below it. For the authors, maintaining author credits in the file is error-prone, as this information gets outdated very soon. A more reliable way to see who modified a file is to use the history information in git. Additionally, we now have the CREDITS.md file, which lists all contributors, even the ones which don't appear in the git history (e.g. because the code was copied and commited by someone else).	2019-08-27 18:10:02 +01:00
Ivan Ribeiro	2a47344dd1	Remove unused signal data_reg_offset	2019-08-20 14:45:47 +01:00
Pirmin Vogel	e1a7dcf37f	ID stage: rework CSR-related pipeline flushes This commit clarifies why CSR-related pipeline flushes are needed (e.g. when enabling interrupts), only introduces them when doing the critical modifications (write/set bits in `mstatus` and `mie` CSRs for enabling interrupts, reads and clears are uncritical for these CSRs), and makes sure the controller is actually able to start handling interrupts while doing a CSR-related pipeline flush. This resolves lowrisc/ibex#6.	2019-08-16 13:28:52 +01:00
Pirmin Vogel	a7f344b02a	Avoid linting errors	2019-07-29 15:52:42 +01:00
Philipp Wagner	428d057c4a	Rename ibex_[tracer_]define to ibex_[tracer_]pkg This file doesn't contain defines any more, but a normal SV package. The diff is best viewed without whitespace changes, as the reindents cause a lof of diff noise. Fixes lowrisc/ibex#173	2019-07-19 11:34:40 +01:00
Pirmin Vogel	e4383589eb	Decoder: remove reg-reg load, make LWU illegal The custom reg-reg load instruction was added in the original design but is no longer needed. This commit removes it. Also, load instructions with `instr[14:12] == 3'b110` are now decoded as illegal. This resolves #25.	2019-07-19 11:29:33 +01:00
Pirmin Vogel	04f4cd44f6	Decoder: avoid setting PC upon illegal JALR instructions Without this commit, the PC is still set to a possible wrong jump target on illegal JALR instructions ultimately causing the wrong PC being saved to `mepc` during the illegal instruction exception. This bug has been reported by @taoliug. This commit resolves #170.	2019-07-18 15:52:26 +01:00
Pirmin Vogel	5925449848	Decoder: fix signaling and detection of illegal instructions This commit fixes two bugs in the decoder: 1. For illegal branch condition selections, the illegal instruction condition must be signaled as long as the instruction is being executed and not just during the first cycle, as the controller cannot interrupt multicycle instructions. 2. Illegal instructions should also be signaled when `instr[28]` is set for register-register ALU operations. Previously, these were not signaled as the original design used `instr[28]` to encode custom bit- manipulation instructions. These bugs were discovered by @taoliug. This resolves issue #163.	2019-07-17 11:04:15 +01:00
Pirmin Vogel	fd6426f52c	Replace `pipe_flush` signal by `wfi_insn` This signal is triggered iff a `wfi` instruction is decoded.	2019-07-11 17:35:12 +01:00
Tobias Wölfel	1e2d5b7eda	Rename opcode OP-IMM Align name with RISC-V spec.	2019-07-11 10:04:56 +01:00
Tobias Wölfel	27d2addaa4	Rename fence opcode (#138 )	2019-07-11 10:04:56 +01:00
Pirmin Vogel	8b4e337118	Decoder: fix drive conflict, set `jump_set_o` for JALR This commit solves problems reported by @jeanlouismaquet in #130.	2019-07-10 15:24:21 +01:00
Pirmin Vogel	5093e0a0d2	Explicitly set `alu_op_x_mux_sel` in decoder To make the regfile address check in RV32E not triggering false alarms when accessing reg x16 - x31, the decoder should only set the muxes for the regfile output when the instruction actually requires the regfile output. It is cleaner to explicitly set the `alu_op_x_mux_sel` signals to `OP_X_REG_X` instead of relying on the default assignment if they are really needed.	2019-07-09 17:56:58 +01:00
Pirmin Vogel	9ed49b9a17	Do not select regfile in decoder if not required Otherwise, the regfile address check in RV32E can trigger false alarms for some instructions and arguments (e.g. immediates).	2019-07-09 17:56:58 +01:00
Pirmin Vogel	a3bfcd0f04	Decoder: enable checking of register file addresses for RV32E	2019-07-09 17:56:58 +01:00
Pirmin Vogel	2478b5f978	Check `rs1` & `rd` for non-CSR system instructions For `ecall`, `ebreak`, `mret`, `dret`, `wfi` these two fields must be 0. If they are not zero, the decoder generates an illegal instruction exception.	2019-07-09 17:08:21 +01:00
Pirmin Vogel	1174603e76	Rework ID stage and decoder This commit moves logic directly related to the decoder from the ID stage into the decoder. This logic includes: - Generation of immediates and decoder-based mux selectors - Generation of register file addresses - CSR operand check and manipulation depending on value in `rs1` - Register file address check for RV32E (still disabled) The muxes themselves stay in the ID stage as their control signals also depend also on other, non-decoder-based signals (LSU, EX, WB FSM).	2019-07-09 09:46:48 +01:00
Pirmin Vogel	7924431144	Rework register file write data mux	2019-07-09 09:46:48 +01:00
Pirmin Vogel	9e46bc50d5	Move `jump_set` generation from WB FSM into decoder	2019-07-09 09:46:48 +01:00
Pirmin Vogel	24ab4ae65c	Remove decoder MUX signals for jump and branch These signals do not need to be generated by the WB FSM inside the ID stage and be fed back into the decoder. They simply depend on whether the instruction is new (we execute for the first cycle) or not.	2019-07-09 09:46:48 +01:00
Pirmin Vogel	3eb147dbc2	Prevent illegal instructions from propagating out of decoder This commit makes sure that if any instruction is detected as being illegal inside the decoder, the decoder does not set the control signals to let the illegal instruction affect the register file, LSU, EX, WB, CSRs. Previously, this was only the case for some but but not all instructions. Note that this is not sufficient to prevent instructions detected as illegal elsewhere from affecting the processor state. For example, when using RV32E, an instruction can be detected to use unavailable registers outside the decoder in the ID stage. But it is cleaner to handle all illegal instructions detected in the decoder similarly.	2019-07-09 09:46:48 +01:00
Pirmin Vogel	d973618ce8	Move logic to ignore decoder output out of decoder into ID stage The decoder shall decode the instruction only. The handling of stalls is not related to instruction decoding.	2019-07-09 09:46:48 +01:00

1 2

69 commits