github-mirrors/ibex - Blizzard Finnegan's Codeforge

mirror of https://github.com/lowRISC/ibex.git synced 2025-04-22 12:57:13 -04:00

Author	SHA1	Message	Date
Hailin	129af7a53b	[rtl] Add memory and memory result interfaces This commit adds memory interface and memory result interface of the RISC-V Extension Interface.	2023-03-31 15:29:50 +02:00
Pirmin Vogel	92a95cac32	[bitmanip] Add new configuration option for OpenTitan Earl Grey RV32BOTEarlGrey selects the Zba, Zbb, Zbc, Zbs sub-extensions from v.1.0.0 of the bitmanip spec and the Zbf, Zbp, Zbr, Zbt sub-extensions from draft v.0.93. Zbe (bcompress/bdecompress) is supported by RV32BFull only. Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2021-12-16 14:18:00 +01:00
Pirmin Vogel	c78acac8cc	[rtl, bitmanip] Add xperm.[nbh] instruction (Zbp, draft v.0.93) Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2021-12-06 11:14:49 +01:00
Pirmin Vogel	16d6f5ea2b	[rtl, bitmanip] Align Zbb implementation with draft v.0.93 and v.1.0.0 This invovles the following changes: - Rename pcnt to cpop - Switch encoding of max and minu - Remove rev from Balanced version, only available in Full version via grev (Zbp) - Include sext.b/h (previously in Zb_tmp) - Remove slo[i] and sro[i] from Balanced version, only available in Full version (Zbp) Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2021-12-03 22:43:05 +01:00
Pirmin Vogel	e765b4dfec	[rtl, bitmanip] Align Zbs implementation with draft v.0.93 and v.1.0.0 This only involves dropping the `s` from the instruction names, i.e., sbext becomes bext etc. Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2021-12-03 22:43:05 +01:00
Pirmin Vogel	71b43a83e2	[rtl, bitmanip] Rename bext/bdep to bcompress/bdecompress This change is related to the bitmanip draft version 0.94. It's needed as in draft version 0.93 as well as in version 1.00 sbext from Zbs changes to bext, leading to two completely different instructions having the same name. Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2021-12-03 22:43:05 +01:00
Michael Munday	c35472abb9	[bitmanip][zba] Add support for Zba (address calculation) extension Add support for the Zba extension added in v0.93 of the bit manipulation specification (unchanged in v1.0.0). The new instructions added are: - sh1add: rd = (rs1 << 1) + rs2 - sh2add: rd = (rs1 << 2) + rs2 - sh3add: rd = (rs1 << 3) + rs2 The instructions are single cycle and have been implemented using the adder in the ALU. Signed-off-by: Michael Munday <mike.munday@lowrisc.org>	2021-11-01 09:58:01 +00:00
Philipp Wagner	a25790abf9	[style] Indent module header with two spaces Both the parameter and the port list in a module header should be indented with two spaces, according to our style guide.	2021-08-31 15:30:28 +02:00
Philipp Wagner	be27bc8bcf	[style] Fix whitespace issues around operators Our style guide typically require a single whitespace around operators.	2021-08-31 15:30:28 +02:00
Greg Chadwick	3e7720e403	[rtl] Lint fixes for Ascent lint issues	2021-03-01 09:52:57 +00:00
Tom Roberts	d977e146ba	[rtl] Various small lint fixes - Move various unused signal fixes from the waiver file to the rtl, so that all tools can pick them up. - Fix some oversize line issues. - Fix some signed / unsigned casting issues. - Remove some extraneous semicolons. Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-10-27 11:29:35 +00:00
Philipp Wagner	67e7417749	Fix Verible lint issues Fix all remaining issues reported by Verible lint. It turns out that #965 undid some of the fixes in `ibex_alu.sv` that were done in #980 around the `SHUFFLE_`/`FLIP_` signals.	2020-07-03 12:20:32 +01:00
ganoam	1aa4d5a32b	[bitmanip] Optimizations and Parametrization This commit contains some final optimizations regarding the bit manipulation extension as well as the parametrization into a balanced version and a full performance version. Balanced Version: * Supports ZBB, ZBS, ZBF and ZBT extensions * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i] * Everything else completes in a single cycle. Full Version: * Supports all 32b sub extensions. * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i], crc32[c], bext, bdep * Everything else completes in a single cycle. Notable Changes: * bext/bdep are now multi-cycle: Sharing additional register with multiplier module * grev/gorc instructions are implemented in separate structures rather than sharing the shifter or butterfly network. * Speed up decision on using rs1 or rs3 for alu_operand_a by introducing single-bit register, to identify ternary instructions in their first cycle. * Introduce enumerated parameter to chose bit manipulation implementation Signed-off-by: ganoam <gnoam@live.com>	2020-06-26 14:43:24 +02:00
Michael Schaffner	ae547c8d30	[top_pkg] Fix style lint warnings Signed-off-by: Michael Schaffner <msf@google.com>	2020-06-22 20:52:15 +01:00
ganoam	66687e927c	[bitmanip] Add ZBR instruction group This commit implements the Bit Manipulation Extension ZBR instruction group: crc32[c].[bhw]. CRC-32 (CRC-32/ISO-HDLC) and CRC-32C (CRC-32/ISCSI) are directly implemented. The CRC operation solves the following equation using binary polynomial arithmetic: rev(rd)(x) = rev(rs1)(x) * xn mod {1, P}(x), where {1,P}(x) denotes the crc polynomial. Using barret reduction one can write this as rd = (rs1 >> n) ^ rev(rev( (rs1 << (32-1)) cx rev(mu)) cx P) ^-- cycle 0--------------------^ ^-- cycle 1 ------------------------------------------^ Where cx denotes carry-less multiplication and mu = polydiv(x64, {1,P}), omitting the MSB (bit 32). The implementation increases area consumption by ~0.6kGE for synthesis with relaxed timing constraints. With tight timing constraints that is ~1.6kGE. There is no significant impact on frequency. Signed-off-by: ganoam <gnoam@live.com>	2020-05-22 17:21:03 +02:00
ganoam	f173e2baba	[bitmanip] Add ZBC instruction group This commit implements the Bit Manipulation Extension ZBC instruction group: clmul[rh] (carry-less multiply [reverse][high]) Carry-less multiplication can be understood as multiplication based on the addition interpreted as the bit-wise xor operation. Example: 1101 X 1011 = 1111111: 1011 X 1101 ----------- 1101 xor 1101 --------- 10111 xor 0000 ---------- 010111 xor 1101 ----------- 1111111 Architectural details: A 32 x 32-bit array [ operand_b[i] ? (operand_a << i) : '0 for i in 0 ... 31 ] is generated. The entries of the array are pairwise 'xor-ed' together in a 5-stage binary tree. The area increase when synthesized with relaxed timing constraints is 1.6-1.7kGE. Timing figures are improve by 0.1 ns for the 3-stage configuration and worsen by 0.04ns for the 2-stage implementation. This suggests fluctuations due to the heuristic nature of the synthesis tools. Signed-off-by: ganoam <gnoam@live.com>	2020-05-19 10:38:38 +02:00
ganoam	9bd3350bb3	[bitmanip] Add sext.b/h instructions This commit implements the Bit Manipulation Extension sign-extend instructions: sext.b (sign-extend byte) and sext.h (sign-extend half word). The implementation is basically a one-liner, duplicating the msb of the byte / half-word into the msb of the output register. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 22:03:45 +02:00
ganoam	fac404a6f3	[bitmanip] Add ZBF instruction group This commit implements the Bit Manipulation Extension ZBF instruction group, which consists only of the one instruction bfp (bit-field place). This instruction places a field of length len < 16 from rs2 in rs1 at offset off. Architectureal details: The implementation works exactly the same as proposed by Claire Wolf in her reference implementation. 1. bfp_mask = slo(o, len) 2. bfp_result = (rs1 & ~(bfp_mask << off)) \| (rs2 & bfp_mask) << off ^------ shifter-^ The existing shifter structure is shared for the indicated operation. Impact on area: * When synthesizing without the B-extension, the 2 stage design seems to move the timing bottleneck, leading to optimizations which result in an area increase by 1 kGE, when synthesized with tight timing constraints. For the 3 stage configuration there is no change. When synthesized with relaxed timing constraints there is no significant change in either configuration. * With the B-extension enabled, the area increase for tight timing constraints is 1.1-1.2 kGE. For relaxed timing constraints that is ~0.4kGE Impact on timing: No significant impact. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 21:34:49 +02:00
ganoam	0afd000a09	[bitmanip] Add ZBE Instruction Group This commit implements the Bit Manipulation Extension ZBE instruction group: bext (bit extract) and bdep (bit deposit). Architectural details: * bext/bdep: A new butterfly and inverse butterfly network is implemented. The generation of its controlbits depend on a parallel prefix bitcount of the deposit / extract mask. * bitcounter: The path for bext / bdep instructions traverses the bit counter and the butterfly network, resulting in both a larger delay and area. To mitigate the bitcounter has been changed from a serial bit counter to a radix-2 tree structure. * grev/gorc: Zbp instructions general reverse and general or-combine have as of yet shared the shifters reversal structure. It has proven benefitial to area and timing to reuse the novel butterfly network instead The butterfly network itself consumes ~3.5kGE and ~1.1kGE for synthesis with tight and relaxed timing constraints respectively. Including the optimizations of the bitcounter and grev/gorc, the overall change in area consumption is +4.6kGE (+1.2kGE) and +3.3kGE (+1.1kGE) for synthesis with tight (relaxed) timing constraints for 2- and 3-stage configurations respectively. For tight timing constraints that is a growth by around ~10%, for relaxed ~5%. The impact on the maximum frequency is negligable. Signed-off-by: ganoam <gnoam@live.com>	2020-05-14 16:43:19 +02:00
ganoam	a68923a404	[bitmanip] Add ZBP Instruction Group This commit implements the Bit Manipulation Extension ZBP instruction group: grev[i] (generalized reverse), gorc[i] (generalized or-combine) and [un]shfl[i] (generalized shuffle) and all of their pseudo-instructions. Architectural details: * grev / gorc: The shifter structure features only a right shift structure. In order to perform a left shift therefore the operand needs to be reversed, shifted and reversed again. The architecture of the back-reversal is implemented in stages which are activated using the general reverse / orcombine operand, or a signal marking left-shifts. * shfl / unshfl: Also known as zip / unzip or interlace / uninterlace operation. These instructions are implemented in their own structure using a permutation networ of 6 stages. 4 stages thereof implement the shuffle permutations. the first and last stage is the flip stage, which effectively reverse s the order of the inner stages, for unshuffle operations. Signed-off-by: ganoam <gnoam@live.com>	2020-04-29 11:10:44 +02:00
ganoam	133fef2c2f	[bitmanip] Add ZBS Instruction Group This commit implements the Bit Manipulation Extension SBS instruction group: sbset[i], sbclr[i], sbinv[i] and sbext[i]. These instructions set, clear, invert or extract bit rs1[rs2] or rs1[imm] for reg-reg and reg-imm instructions respectively. Archtectural details: * A multiplexer is added to the shifter structure in order to chose between 32'h1, used for the single-bit instructions as summarized below, and regular operand_b input. * Dedicated bitwise-logic blocks are introduced for multicycle shifts and cmix instructions (fsr, fsl, ror, rol), single-bit instructions (sbset, sbclr, sbinv, sbext), and stanard-ALU and zbb instructions (or, and xor, orn, andn, xnor). Instruction details: All of the zbs instructions rely on sharing the existing shifter structure. The instructions are carried out in one cycle. * sbset, sbclr, sbinv: shift_result = 32'h1 << rs2[4:0]; singlebit_result = rs1 [\|, ^ , &~] shift_result; * sbext: shift_result = rs1 >> rs2[4:0]; singlebit_result = {31'0,shift_resutl[0]}; Signed-off-by: ganoam <gnoam@live.com>	2020-04-24 08:32:30 +02:00
ganoam	4cb77b8121	[bitmanip] Add ZBT Instruction Group This commits implements the Bit Manipulateion Extension ZBT instruction group: cmix, cmov, fsr[i] and fsl. Those are instructions depend on three ALU operands. Completeion of these instructions takes 2 clock cycles. Additionally, the rotation shifts rol and ror are made multicycle instructions. All multicycle instructions take exactly two cycles to complete. Architectural additions: * Multicycle Stage Register in ID stage. multicycle_op_stage_reg * Decoder generates alu_multicycle signal, to stall pipeline * For all ternary instructions: 1. cycle: connect alu operands a and b to rs1 and rs2 respectively 2. cycle: connect operands a and be to rs3 and rs2 respectively * Reduce the physical size of the shifter from 64 bit to 63 bit: 32-bit operand + 1 bit for arithmetic / one-shift * Make rotation shifts multicycle instructions. Instruction Details: * cmov: 1. store operand a (rs1) in stage reg. 2. return stage reg output (rs2) or rs3. if rs2 != 0 the output (rs1) is already known in the first cycle. -> variable latency implementation is possible. * cmix: 1. store rs1 & rs2 in stage reg 2. return stage_reg_q \| (rs2 & ~rs3) reusing bwlogic from zbb * rol/ror: (here: ror) shift_amt = rs2 & 31; shift_amt_compl = (32 - shift_amt) & 31 1. store (rs1 >> shift_amt) in stage reg 2. return (rs1 << shift_amt_compl) \| stage_reg_q * fsl/fsr: For funnel shifts, the order of applying the shift amount or its complement is determined by bit [5] of shift_amt. Pseudocode for fsr: shift_amt = rs2 & 63 shift_amt_compl = (32 - shift_amt[4:0]) 1. if (shift_amt >= 33): store (rs1 >> shift_amt_compl[4:0]) in stage reg else if (shift_amt <0 && shift_amt <= 31): store (rs1 << shift_amt[4:0]) in stage reg else if (shift_amt == 32 \|\| shift_amt == 0): store rs1 in stage reg 2. if (shift_amt >= 33): return stage_reg_q \| (rs3 << shift_amt[4:0]) else if (shift_amt <0 && shift_amt <= 31): return stage_reg_q \| (rs3 >> shift_amt_compl[4:0]) else if (shift_amt == 32): return rs3 else if (shift_amt == 0): return rs1 Signed-off-by: ganoam <gnoam@live.com>	2020-04-16 14:03:35 +02:00
ganoam	8a26111f40	[bitmanip] Add ZBB Instruction Group This commit implements the Bit Manipulation Extension ZBB instruction group: clz, ctz, pcnt, slo, sro, rol, ror, rev, rev8, orcb, pack packu, packh, min, max, andn, orn, and xnor. * Bit counting instructions clz, ctz and pcnt can be implemented to share much of the architecture: clz: Count Leading Zeros. Counts the number of 0 bits at the MSB end of the argument. ctz: Count Trailing Zeros. Counts the number of 0 bits at the LSB end of the argument. pcnt: Counts the number of set bits of the argument. The implementation uses: - 32 one bit adders, counting the set bits of a signal bitcnt_bits, starting from the LSB end. - For pcnt the argument is fed directly into bitcnt_bits. - For clz, the operand is reversed such that leading zeros are located at the LSB end of bitcnt_bits. - For ctz and clz: counter enable signal for 1-bit counter i is high, if the previous enable signal, and its corresponting bitcnt_bit was high. * Instructions sll[i], srl[i],slo[i], sro[i], rol, ror[i], rev, rev8 and orc.b are summarized as shifting instructions and related: The following instructions are slight variations of the existing base spec's sll, srl and sra instructions. - slo[i] and sro[i]: shift left/right ones: similar to shift-logical operations from base spec, but shifting in ones instead of zeros. - rol and ror[i]: rotate left/right ones: circular shift operations. shifting in values from the oposite end of the operand instead of zeros. Those instructions can be implemented, sharing the base spec's shifting structure. In order to support rotate operations, a 64-bit shifting structure is needed. In the existing ALU, hardware is described only for right shifts. For left shifts the operand is initially reversed, right shifted and the result is reversed back. This gives rise to an additional resource sharing oportunity for some more zbb operations: - rev: bitwise reversal. - rev8: byte-order swap. - orc.b: byte-wise reverse and or-combine. * Instructions min, max: For the B-extension's min/max instructions, we can share the existing comparison operations. The result is obtained by activating the comparison structure accordingly and multiplexing the operands using the comparison result. * Logic-with-negate instructions andn, orn, xnor: For the B-extension's logic-with-negate instructions we can share the structures of the base spec's logic structures already present for 'xnor', 'or' and 'and' instructions as well as the conditionally negated b operand generated for subtraction operations. * Instructions pack, packu, packh: For the pack, packh and packu instructions I don't see any opportunities for resource sharing. However, the architecture is quite simple. - pack: pack the lower halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packu: pack the upper halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packh: pack the LSB bytes of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. Signed-off-by: ganoam <gnoam@live.com>	2020-03-27 17:13:26 +01:00
Greg Chadwick	b52aacf91b	[rtl] Add multdiv_sel signal to decode multdiv_sel signals the mult/div operand should be selected for the ALU inputs. Previously the mult_en/div_en signals were used but these factor in whether the instruction is actually happening which is not relevant for the mux select. The dedicated select signal gives better timing.	2020-01-31 09:32:20 +00:00
Pirmin Vogel	275c3310fa	[rtl] Remove unused set-less-or-equal-than ALU ops These operations (ALU_SLET, ALU_SLETU) are remnants of of the original OpenRISC design. RISC-V just has set-less-than (ALU_SLT, ALU_SLTU). This resolves lowRISC/ibex#432 reported by @udinator.	2019-11-01 11:51:25 +00:00
Pirmin Vogel	0331ed61b1	[rtl/alu] Remove unused comparison operations These operations (ALU_GT, ALU_GTU, ALU_LE, ALU_LEU) are remnants of of the original OpenRISC design. RISC-V does not have these instructions and instead implements the operations by reversing operands for ALU_LT, ALU_LTU, ALU_GE, ALU_GEU. This resolves lowRISC/ibex#420 reported by @udinator.	2019-10-25 13:58:59 +01:00
Philipp Wagner	7eee24c094	Mention CREDITS.md in license header	2019-08-27 18:10:02 +01:00
Philipp Wagner	14b8f88957	Replace author credits in files with CREDITS.md We currently have a documentation block at the beginning of each file, containing author credits and module-level documentation. The module-level documentation is retained for historic reasons and duplicated with the newer comments below it. For the authors, maintaining author credits in the file is error-prone, as this information gets outdated very soon. A more reliable way to see who modified a file is to use the history information in git. Additionally, we now have the CREDITS.md file, which lists all contributors, even the ones which don't appear in the git history (e.g. because the code was copied and commited by someone else).	2019-08-27 18:10:02 +01:00
Philipp Wagner	428d057c4a	Rename ibex_[tracer_]define to ibex_[tracer_]pkg This file doesn't contain defines any more, but a normal SV package. The diff is best viewed without whitespace changes, as the reindents cause a lof of diff noise. Fixes lowrisc/ibex#173	2019-07-19 11:34:40 +01:00
Pirmin Vogel	db75d30547	Replace non-unique `case` constructs by `unique case` Our coding guidelines require the usage of `unique case` constructs with proper `default` cases. This commit implements this change and also makes sure that potential `'X` are propagated.	2019-05-21 15:22:21 +01:00
Pirmin Vogel	e87d0cc3e9	Add comment to clarify that the MSB in the shifter can safely be ignored	2019-05-15 16:07:25 +01:00
Pirmin Vogel	29107c5283	Use unsigned type for part select to avoid linting errors	2019-05-15 16:07:25 +01:00
Pirmin Vogel	f84ed3ecd3	Use lower_snake_case for code block labels	2019-05-15 10:42:48 +01:00
Pirmin Vogel	708f9287e4	Declare result of signed arithmetic as signed to avoid linting error	2019-05-10 11:11:19 +01:00
Pirmin Vogel	6ac7f844a1	Replace hard coded parameters and signal widths by enum types	2019-05-07 13:30:07 +01:00
Pirmin Vogel	a2184f9ec7	Align ALU comparison op naming to instruction names By default, ops are signed in RISC-V. As opposed to the U suffix for designating an op as being unsigned, there is no suffix for designating ops as being signed.	2019-05-07 13:30:07 +01:00
Philipp Wagner	f9ad280d0c	Cleanup includes and defines - Move ibex_tracer_defines.sv and ibex_defines.sv out of the 'include' directory, since these files are not actually included. - Remove ibex_config.sv, it's mostly unused code. The remaining defines, SYNTHESIS, ASIC_SYNTHESIS, TRACE_EXECUTION, and CHECK_MISALIGNED should be set through command-line flags to the simulation/synthesis tools. Initial version by Nils Gräf.	2019-05-03 17:30:29 +01:00
Alex Bradbury	7e81f6f4eb	Standardise "subheadings" in code comments The code base made extensive use of ASCII art headings/subheadings in comments to delineate code. Switch to a more space efficient and easier to edit format: ///////// // Foo // /////////	2019-04-26 15:09:00 +01:00
Scott Johnson	6b0475744d	Code cleanup Fix errors and warnings reported by lint tools, and clean up the code according to our coding style. Move all imports into the module.	2019-04-26 15:09:00 +01:00
Philipp Wagner	8813f57624	Add Doxygen-style module descriptions We leave the existing longer headers in place for author information.	2019-04-26 15:08:30 +01:00
Alex Bradbury	27e68bd76e	Convert from Solderpad to standard Apache 2.0 license This change has been informed by advice from the lowRISC legal committee. The Solderpad 0.51 license states "the Licensor permits any Work licensed under this License, at the option of the Licensee, to be treated as licensed under the Apache License Version 2.0". We use this freedom to convert license markings to Apache 2.0. This commit ensures that we retain all authorship and copyright attribution information.	2019-04-26 15:05:17 +01:00
Philipp Wagner	e9e5a719bc	Move RTL code into rtl/ directory This gives us a bit of space in the source tree for documentation, verification, utilities, and much more.	2019-04-26 15:05:17 +01:00

42 commits