github-mirrors/ibex - Blizzard Finnegan's Codeforge

mirror of https://github.com/lowRISC/ibex.git synced 2025-04-24 13:57:19 -04:00

Author	SHA1	Message	Date
Philipp Wagner	a25790abf9	[style] Indent module header with two spaces Both the parameter and the port list in a module header should be indented with two spaces, according to our style guide.	2021-08-31 15:30:28 +02:00
Philipp Wagner	b5011ecec6	[style] Format module instantiations in tabular format The style guide mandates tabular format in port expressions in module instantiations (https://github.com/lowRISC/style-guides/blob/master/VerilogCodingStyle.md#module-instantiation). The style guide also mandates a two-space indentation for ports and parameters in module instantiations. Apply the formatting produced by verible-format to match our style guide.	2021-08-31 15:30:28 +02:00
Pirmin Vogel	2ef5e5e3f2	Add a single `RV32M` enum parameter to select multiplier implementation This commit replaces the previous combination of `RV32M` bit parameter used to en/disable the M extension and the `MultiplierImplementation` used to select the multiplier implementation by a single enum parameter. Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2020-08-20 11:50:08 +02:00
ganoam	1aa4d5a32b	[bitmanip] Optimizations and Parametrization This commit contains some final optimizations regarding the bit manipulation extension as well as the parametrization into a balanced version and a full performance version. Balanced Version: * Supports ZBB, ZBS, ZBF and ZBT extensions * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i] * Everything else completes in a single cycle. Full Version: * Supports all 32b sub extensions. * Dual cycle instructions: ror[i], rol, cmov, cmix fsl, fsr[i], crc32[c], bext, bdep * Everything else completes in a single cycle. Notable Changes: * bext/bdep are now multi-cycle: Sharing additional register with multiplier module * grev/gorc instructions are implemented in separate structures rather than sharing the shifter or butterfly network. * Speed up decision on using rs1 or rs3 for alu_operand_a by introducing single-bit register, to identify ternary instructions in their first cycle. * Introduce enumerated parameter to chose bit manipulation implementation Signed-off-by: ganoam <gnoam@live.com>	2020-06-26 14:43:24 +02:00
Tom Roberts	a5ae9f4995	[rtl] Add data-independent timing to multdiv_fast - No early return on divide by zero Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-05-15 10:19:55 +01:00
Tom Roberts	d19189ba43	[rtl] data-independent execution for multdiv_slow - Remove all early exit's from multiply and divide operations when in fixed time execution mode. Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-05-15 10:19:55 +01:00
Tom Roberts	0ba0ad5a43	[rtl] multdiv_slow general tidy-up - Correct some typos and fix various lint / style guide issues - No functional changes Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-05-15 10:19:55 +01:00
Pirmin Vogel	3922b2582f	[rtl] Rework generation and use of mult/div_sel/en Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2020-05-01 17:29:59 +02:00
Pirmin Vogel	511c59db18	[rtl] Switch multdiv_en to multdiv_sel where possible Signed-off-by: Pirmin Vogel <vogelpi@lowrisc.org>	2020-05-01 17:29:59 +02:00
ganoam	999735568e	Use Shared Imd Val Reg with Multdiv Slow This commit adds support for the shared immediate value register in the id_stage for the slow implementation of the multdiv module. Register accum_window_q is now stored in the intermediate value register. Signed-off-by: ganoam <gnoam@live.com>	2020-04-23 12:14:45 +02:00
ganoam	4cb77b8121	[bitmanip] Add ZBT Instruction Group This commits implements the Bit Manipulateion Extension ZBT instruction group: cmix, cmov, fsr[i] and fsl. Those are instructions depend on three ALU operands. Completeion of these instructions takes 2 clock cycles. Additionally, the rotation shifts rol and ror are made multicycle instructions. All multicycle instructions take exactly two cycles to complete. Architectural additions: * Multicycle Stage Register in ID stage. multicycle_op_stage_reg * Decoder generates alu_multicycle signal, to stall pipeline * For all ternary instructions: 1. cycle: connect alu operands a and b to rs1 and rs2 respectively 2. cycle: connect operands a and be to rs3 and rs2 respectively * Reduce the physical size of the shifter from 64 bit to 63 bit: 32-bit operand + 1 bit for arithmetic / one-shift * Make rotation shifts multicycle instructions. Instruction Details: * cmov: 1. store operand a (rs1) in stage reg. 2. return stage reg output (rs2) or rs3. if rs2 != 0 the output (rs1) is already known in the first cycle. -> variable latency implementation is possible. * cmix: 1. store rs1 & rs2 in stage reg 2. return stage_reg_q \| (rs2 & ~rs3) reusing bwlogic from zbb * rol/ror: (here: ror) shift_amt = rs2 & 31; shift_amt_compl = (32 - shift_amt) & 31 1. store (rs1 >> shift_amt) in stage reg 2. return (rs1 << shift_amt_compl) \| stage_reg_q * fsl/fsr: For funnel shifts, the order of applying the shift amount or its complement is determined by bit [5] of shift_amt. Pseudocode for fsr: shift_amt = rs2 & 63 shift_amt_compl = (32 - shift_amt[4:0]) 1. if (shift_amt >= 33): store (rs1 >> shift_amt_compl[4:0]) in stage reg else if (shift_amt <0 && shift_amt <= 31): store (rs1 << shift_amt[4:0]) in stage reg else if (shift_amt == 32 \|\| shift_amt == 0): store rs1 in stage reg 2. if (shift_amt >= 33): return stage_reg_q \| (rs3 << shift_amt[4:0]) else if (shift_amt <0 && shift_amt <= 31): return stage_reg_q \| (rs3 >> shift_amt_compl[4:0]) else if (shift_amt == 32): return rs3 else if (shift_amt == 0): return rs1 Signed-off-by: ganoam <gnoam@live.com>	2020-04-16 14:03:35 +02:00
ganoam	8a26111f40	[bitmanip] Add ZBB Instruction Group This commit implements the Bit Manipulation Extension ZBB instruction group: clz, ctz, pcnt, slo, sro, rol, ror, rev, rev8, orcb, pack packu, packh, min, max, andn, orn, and xnor. * Bit counting instructions clz, ctz and pcnt can be implemented to share much of the architecture: clz: Count Leading Zeros. Counts the number of 0 bits at the MSB end of the argument. ctz: Count Trailing Zeros. Counts the number of 0 bits at the LSB end of the argument. pcnt: Counts the number of set bits of the argument. The implementation uses: - 32 one bit adders, counting the set bits of a signal bitcnt_bits, starting from the LSB end. - For pcnt the argument is fed directly into bitcnt_bits. - For clz, the operand is reversed such that leading zeros are located at the LSB end of bitcnt_bits. - For ctz and clz: counter enable signal for 1-bit counter i is high, if the previous enable signal, and its corresponting bitcnt_bit was high. * Instructions sll[i], srl[i],slo[i], sro[i], rol, ror[i], rev, rev8 and orc.b are summarized as shifting instructions and related: The following instructions are slight variations of the existing base spec's sll, srl and sra instructions. - slo[i] and sro[i]: shift left/right ones: similar to shift-logical operations from base spec, but shifting in ones instead of zeros. - rol and ror[i]: rotate left/right ones: circular shift operations. shifting in values from the oposite end of the operand instead of zeros. Those instructions can be implemented, sharing the base spec's shifting structure. In order to support rotate operations, a 64-bit shifting structure is needed. In the existing ALU, hardware is described only for right shifts. For left shifts the operand is initially reversed, right shifted and the result is reversed back. This gives rise to an additional resource sharing oportunity for some more zbb operations: - rev: bitwise reversal. - rev8: byte-order swap. - orc.b: byte-wise reverse and or-combine. * Instructions min, max: For the B-extension's min/max instructions, we can share the existing comparison operations. The result is obtained by activating the comparison structure accordingly and multiplexing the operands using the comparison result. * Logic-with-negate instructions andn, orn, xnor: For the B-extension's logic-with-negate instructions we can share the structures of the base spec's logic structures already present for 'xnor', 'or' and 'and' instructions as well as the conditionally negated b operand generated for subtraction operations. * Instructions pack, packu, packh: For the pack, packh and packu instructions I don't see any opportunities for resource sharing. However, the architecture is quite simple. - pack: pack the lower halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packu: pack the upper halves of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. - packh: pack the LSB bytes of rs1 and rs2 into rd, with rs1 in the lower half and rs2 in the upper half. Signed-off-by: ganoam <gnoam@live.com>	2020-03-27 17:13:26 +01:00
Tom Roberts	624ef41462	[rtl] Extend BT ALU to be used for all jumps - Create separate operand muxes for the branch/jump target ALU - Complete jump instructions in one cycle when BT ALU configured Signed-off-by: Tom Roberts <tomroberts@lowrisc.org>	2020-03-25 15:25:22 +00:00
Greg Chadwick	89e5fc11ed	[RTL] Add configurable third pipeline stage The third pipeline stage is a new writeback stage. Ibex can now be configured as the original two stage design or the new three stage design using the `WritebackStage` parameter in ibex_core. This defaults to 0 (giving the original two stage design). The three stage design is EXPERIMENTAL In the three stage design all register write back occurs in the third, final stage. This allows a cycle for responses to loads and stores so when the memory system can respond in a single cycle there will be no stall. This offers significant performance benefits. Documentation of the three stage design is still to be written so existing documentation applies to the two stage design only as various aspects of Ibex behaviour will change in the three stage design. Signed-off-by: Greg Chadwick <gac@lowrisc.org>	2020-03-06 15:29:14 +00:00
ganoam	48c4b6a5ea	[rtl] Add Single Cycle Multiplier targeting FPGA * Integrate option to implement a multiplier using 3 parallel 17 bit multipliers in order to compute MUL instructions in 1 cycle MULH in 2 cycles. * Add parameter SingleCycleMultiply to select single cycle multiplication. The single cycle multiplication capability is intended for FPGA targets. Using three parallel multiplication units improves performance of multiplication operations at the cost of DSP primitives. For ASIC targets, the area consumed by the multiplication structure will grow approximately 3-4x. The functionality is selected within the module using the parameter `SingleCycleMultiply`. From the top level it can be chosen by setting the parameter `MultiplierImplementation` to 'single_cc'. Signed-off-by: ganoam <gnoam@live.com>	2020-02-11 16:09:41 +01:00
Greg Chadwick	3fcede8a13	[rtl] Comment and naming tweaks	2020-01-31 09:32:20 +00:00
Greg Chadwick	b52aacf91b	[rtl] Add multdiv_sel signal to decode multdiv_sel signals the mult/div operand should be selected for the ALU inputs. Previously the mult_en/div_en signals were used but these factor in whether the instruction is actually happening which is not relevant for the mux select. The dedicated select signal gives better timing.	2020-01-31 09:32:20 +00:00
Greg Chadwick	639964514c	[RTL] Added seperate ALU for branch target On branches now compute target same cycle as the condition. This removes a stall cycle from all taken conditional branches.	2020-01-31 09:32:20 +00:00
Pascal Cotret	e5cf0c0fcf	Error synthesis in Vivado	2019-10-28 20:36:37 +00:00
Marek Pikuła	294849bb18	[RTL] Add MultiplierImplementation parameter in top level	2019-10-24 14:33:24 +01:00
Philipp Wagner	7eee24c094	Mention CREDITS.md in license header	2019-08-27 18:10:02 +01:00
Philipp Wagner	14b8f88957	Replace author credits in files with CREDITS.md We currently have a documentation block at the beginning of each file, containing author credits and module-level documentation. The module-level documentation is retained for historic reasons and duplicated with the newer comments below it. For the authors, maintaining author credits in the file is error-prone, as this information gets outdated very soon. A more reliable way to see who modified a file is to use the history information in git. Additionally, we now have the CREDITS.md file, which lists all contributors, even the ones which don't appear in the git history (e.g. because the code was copied and commited by someone else).	2019-08-27 18:10:02 +01:00
Philipp Wagner	428d057c4a	Rename ibex_[tracer_]define to ibex_[tracer_]pkg This file doesn't contain defines any more, but a normal SV package. The diff is best viewed without whitespace changes, as the reindents cause a lof of diff noise. Fixes lowrisc/ibex#173	2019-07-19 11:34:40 +01:00
Pirmin Vogel	1082b85ff0	Rework interaction between EX block and ID stage The EX block actually signals when its output is valid, and not when it is ready to accept new input. The LSU valid signal is not needed inside the EX block and can thus be fed directly to the ID stage.	2019-07-09 09:46:48 +01:00
Pirmin Vogel	f84ed3ecd3	Use lower_snake_case for code block labels	2019-05-15 10:42:48 +01:00
Pirmin Vogel	0894cfb0db	Use UpperCamelCase for config parameters, specify type	2019-05-15 10:42:48 +01:00
Pirmin Vogel	fde6e505df	Make sure all inputs/outputs use _i/_o suffices	2019-05-15 10:42:48 +01:00
Pirmin Vogel	6ac7f844a1	Replace hard coded parameters and signal widths by enum types	2019-05-07 13:30:07 +01:00
Philipp Wagner	f9ad280d0c	Cleanup includes and defines - Move ibex_tracer_defines.sv and ibex_defines.sv out of the 'include' directory, since these files are not actually included. - Remove ibex_config.sv, it's mostly unused code. The remaining defines, SYNTHESIS, ASIC_SYNTHESIS, TRACE_EXECUTION, and CHECK_MISALIGNED should be set through command-line flags to the simulation/synthesis tools. Initial version by Nils Gräf.	2019-05-03 17:30:29 +01:00
Alex Bradbury	7e81f6f4eb	Standardise "subheadings" in code comments The code base made extensive use of ASCII art headings/subheadings in comments to delineate code. Switch to a more space efficient and easier to edit format: ///////// // Foo // /////////	2019-04-26 15:09:00 +01:00
Philipp Wagner	c4c4b8f1e4	Make RV32E/RV32M parameters bits These parameters are boolean, use the "bit" data type to ensure their value range.	2019-04-26 15:09:00 +01:00
Scott Johnson	6b0475744d	Code cleanup Fix errors and warnings reported by lint tools, and clean up the code according to our coding style. Move all imports into the module.	2019-04-26 15:09:00 +01:00
Philipp Wagner	8813f57624	Add Doxygen-style module descriptions We leave the existing longer headers in place for author information.	2019-04-26 15:08:30 +01:00
Alex Bradbury	27e68bd76e	Convert from Solderpad to standard Apache 2.0 license This change has been informed by advice from the lowRISC legal committee. The Solderpad 0.51 license states "the Licensor permits any Work licensed under this License, at the option of the Licensee, to be treated as licensed under the Apache License Version 2.0". We use this freedom to convert license markings to Apache 2.0. This commit ensures that we retain all authorship and copyright attribution information.	2019-04-26 15:05:17 +01:00
Philipp Wagner	e9e5a719bc	Move RTL code into rtl/ directory This gives us a bit of space in the source tree for documentation, verification, utilities, and much more.	2019-04-26 15:05:17 +01:00

35 commits