cranelift

Commit Graph

Author	SHA1	Message	Date
Alex Crichton	d03f20e0e0	x64: Add non-SSSE3 lowerings of `iadd_pairwise` (#6561 ) This commit adds lowerings to have lowering rules for these instructions on the x64 backend when the `phadd{w,d}` instructions are not available. Additionally this implements `iadd_pairwise` for i8x16 types which while not used by wasm enables running the CLIF runtest on x64.	1 year ago
Alex Crichton	8bec98da28	Fix some beta warnings in the C API (#6578 ) This'll make the future upgrade to Rust 1.71 that much more smoother.	1 year ago
Afonso Bordado	1d6044311d	fuzzgen: Add `bb_padding_log2` option (#6575 )	1 year ago
Alex Crichton	aef1f57d13	Update adapter build (#6573 ) * Update adapter build * Rename the binary artifact to `wasi_snapshot_preview1.wasm` and update build scripts to account for this. * Update documentation to mention difference between reactor/command builds. Closes #6569 * More renaming	1 year ago
Afonso Bordado	f51643db64	riscv64: Add `isub` Widening Instructions (#6555 ) These are similar to the instructions added in #6542. The only difference is that the rules are not commutative so we have slightly fewer of them.	1 year ago
Saúl Cabrera	a50c49724e	winch(x64) Add support for if/else (#6550 ) * winch(x64) Add support for if/else This change adds the necessary building blocks to support control flow; this change also adds support for the `If` / `Else` operators. This change does not include multi-value support. The idea is to add support for multi-value across the compiler (functions and blocks) as a separate future change. The general gist of the change is to track the presence of control flow frames as part of the code generation context and emit the corresponding labels as and instructions as control flow blocks are found. * PR review * Allocate 64 slots for `ControlStackFrames` * Explicitly track else branches through an else entry in `ControlStackFrame`	1 year ago
Alex Crichton	3a81e450f9	x64: Move some constants directly in-line with ISLE (#6564 ) I believe that historically it was difficult to write a 128-bit constant in ISLE but nowadays ISLE supports `u128` integer literals so it's now possible to do that. This commit moves some existing constants in `x64/lower/isle.rs` into `lower.isle` directly to more easily understand them when reading over instruction lowerings by avoiding having to context switch between ISLE and Rust to understand the value of a constant.	1 year ago
Alex Crichton	7f108b1e3a	cranelift: Remove the `fcvt_low_from_sint` instruction (#6565 ) * cranelift: Remove the `fcvt_low_from_sint` instruction This commit removes this instruction since it's a combination of `swiden_low` plus `fcvt_from_sint`. This was used by the WebAssembly `f64x2.convert_low_i32x4_s` instruction previously but the corresponding unsigned variant of the instruction, `f64x2.convert_low_i32x4_u`, used a `uwiden_low` plus `fcvt_from_uint` combo. To help simplify Cranelift's instruction set and to make these two instructions mirrors of each other the Cranelift instruction is removed. The s390x and AArch64 backend lowering rules for this instruction could simply be deleted as the previous combination of the `swiden_low` and `fcvt_from_sint` lowering rules produces the same code. The x64 backend moved its lowering to a special case of the `fcvt_from_sint` lowering. * Fix cranelift-fuzzgen build	1 year ago
Alex Crichton	a986ce9682	egraphs: Lift `splat` outside of int-to-float conversions (#6563 ) This commit adds a targeted optimization aimed at fixing #6562 as a temporary measure for now. The "real" fix for #6562 is to add a full lowering of `fcvt_from_uint` to the x64 backend, but for now adding this rule should fix the specific issue cropping up. Closes #6562	1 year ago
Alex Crichton	9f3bf5c53b	Trim the size of the gh-pages branch (#6539 ) No need to retain a full history of this branch as it's purely a function of the latest commit, so configure some options to throw away its history.	1 year ago
Bobby Holley	5610cbf710	Bump cargo-vet to 0.7.0 (#6544 ) * Bump cargo-vet to 0.7.0. * Prune exemptions. * Import fermyon.	1 year ago
Jeffrey Charles	c26a3cf66f	Add clz and ctz instructions to Winch (#6557 )	1 year ago
Doug A	1cabece50b	Update mod.rs (#6559 ) update docs: prepend func call with `call_`	1 year ago
Nick Fitzgerald	e105aa385e	Cranelift: Get non-tail calls working for the "tail" calling convention (#6500 ) Co-authored-by: Jamey Sharp <jsharp@fastly.com>	1 year ago
Jamey Sharp	c3f40209dd	cranelift: don't enable trace-log feature by default (#6549 ) In #5382 ("egraph support: rewrite to work in terms of CLIF data structures"), we added the `trace-log` feature to the set of default features for `cranelift-codegen`. I think this was an accident, probably added while debugging and overlooked when cleaning up to merge. So let's undo that change. Fixes #6548.	1 year ago
Afonso Bordado	46826c6273	riscv64: Add `iadd` Widening Addition Instructions (#6542 ) Adds the widening add instructions from the V spec. These are `vwadd{u,}.{w,v}{v,x}`. This also adds a bunch of rules to try to match these instructions. And some of these end up being quite complex. Rules that match `{u,s}widen_high` are the same as their `{u,s}widen_low` counterparts but they first do a `vslidedown` of half the vector, to bring the top lanes down. `uwiden_low` rules are the same as the `swiden_low` rules, but they use `vwaddu.` instead of `vwadd.` which is the unsigned version of the instruction. Now, in each of these groups of rules we have a few different instructions. `vwadd.wv` does a 2SEW = 2SEW + SEW, this just means that the elements in the RHS vector are first sign extended before doing the addition. The only trick here is that since the result is 2SEW we must use a vstate type that has half the element size as the type that we want to end up with. So to end up with a i32x4 `iadd` we need to pass in a i16x4 type as a vstate type. `vwadd.vv` does 2SEW = SEW + SEW, so as long as both sides are extended we can use this instruction. Again we must pass in a type with half the element size. `vwadd.wx` and `vwadd.vx` do the same thing, but the RHS is expected to be a extended and splatted X register, so we try to match exactly that. To make these rules more applicable I've previously added some egraph rules (#6533) that convert `{u,s}widen_{low,high}` into `splat+{u,s}extend`, this way we only have to try to match the splat version, which reduces the number of rules. All of these rules use `vstate_mf2`. This is sets the LMUL setting to 1/2, meaning that at most we will read half of the source vector registers, and the result is guaranteed to fit in a single destination register. Otherwise the CPU could have to write the result into multiple register, which is something that the ISA supports, but adds a bunch of constraints that we dont need here.	1 year ago
Jamey Sharp	afd9aced3b	Audit some of the cargo vet backlog (#6536 ) Joint session between myself, pchickey, elliottt, itsrainy, and fitzgen.	1 year ago
Afonso Bordado	1d0565ba87	riscv64: Implement `{u,s}widen_{low,high}` and `load+extend` instructions (#6534 ) * riscv64: Add SIMD Load+Extends * riscv64: Add SIMD `{u,s}widen_{low,high}` * riscv64: Add `gen_slidedown_half` This isn't really necessary yet, but we are going to make a lot of use for it in the widening arithmetic instructions, so might as well add it now. * riscv64: Add multi widen SIMD instructions * riscv64: Typo Fix	1 year ago
Jeffrey Charles	f5fafba809	Add integer binary instructions to Winch (#6538 ) * Add integer binary instructions to Winch * Use handle_invalid_operand_combination and load_constant	1 year ago
Saúl Cabrera	7229ba9048	winch: Fix `CodeGenContext::pop_to_reg` (#6535 ) This commit fixes the implementation of `pop_to_reg`. In the previous implementation, whenever a specific register was requested as the destination register and a register-to-register moved happened the source register was never marked as free. This issue became more evident with more complex programs involving control flow and division for example.	1 year ago
Afonso Bordado	b357b1b1e9	egraphs: Transform `{u,s}widen_{low,high}+splat` into `splat+{u,s}extend` (#6533 )	1 year ago
Alex Crichton	8c530771d9	x64: Avoid using `movddup` without SSSE3 (#6496 ) * x64: Avoid using `movddup` without SSSE3 Update the lowerings for 64-bit splats to use `pshufd` instead, as LLVM does. * Disable runtest for now	1 year ago
Trevor Elliott	1eb8d750ee	Release notes for 10.0.0 (#6521 ) * Initial release notes * Apply suggestions from code review Co-authored-by: Pat Hickey <pat@moreproductive.org> Co-authored-by: Rainy Sinclair <844493+itsrainy@users.noreply.github.com> --------- Co-authored-by: Pat Hickey <pat@moreproductive.org> Co-authored-by: Rainy Sinclair <844493+itsrainy@users.noreply.github.com>	1 year ago
Saúl Cabrera	5c42fb490d	cranelift: Expose `MachLabel` (#6529 ) This is the first of a series of patches to support control flow in Winch. This change exposes `MachLabel` from cranelift for it to be consumed by Winch's `MacroAssembler` and `Assembler`.	1 year ago
Afonso Bordado	579918c2d6	riscv64: Implement SIMD `swizzle` and `shuffle` (#6515 ) * riscv64: Implement SIMD `swizzle` * riscv64: Implement SIMD `shuffle` * wasmtime: Enable more RISC-V SIMD tests * riscv64: Add TODO issue numbers * riscv64: Fix trailing newline issues	1 year ago
Nick Fitzgerald	81cd998350	A couple small Winch cleanups (#6526 ) * Winch: remove tabs, use spaces * Winch: remove unnecessary mutable references	1 year ago
Afonso Bordado	bea401414e	riscv64: Fix wrong move instruction in `select` (#6518 )	1 year ago
Afonso Bordado	cca5726781	fuzzgen: Fix timeout in interpreter vs interpreter mode (#6520 )	1 year ago
wasmtime-publish	b3fd185390	Bump Wasmtime to 11.0.0 (#6519 ) Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com>	1 year ago
Iceber Gu	5ac3858527	fix the link to the published binaries of the component adapter (#6522 ) Signed-off-by: Iceber Gu <wei.cai-nat@daocloud.io>	1 year ago
Afonso Bordado	f7ae056a0a	riscv64: Implement SIMD shifts, `v{all,any}_true` and `vhigh_bits` (#6507 ) * riscv64: Add SIMD shifts * riscv64: Implement SIMD `vall_true` * riscv64: Implement SIMD `vany_true` * riscv64: Add SIMD `vhigh_bits` * wasmtime: Enable more RISC-V SIMD tests	1 year ago
Jamey Sharp	176935e75e	aarch64: Use Imm12 in more cases (#6512 ) The previous implementation of Imm12::maybe_from_u64 did not match the constant values 0xfff or 0xfff000, even though those are expressible in the aarch64 12-bit immediate format. Also the explicit test for 0 was unnecessary; it's a valid example of all bits outside the least-significant 12 bits being 0.	1 year ago
Alex Crichton	550a16f539	Fix a soundness issue with the component model and async (#6509 ) * Force `execute_across_threads` to use multiple threads Currently this uses tokio's `spawn_blocking` but that will reuse threads in its thread pool. Instead spawn a thread and perform a single poll on that thread to force lots of fresh threads to be used and ideally stress TLS management further. * Add a guard against using stale stacks This commit adds a guard to Wasmtime's async support to double-check that when a call to `poll` is finished that the currently active TLS activation pointer does not point to the stack that is being switched off of. This is attempting to be a bit of a defense-in-depth measure to prevent stale pointers from sticking around in TLS. This is currently happening and causing #6493 which can result in unsoundness but currently is manifesting as a crash. * Fix a soundness issue with the component model and async This commit addresses #6493 by fixing a soundness issue with the async implementation of the component model. This issue has been presence since the inception of the addition of async support to the component model and doesn't represent a recent regression. The underlying problem is that one of the base assumptions of the trap handling code is that there's only one single activation in TLS that needs to be pushed/popped when a stack is switched (e.g. a fiber is switched to or from). In the case of the component model there might be two activations: one for an invocation of a component function and then a second for an invocation of a `realloc` function to return results back to wasm (e.g. in the case an imported function returns a list). This problem is fixed by changing how TLS is managed in the presence of fibers. Previously when a fiber was suspended it would pop a single activation from the top of the stack and save that to get pushed when the fiber was resumed. This has the benefit of maintaining an entire linked list of activations for the current thread but has the problem above where it doesn't handle a fiber with multiple activations on it. Instead now TLS management is done when a fiber is resumed instead of suspended. Instead of pushing/popping a single activation the entire linked list of activations is tracked for a particular fiber and stored within the fiber itself. In this manner resuming a fiber will push all activations onto the current thread and suspending a fiber will pop all activations for the fiber (and store them as a new linked list in the fiber's state itself). This end result is that all activations on a fiber should now be managed correctly, regardless of how many there are. The main downside of this commit is that fiber suspension and resumption is more complicated, but the hope there is that fiber suspension typically corresponds with I/O not being ready or similar so the order of magnitude of TLS operations isn't too significant compared to the I/O overhead. Closes #6493 * Review comments * Fix restoration during panic	1 year ago
Jamey Sharp	1d4686de54	Allow async yield from epoch interruption callback (#6464 ) * Allow async yield from epoch interruption callback When an epoch interruption deadline arrives, previously it was possible to yield to the async executor, or to invoke a callback on the wasm stack, but not both. This changes the API to allow callbacks to run and then request yielding to the async executor. * Fix Wasmtime C API implementation	1 year ago
Jeffrey Charles	ace1388f60	Use two operands for Winch's masm cmp_with_set method (#6511 )	1 year ago
Benjamin Bouvier	112e52d722	Upgrade file per thread logger to 0.2.0 (#6503 ) * Upgrade file-per-thread-logger to v0.2.0 Signed-off-by: Benjamin Bouvier <public@benj.me> * Update audits too Signed-off-by: Benjamin Bouvier <public@benj.me> --------- Signed-off-by: Benjamin Bouvier <public@benj.me>	1 year ago
Jeffrey Charles	0893f7c741	Add support to Winch for i*.eqz instructions (#6508 )	1 year ago
Jeffrey Charles	2b20db1ce7	Refactor Winch x64 asm operand checks (#6506 )	1 year ago
Jan-Justin van Tonder	4cf3a7f688	cranelift-interpreter: Fix panic when bitcasting SIMD values (#6379 ) Fixes #5915	1 year ago
Afonso Bordado	f206732db9	riscv64: Cleanup shift rules (#6476 ) * riscv64: Add shifts compile tests * riscv64: Cleanup `ishl` rules * riscv64: Improve `ishl` uextend rules * riscv64: Cleanup `ushr` rules * riscv64: Improve `ushr` uextend rules * riscv64: Improve `sshr` rules * riscv64: Change shift rules priorities * riscv64: Remove trailing whitespace in shift tests	1 year ago
Nick Fitzgerald	dd211c593d	Cranelift: Remove unused parameter for `lower_return` in ISLE (#6499 ) Co-authored-by: Jamey Sharp <jsharp@fastly.com>	1 year ago
Nick Fitzgerald	f4bce01354	Cranelift: Fix typo: "ajust" -> "adjust" (#6501 ) Co-authored-by: Jamey Sharp <jsharp@fastly.com>	1 year ago
Brendan Burns	fc11f56318	Re-enable wasi-http for recent wit bindgen changes. Renable tests. (#6495 ) * Re-enable wasi-http * Address comments.	1 year ago
Nick Fitzgerald	d1a7628290	Cranelift: clean up `adjust_stack_and_nominal_sp` helper (#6497 ) Stop passing around sign booleans and just use the signed integer type we already have at hand.	1 year ago
Alex Crichton	d28986ed5d	Use Rust 1.69.0 instead of 1.70.0 (#6502 ) * Use Rust 1.69.0 instead of 1.70.0 Try to fix some recent CI breakage prtest:full * Don't force stable for test-programs	1 year ago
Alex Crichton	eb47240157	Fix `execute_across_threads` in tests (#6494 ) This commit fixes the helper function `execute_across_threads` in tests to actually execute across threads. This function was refactored in #3975 and accidentally introduced a bug where the future provided was polled once and then dropped, cancelling it instead of executing it to completion.	1 year ago
Remo Senekowitsch	9fcdc7a68e	fuzz: Insert random instructions (#6407 ) * Fix fuel consumption of ControlPlane::shuffle Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * Insert random instructions during lowering Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * add documentation for get_arbitrary Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * Fix zero-sized version of get_arbitrary Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * Insert ints and floats Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * fix inserting of floats Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> * improve abstraction of MachInst::gen_imm_f64 Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me> --------- Co-authored-by: Falk Zwimpfer <24669719+FalkZ@users.noreply.github.com> Co-authored-by: Moritz Waser <mzrw.dev@pm.me>	1 year ago
Nick Fitzgerald	4e821d504b	Cranelift: Add the ability to pop stack while returning (#6478 ) This is necessary for implementing callee-pops calling conventions, as is required for tail calls. This is just a small part of tail calls, and doesn't implement everything, but is a good piece to land on its own so that eventual PR isn't so huge. Co-authored-by: Jamey Sharp <jsharp@fastly.com>	1 year ago
Andrew Brown	9aacf31ad2	wasi-threads: check module shape at spawn time (#6492 ) This change relaxes what kinds of modules can be run when wasi-threads is enabled via `--wasi-modules experimental-wasi-threads`. Previously, as reported in #6153, simple modules that made no use of thread spawning or shared memories were preemptively rejected when the wasi-threads context was created. This is too restrictive. Instead, this change does the following: - it moves the check for whether a module is valid according to the wasi-threads specification to the point a new thread is spawned; this resolves #6153 - as noted in #6153, this change also adds a better error message indicating that wasi-threads expects a shared memory import - the way this is implemented also improves the module instantiation: by constructing an `InstancePre` once when the `WasiThreadsCtx` is built, we might shave off a bit of time from the "spawn a thread" call; this supercedes a similar effort in #5741	1 year ago
Alex Crichton	8fb41ca4f9	x64: Don't require SSE4.1 for `enable_simd` (#6489 ) This commit removes the SSE4.1 requirement for the `enable_simd` CLIF feature. This means that the new baseline required is SSSE3 for the WebAssembly SIMD proposal. Many existing tests for codegen were all updated to explicitly enable `has_sse41` and runtests were updated to test with and without SSE 4.1. Wasmtime's fuzzing is additionally updated to flip the SSE4.1 feature to enable fuzz-testing this.	1 year ago

1 2 3 4 5 ...

11965 Commits (a90f625f09f8d1036c0808deebed25b9b2f75054) All Branches Search

11965 Commits (a90f625f09f8d1036c0808deebed25b9b2f75054)

All Branches