cranelift

Commit Graph

Author	SHA1	Message	Date
Trevor Elliott	aba239e9b8	Fix handling of jumps in bugpoint (#5794 ) Fixes #5792	2 years ago
Afonso Bordado	76539ef9f2	cranelift: Optimize `select+icmp` into `{s,u}{min,max}` (#5546 ) * cranelift: Optimize `select+icmp` into `{s,u}{min,max}` * cranelift: Add generic egraph icmp reverse rule * cranelift: Optimize `vselect+icmp` into `{s,u}{min,max}` * cranelift: Optimize some `vselect+fcmp` into `f{min,max}_pseudo` * cranelift: Add inverted forms of min/max rules	2 years ago
Trevor Elliott	f0137c2618	x64: Fix the formatting for `andn` (#5789 ) * Print AluRmRVex instructions with the destination last * Update andn tests	2 years ago
Alex Crichton	0037b71b11	Use xmm_rm_r more frequently in x64 backend (#5787 ) This updates the signatures of the `xmm_rm_r` helper function and then updates existing users and migrates other users to the helper now that the type information is no longer required.	2 years ago
Ulrich Weigand	305000d14b	s390x: Fix instruction encoding and disassembly format bugs (#5786 ) - Fix encoding of the AHY instruction. - Fix disassembly format of FIEBR, FIDBR, and LEDBRA instructions.	2 years ago
Ulrich Weigand	e10094dcd6	s390x: Support scalar min/max clif instructions (#5762 ) We don't have ISA instructions for that, so simply expand them to icmp + select. Also enable fuzzing for those clif instructions now.	2 years ago
Alex Crichton	255fd6be0a	Update world-selection in `bindgen!` macro (#5779 ) * Update world-selection in `bindgen!` macro Inspired by bytecodealliance/wit-bindgen#494 specifying a world or document to bindgen is now optional as it's inferred if there's only one `default world` in a package's documents. * Add cargo-vet entry	2 years ago
Alphyr	cb150d37ce	Update dependencies (#5513 )	2 years ago
Alex Crichton	49a89f91e5	Add cargo-vet entries for dependency update (#5778 ) This adds vet entries for the updates being performed in #5513	2 years ago
Alex Crichton	b5e9fb710b	Improve type imports into components (#5777 ) This commit fixes a panic related to type imports where an import of a type didn't correctly declare the new type index on the Wasmtime side of things. Additionally this plumbs more support throughout Wasmtime to support type imports, namely that they do not need to be supplied through a `Linker`. This additionally implements a feature where empty instances, even transitively, do not need to be supplied by a Wasmtime embedder. This means that instances which only have types, for example, do not need to be supplied into a `Linker` since no runtime information for them is required anyway. Closes #5775	2 years ago
Koute	e40a838beb	Prevent trampoline entrypoints from being stripped out during LTO (#5773 ) This works around a `rustc` bug where compiling with LTO will sometimes strip out some of the trampoline entrypoint symbols resulting in a linking failure.	2 years ago
Nick Fitzgerald	6df3bbbe60	Cranelift: Collapse double extends into a single extend (#5772 )	2 years ago
Saúl Cabrera	91c8114f00	winch: Add support for integer multiplication in x64. (#5769 ) This commit adds support for the `<i32\|i64>.mul` WebAssembly instructions in x64.	2 years ago
Trevor Elliott	19f337e29b	Move the default block to the front of the underlying jump table storage (#5770 ) The new api on JumpTableData makese it easy to keep the default label first, and that shrinks the diff in #5731 a bit.	2 years ago
Lukas Forst	6cddc923f3	Expose wasmtime_store_limiter in the c-api (#5761 )	2 years ago
Alex Crichton	a0a97f5e8f	Add (bnot (bxor x y)) lowerings for s390x/aarch64 (#5763 ) * Add (bnot (bxor x y)) lowerings for s390x/aarch64 I originally thought that s390x's original lowering in #5709, but as was rightfully pointed out `(bnot (bxor x y))` is equivalent to `(bxor x (bnot y))` so the special lowering for one should apply as a special lowering for the other. For the s390x and aarch64 backend that have already have a fused lowering of the bxor/bnot add a lowering additionally for the bnot/bxor combination. * Add bnot(bxor(..)) tests for s390x 128-bit sizes	2 years ago
Trevor Elliott	d99783fc91	Move default blocks into jump tables (#5756 ) Move the default block off of the br_table instrution, and into the JumpTable that it references.	2 years ago
Alex Crichton	49613be393	Update wasm-tools crates (#5757 ) * Update wasm-tools crates Pulls in a new component binary format which should hopefully be the last update for awhile. * Update cargo vet configuration	2 years ago
Ivan Font	de68cc1726	Add support for WASI sockets to C API (#5624 ) * Add support for WASI sockets to C API Add support for WASI sockets in the C API by adding a new API to handle preopening sockets for clients. This uses HashMap instead of Vec for preopened sockets to identify if caller has called in more than once with the same FD number. If so, then we return false so caller is given hint that they are attempting to overwrite an already existing socket FD. * Apply suggestions from code review Co-authored-by: Peter Huene <peter@huene.dev> * s/stdlistener/listener/ --------- Co-authored-by: Peter Huene <peter@huene.dev>	2 years ago
Trevor Elliott	15fe9c7c93	Inline jump tables in parsed br_table instructions (#5755 ) As jump tables are used by at most one br_table instruction, inline their definition in those instructions instead of requiring them to be declared as function-level metadata.	2 years ago
bjorn3	202d3af16a	Remove the unused sigid argument purpose (#5753 )	2 years ago
Amanieu d'Antras	a2d356d45e	Add `JITBuilder::with_flags` constructor (#5751 ) This allows custom flags to be set (e.g. `opt-level`) while still leaving most of of the boilerplate to select the native target to the `JITBuilder`.	2 years ago
Saúl Cabrera	7c5c7e4b6d	winch: Add full support for integer `sub` and `add` instructions (#5737 ) This patch adds complete support for the `sub` and `add` WebAssembly instructions for x64, and complete support for the `add` WebAssembly instruction for aarch64. This patch also refactors how the binary operations get constructed within the `VisitOperator` trait implementation. The refactor adds methods in the `CodeGenContext` to abstract all the common steps to emit binary operations, making this process less repetitive and less brittle (e.g. omitting to push the resulting value to the stack, or omitting to free registers after used). This patch also improves test coverage and refactors the filetests directory to make it easier to add tests for other instructions.	2 years ago
Ayomide Bamidele	9637840b4b	Update AMD and generic x86 CPU presets to match LLVM (#5575 ) * Add x86 AMD and generic presets * Fix typos * Add zen2 definition	2 years ago
Trevor Elliott	34ec4b4e44	Reuse `inst_predicates::visit_block_succs` in more places (#5750 ) Following up from #5730, replace some explicit matching over branch instructions with a use of inst_predicates::visit_block_succs.	2 years ago
Andrew Brown	cacc416080	wasi-threads: fix import name (#5748 ) * wasi-threads: fix import name As @TerrorJack pointed out in #5484, that PR implements an older name--`thread_spawn`. This change uses the now-official name from the specification--`thread-spawn`. * fix: update name in test	2 years ago
Alex Crichton	46fe366756	Fix a missing `async_trait` annotation in `bindgen!` (#5747 ) Closes #5743	2 years ago
Trevor Elliott	b0b3f67cb0	Move jump tables to the DataFlowGraph (#5745 ) Move the storage for jump tables off of FunctionStencil and onto DataFlowGraph. This change is in service of #5731, making it easier to access the jump table data in the context of helpers like inst_values.	2 years ago
Jamey Sharp	7bf89683e9	Generalize and/or/xor optimizations (#5744 ) * Generalize `n ^ !n` optimization to more types * Generalize `x & -1` optimization to more types Also mark the `x & x` rewrite to `subsume`. * Cranelift: Optimize x\|!x and x&!x to constants These cases are much like the existing x^!x rules.	2 years ago
Trevor Elliott	d71c9458dc	Make `DataFlowGraph::blocks` public (#5740 ) Similar to when we exposed the DataFlowGraph::insts field through a restrictive newtype, expose DataFlowGraph::blocks through an interface that allows a restrictive set of operations. This field being public now allows us to avoid a rematch in ssa construction, and simplifies the implementation of adding a block argument to a block referenced by a br_table instruction.	2 years ago
Jamey Sharp	f3b408d5e2	Algebraic opts: Reuse `iconst 0` from LHS (#5724 ) We don't need to spend time going through the GVN map to dedup a newly-constructed `iconst 0` when we already matched that value on the left-hand side of these rules. Also, mark these rules as subsuming any others since we can't do better than reducing an expression to a constant.	2 years ago
Trevor Elliott	116e5a665f	Bump regalloc2 to 0.6.0 (#5742 ) * Bump regalloc2 * Certify regalloc2 0.6.0	2 years ago
Andrew Brown	121094054b	doc: update Wasm threads support status (#5739 )	2 years ago
Trevor Elliott	3343cf80e9	Add assertions for matches that used to use analyze_branch (#5733 ) Following up from #5730, add debug assertions to ensure that new branch instructions don't slip through matches that used to use analyze_branch.	2 years ago
Nick Fitzgerald	317cc51337	Rename `VMCallerCheckedAnyfunc` to `VMCallerCheckedFuncRef` (#5738 ) At some point what is now `funcref` was called `anyfunc` and the spec changed, but we didn't update our internal names. This does that. Co-authored-by: Jamey Sharp <jsharp@fastly.com>	2 years ago
Andrew Brown	edfa10d607	wasi-threads: an initial implementation (#5484 ) This commit includes a set of changes that add initial support for `wasi-threads` to Wasmtime: * feat: remove mutability from the WasiCtx Table This patch adds interior mutability to the WasiCtx Table and the Table elements. Major pain points: * `File` only needs `RwLock<cap_std::fs::File>` to implement `File::set_fdflags()` on Windows, because of [1] * Because `File` needs a `RwLock` and `RwLockGuard` cannot be hold across an `.await`, The `async` from `async fn num_ready_bytes(&self)` had to be removed Because `File` needs a `RwLock` and `RwLockGuard` cannot be dereferenced in `pollable`, the signature of `fn pollable(&self) -> Option<rustix::fd::BorrowedFd>` changed to `fn pollable(&self) -> Option<Arc<dyn AsFd + '_>>` [1] `da238e324e/src/fs/fd_flags.rs (L210-L217)` wasi-threads: add an initial implementation This change is a first step toward implementing `wasi-threads` in Wasmtime. We may find that it has some missing pieces, but the core functionality is there: when `wasi::thread_spawn` is called by a running WebAssembly module, a function named `wasi_thread_start` is found in the module's exports and called in a new instance. The shared memory of the original instance is reused in the new instance. This new WASI proposal is in its early stages and details are still being hashed out in the [spec] and [wasi-libc] repositories. Due to its experimental state, the `wasi-threads` functionality is hidden behind both a compile-time and runtime flag: one must build with `--features wasi-threads` but also run the Wasmtime CLI with `--wasm-features threads` and `--wasi-modules experimental-wasi-threads`. One can experiment with `wasi-threads` by running: ```console $ cargo run --features wasi-threads -- \ --wasm-features threads --wasi-modules experimental-wasi-threads \ <a threads-enabled module> ``` Threads-enabled Wasm modules are not yet easy to build. Hopefully this is resolved soon, but in the meantime see the use of `THREAD_MODEL=posix` in the [wasi-libc] repository for some clues on what is necessary. Wiggle complicates things by requiring the Wasm memory to be exported with a certain name and `wasi-threads` also expects that memory to be imported; this build-time obstacle can be overcome with the `--import-memory --export-memory` flags only available in the latest Clang tree. Due to all of this, the included tests are written directly in WAT--run these with: ```console $ cargo test --features wasi-threads -p wasmtime-cli -- cli_tests ``` [spec]: https://github.com/WebAssembly/wasi-threads [wasi-libc]: https://github.com/WebAssembly/wasi-libc This change does not protect the WASI implementations themselves from concurrent access. This is already complete in previous commits or left for future commits in certain cases (e.g., wasi-nn). * wasi-threads: factor out process exit logic As is being discussed [elsewhere], either calling `proc_exit` or trapping in any thread should halt execution of all threads. The Wasmtime CLI already has logic for adapting a WebAssembly error code to a code expected in each OS. This change factors out this logic to a new function, `maybe_exit_on_error`, for use within the `wasi-threads` implementation. This will work reasonably well for CLI users of Wasmtime + `wasi-threads`, but embedders will want something better in the future: when a `wasi-threads` threads fails, they may not want their application to exit. Handling this is tricky, because it will require cancelling the threads spawned by the `wasi-threads` implementation, something that is not trivial to do in Rust. With this change, we defer that work until later in order to provide a working implementation of `wasi-threads` for experimentation. [elsewhere]: https://github.com/WebAssembly/wasi-threads/pull/17 * review: work around `fd_fdstat_set_flags` In order to make progress with wasi-threads, this change temporarily works around limitations induced by `wasi-common`'s `fd_fdstat_set_flags` to allow `&mut self` use in the implementation. Eventual resolution is tracked in https://github.com/bytecodealliance/wasmtime/issues/5643. This change makes several related helper functions (e.g., `set_fdflags`) take `&mut self` as well. * test: use `wait`/`notify` to improve `threads.wat` test Previously, the test simply executed in a loop for some hardcoded number of iterations. This changes uses `wait` and `notify` and atomic operations to keep track of when the spawned threads are done and join on the main thread appropriately. * various fixes and tweaks due to the PR review --------- Signed-off-by: Harald Hoyer <harald@profian.com> Co-authored-by: Harald Hoyer <harald@profian.com> Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2 years ago
Trevor Elliott	2c8425998b	Refactor matches that used to consume BranchInfo (#5734 ) Explicitly borrow the instruction data, and use a mutable borrow to avoid rematch.	2 years ago
Raekye	fdd4a778fc	Fix ABI of jitted function in cranelift-jit example. (#5736 )	2 years ago
Alex Crichton	72962c9f08	Add some minor souper-harvested optimizations (#5735 ) I was playing around with souper recently on some wasms I had lying around and these are some optimization opportunities that popped out which seemed easy-enough to add to the egraph-based optimizations.	2 years ago
Brendan Burns	08403c9915	Update base64 dependency to 0.21.0 (#5702 ) * Update base64 to 0.21.0 * Update code for base64 0.21.0	2 years ago
Chris Fallin	673b448cfe	Cranelift DFG: make inst clone deep-clone varargs lists. (#5727 ) When investigating #5716, I found that rematerialization of a `call`, in addition to blowing up for other reasons, caused aliasing of the varargs list (the `EntityList` in the `ListPool`), such that editing the args of the second copy of the call instruction inadvertently updated the first as well. This PR modifies `DataFlowGraph::clone_inst` so that it always clones the varargs list if present. This shouldn't have any functional impact on Cranelift today, because we don't rematerialize any instructions with varargs; but it's important to get it right to avoid a bug later!	2 years ago
Trevor Elliott	c8a6adf825	Remove analyze_branch and BranchInfo (#5730 ) We don't have overlap in behavior for branch instructions anymore, so we can remove analyze_branch and instead match on the InstructionData directly. Co-authored-by: Jamey Sharp <jamey@minilop.net>	2 years ago
Chris Fallin	75ae976adc	egraphs: fix accidental remat of call. (#5726 ) In the provided test case in #5716, the result of a call was then added to 0. We have a rewrite rule that sets the remat-bit on any add of a value and a constant, because these frequently appear (e.g. from address offset calculations) and this can frequently reduce register pressure (one long-lived base vs. many long-lived base+offset values). Separately, we have an algebraic rule that `x+0` rewrites to `x`. The result of this was that we had an eclass with the remat bit set on the add, but the add was also union'd into the call. We pick the latter during extraction, because it's cheaper not to do the add at all; but we still get the remat bit, and try to remat a call (!), which blows up later. This PR fixes the logic to look up the "best value" for a value (i.e., whatever extraction determined), and look up the remat bit on that node, not the canonical node. (Why did the canonical node become the iadd and not the call? Because the former had a lower value-number, as an accident of IR construction; we don't impose any requirements on the input CLIF's value-number ordering, and I don't think this breaks any of the important acyclic properties, even though there is technically a dependence from a lower-numbered to a higher-numbered node. In essence one can think of them as having "virtual numbers" in any true topologically-sorted order, and the only place the actual integer indices matter should be in choosing the "canonical ID", which is just used for dedup'ing, modulo this bug.) Fixes #5716.	2 years ago
bjorn3	16afefdab1	Some refactorings to the ISLE parser (#5693 ) * Use is_ascii_digit and is_ascii_hexdigit in the ISLE lexer * Use range pattern in ISLE lexer * Use a couple of shorthands in the ISLE parser * Use parse_ident instead of symbol + str_to_ident * Introduce token eating api This is a non-fatal version of the take api * Rename take to expect and add expect_ prefixes to several methods * Review comments	2 years ago
Trevor Elliott	e9c05622c0	Keep reachable jump tables (#5721 ) Instead of identifying unused branch tables by looking for unused blocks inside of them, track used branch tables while traversing reachable blocks. This introduces an extra allocation of an EntitySet to track the used jump tables, but as those are few and this function runs once per ir::Function, the allocation seems reasonable.	2 years ago
Jamey Sharp	65c1f654f2	Cranelift: Only build iconst for ints <= 64 bits (#5723 ) I audited the egraph "algebraic" optimization rules for any which construct an `iconst` on the right-hand side of the rule. In these cases we need to constrain the type passed to `iconst` to be both `fits_in_64` and `ty_int`, because `iconst` is not defined on other types.	2 years ago
Alex Crichton	284fec127a	Remove explicit `S` type from component functions (#5722 ) I ended up forgetting this as part of #5275.	2 years ago
Saúl Cabrera	939b6ea933	winch: Fix retrieving function signature for compilation (#5725 ) This commit fixes an incorrect usage of `func_type_at` to retrieve a defined function signature and instead uses `function_at` to retrieve the signature. Additionally it enhances `winch-tools` `compile` and `test` commands to handle modules with multiple functions correctly.	2 years ago
Alex Crichton	1390882b56	Update release notes for 6.0.0 (#5719 )	2 years ago
Alex Crichton	de0e0bea3f	Legalize `b{and,or,xor}_not` into component instructions (#5709 ) * Remove trailing whitespace in `lower.isle` files * Legalize the `band_not` instruction into simpler form This commit legalizes the `band_not` instruction into `band`-of-`bnot`, or two instructions. This is intended to assist with egraph-based optimizations where the `band_not` instruction doesn't have to be specifically included in other bit-operation-patterns. Lowerings of the `band_not` instruction have been moved to a specialization of the `band` instruction. * Legalize `bor_not` into components Same as prior commit, but for the `bor_not` instruction. * Legalize bxor_not into bxor-of-bnot Same as prior commits. I think this also ended up fixing a bug in the s390x backend where `bxor_not x y` was actually translated as `bnot (bxor x y)` by accident given the test update changes. * Simplify not-fused operands for riscv64 Looks like some delegated-to rules have special-cases for "if this feature is enabled use the fused instruction" so move the clause for testing the feature up to the lowering phase to help trigger other rules if the feature isn't enabled. This should make the riscv64 backend more consistent with how other backends are implemented. * Remove B{and,or,xor}Not from cost of egraph metrics These shouldn't ever reach egraphs now that they're legalized away. * Add an egraph optimization for `x^-1 => ~x` This adds a simplification node to translate xor-against-minus-1 to a `bnot` instruction. This helps trigger various other optimizations in the egraph implementation and also various backend lowering rules for instructions. This is chiefly useful as wasm doesn't have a `bnot` equivalent, so it's encoded as `x^-1`. * Add a wasm test for end-to-end bitwise lowerings Test that end-to-end various optimizations are being applied for input wasm modules. * Specifically don't self-update rustup on CI I forget why this was here originally, but this is failing on Windows CI. In general there's no need to update rustup, so leave it as-is. * Cleanup some aarch64 lowering rules Previously a 32/64 split was necessary due to the `ALUOp` being different but that's been refactored away no so there's no longer any need for duplicate rules. * Narrow a x64 lowering rule This previously made more sense when it was `band_not` and rarely used, but be more specific in the type-filter on this rule that it's only applicable to SIMD types with lanes. * Simplify xor-against-minus-1 rule No need to have the commutative version since constants are already shuffled right for egraphs * Optimize band-of-bnot when bnot is on the left Use some more rules in the egraph algebraic optimizations to canonicalize band/bor/bxor with a `bnot` operand to put the operand on the right. That way the lowerings in the backends only have to list the rule once, with the operand on the right, to optimize both styles of input. * Add commutative lowering rules * Update cranelift/codegen/src/isa/x64/lower.isle Co-authored-by: Jamey Sharp <jamey@minilop.net> --------- Co-authored-by: Jamey Sharp <jamey@minilop.net>	2 years ago

1 2 3 4 5 ...

10919 Commits (4c88acbb897c3843c5f0f11eb0f1b0729656e18b) All Branches Search

10919 Commits (4c88acbb897c3843c5f0f11eb0f1b0729656e18b)

All Branches