cranelift

Commit Graph

Author	SHA1	Message	Date
Saúl Cabrera	426c49b8e3	winch: Use aarch64 backend for code emission. (#5652 ) This patch introduces basic aarch64 code generation by using `cranelift-codegen`'s backend. This commit does not: * Change the semantics of the code generation * Adds support for other Wasm instructions The most notable change in this patch is how addressing modes are handled at the MacroAssembler layer: instead of having a canonical address representation, this patch introduces the addressing mode as an associated type in the MacroAssembler trait. This approach has the advantage that gives each ISA enough flexiblity to describe the addressing modes and their constraints in isolation without having to worry on how a particular addressing mode is going to affect other ISAs. In the case of Aarch64 this becomes useful to describe indexed addressing modes (particularly from the stack pointer). This patch uses the concept of a shadow stack pointer (x28) as a workaround to Aarch64's stack pointer 16-byte alignment. This constraint is enforced by: * Introducing specialized addressing modes when using the real stack pointer; this enables auditing when the real stack pointer is used. As of this change, the real stack pointer is only used in the function's prologue and epilogue. * Asserting that the real stack pointer is not used as a base for addressing modes. * Ensuring that at any point during the code generation process where the stack pointer changes (e.g. when stack space is allocated / deallocated) the value of the real stack pointer is copied into the shadow stack pointer.	2 years ago
Alex Crichton	a2a0a9ef5b	Update to the latest `wit-parser` (#5694 ) This notably pulls in support in WIT for types-in-worlds.	2 years ago
Alex Crichton	545749b279	Fix some wit-bindgen-related issues with generated bindings (#5692 ) * Prefix component-bindgen-generated-functions with `call_` This fixes clashes between Rust-native methods and the methods themselves. For example right now `new` is a Rust-generated function for constructing the wrapper but this can conflict with a world-exported function called `new`. Closes #5585 * Fix types being both shared and owned This refactors some inherited cruft from the original `wit-bindgen` repository to be more Wasmtime-specific and fixes a codegen case where a type was used in both a shared and an owned context. Closes #5688	2 years ago
Alex Crichton	63d80fc509	Remove the need to have a `Store` for an `InstancePre` (#5683 ) * Remove the need to have a `Store` for an `InstancePre` This commit relaxes a requirement of the `InstancePre` API, notably its construction via `Linker::instantiate_pre`. Previously this function required a `Store<T>` to be present to be able to perform type-checking on the contents of the linker, and now this requirement has been removed. Items stored within a linker are either a `HostFunc`, which has type information inside of it, or an `Extern`, which doesn't have type information inside of it. Due to the usage of `Extern` this is why a `Store` was required during the `InstancePre` construction process, it's used to extract the type of an `Extern`. This commit implements a solution where the type information of an `Extern` is stored alongside the `Extern` itself, meaning that the `InstancePre` construction process no longer requires a `Store<T>`. One caveat of this implementation is that some items, such as tables and memories, technically have a "dynamic type" where during type checking their current size is consulted to match against the minimum size required of an import. This no longer works when using `Linker::instantiate_pre` as the current size used is the one when it was inserted into the linker rather than the one available at instantiation time. It's hoped, however, that this is a relatively esoteric use case that doesn't impact many real-world users. Additionally note that this is an API-breaking change. Not only is the `Store` argument removed from `Linker::instantiate_pre`, but some other methods such as `Linker::define` grew a `Store` argument as the type needs to be extracted when an item is inserted into a linker. Closes #5675 * Fix the C API * Fix benchmark compilation * Add C API docs * Update crates/wasmtime/src/linker.rs Co-authored-by: Andrew Brown <andrew.brown@intel.com> --------- Co-authored-by: Andrew Brown <andrew.brown@intel.com>	2 years ago
Saúl Cabrera	f5f517e811	winch: Small clean-up for x64 (#5691 ) This commit contains a small set of clean up items for x64. Notably: * Adds filetests * Documents why 16 for the arg base offset abi implementation, for clarity. * Fixes a bug in the spill implementation caught while anlyzing the filetests results. The fix consists of emitting a load instead of a store into the scratch register before spiiling its value. * Remove dead code for pretty printing registers which is not needed anymore since we now have proper disassembly.	2 years ago
Trevor Elliott	446337c746	Generate an instance_pre wrapper in the component bindgen output (#5685 )	2 years ago
Jun Ryung Ju	9cd4146939	Implemented `b{and,or,xor}_not` bitops for ty_int_ref_scalar_64 type. (#5604 ) * Implemented `b{and,or,xor}_not` bitops for ty_int_ref_scalar_64 type. * Added tests.	2 years ago
Jamey Sharp	ac4d28f4dd	Constant-fold icmp instructions (#5666 ) We found examples of icmp instructions with both operands constant in spidermonkey.wasm.	2 years ago
Nick Fitzgerald	bdfb746548	Cranelift: Introduce the `return_call` and `return_call_indirect` instructions (#5679 ) * Cranelift: Introduce the `tail` calling convention This is an unstable-ABI calling convention that we will eventually use to support Wasm tail calls. Co-Authored-By: Jamey Sharp <jsharp@fastly.com> * Cranelift: Introduce the `return_call` and `return_call_indirect` instructions These will be used to implement tail calls for Wasm and any other language targeting CLIF. The `return_call_indirect` instruction differs from the Wasm instruction of the same name by taking a native address callee rather than a Wasm function index. Co-Authored-By: Jamey Sharp <jsharp@fastly.com> * Cranelift: Implement verification rules for `return_call[_indirect]` They must: * have the same return types between the caller and callee, * have the same calling convention between caller and callee, * and that calling convention must support tail calls. Co-Authored-By: Jamey Sharp <jsharp@fastly.com> * cargo fmt --------- Co-authored-by: Jamey Sharp <jsharp@fastly.com>	2 years ago
Nick Fitzgerald	ffbbfbffce	Cranelift: Rewrite `or(and(x, y), not(y)) => or(x, not(y))` again (#5684 ) This rewrite was introduced in #5676 and then reverted in #5682 due to a footgun where we accidentally weren't actually checking the `y == !z` precondition. This commit fixes the precondition check. It also fixes the arithmetic to be correctly masked to the value type's width. This reverts commit `268f6bfc1d`.	2 years ago
Alex Crichton	91b8a2c527	Always allocate `Instance` memory with `malloc` (#5656 ) This commit removes the pooling of `Instance` allocations from the pooling instance allocator. This means that the allocation of `Instance` (and `VMContext`) memory, now always happens through the system `malloc` and `free` instead of optionally being part of the pooling instance allocator. Along the way this refactors the `InstanceAllocator` trait so the pooling and on-demand allocators can share more structure with this new property of the implementation. The main rationale for this commit is to reduce the RSS of long-lived programs which allocate instances with the pooling instance allocator and aren't using the "next available" allocation strategy. In this situation the memory for an instance is never decommitted until the end of the program, meaning that eventually all instance slots will become occupied and resident. This has the effect of Wasmtime slowly eating more and more memory over time as each slot gets an instance allocated. By switching to the system allocator this should reduce the current RSS workload from O(used slots) to O(active slots), which is more in line with expectations.	2 years ago
Alex Crichton	8ffbb9cfd7	Reimplement the pooling instance allocation strategy (#5661 ) * Reimplement the pooling instance allocation strategy This commit is a reimplementation of the strategy by which the pooling instance allocator selects a slot for a module. Previously there was a choice amongst three different algorithms: "reuse affinity", "next available", and "random". The default was "reuse affinity" but some new data has come to light which shows that this may not always be a good default. Notably the pooling allocator will retain some memory per-slot in the pooling instance allocator, for example instance data or memory data if-so-configured. This means that a currently unused, but previously used, slot can contribute to the RSS usage of a program using Wasmtime. Consequently the RSS impact here is O(max slots) which can be counter-intuitive for embedders. This particularly affects "reuse affinity" because the algorithm for picking a slot when there are no affine slots is "pick a random slot", which means eventually all slots will get used. In discussions about possible ways to tackle this, an alternative to "pick a strategy" arose and is now implemented in this commit. Concretely the new allocation algorithm for a slot is now: * First pick the most recently used affine slot, if one exists. * Otherwise if the number of affine slots to other modules is above some threshold N then pick the least-recently used affine slot. * Otherwise pick a slot that's affine to nothing. The "N" in this algorithm is configurable and setting it to 0 is the same as the old "next available" strategy while setting it to infinity is the same as the "reuse affinity" algorithm. Setting it to something in the middle provides a knob to allow a modest "cache" of affine slots while not allowing the total set of slots used to grow too much beyond the maximal concurrent set of modules. The "random" strategy is now no longer possible and was removed to help simplify the allocator. * Resolve rustdoc warnings in `wasmtime-runtime` crate * Remove `max_cold` as it duplicates the `slot_state.len()` * More descriptive names * Add a comment and debug assertion * Add some list assertions	2 years ago
yuyang	cb3b6c621f	fix rotl.i16 with i128 shift value. (#5611 ) * fix issue 5523. * fix. * add missing issue file. * fix issue. * fix duplicate shamt_128. * issue 5523 add test target,and fix some wrong comment. * fix output file. * enable llvm_abi_extensions for regression test file.	2 years ago
Trevor Elliott	268f6bfc1d	Revert "Cranelift: Rewrite `or(and(x, y), not(y)) => or(x, not(y))` (#5676 )" (#5682 ) This reverts commit `8c9eb9939b`. Fixes #5680	2 years ago
yuyang	0c66a1bba7	Fix issue 5528 (#5605 ) * fix parameter error. * fix float convert to i8 and i16 should extract sign bit. * add missing regression test file. * using tmp register. * float convert i8 will consume more instructions. * fix worse inst emit size. * fix worst_case_size.	2 years ago
Nick Fitzgerald	8c9eb9939b	Cranelift: Rewrite `or(and(x, y), not(y)) => or(x, not(y))` (#5676 ) Co-authored-by: Rainy Sinclair <844493+itsrainy@users.noreply.github.com>	2 years ago
Trevor Elliott	e82995f03c	Add a convenience function for displaying a BlockCall (#5677 ) Add a display method to BlockCall that returns a std::fmt::Displayable result. Rework the display code in the write module of cranelift-codegen to use this method instead.	2 years ago
Nick Fitzgerald	253e28ca4f	Cranelift: Rewrite `(x>>k)<<k` into masking off the bottom `k` bits (#5673 ) * Cranelift: Rewrite `(x>>k)<<k` into masking off the bottom `k` bits * Add a runtest for exercising our rewrite of `(x >> k) << k` into masking	2 years ago
Alex Crichton	7f2c8e6344	Fix some warnings on nightly Rust (#5668 ) * Fix some warnings on nightly Rust Cargo is warning about the usage of workspace dependencies where the workspace declaration does not mention `default-features` but the dependency mentions `default-features`, so this explicitly turns off default features for `cranelift-codegen` at the workspace level and removes the explicit `default-features = false` at the manifest levels. * Explicitly enable default feature in wasmtime * Enable another feature	2 years ago
Nick Fitzgerald	7aa240e0f2	Cranelift: constant propagate shifts (#5671 ) Thanks to Souper for pointing out we weren't doing this!	2 years ago
Trevor Elliott	10fcd14287	Remove unused code from the write module (#5674 ) The DisplayValuesWithDelimiter struct is no longer used.	2 years ago
Kevin Rizzo	f110bd98d1	Making sure that new files in the winch filetests directory will cause a rebuild (#5672 )	2 years ago
Nick Fitzgerald	c9d1c068bc	Cranelift: Add egraph rule to rewrite `x * C ==> x << log2(C)` when `C` is a power of two (#5647 )	2 years ago
Jamey Sharp	61270cdaed	ISLE: reject multi-term rules with explicit priorities (#5663 ) In multi-terms, all matching rules fire. We treat the result as an unordered set of values, so setting rule priorities is meaningless. We want to prohibit relying on the rule match order in this case. Also, codegen can produce invalid Rust if rules with different priorities both match against a multi-term. We first documented this symptom in #5647. As far as I can figure, prohibiting rule priorities prevents all possible instances of that bug. At some point in the future we might decide we want to carefully define semantics for multi-term result ordering, at which point we can revisit this.	2 years ago
Alex Crichton	d61758e2e9	Pin release artifacts Rust toolchain (#5669 ) This fixes the build issue identified in #5664 at the toolchain level rather than working around it in our own build. The next step in fixing this will be to remove the nightly override in the future when the toolchain becomes stable.	2 years ago
Nick Fitzgerald	bf4d0e9212	Cranelift: Fix `select` condition harvesting (#5662 ) Souper requires an `i1` condition value, we don't and will implicitly check against 0. We were truncating conditions but should actually be doing the comparison against `0`.	2 years ago
Trevor Elliott	cc768f22a2	Debug the build step (#5664 ) Change the permissions on libwasmtime.a before copying it, to avoid errors stemming from new behavior in rustc-1.67.	2 years ago
Trevor Elliott	b5692db7ce	Remove boolean parameters from instruction builder functions (#5658 ) Remove the boolean parameters from the instruction builder functions, as they were only ever used with true. Additionally, change the returns and branches functions to imply terminates_block.	2 years ago
Nick Fitzgerald	e4fa355866	cranelift: Generate the correct souper size for comparisons in LHSes (#5659 )	2 years ago
Chris Fallin	f488d93c5a	Wasmtime: build release artifacts with `all-arch`. (#5657 ) This allows the `wasmtime` binary provided in our release artifacts to cross-compile: `wasmtime compile` can build a `.cwasm` for any platform that Wasmtime supports, not just the host platform. This may be useful in some deployment scenarios. We don't turn on `all-arch` by default because it increases build time and binary size of Wasmtime itself, and other embedders of the `wasmtime` crate won't necessarily want this; hence, we set it only as part of the CI build configuration. Fixes #5655.	2 years ago
Nick Fitzgerald	ffcd61b520	Cranelift: Harvest each Souper LHS into its own file (#5649 ) * Cranelift: Harvest each Souper LHS into its own file Souper only handles one input LHS at a time, so this makes it way easier to script. Don't need to try and parse each LHS. * Add audit of `arrayref` version 0.3.6 * Add audit of `constant_time_eq` version 0.2.4	2 years ago
Trevor Elliott	a5698cedf8	cranelift: Remove brz and brnz (#5630 ) Remove the brz and brnz instructions, as their behavior is now redundant with brif.	2 years ago
yuyang	77cf547f41	fix issue 5569. (#5612 ) * add regression test file. * fix issute5569. * enable code length check.	2 years ago
Thibault Charbonnier	e835255fbf	c-api: add Wasmtime version macros to wasmtime.h (#5651 ) * Add several `WASMTIME_VERSION_` macros to `wasmtime.h`. Update `scripts/publish.rs` * To set these macros as per the new version in `./Cargo.toml` during `./publish bump`. * To verify the macros match the version in `./Cargo.toml` during `./publish verify`. Fix #5635	2 years ago
Trevor Elliott	20a216923b	Fix an assertion failure with an empty Switch (#5650 ) Fix an error introduced in #5644, where an unsigned subtraction from zero was possible with an empty Switch structure. Additionally, missing the empty case caused us to not emit a branch to the default block. This PR fixes the issue by detecting the empty Switch case early, and emitting a jump.	2 years ago
Nick Fitzgerald	ffbcc67eb3	Cranelift: Consider shifts as "simple" arithmetic in egraph cost model (#5646 )	2 years ago
Trevor Elliott	b47006d432	Rework the switch module in cranelift-frontend in terms of brif (#5644 ) Rework the compilation strategy for switch to: * use brif instead of brz and brnz * generate tables inline, rather than delyaing them to after the decision tree has been generated * avoid allocating new vectors by using slices into the sorted contiguous ranges * avoid generating some unconditional jumps * output differences in test output using the similar crate for easier debugging	2 years ago
Saúl Cabrera	0f8393508a	cranelift-codegen: Expose `EmitState` and `EmitInfo` from aarch64 (#5640 ) This commit exposes `EmitState` and `EmitInfo` so that they can be consumed by Winch. This is a follow up to https://github.com/bytecodealliance/wasmtime/pull/5570, in which this should've been included.	2 years ago
Trevor Elliott	058d93bc64	Migrate cranelift-wasm to brif (#5638 ) Incrementally working towards removing brz and brnz completely.	2 years ago
Jamey Sharp	915801551b	Delete old cranelift-preopt crate (#5642 ) Most of these optimizations are in the egraph `cprop.isle` rules now, making a separate crate unnecessary. Also I think the `udiv` optimizations here are straight-up wrong (doing signed instead of unsigned division, and panicking instead of preserving traps on division by zero) so I'm guessing this crate isn't seriously used anywhere. At the least, bjorn3 confirms that cg_clif doesn't use this, and I've verified that Wasmtime doesn't either. Closes #1090.	2 years ago
Trevor Elliott	a181ad2932	Cleanup the use of `maybe_uextend` in the x64 lowerings (#5637 ) Use maybe_uextend for the brnz lowerings on x64.	2 years ago
Trevor Elliott	7926808e8e	riscv64: improve unordered comparison generated code (#5636 ) Improve the generated code for unordered floating point comparisons by negating the comparison and inveritng the branches. This allows us to pick the unordered versions, which generate significantly better code.	2 years ago
Alex Crichton	4ad86752de	Fix libcall relocations for precompiled modules (#5608 ) * Fix libcall relocations for precompiled modules This commit fixes some asserts and support for relocation libcalls in precompiled modules loaded from disk. In doing so this reworks how mmaps are managed for files from disk. All non-file-backed `Mmap` entries are read/write but file-backed versions were readonly. This commit changes this such that all `Mmap` objects, even if they're file-backed, start as read/write. The file-based versions all use copy-on-write to preserve the private-ness of the mapping. This is not functionally intended to change anything. Instead this should have some more memory writable after a module is loaded but the text section, for example, is still left as read/execute when loading is finished. Additionally this makes modules compiled in memory more consistent with modules loaded from disk. * Update a comment * Force images to become readonly during publish This marks compiled images as entirely readonly during the `CodeMemory::publish` step which happens just before the text section becomes executable. This ensures that all images, no matter where they come from, are guaranteed frozen before they start executing.	2 years ago
Alex Crichton	38bf38c514	Flag to rustdoc component support requires a feature (#5632 ) This helps render the information "officially" in documentation.	2 years ago
Alex Crichton	a7d0d00e57	Update wasm-tools crates (#5631 ) Nothing major pulled in here, but wanted to update to the latest versions which enable tail calls by default. When used in Wasmtime, however, the feature is disabled without the possibility of being enabled since it's not implemented.	2 years ago
Trevor Elliott	b58a197d33	cranelift: Add a conditional branch instruction with two targets (#5446 ) Add a conditional branch instruction with two targets: brif. This instruction will eventually replace brz and brnz, as it encompasses the behavior of both. This PR also changes the InstructionData layout for instruction formats that hold BlockCall values, taking the same approach we use for Value arguments. This allows branch_destination to return a slice to the BlockCall values held in the instruction, rather than requiring that we pattern match on InstructionData to fetch the then/else blocks. Function generation for fuzzing has been updated to generate uses of brif, and I've run the cranelift-fuzzgen target locally for hours without triggering any new failures.	2 years ago
bjorn3	ec6922ff24	Produce an error at runtime rather than at compile time for unsupported architectures in cranelift-native (#5627 )	2 years ago
Jamey Sharp	bfc6aad184	cranelift-isle: codegen from new IR (#5435 ) ISLE's existing code-generation strategy doesn't generate the most efficient matching order for rules. This PR completely replaces it. With this PR applied, wasmtime compile retires 2% fewer instructions on the pulldown-cmark and spidermonkey benchmarks from Sightglass. A dev build of cranelift-codegen from an empty target/ directory takes 2% less time. The build script, invoking ISLE, takes a little longer, but Rust can compile the generated code faster, so it balances out.	2 years ago
Jamey Sharp	fef9f64d2c	x86: Test paired udiv/urem (#5573 ) Ideally these pairs of CLIF instructions should emit a single x86 instruction, but they don't today. This test will tell us if somebody fixes that. Similar tests might make sense for imul/umulhi as well as signed versions, but I haven't tried that.	2 years ago
Alex Crichton	293005bd64	Fix calculation of param/result types in wit-bindgen (#5622 ) This commit fixes a bug in the `bindgen!` macro for components where previously the `param` and `result` properties weren't properly calculated depending on the structure of the type and which types were visited in which order. This is simplified to use a `LiveTypes` structure from the `wit-parser` crate and relies on that to do necessary recursion.	2 years ago

1 2 3 4 5 ...

10857 Commits (91c8114f00e2e84b6c9f9aa7ea0b49702ca820db) All Branches Search

10857 Commits (91c8114f00e2e84b6c9f9aa7ea0b49702ca820db)

All Branches