scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Wojciech Mitros	6d89d718d9	wasm: replace wasm programs with their source programs After recent changes, we are able to store only the C/Rust source codes for Wasm programs, and only build them when neccessary. This patch utilizes this opportunity by removing most of the currently stored raw Wasm programs, replacing them with C/Rust sources and adding them to the new build system.	2023-05-08 10:47:34 +02:00
Wojciech Mitros	0a34a54c73	test: extend capabilities of Wasm reading helper funciton Currently, we require that the Wasm file is named the same as the funciton. In the future we may want multiple functions with the same name, which we can't currently do due to this limitation. This patch allows specifying the function name, so that multiple files can have a function with the same name. Additionally, the helper method now escapes "'" characters, so that they can appear in future Wasm files.	2023-05-08 10:47:34 +02:00
Wojciech Mitros	d4851ccae7	treewide: rename the "xwasm" UDF language to "wasm" When the WASM UDFs were first introduced, the LANGUAGE required in the CQL statements to use them was "xwasm", because the ABI for the UDFs was still not specified and changes to it could be backwards incompatible. Now, the ABI is stabilized, but if backwards incompatible changes are made in the future, we will add a new ABI version for them, so the name "xwasm" is no longer needed and we can finally change it to "wasm". Closes #13089	2023-03-07 10:21:11 +02:00
Wojciech Mitros	6d2e785b5c	docs: update wasm.md The WASM UDF implementation has changed since the last time the docs were written. In particular, the Rust helper library has been released, and using it should be the recommended method. Some decisions that were only experimental at the start, were also "set in stone", so we should refer to them as such. The docs also contain some code examples. This patch adds tests for these examples to make sure that they are not wrong and misleading. Closes #12941	2023-02-28 20:59:25 +02:00
Wojciech Mitros	b25ee62f75	wasm udf: deserialize counters as integers Currently, because serialize_visitor::operator() is not implemented for counters, we cannot convert a counter returned by a WASM UDF to bytes when returning from wasm::run_script(). We could disallow using counters as WASM UDF return types, but an easier solution which we're already using in Lua UDFs is treating the returned counters as 64-bit integers when deserializing. This patch implements the latter approach and adds a test for it.	2023-02-13 14:24:20 +01:00
Wojciech Mitros	3b8bf1ae3a	test_wasm.py: add utility function for reading WASM UDF saved in files Currently, we're repeating the same os.path, open, read, replace each time we read a WASM UDF from a file. To reduce code bloat, this patch adds a utility function "read_function_from_file" that finds the file and reads it given a function name and an optional new name, for cases when we want to use a different name in cql (mostly for unique_names).	2023-02-13 14:24:20 +01:00
Wojciech Mitros	b8d28a95bf	wasm: add configuration options for instance cache and udf execution Different users may require different limits for their UDFs. This patch allows them to configure the size of their cache of wasm, the maximum size of indivitual instances stored in the cache, the time after which the instances are evicted, the fuel that all wasm UDFs are allowed to consume before yielding (for the control of latency), the fuel that wasm UDFs are allowed to consume in total (to allow performing longer computations in the UDF without detecting an infinite loop) and the hard limit of the size of UDFs that are executed (to avoid large allocations)	2023-01-06 14:07:27 +01:00
Wojciech Mitros	9e6e8de38f	tests: prevent test_wasm from occasional failing Some cases in test_wasm.py assumed that all cases are ran in the same order every time and depended on values that should have been added to tables in previous cases. Because of that, they were sometimes failing. This patch removes this assumption by adding the missing inserts to the affected cases. Additionally, an assert that confirms low miss rate of udfs is more precise, a comment is added to explain it clearly. Closes #11367	2022-08-25 11:32:06 +03:00
Wojciech Mitros	5590493abd	wasm: test instances reuse Add a test for a wasm aggregate function which uses the new metrics to check if the cache has been hit at least once. Also check that the cache can get reused on different queries, by testing that the number of queries is higher than the number of cache misses. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2022-07-20 18:19:25 +02:00
Wojciech Mitros	9281ba3919	wasm: reuse UDF instances When executing a wasm UDF, most of the time is spent on setting up the instance. To minimize its cost, we reuse the instance using wasm::instance_cache. This patch adds a wasm instance cache, that stores a wasmtime instance for each UDF and scheduling group. The instances are evicted using LRU strategy. The cache may store some entries for the UDF after evicting the instance, but they are evicted when the corresponding UDF is dropped, which greatly limits their number. The size of stored instances is estimated using the size of their WASM memories. In order to be able to read the size of memory, we require that the memory is exported by the client. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2022-07-20 18:19:22 +02:00
Wojciech Mitros	bfa3c0e734	test: move codes of UDFs compiled to WASM to test/resource After compiling to WASM, UDFs become much larger than the source code. When they're included in test_wasm.py, it becomes difficult to navigate in the file. Moving them to another place does not make understanding the test scripts harder, because the source code is still included. This problem will become even more severe when testing UDFs using WASI. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> Closes #10934	2022-07-03 17:37:21 +03:00
Wojciech Mitros	4fd289a78c	wasm: fix freeing in wasm UDFs using WASI To call a UDF that is using WASI, we need to properly configure the wasmtime instance that it will be called on. The configuration was missing from udf_cache::load(), so we add it here. The free function does not return any value, so we should use a calling method that does not expect any returns. This patch adds such a method and uses it. A test that did not pass without this fix and does pass after is added. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> Closes #10935	2022-07-01 07:57:45 +02:00
Avi Kivity	5fc093ad42	Merge 'wasm: manage memory using exports from the client' from Wojciech Mitros This patch adds importing the `malloc` and `free` method from the wasm client, and using them for allocating wasm memory for UDF arguments and freeing its result. When the methods are not exported, the old behaviour is used instead. To make that possible, this patch also includes a fix to the usage of pages in wasm memory (methods `size` and `grow`) that were used for allocating memory for arguments until now. (The source codes for the examples didn't work on my machine in their original form, so when updating paging I've also added small unrelated modifications) Tests:unit(dev) Closes #10234 * github.com:scylladb/scylla: wasm: add wasm ABI version 2 wasm: add WASI handling wasm: add documentation wasm: add _scylla_abi export for specifying abi for wasm udfs wasm: update ABI for passing parameters to wasm UDFs wasm: move common code to a separate function wasm: use wasm pages for wasm memory	2022-03-31 12:33:55 +03:00
Wojciech Mitros	56c5459c50	wasm: add null handling for wasm udf As the name suggests, for UDFs defined as RETURNS NULL ON NULL INPUT, we sometimes want to return nulls. However, currently we do not return nulls. Instead, we fail on the null check in init_arg_visitor. Fix by adding null handling before passing arguments, same as in lua. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> Closes #10298	2022-03-31 12:27:38 +03:00
Wojciech Mitros	93b082e8d9	wasm: add _scylla_abi export for specifying abi for wasm udfs Different languages may require different ABIs for passing parameters, etc. This patch adds a requirement for all wasm UDFs to export an _scylla_abi symbol, that is an 32-bit integer with a value specifying the ABI version. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2022-03-30 19:37:11 +02:00
Wojciech Mitros	a7ee3ccf52	wasm: update ABI for passing parameters to wasm UDFs WebAssembly uses 32-bit address space, while also having 64-bit integers as it native types. As a result, when passing size of an object in memory and its address, it can be combined into one 64-bit value. As a bonus, if the object is null, we can signal it by passing -1 as its size. This patch implements handling of this new ABI and adjusts expamples in test_wasm.py. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2022-03-30 17:13:25 +02:00
Wojciech Mitros	62761a7cf3	wasm: use wasm pages for wasm memory The memory.grow and memory.size wasm methods return the memory size in pages, and memory.size takes its argument in the number of pages. A WebAssembly page has a size of 64KiB, so during memory allocation we have to divide our desired size in bytes by page size and round up. Similarly, when reading memory size we need to multiply the result by 64KiB to get the size in bytes. The change affects current naive allocator for arguments when calling wasm UDFs and the examples in wasm_test.py - both commented code and compiled wasm in text representation. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2022-03-30 17:13:13 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Piotr Sarna	88480ac504	cql-pytest: relax another condition for a failed wasm execution The previous commit already relaxed the condition for test_fib, but the same should be done for test_fib_called_on_null for an identical reason - more than 1 error can be expected in the case of calling heavily recursive function, and either fuel exhaustion, or hitting the stack limit, or any other InvalidRequest exception should be accepted. Closes #9363	2021-09-23 14:11:02 +03:00
Piotr Sarna	dd9d6c081e	cql-pytest: relax error conditions for a failed wasm execution Originally, the expected failure for a recursive invocation test case was to expect that fuel gets exhausted, but it's also possible to hit a stack limit first. All errors are equally expected here as long as the execution is halted, so let's relax the condition and accept any wasm-related InvalidRequest errors. Closes #9361	2021-09-20 15:20:52 +03:00
Piotr Sarna	41b94d3cf3	cql-pytest: add wasm-based tests for user-defined functions A first set of wasm-based test cases is added. The tests include verifying that supported types work and that validation of the input wasm is performed.	2021-09-13 19:03:58 +02:00

21 Commits