scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 19:46:48 +00:00

Author	SHA1	Message	Date
Kefu Chai	68327123ac	utils/histogram: drop defaulted copy ctor and assignment operator as one of the (indirected) member variables has a user-declared move ctor, this prevents the compiler from generating the default copy ctor or assignment operator for the classes containing `timer`. ``` /home/kefu/dev/scylladb/utils/histogram.hh:440:5: warning: explicitly defaulted copy constructor is implicitly deleted [-Wdefaulted-function-deleted] timed_rate_moving_average_and_histogram(const timed_rate_moving_average_and_histogram&) = default; ^ /home/kefu/dev/scylladb/utils/histogram.hh:437:31: note: copy constructor of 'timed_rate_moving_average_and_histogram' is implicitly deleted because field 'met' has a deleted copy constructor timed_rate_moving_average met; ^ /home/kefu/dev/scylladb/utils/histogram.hh:298:17: note: copy constructor of 'timed_rate_moving_average' is implicitly deleted because field '_timer' has a deleted copy constructor meter_timer _timer; ^ /home/kefu/dev/scylladb/utils/histogram.hh:212:13: note: copy constructor of 'meter_timer' is implicitly deleted because field '_timer' has a deleted copy constructor timer<> _timer; ^ /home/kefu/dev/scylladb/seastar/include/seastar/core/timer.hh:111:5: note: copy constructor is implicitly deleted because 'timer<>' has a user-declared move constructor timer(timer&& t) noexcept : _sg(t._sg), _callback(std::move(t._callback)), _expiry(std::move(t._expiry)), _period(std::move(t._period)), ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-14 19:22:19 +08:00
Kamil Braun	56c4d246ef	Merge 'Introduce recent_entries_map datatype to track least recent visited entries.' from Andrii Patsula Fixes: https://github.com/scylladb/scylladb/issues/12309 Closes #12720 * github.com:scylladb/scylladb: service/raft: raft_group_registry: use recent_entries_map to store rate_limits in pinger. Fixes #12309 utils: introduce recent_entries_map datatype to track least recent visited entries.	2023-02-06 18:01:26 +01:00
Andrii Patsula	c95066a410	utils: introduce recent_entries_map datatype to track least recent visited entries.	2023-02-03 19:04:32 +01:00
Tomasz Grabiec	7bb975eb22	row_cache, lru: Introduce evict_shallow() Will be used by MVCC tests which don't want (can't) deal with the row_cache as the container but work with the partition_entry directly. Currently, rows_entry::on_evicted() assumes that it's embedded in row_cache and would segfault when trying to evict the contining partition entry which is not embedded in row_cache. The solution is to call evict_shallow() from mvcc_tests, which does not attempt to evict the containing partition_entry.	2023-01-27 21:56:31 +01:00
Tomasz Grabiec	8ae78ffebd	mutation_partition_v2: Accept arbitrary preemption source in apply_monotonically() Will be useful in testing to exhaustivaly test preemption scenarios.	2023-01-27 21:56:31 +01:00
Tomasz Grabiec	7e6056b3cc	db: Introduce mutation_partition_v2 Intended to be used in memtable/cache, as opposed to the old mutation_partition which will be intended to be used as temporary object. The two will have different trade-offs regarding memory efficiency and algorithms. In this commit there is no change in logic, the class is mostly copied. Some methods which are not needed on the v2 model were removed from the interface. Logic changes will be introduced in later commits.	2023-01-27 19:15:39 +01:00
Tomasz Grabiec	27882ff19e	db: cache_tracker: Introduce insert() variant which positions before existing entry in the LRU	2023-01-27 19:15:39 +01:00
Marcin Maliszkiewicz	6f055ca5f9	alternator: evaluate expressions as false for stored malformed binary data We'll try to distinguish the case when data comes from the storage rather than user reuqest. Such attribute can be used in expressions and when it can't be decoded it should make expression evaluate as false to simply exclude the row during filter query or scan. Note that this change focuses on binary type, for other types we may have some inconsistencies in the implementation.	2023-01-16 15:15:27 +01:00
Marcin Maliszkiewicz	86dc1bfdb1	utils: throw error on malformed input in base64 decode We already fixed the case of missing padding but there is also more generic one where input for decode function contains non base64 characters. This is mostly done for alternator purpose, it should discard the request containing such data and return 400 http error. Addionally some harmless integer overflow during integer casting was fixed here. This was attempted to be fixed by `2d33a3f` but since we also implicitly cast to uint8_t the problem persisted.	2023-01-16 14:36:23 +01:00
Marcin Maliszkiewicz	f53c0fd0fc	utils: throw error on missing padding in base64 decode This is done to make alternator behavior more on a pair with dynamodb. Decode function is used there when processing user requests containing binary item values. We will now discard improperly formed user input with 400 http error. It also makes it more consistent as some of our other base64 functions may have assumed padding is present. The patch should not break other usages of base64 functions as the only one is in db/hints where the code already throws std::runtime_error. Fixes #6487	2023-01-16 14:36:23 +01:00
Avi Kivity	cb2cb8a606	utils: small_vector: mark throw_out_of_range() const It can be called from the const version of small_vector::at. Closes #12493	2023-01-11 20:58:53 +02:00
Marcin Maliszkiewicz	61a9816bad	utils/rjson: enable inlining in rapidjson library Due to lack of NDEBUG macro inlining was disabled. It's important for parsing and printing performance. Testing with perf_simple_query shows that it reduced around 7000 insns/op, thus increasing median tps by 4.2% for the alternator frontend. Because inlined functions are called for every character in json this scales with request/response size. When default write size is increased by around 7x (from ~180 to ~ 1255 bytes) then the median tps increased by 12%. Running: ./build/release/test/perf/perf_simple_query_g --smp 1 \ --alternator forbid --default-log-level error \ --random-seed=1235000092 --duration=60 --write Results before the patch: median 46011.50 tps (197.1 allocs/op, 12.1 tasks/op, 170989 insns/op, 0 errors) median absolute deviation: 296.05 maximum: 46548.07 minimum: 42955.49 Results after the patch: median 47974.79 tps (197.1 allocs/op, 12.1 tasks/op, 163723 insns/op, 0 errors) median absolute deviation: 303.06 maximum: 48517.53 minimum: 44083.74 The change affects both json parsing and printing. Closes #12440	2023-01-04 10:27:35 +02:00
Nadav Har'El	09a3c63345	cross-tree: allow std::source_location in clang 14 We recently (commit `6a5d9ff261`) started to use std::source_location instead of std::experimental::source_location. However, this does not work on clang 14, because libc++ 12's <source_location> only works if __builtin_source_location, and that is not available on clang 14. clang 15 is just three months old, and several relatively-recent distributions still carry clang 14 so it would be nice to support it as well. So this patch adds a trivial compatibility header file, which, when included and compiled with clang 14, it aliases the functional std::experimental::source_location to std::source_location. It turns out it's enough to include the new header file from three headers that included <source_location> - I guess all other uses of source_location depend on those header files directly or indirectly. We may later need to include the compatibility header file in additional places, bug for now we don't. Refs #12259 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12265	2022-12-11 20:28:49 +02:00
Avi Kivity	3a6eafa8c6	utils: observer: qualify seastar::noncopyable_function gcc checks name resolution eagerly, and can't find noncopyable_function as this header doesn't include "seastarx.hh". Qualify the name so it finds it.	2022-11-28 21:58:30 +02:00
Avi Kivity	a2d43bb851	logalloc: disambiguate types and non-type members logalloc::tracker has some members with the same names as types from namespace scope. gcc (rightfully) complains that this changes the meaning of the name. Qualify the types to disambiguate.	2022-11-28 21:58:30 +02:00
Kefu Chai	af011aaba1	utils/variant_element: simplify is_variant_element with right fold for better readability than the recursive approach. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Closes #12091	2022-11-27 16:34:34 +02:00
Piotr Dulikowski	22fbf2567c	utils/abi: don't use the deprecated std::unexpected_handler Recently, clang started complaining about std::unexpected_handler being deprecated: ``` In file included from utils/exceptions.cc:18: ./utils/abi/eh_ia64.hh:26:10: warning: 'unexpected_handler' is deprecated [-Wdeprecated-declarations] std::unexpected_handler unexpectedHandler; ^ /usr/bin/../lib/gcc/x86_64-redhat-linux/12/../../../../include/c++/12/exception:84:18: note: 'unexpected_handler' has been explicitly marked deprecated here typedef void (*_GLIBCXX11_DEPRECATED unexpected_handler) (); ^ /usr/bin/../lib/gcc/x86_64-redhat-linux/12/../../../../include/c++/12/x86_64-redhat-linux/bits/c++config.h:2343:32: note: expanded from macro '_GLIBCXX11_DEPRECATED' ^ /usr/bin/../lib/gcc/x86_64-redhat-linux/12/../../../../include/c++/12/x86_64-redhat-linux/bits/c++config.h:2334:46: note: expanded from macro '_GLIBCXX_DEPRECATED' ^ 1 warning generated. ``` According to cppreference.com, it was deprecated in C++11 and removed in C++17 (!). This commit gets rid of the warning by inlining the std::unexpected_handler typedef, which is defined as a pointer a function with 0 arguments, returning void. Fixes: #12022 Closes #12074	2022-11-27 12:25:20 +02:00
Botond Dénes	437fcdeeda	Merge 'Make use of enum_set in directory lister' from Pavel Emelyanov The lister accepts sort of a filter -- what kind of entries to list, regular, directories or both. It currently uses unordered_set, but enum_set is shorter and better describes the intent. Closes #12017 * github.com:scylladb/scylladb: lister: Make lister::dir_entry_types an enum_set database: Avoid useless local variable	2022-11-18 12:15:26 +02:00
Pavel Emelyanov	bc62ca46d4	lister: Make lister::dir_entry_types an enum_set This type is currently an unordered_set, but only consists of at most two elements. Making it an enum_set renders it into a size_t variable and better describes the intention. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-17 19:01:45 +03:00
Botond Dénes	e925c41f02	utils/gs/barrett.hh: aarch64: s/brarett/barrett/ Fix a typo introduced by the the recent patch fixing the spelling of Barrett. The patch introduced a typo in the aarch64 version of the code, which wasn't found by promotion, as that only builds on X86_64. Closes #12006	2022-11-17 11:09:59 +02:00
Avi Kivity	3497891cf9	utils: spell "barrett" correctly As P. T. Barnoom famously said, "write what you like but spell my name correctly". Following that, we correct the spelling of Barrett's name in the source tree. Closes #11989	2022-11-16 16:30:38 +02:00
Nadav Har'El	8a4ab87e44	Merge 'utils: crc: generate crc barrett fold tables at compile time' from Avi Kivity We use Barrett tables (misspelled in the code unfortunately) to fold crc computations of multiple buffers into a single crc. This is important because it turns out to be faster to compute crc of three different buffers in parallel rather than compute the crc of one large buffer, since the crc instruction has latency 3. Currently, we have a separate code generation step to compute the fold tables. The step generates a new C++ source files with the tables. But modern C++ allows us to do this computation at compile time, avoiding the code generation step. This simplifies the build. This series does that. There is some complication in that the code uses compiler intrinsics for the computation, and these are not constexpr friendly. So we first introduce constexpr-friendly alternatives and use them. To prove the transformation is correct, I compared the generated code from before the series and from just before the last step (where we use constexpr evaluation but still retain the generated file) and saw no difference in the values. Note that constexpr is not strictly needed - we could have run the code in the global variables' initializer. But that would cause a crash if we run on a pre-clmul machine, and is not as fun. Closes #11957 * github.com:scylladb/scylladb: test: crc: add unit tests for constexpr clmul and barrett fold utils: crc combine table: generate at compile time utils: barrett: inline functions in header utils: crc combine table: generate tables at compile time utils: crc combine table: extract table generation into a constexpr function utils: crc combine table: extract "pow table" code into constexpr function utils: crc combine table: store tables std::arrray rather than C array utils: barrett: make the barrett reduction constexpr friendly utils: clmul: add 64-bit constexpr clmul utils: barrett: extract barrett reduction constants utils: barrett: reorder functions utils: make clmul() constexpr	2022-11-15 14:21:48 +01:00
Avi Kivity	70217b5109	utils: crc combine table: generate at compile time By now the crc combine tables are generated at compile time, but still in a separate code generation step. We now eliminate the code generation step and instead link the global variables directly into the main executable. The global variables have been conveniently named exactly as the code generation step names them, so we don't need to touch any users.	2022-11-12 17:26:45 +02:00
Avi Kivity	164e991181	utils: barrett: inline functions in header Avoid duplicate definitions if the same header is used from more than one place, at it will soon be.	2022-11-12 17:26:08 +02:00
Avi Kivity	a4f06773da	utils: crc combine table: generate tables at compile time Move the tables into global constinit variables that are generated at compile time. Note the code that creates the generated crc32_combine_table.cc is still called; it transorms compile-time generated tables into a C++ source that contains the same values, as literals. If we generate a diff between gen/utils/gz/crc_combine_table.cc before this series and after this patch, we see the only change in the file is the type of the variable (which changed to std::array), proving our constexpr code is correct.	2022-11-12 17:16:59 +02:00
Avi Kivity	a229fdc41e	utils: crc combine table: extract table generation into a constexpr function Move the code to a constexpr function, so we can later generate the tables at compile time. Note that although the function is constexpr, it is still evaluated at runtime, since the calling function (main()) isn't constexpr itself.	2022-11-12 17:13:52 +02:00
Avi Kivity	d42bec59bb	utils: crc combine table: extract "pow table" code into constexpr function A "pow table" is used to generate the Barrett fold tables. Extract its code into a constexpr function so we can later generate the fold tables at compile time.	2022-11-12 17:11:44 +02:00
Avi Kivity	6e34014b64	utils: crc combine table: store tables std::arrray rather than C array C arrays cannot be returned from functions and therefore aren't suitable for constexpr processing. std::array<> is a regular value and so is constexpr friendly.	2022-11-12 17:09:02 +02:00
Avi Kivity	1e9252f79a	utils: barrett: make the barrett reduction constexpr friendly Dispatch to intrinsics or constexpr based on evaluation context.	2022-11-12 17:04:44 +02:00
Avi Kivity	0bd90b5465	utils: clmul: add 64-bit constexpr clmul This is used when generating the Barrett reduction tables, and also when applying the Barrett reduction at runtime, so we need it to be constexpr friendly.	2022-11-12 17:04:05 +02:00
Avi Kivity	c376c539b8	utils: barrett: extract barrett reduction constants The constants are repeated across x86_64 and aarch64, so extract them into a common definition.	2022-11-12 17:00:17 +02:00
Avi Kivity	2fdf81af7b	utils: barrett: reorder functions Reorder functions in dependency order rather than forward declaring them. This makes them more constexpr-friendly.	2022-11-12 16:52:41 +02:00
Avi Kivity	8aa59a897e	utils: make clmul() constexpr clmul() is a pure function and so should already be constexpr, but it uses intrinsics that aren't defined as constexpr and so the compiler can't really compute it at compile time. Fix by defining a constexpr variant and dispatching based on whether we're being constant-evaluated or not. The implementation is simple, but in any case proof that it is correct will be provided later on.	2022-11-12 16:49:43 +02:00
Benny Halevy	1a183047c0	utils: config_src: add set_value_on_all_shards functions Currently when we set a single value we need to call broadcast_to_all_shards to let observers on all shards get notified of the new value. However, the latter broadcasts all value to all shards so it's terribly inefficient. Instead, add async set_value_on_all_shards functions to broadcast a value to all shards. Use those in system_keyspace for db_config_table virtual table and in task_manager_test to update the task_manager ttl. Refs #7316 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-09 11:55:14 +02:00
Benny Halevy	e83f42ec70	utils: config_file: add config_source::API For task_manager test api. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-09 11:53:20 +02:00
Tomasz Grabiec	4ff204c028	Merge 'cache: make all removals of cache items explicit' from Michał Chojnowski This series is a step towards non-LRU cache algorithms. Our cache items are able to unlink themselves from the LRU list. (In other words, they can be unlinked solely via a pointer to the item, without access to the containing list head). Some places in the code make use of that, e.g. by relying on auto-unlink of items in their destructor. However, to implement algorithms smarter than LRU, we might want to update some cache-wide metadata on item removal. But any cache-wide structures are unreachable through an item pointer, since items only have access to themselves and their immediate neighbours. Therefore, we don't want items to unlink themselves — we want `cache.remove(item)`, rather than `item.remove_self()`, because the former can update the metadata in `cache`. This series inserts explicit item unlink calls in places that were previously relying on destructors, gets rid of other self-unlinks, and adds an assert which ensures that every item is explicitly unlinked before destruction. Closes #11716 * github.com:scylladb/scylladb: utils: lru: assert that evictables are unlinked before destruction utils: lru: remove unlink_from_lru() cache: make all cache unlinks explicit	2022-10-17 12:47:02 +02:00
Michał Chojnowski	a96433d3a4	utils: lru: assert that evictables are unlinked before destruction Previous patches introduce the assumption that evictables are manually unlinked before destruction, to allow for correct bookkeeping within the cache. This assert assures that this assumptions is correct. This is particularly important because the switch from automatic to explicit unlinking had to be done manually. Destructor calls are invisible, so it's possible that we have missed some automatic destruction site.	2022-10-17 12:07:27 +02:00
Michał Chojnowski	f340c9cca5	utils: lru: remove unlink_from_lru() unlink_from_lru() allows for unlinking elements from cache without notifying the cache. This messes up any potential cache bookkeeping. Improved that by replacing all uses of unlink_from_lru() with calls to lru::remove(), which does have access to cache's metadata.	2022-10-17 12:07:27 +02:00
Michał Chojnowski	d785364375	cache: make all cache unlinks explicit Our LSA cache is implemented as an auto_unlink Boost intrusive list, meaning that elements of the list unlink themselves from the list automatically on destruction. Some parts of the code rely on that, and don't unlink them manually. However, this precludes accurate bookkeeping about the cache. Elements only have access to themselves and their neighbours, not to any bookkeeping context. Therefore, a destructor cannot update the relevant metadata. In this patch, we fix this by adding explicit unlink calls to places where it would be done by a destructor. In a following patch, we will add an assert to the destructor to check that every element is unlinked before destruction.	2022-10-17 12:07:27 +02:00
Avi Kivity	20bad62562	Merge 'Detect and record large collections' from Benny Halevy This series adds support for detecting collections that have too many items and recording them in `system.large_cells`. A configuration variable was added to db/config: `compaction_collection_items_count_warning_threshold` set by default to 10000. Collections that have more items than this threshold will be warned about and will be recorded as a large cell in the `system.large_cells` table. Documentation has been updated respectively. A new column was added to system.large_cells: `collection_items`. Similar to the `rows` column in system.large_partition, `collection_items` holds the number of items in a collection when the large cell is a collection, or 0 if it isn't. Note that the collection may be recorded in system.large_cells either due to its size, like any other cell, and/or due to the number of items in it, if it cross the said threshold. Note that #11449 called for a new system.large_collections table, but extending system.large_cells follows the logic of system.large_partitions is a smaller change overall, hence it was preferred. Since the system keyspace schema is hard coded, the schema version of system.large_cells was bumped, and since the change is not backward compatible, we added a cluster feature - `LARGE_COLLECTION_DETECTION` - to enable using it. The large_data_handler large cell detection record function will populate the new column only when the new cluster feature is enabled. In addition, unit tests were added in sstable_3_x_test for testing large cells detection by cell size, and large_collection detection by the number of items. Closes #11449 Closes #11674 * github.com:scylladb/scylladb: sstables: mx/writer: optimize large data stats members order sstables: mx/writer: keep large data stats entry as members db: large_data_handler: dynamically update config thresholds utils/updateable_value: add transforming_value_updater db/large_data_handler: cql_table_large_data_handler: record large_collections db/large_data_handler: pass ref to feature_service to cql_table_large_data_handler db/large_data_handler: cql_table_large_data_handler: move ctor out of line docs: large-rows-large-cells-tables: fix typos db/system_keyspace: add collection_elements column to system.large_cells gms/feature_service: add large_collection_detection cluster feature test: sstable_3_x_test: add test_sstable_too_many_collection_elements test: lib: simple_schema: add support for optional collection column test: lib: simple_schema: build schema in ctor body test: lib: simple_schema: cql: define s1 as static only if built this way db/large_data_handler: maybe_record_large_cells: consider collection_elements db/large_data_handler: debug cql_table_large_data_handler::delete_large_data_entries sstables: mx/writer: pass collection_elements to writer::maybe_record_large_cells sstables: mx/writer: add large_data_type::elements_in_collection db/large_data_handler: get the collection_elements_count_threshold db/config: add compaction_collection_elements_count_warning_threshold test: sstable_3_x_test: add test_sstable_write_large_cell test: sstable_3_x_test: pass cell_threshold_bytes to large_data_handler test: sstable_3_x_test: large_data_handler: prepare callback for testing large_cells test: sstable_3_x_test: large_data tests: use BOOST_REQUIRE_[GL]T test: sstable_3_x_test: test_sstable_log_too_many_rows: use tests::random	2022-10-06 18:28:21 +03:00
Benny Halevy	6d582054c0	utils/updateable_value: add transforming_value_updater Automatically updates a value from a utils::updateable_value Where they can be of different types. An optional transfom function can provide an additional transformation when updating the value, like multiplying it by a factor for unit conversion, for example. To be used for auto-updating the large data thresholds from the db::config. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-10-05 10:52:49 +03:00
Pavel Emelyanov	7ba1f551f3	exceptions: Mark storage_io_error::code() with noexcept Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-03 18:50:06 +03:00
Michał Chojnowski	4563cbe595	logalloc: prevent false positives in reclaim_timer reclaim_timer uses a coarse clock, but does not account for the measurement error introduced by that -- it can falsely report reclaims as stalls, even if they are shorter by a full coarse clock tick from the requested threshold (blocked-reactor-notify-ms). Notably, if the stall threshold happens to be smaller or equal to coarse clock resolution, Scylla's log gets spammed with false stall reports. The resolution of coarse clocks in Linux is 1/CONFIG_HZ. This is typically equal to 1 ms or 4 ms, and stall thresholds of this order can occur in practice. Eliminate false positives by requiring the measured reclaim duration to be at least 1 clock tick longer than the configured threshold for it to be considered a stall. Fixes #10981 Closes #11680	2022-10-02 13:41:40 +03:00
Avi Kivity	2cec417426	Merge 'tools: use the standard allocator' from Botond Dénes Tools want to be as little disrupting to the environment they run in as possible, because they might be run in a production environment, next to a running scylladb production server. As such, the usual behavior of seastar applications w.r.t. memory is an anti-pattern for tools: they don't want to reserve most of the system memory, in fact they don't want to reserve any amount, instead consuming as much as needed on-demand. To achieve this, tools want to use the standard allocator. To achieve this they need a seastar option to to instruct seastar to not configure and use the seastar allocator and they need LSA to cooperate with the standard allocator. The former is provided by https://github.com/scylladb/seastar/pull/1211. The latter is solved by introducing the concept of a `segment_store_backend`, which abstracts away how the memory arena for segments is acquired and managed. We then refactor the existing segment store so that the seastar allocator specific parts are moved to an implementation of this backend concept, then we introduce another backend implementation appropriate to the standard allocator. Finally, tools configure seastar with the newly introduced option to use the standard allocator and similarly configure LSA to use the standard allocator appropriate backend. Refs: https://github.com/scylladb/scylladb/issues/9882 This is the last major code piece in scylla for making tools production ready. Closes #11510 * github.com:scylladb/scylladb: test/boost: add alternative variant of logalloc test tools: use standard allocator utils/logalloc: add use_standard_allocator_segment_pool_backend() utils/logalloc: introduce segment store backend for standard allocator utils/logalloc: rebase release segment-store on segment-store-backend utils/logalloc: introduce segment_store_backend utils/logalloc: push segment alloc/dealloc to segment_store test/boost/logalloc_test: make test_compaction_with_multiple_regions exception-safe	2022-09-20 12:59:34 +03:00
Nadav Har'El	4c93a694b7	cql: validate bloom_filter_fp_chance up-front Scylla's Bloom filter implementation has a minimal false-positive rate that it can support (6.71e-5). When setting bloom_filter_fp_chance any lower than that, the compute_bloom_spec() function, which writes the bloom filter, throws an exception. However, this is too late - it only happens while flushing the memtable to disk, and a failure at that point causes Scylla to crash. Instead, we should refuse the table creation with the unsupported bloom_filter_fp_chance. This is also what Cassandra did six years ago - see CASSANDRA-11920. This patch also includes a regression test, which crashes Scylla before this patch but passes after the patch (and also passes on Cassandra). Fixes #11524. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #11576	2022-09-20 06:18:51 +03:00
Botond Dénes	a55903c839	utils/logalloc: add use_standard_allocator_segment_pool_backend() Creating a standard-memory-allocator backend for the segment store. This is targeted towards tools, which want to configure LSA with a segment store backend that is appropriate for the standard allocator (which they want to use). We want to be able to use this in both release and debug mode. The former will be used by tools and the latter will be used to run the logalloc tests with this new backend, making sure it works and doesn't regress. For this latter, we have to allow the release and debug stores to coexist in the same build and for the debug store to be able to delegate to the release store when the standard allocator backend is used.	2022-09-16 13:02:40 +03:00
Botond Dénes	c1c74005b7	utils/logalloc: introduce segment store backend for standard allocator To be used by tools, this store backend is compatible with the standard allocator as it acquires the memory arena for segments via mmap().	2022-09-16 12:16:57 +03:00
Botond Dénes	d2a7ebbe66	utils/logalloc: rebase release segment-store on segment-store-backend Rebase the seastar allocator based segment store implementation on the recently introduced segment store backend which is now abstracts away how memory for segments is obtained. This patch also introduces an explicit `segment_npos` to be used for cases when a segment -> index mapping fails (segment doesn't belong to the store). Currently the seastar allocator based store simply doesn't handle this case, while the standard allocator based store uses 0 as the implicit invalid index.	2022-09-16 12:16:57 +03:00
Botond Dénes	3717f7740d	utils/logalloc: introduce segment_store_backend We want to make it possible to select the segment-store to be used for LSA -- the seastar allocator based one or the standard allocator based on -- at runtime. Currently this choice is made at compile time via preprocessor switches. The current standard memory based store is specialized for debug build, we want something more similar to the seastar standard memory allocator based one. So we introduce a segment store backend for the current seastar allocator based store, which abstracts how the backing memory for all segments is allocated/freed, while keeping the segment <-> index mapping common. In the next patches we will rebase the current seastar allocator based segment store on this backend and later introduce another backend for standard allocator, targeted for release builds.	2022-09-16 12:16:57 +03:00
Botond Dénes	5ea4d7fb39	utils/logalloc: push segment alloc/dealloc to segment_store Currently the actual alloc/dealloc of memory for segments is located outside the segment stores. We want to abstract away how segments are allocated, so we move this logic too into the segment store. For now this results in duplicate code in the two segment store implementations, but this will soon be gone.	2022-09-16 12:16:57 +03:00

1 2 3 4 5 ...

1320 Commits