scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-21 23:32:15 +00:00

Author	SHA1	Message	Date
Piotr Dulikowski	25fec0acce	gms/feature_service: introduce SECONDARY_INDEXES_ON_STATIC_COLUMNS cluster feature The new feature will prevent secondary indexes on static columns from being created unless the whole cluster is ready to support them.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	9f14f0ac09	create_index_statement: disallow creation of local indexes with static columns Local indexes on static columns don't make sense because there is only one static row per partition. It's always better to just run SELECT DISTINCT on the base table. Allowing for such an index would only make such queries slower (due to double lookup), would take unnecessary space and could pose potential consistency problems, so this commit explicitly forbids them.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	8c4cdfc2db	select_statement: prepare paging for indexes on static columns When performing a query on a table which is accelerated by a secondary index, the paging state returned along with the query contains a partition key and a clustering key of the secondary index table. The logic wasn't prepared to handle the case of secondary indexes on static columns - notably, it tried to put base table's clustering key columns into the paging state which caused problems in other places. This commit fixes the paging logic so that the PK and CK of a secondary index table is calculated correctly. However, this solution has a major drawback: because it is impossible to encode clustering key of the base table in the paging state, partitions returned by queries accelerated by secondary indexes on static columns will _not_ be split by paging. This can be problematic in case there are large partitions in the base table. The main advantage of this fix is that it is simple. Moreover, the problem described above is not unique to static column indexes, but also happens e.g. in case of some indexes on clustering columns (see case 2 of scylladb/scylla#7432). Fixing this issue will require a more sophisticated solution and may affect more than only secondary indexes on static columns, so this is left for a followup.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	ba390072c5	select_statement: do not attempt to fetch clustering columns from secondary index's table The previous commit made sure that the index table for secondary indexes on static tables don't have columns corresponding to clustering rows in the base table - therefore, we must make sure that we don't try to fetch them when querying the index table.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	983b440a81	secondary_index_manager: don't add clustering key columns to index table of static column index The implementation of secondary indexes on static columns relies on the fact that the index table only includes partition key columns of the base table, but not clustering key columns. A static column's value determines a set of full partitions, so including the clustering key would only be redundant. It would also generate more work as a single static column update would require a large portion of the index to be updated. This commit makes sure that clustering columns are not included in the index table for indexes based on a static column.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	6ab41d76e6	replica/table: adjust the view read-before-write to return static rows when needed Adjusts the read-before-write query issued in `table::do_push_view_replica_updates` so that, when needed, requests static columns and makes sure that the static row is present.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	18be90b1e6	db/view: process static rows in view_update_builder::on_results The `view_update_builder::on_results()` function is changed to react to static rows when comparing read-before-write results with the base table mutation.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	2dd95d76f1	db/view: adjust existing view update generation path to use clustering_or_static_row The view update path is modified to use `clustering_or_static_row` instead of just `clustering_row`.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	b0a31bb7a7	column_computation: adjust to use clustering_or_static_row Adjusts the column_computation interface so that it is able to accept both clustering and static rows through the common db::view::clustering_or_static_row interface.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	986ab6034c	db/view: add clustering_or_static_row Adds a `clustering_or_static_row`, which is a common, immutable representation of either a static or clustering row. It will allow to handle view update generation based on static or clustering rows in a uniform way.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	05d4328f02	deletable_row: add column_kind parameter to is_live While deletable_row is used to hold regular columns of a clustering row, its name or implementation doesn't suggest that it is a requirement. In fact, some of its methods already take a column_kind parameter which is used to interpret the kind of columns held in the row. This commit removes the assumption about the column kind from the `deletable_row::is_live` method.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	27c81432cd	view_info: adjust view_column to accept column_kind The `view_info::view_column()` and `view_column` in view.cc allow to get a view's column definition which corresponds to given base table's column. They currently assume that the given column id corresponds to a regular column. In preparation for secondary indexes based on static columns, this commit adjusts those functions so that they accept other kinds of columns, including static columns.	2022-12-06 11:21:16 +01:00
Piotr Dulikowski	f7b7724eaf	db/view: base_dependent_view_info: split non-pk columns into regular and static Currently, `base_dependent_view_info::_base_non_pk_columns_in_view_pk` field keeps a list of non-primary-key columns from the base table which are a part of the view's primary key. Because the current code does not allow indexes on static columns yet, the columns kept in the aforementioned field are always assumed to be regular columns of the base table and are kept as `column_id`s which do not contain information about the column kind. This commit splits the `_base_non_pk_columns_in_view_pk` field into two, one for regular columns and the other for static columns, so that it is possible to keep both kinds of columns in `base_dependent_view_info` and the structure can be used for secondary indexes on static columns.	2022-12-06 11:21:16 +01:00
Avi Kivity	fe4d7fbdf2	Update abseil submodule * abseil 7f3c0d78...4e5ff155 (125): > Add a compilation test for recursive hash map types > Add AbslStringify support for enum types in Substitute. > Use a c++14-style constexpr initialization if c++14 constexpr is available. > Move the vtable into a function to delay instantiation until the function is called. When the variable is a global the compiler is allowed to instantiate it more aggresively and it might happen before the types involved are complete. When it is inside a function the compiler can't instantiate it until after the functions are called. > Cosmetic reformatting in a test. > Reorder base64 unescape methods to be below the escaping methods. > Fixes many compilation issues that come from having no external CI coverage of the accelerated CRC implementation and some differences bewteen the internal and external implementation. > Remove static initializer from mutex.h. > Import of CCTZ from GitHub. > Remove unused iostream include from crc32c.h > Fix MSVC builds that reject C-style arrays of size 0 > Remove deprecated use of absl::ToCrc32c() > CRC: Make crc32c_t as a class for explicit control of operators > Convert the full parser into constexpr now that Abseil requires C++14, and use this parser for the static checker. This fixes some outstanding bugs where the static checker differed from the dynamic one. Also, fix `%v` to be accepted with POSIX syntax. > Write (more) directly into the structured buffer from StringifySink, including for (size_t, char) overload. > Avoid using the non-portable type __m128i_u. > Reduce flat_hash_{set,map} generated code size. > Use ABSL_HAVE_BUILTIN to fix -Wundef __has_builtin warning > Add a TODO for the deprecation of absl::aligned_storage_t > TSAN: Remove report_atomic_races=0 from CI now that it has been fixed > absl: fix Mutex TSan annotations > CMake: Remove trailing commas in `AbseilDll.cmake` > Fix AMD cpu detection. > CRC: Get CPU detection and hardware acceleration working on MSVC x86(_64) > Removing trailing period that can confuse a url in str_format.h. > Refactor btree iterator generation code into a base class rather than using ifdefs inside btree_iterator. > container.h: fix incorrect comments about the location of <numeric> algorithms. > Zero encoded_remaining when a string field doesn't fit, so that we don't leave partial data in the buffer (all decoders should ignore it anyway) and to be sure that we don't try to put any subsequent operands in either (there shouldn't be enough space). > Improve error messages when comparing btree iterators when generations are enabled. > Document the WebSafe* and WithPadding variants more concisely, as deltas from Base64Encode. > Drop outdated comment about LogEntry copyability. > Release structured logging. > Minor formatting changes in preparation for structured logging... > Update absl::make_unique to reflect the C++14 minimum > Update Condition to allocate 24 bytes for MSVC platform pointers to methods. > Add missing include > Refactor "RAW: " prefix formatting into FormatLogPrefix. > Minor formatting changes due to internal refactoring > Fix typos > Add a new API for `extract_and_get_next()` in b-tree that returns both the extracted node and an iterator to the next element in the container. > Use AnyInvocable in internal thread_pool > Remove absl/time/internal/zoneinfo.inc. It was used to guarantee availability of a few timezones for "time_test" and "time_benchmark", but (file-based) zoneinfo is now secured via existing Bazel data/env attributes, or new CMake environment settings. > Updated documentation on use of %v Also updated documentation around FormatSink and PutPaddedString > Use the correct Bazel copts in crc targets > Run the //absl/time timezone tests with a data dependency on, and a matching ${TZDIR} setting for, //absl/time/internal/cctz:zoneinfo. > Stop unnecessary clearing of fields in ~raw_hash_set. > Fix throw_delegate_test when using libc++ with shared libraries > CRC: Ensure SupportsArmCRC32PMULL() is defined > Improve error messages when comparing btree iterators. > Refactor the throw_delegate test into separate test cases > Replace std::atomic_flag with std::atomic<bool> to avoid the C++20 deprecation of ATOMIC_FLAG_INIT. > Add support for enum types with AbslStringify > Release the CRC library > Improve error messages when comparing swisstable iterators. > Auto increase inlined capacity whenever it does not affect class' size. > drop an unused dep > Factor out the internal helper AppendTruncated, which is used and redefined in a couple places, plus several more that have yet to be released. > Fix some invalid iterator bugs in btree_test.cc for multi{set,map} emplace{_hint} tests. > Force a conservative allocation for pointers to methods in Condition objects. > Fix a few lint findings in flags' usage.cc > Narrow some _MSC_VER checks to not catch clang-cl. > Small cleanups in logging test helpers > Import of CCTZ from GitHub. > Merge pull request abseil/abseil-cpp#1287 from GOGOYAO:patch-1 > Merge pull request abseil/abseil-cpp#1307 from KindDragon:patch-1 > Stop disabling some test warnings that have been fixed > Support logging of user-defined types that implement `AbslStringify()` > Eliminate span_internal::Min in favor of std::min, since Min conflicts with a macro in a third-party library. > Fix -Wimplicit-int-conversion. > Improve error messages when dereferencing invalid swisstable iterators. > Cord: Avoid leaking a node if SetExpectedChecksum() is called on an empty cord twice in a row. > Add a warning about extract invalidating iterators (not just the iterator of the element being extracted). > CMake: installed artifacts reflect the compiled ABI > Import of CCTZ from GitHub. > Import of CCTZ from GitHub. > Support empty Cords with an expected checksum > Move internal details from one source file to another more appropriate source file. > Removes `PutPaddedString()` function > Return uint8_t from CappedDamerauLevenshteinDistance. > Remove the unknown CMAKE_SYSTEM_PROCESSOR warning when configuring ABSL_RANDOM_RANDEN_COPTS > Enforce Visual Studio 2017 (MSVC++ 15.0) minumum > `absl::InlinedVector::swap` supports non-assignable types. > Improve b-tree error messages when dereferencing invalid iterators. > Mutex: Fix stall on single-core systems > Document Base64Unescape() padding > Fix sign conversion warnings in memory_test.cc. > Fix a sign conversion warning. > Fix a truncation warning on Windows 64-bit. > Use btree iterator subtraction instead of std::distance in erase_range() and count(). > Eliminate use of internal interfaces and make the test portable and expose it to OSS. > Fix various warnings for _WIN32. > Disables StderrKnobsDefault due to order dependency > Implement btree_iterator::operator-, which is faster than std::distance for btree iterators. > Merge pull request abseil/abseil-cpp#1298 from rpjohnst:mingw-cmake-build > Implement function to calculate Damerau-Levenshtein distance between two strings. > Change per_thread_sem_test from size medium to size large. > Support stringification of user-defined types in AbslStringify in absl::Substitute. > Fix "unsafe narrowing" warnings in absl, 12/12. > Revert change to internal 'Rep', this causes issues for gdb > Reorganize InlineData into an inner Rep structure. > Remove internal `VLOG_xxx` macros > Import of CCTZ from GitHub. > `absl::InlinedVector` supports move assignment with non-assignable types. > Change Cord internal layout, which reduces store-load penalties on ARM > Detects accidental multiple invocations of AnyInvocable<R(...)&&>::operator()&& by producing an error in debug mode, and clarifies that the behavior is undefined in the general case. > Fix a bug in StrFormat. This issue would have been caught by any compile-time checking but can happen for incorrect formats parsed via ParsedFormat::New. Specifically, if a user were to add length modifiers with 'v', for example the incorrect format string "%hv", the ParsedFormat would incorrectly be allowed. > Adds documentation for stringification extension > CMake: Remove check_target calls which can be problematic in case of dependency cycle > Changes mutex unlock profiling > Add static_cast<void> to the sources for trivial relocations to avoid spurious -Wdynamic-class-memaccess errors in the presence of other compilation errors. > Configure ABSL_CACHE_ALIGNED for clang-like and MSVC toolchains. > Fix "unsafe narrowing" warnings in absl, 11/n. > Eliminate use of internal interfaces > Merge pull request abseil/abseil-cpp#1289 from keith:ks/fix-more-clang-deprecated-builtins > Merge pull request abseil/abseil-cpp#1285 from jun-sheaf:patch-1 > Delete LogEntry's copy ctor and assignment operator. > Make sinks provided to `AbslStringify()` usable with `absl::Format()`. > Cast unused variable to void > No changes in OSS. > No changes in OSS > Replace the kPower10ExponentTable array with a formula. > CMake: Mark absl::cord_test_helpers and absl::spy_hash_state PUBLIC > Use trivial relocation for transfers in swisstable and b-tree. > Merge pull request abseil/abseil-cpp#1284 from t0ny-peng:chore/remove-unused-class-in-variant > Removes the legacy spellings of the thread annotation macros/functions by default. Closes #12201	2022-12-05 21:07:16 +02:00
Eliran Sinvani	5a5514d052	cql server: Only parallelize relevant cql requests The cql server uses an execution stage to process and execute queries, however, processing stage is best utilized when having a recurrent flow that needs to be called repeatedly since it better utilizes the instruction cache. Up until now, every request was sent through the processing stage, but most requests are not meant to be executed repeatedly with high volume. This change processes and executes the data queries asynchronously, through an execution stage, and all of the rest are processed one by one, only continuing once the request has been done end to end. Tests: Unit tests in dev and debug. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes #12202	2022-12-05 21:06:58 +02:00
Takuya ASADA	b7851ab1ec	docker: fix locale on SSH shell `4ecc08c` broke locale settings on SSH shell, since we dropped "update-locale". To fix this without installing locales package, we need to manually specify LANG=C.UTF-8 in /etc/default/locale. see https://github.com/scylladb/scylla-cluster-tests/pull/5519 Closes #12197	2022-12-05 20:02:18 +02:00
Avi Kivity	6f2d060d12	Merge 'Make sstable_directory call sstable_manager for sstables' components' from Pavel Emelyanov This PR hits two goals for "object storage" effort 1. Sstables loader "knows" that sstables components are stored in a Linux directory and uses utils/lister to access it. This is not going to work with sstables over object storage, the loader should be abstracted from the underlying storage. 2. Currently class keyspace and class column_family carry "datadir" and "all_datadirs" on board which are path on local filesystem where sstable files are stored (those usually started with /var/lib/scylla/data). The paths include subsdirs like "snapshots", "staging", etc. This is not going to look nice for obejct storage, the /var/lib/ prefix is excessive and meaningless in this case. Instead, ks and cf should know their "location" and some other component should know the directory where in which the files are stored. Said that, this PR prepares distributed_loader and sstables_directly to stop using Linux paths explicitly by making both call sstables_manager to list and open sstables object. After it will be possible to teach manager to list sstables from object storage. Also this opens the way to removing paths from keyspace and column_family classes and replacing those with relative "location"s. Closes #12128 * github.com:scylladb/scylladb: sstable_directory: Get components lister from manager sstable_directory: Extract directory lister sstable_directory: Remove sstable creation callback sstable_directory: Call manager to make sstables sstable_directory: Keep error handler generator sstable_directory: Keep schema_ptr sstable_directory: Use directory semaphore from manager sstable_directory: Keep reference on manager tests: Use sstables creation helper in some cases sstables_manager: Keep directory semaphore reference sstables, code: Wrap directory semaphore with concurrency	2022-12-05 18:54:17 +02:00
Gleb Natapov	022a825b33	raft: introduce not_a_member error and return it when non member tries to do add/modify_config Currently if a node that is outside of the config tries to add an entry or modify config transient error is returned and this causes the node to retry. But the error is not transient. If a node tries to do one of the operations above it means it was part of the cluster at some point, but since a node with the same id should not be added back to a cluster if it is not in the cluster now it will never be. Return a new error not_a_member to a caller instead. Message-Id: <Y42mTOx8bNNrHqpd@scylladb.com>	2022-12-05 17:11:04 +01:00
Benny Halevy	c61083852c	storage_service: handle_state_normal: calculate candidates_for_removal when replacing tokens We currently try to detect a replaced node so to insert it to endpoints_to_remove when it has no owned tokens left. However, for each token we first generate a multimap using get_endpoint_to_token_map_for_reading(). There are 2 problems with that: 1. unless the replaced node owns a single token, this map will not be empty after erasing one token out of it, since the token metadata has not changed yet (this is done later with update_normal_tokens(owned_tokens, endpoint)). 2. generating this map for each token is inefficient, turning this algorithm complexity to quadratic in the number of tokens... This change copies the current token_to_endpoint map to temporary map and erases replaced tokens from it, while maintaining a set of candidates_for_removal. After traversing all replaced tokens, we check again the `token_to_endpoint_map` erasing from `candidates_for_removal` any endpoint that still owns tokens. The leftover candidates are endpoints the own no tokens and so they are added to `hosts_to_remove`. Fixes #12082 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #12141	2022-12-05 16:17:18 +01:00
Botond Dénes	3d620378d4	Merge 'view: coroutinize maybe_mark_view_as_built' from Avi Kivity Simplifying it a little. Closes #12171 * github.com:scylladb/scylladb: view: reindent maybe_mark_view_as_built view: coroutinize maybe_mark_view_as_built	2022-12-05 13:43:34 +02:00
Pavel Emelyanov	b5ede873f2	sstable_directory: Get components lister from manager For now this is almost a no-op because manager just calls sstables_directory code back to create the lister. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	3f9b8c855d	sstable_directory: Extract directory lister Currently the utils/lister.cc code is in use to list regular files in a directory. This patch wraps the lister into more abstract components lister class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	abd3602b10	sstable_directory: Remove sstable creation callback It's no longer used. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	3d559391df	sstable_directory: Call manager to make sstables Now the directory code has everyhting it needs to create sstable object and can stop using the external lambda. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	db657a8d1c	sstable_directory: Keep error handler generator Yet another continuation to previous patch -- IO error handlers generator is also needed to create sstables. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	4281f4af42	sstable_directory: Keep schema_ptr Continuation of one-before-previous patch. In order to create sstable without external lambda the directory code needs schema. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	8df1bcb907	sstable_directory: Use directory semaphore from manager After previous patch sstables_directory code may no longer require for semaphore argument, because it can get one from manager. This makes the directory API shorter and simpler. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	4da941e159	sstable_directory: Keep reference on manager The sstables_directly accesses /var/lib/scylla/data in two ways -- lists files in it and opens sstables. The latter is abdtracted with the help of lambdas passed around, but the former (listing) is done by using directory liters from utils. Listing sstables components with directlry lister won't work for object storage, the directory code will need to call some abstraction layer instead. Opening sstables with the help of a lambda is a bit of overkill, having sstables manager at hand could make it much simpler. Said that, this patch makes sstables_directly reference sstables_manager on start. This change will also simplify directory semaphore usage (next patch). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	784d78810a	tests: Use sstables creation helper in some cases Several test cases push sstables creation lambda into with_sstables_directory helper. There's a ready to use helper class that does the same. Next patch will make additional use of that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:19 +03:00
Pavel Emelyanov	5e13ce2619	sstables_manager: Keep directory semaphore reference Preparational patch. The semaphore will be used by sstables_directory in next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 12:03:18 +03:00
Pavel Emelyanov	be8512d7cc	sstables, code: Wrap directory semaphore with concurrency Currently this is a sharded<semaphore> started/stopped in main and referenced by database in order to be fed into sstables code. This semaphore always comes with the "concurrency" parameter that limits the parallel_for_each parallelizm. This patch wraps both together into directory_semaphore class. This makes its usage simpler and will allow extending it in the future. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-05 11:59:30 +03:00
Asias He	c6087cf3a0	repair: Reduce repair reader eviction with diff shard count When repair master and followers have different shard count, the repair followers need to create multi-shard readers. Each multi-shard reader will create one local reader on each shard, N (smp::count) local readers in total. There is a hard limit on the number of readers who can work in parallel. When there are more readers than this limit. The readers will start to evict each other, causing buffers already read from disk to be dropped and recreating of readers, which is not very efficient. To optimize and reduce reader eviction overhead, a global reader permit is introduced which considers the multi-shard reader bloats. With this patch, at any point in time, the number of readers created by repair will not exceed the reader limit. Test Results: 1) with stream sem 10, repair global sem 10, 5 ranges in parallel, n1=2 shards, n2=8 shards, memory wanted =1 1.1) [asias@hjpc2 mycluster]$ time nodetool -p 7200 repair ks2 (repair on n2) [2022-11-23 17:45:24,770] Starting repair command #1, repairing 1 ranges for keyspace ks2 (parallelism=SEQUENTIAL, full=true) [2022-11-23 17:45:53,869] Repair session 1 [2022-11-23 17:45:53,869] Repair session 1 finished real 0m30.212s user 0m1.680s sys 0m0.222s 1.2) [asias@hjpc2 mycluster]$ time nodetool repair ks2 (repair on n1) [2022-11-23 17:46:07,507] Starting repair command #1, repairing 1 ranges for keyspace ks2 (parallelism=SEQUENTIAL, full=true) [2022-11-23 17:46:30,608] Repair session 1 [2022-11-23 17:46:30,608] Repair session 1 finished real 0m24.241s user 0m1.731s sys 0m0.213s 2) with stream sem 10, repair global sem no_limit, 5 ranges in parallel, n1=2 shards, n2=8 shards, memory wanted =1 2.1) [asias@hjpc2 mycluster]$ time nodetool -p 7200 repair ks2 (repair on n2) [2022-11-23 17:49:49,301] Starting repair command #1, repairing 1 ranges for keyspace ks2 (parallelism=SEQUENTIAL, full=true) [2022-11-23 17:52:01,414] Repair session 1 [2022-11-23 17:52:01,415] Repair session 1 finished real 2m13.227s user 0m1.752s sys 0m0.218s 2.2) [asias@hjpc2 mycluster]$ time nodetool repair ks2 (repair on n1) [2022-11-23 17:52:19,280] Starting repair command #1, repairing 1 ranges for keyspace ks2 (parallelism=SEQUENTIAL, full=true) [2022-11-23 17:52:42,387] Repair session 1 [2022-11-23 17:52:42,387] Repair session 1 finished real 0m24.196s user 0m1.689s sys 0m0.184s Comparing 1.1) and 2.1), it shows the eviction played a major role here. The patch gives 73s / 30s = 2.5X speed up in this setup. Comparing 1.1 and 1.2, it shows even if we limit the readers, starting on the lower shard is faster 30s / 24s = 1.25X (the total number of multishard readers is lower) Fixes #12157 Closes #12158	2022-12-05 10:47:36 +02:00
Botond Dénes	1e20095547	Update tools/java submodule * tools/java 1c06006447...ecab7cf7d6 (1): > Add VSCode files to gitignore	2022-12-05 09:54:51 +02:00
Botond Dénes	c4d72c8dd0	Merge 'cql3: select_statement: split and coroutinize process_results()' from Avi Kivity Split the simple (and common) case from the complex case, and coroutinize the latter. Hopefully this generates better code for the simple case, and it makes the complex case a little nicer. Closes #12194 * github.com:scylladb/scylladb: cql3: select_statement: reindent process_results_complex() cql3: select_statement: coroutinize process_results_complex() cql3: select_statement: split process_results() into fast path and complex path	2022-12-05 08:16:22 +02:00
Avi Kivity	a0a4711b74	snapshot: protect list operations against the lambda coroutine fiasco run_snapshot_list_operation() takes a continuation, so passing it a lambda coroutine without protection is dangerous. Protect the coroutine with coroutine::lambda so it doesn't lost its contents. Fixes #12192. Closes #12193	2022-12-05 08:14:39 +02:00
guy9	cb842b2729	Replacing the Docs top bar message from the LIVE event to the community forum announcement Closes #12189	2022-12-05 08:05:04 +02:00
Avi Kivity	0834bb0365	cql3: select_statement: reindent process_results_complex()	2022-12-04 21:36:17 +02:00
Avi Kivity	a63f98e3fc	cql3: select_statement: coroutinize process_results_complex() Not a huge gain, since it's just a do_with, but still a little better. Note the inner lambda is not a coroutine, so isn't susceptibe to the lambda coroutine fiasco.	2022-12-04 21:34:51 +02:00
Avi Kivity	7f29efa0ad	cql3: select_statement: split process_results() into fast path and complex path This will allow us to coroutinize the complex path without adding an allocation to the fast path.	2022-12-04 21:30:45 +02:00
Avi Kivity	02b66bb31a	Merge 'Mark sstable::<directory accessing methods> private' from Pavel Emelyanov One of the prerequisites to make sstables reside on object-storage is not to let the rest of the code "know" the filesystem path they are located on (because sometimes they will not be on any filesystem path). This patch makes the methods that can reveal this path back private so that later they can be abstracted out. Closes #12182 * github.com:scylladb/scylladb: sstable: Mark some methods private test: Don't get sstable dir when known test: Use move_to_quarantine() helper test: Use sstable::filename() overload without dir name sstables: Reimplement batch directory sync after move table, tests: Make use of move_to_new_dir() default arg sstables: Remove fsync_directory() helper table: Simplify take_snapshot()'s collecting sstables names	2022-12-04 17:45:37 +02:00
Kamil Braun	b551cd254c	test: test_raft_upgrade: fix test_recover_stuck_raft_upgrade flakiness The test enables an error injection inside the Raft upgrade procedure on one of the nodes which will cause the node to throw an exception before entering `synchronize` state. Then it restarts other nodes with Raft enabled, waits until they enter `synchronize` state, puts them in RECOVERY mode, removes the error-injected node and creates a new Raft group 0. As soon as the other nodes enter `synchronize`, the test disabled the error injection (the rest of the test was outside the `async with inject_error(...)` block). There was a small chance that we disabled the error injection before the node reached it. In that case the node also entered `synchronize` and the cluster managed to finish the upgrade procedure. We encountered this during next promotion. Eliminate this possibility by extending the scope of the `async with inject_error(...)` block, so that the RECOVERY mode steps on the other nodes are performed within that block. Closes #12162	2022-12-02 21:26:44 +01:00
Avi Kivity	94f18b5580	test: sstable_conforms_to_mutation_source: use do_with_async() where needed The test clearly needs a thread (it converts a reader to a mutation without waiting), so give it one. Closes #12178	2022-12-02 20:48:37 +01:00
Pavel Emelyanov	084522d9eb	sstable: Mark some methods private There are several class sstable methods that reveal internal directory path to caller. It's not object-storage-friendly. Fortunately, all the callers of those methods had been patched not to work with full paths, so these can be marked private. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:15:02 +03:00
Pavel Emelyanov	fb63850f2c	test: Don't get sstable dir when known The sstable_move_test creates sstables in its own temp directories and the requests these dirs' paths back from sstables. Test can come with the paths it has at hand, no need to call sstables for it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:13:58 +03:00
Pavel Emelyanov	4c742a658d	test: Use move_to_quarantine() helper Two places in tests move sstable to quarantine subdir by hand. There's the class sstable method that does the same, so use it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:13:19 +03:00
Pavel Emelyanov	d6244b7408	test: Use sstable::filename() overload without dir name The dir this place currently uses is the directory where the sstable was created, so dropping this argument would just render the same path. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:12:21 +03:00
Pavel Emelyanov	a702affd4d	sstables: Reimplement batch directory sync after move There's a table::move_sstables_from_staging() method that gets a bunch of sstables and moves them from staging subdit into table's root datadir. Not to flush the root dir for every sstable move, it asks the sstable::move_to_new_dir() not to flush, but collects staging dir names and flushes them and the root dir at the end altothether. In order to make it more friendly to object-storage and to remove one more caller of sstable::get_dir() the delayed_commit_changes struct is introduced. It collects _all_ the affected dir names in unordered_set, then allows flushing them. By default the move_to_new_dir() doesn't receive this object and flushes the directories instantly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:08:47 +03:00
Pavel Emelyanov	1b42d5fce3	table, tests: Make use of move_to_new_dir() default arg The method in question accepts boolean bit whether or not it should sync directories at the end. It's always true but in one case, so there's the default value for it. Make use of it. Anticipating the suggestion to replace bool with bool_class -- next patch will replace it with something else. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:07:16 +03:00
Pavel Emelyanov	339feb4205	sstables: Remove fsync_directory() helper The one effectively wraps existing seastar sync_directory() helper into two io_check-s. It's simpler just to call the latter directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:05:43 +03:00
Pavel Emelyanov	80f5d7393f	table: Simplify take_snapshot()'s collecting sstables names The method in question "snapshots" all sstables it can find, then writes their Datafile names into the manifest file. To get the list of file names it iterates over sstables list again and does silly conversion of full file path to file name with the help of the directory path length. This all can be made much simpler if just collecting component names directly at the time sstable is hardlinked. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-02 21:02:37 +03:00

1 2 3 4 5 ...

34128 Commits