scylladb

Author	SHA1	Message	Date
Avi Kivity	94c21e5c05	Merge 'sstables: Reduce amount of I/O for clustering-key-bounded reads from large partitions' from Tomasz Grabiec Single-row reads from large partition issue 64 KiB reads to the data file, which is equal to the default span of the promoted index block in the data file. If users would want to increase selectivity of the index to speed up single-row reads, this won't be effective. The reason is that the reader uses promoted index to look up the start position in the data file of the read, but end position will in practice extend to the next partition, and amount of I/O will be determined by the underlying file input stream implementation and its read-ahead heuristics. By default, that results in at least 2 IOs 32KB each. There is already infrastructure to lookup end position based on upper bound of the read, in anticipation for sharing the promoted index cache, but it's not effective becasue it's a non-populating lookup and the upper bound cursor has its own private cached_promoted_index, which is cold when positions are computed. It's non-populating on purpose, to avoid extra index file IO to read upper bound. In case upper bound is far-enough from the lower bound, this will only increase the cost of the read. The solution employed here is to warm up the lower bound cursor's cache before positions are computed, and use that cursor for non-populating lookup of the upper bound. We use the lower bound cursor and the slice's lower bound so that we read the same blocks as later lower-bound slicing would, so that we don't incur extra IO for cases where looking up upper bound is not worth it, that is when upper bound is far from the lower bound. If upper bound is near lower bound, then warming up using lower bound will populate cached_promoted_index with blocks which will allow us to locate the upper bound block accurately. This is especially important for single-row reads, where the bounds are around the same key. In this case we want to read the data file range which belongs to a single promoted index block. It doesn't matter that the upper bound is not exactly the same. They both will likely lie in the same block, and if not, binary search will bring adjacent blocks into cache. Even if upper bound is not near, the binary search will populate the cache with blocks which can be used to narrow down the data file range somewhat. Fixes #10030. The change was tested with perf-fast-forward. I populated the data set with `column_index_size_in_kb` set to 1 scylla perf-fast-forward --populate --run-tests=large-partition-slicing --column-index-size-in-kb=1 Test run: build/release/scylla perf-fast-forward --run-tests=large-partition-select-few-rows -c1 --keep-cache-across-test-cases --test-case-duration=0 This test issues two reads of subsequent keys from the middle of a large partition (1M rows in total). The first read will miss in the index file page cache, the second read will hit. Notice that before the change, the second read issued 2 aio requests worth of 64KiB in total. After the change, the second read issued 1 aio worth of 2 KiB. That's because promoted index block is larger than 1 KiB. I verified using logging that the data file range matches a single promoted index block. Also, the first read which misses in cache is still faster after the change. Before: ``` running: large-partition-select-few-rows on dataset large-part-ds1 Testing selecting few rows from a large partition: stride rows time (s) iterations frags frag/s mad f/s max f/s min f/s avg aio aio (KiB) blocked dropped idx hit idx miss idx blk c hit c miss c blk allocs tasks insns/f cpu 500000 1 0.009802 1 1 102 0 102 102 21.0 21 196 2 1 0 1 1 0 0 0 568 269 4716050 53.4% 500001 1 0.000321 1 1 3113 0 3113 3113 2.0 2 64 1 0 1 0 0 0 0 0 116 26 555110 45.0% ``` After: ``` running: large-partition-select-few-rows on dataset large-part-ds1 Testing selecting few rows from a large partition: stride rows time (s) iterations frags frag/s mad f/s max f/s min f/s avg aio aio (KiB) blocked dropped idx hit idx miss idx blk c hit c miss c blk allocs tasks insns/f cpu 500000 1 0.009609 1 1 104 0 104 104 20.0 20 137 2 1 0 1 1 0 0 0 561 268 4633407 43.1% 500001 1 0.000217 1 1 4602 0 4602 4602 1.0 1 2 1 0 1 0 0 0 0 0 110 26 313882 64.1% ``` Backports: none, not a regression Closes scylladb/scylladb#20522 * github.com:scylladb/scylladb: perf: perf_fast_forward: Add test case for querying missing rows perf-fast-forward: Allow overriding promoted index block size perf-fast-forward: Test subsequent key reads from the middle in test_large_partition_select_few_rows perf-fast-forward: Allow adding key offset in test_large_partition_select_few_rows perf-fast-forward: Use single-partition reads in test_large_partition_select_few_rows sstables: bsearch_clustered_cursor: Add more tracing points sstables: reader: Log data file range sstables: bsearch_clustered_cursor: Unify skip_info logging sstables: bsearch_clustered_cursor: Narrow down range using "end" position of the block sstables: bsearch_clustered_cursor: Skip even to the first block test: sstables: sstable_3_x_test: Improve failure message sstables: mx: writer: Never include partition_end marker in promoted index block width sstables: Reduce amount of I/O for clustering-key-bounded reads from large partitions sstables: clustered_cursor: Track current block	2024-10-28 21:13:23 +02:00
Kefu Chai	24d14b601b	treewide: s/boost::adaptors::map_values/std::views::values/ now that we are allowed to use C++23. we now have the luxury of using `std::views::values`. in this change, we: - replace `boost::adaptors::map_values` with `std::views::values` - update affected code to work with `std::views::values` - the places where we use `boost::join()` are not changed, because we cannot use `std::views::concat` yet. this helper is only available in C++26. to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21265	2024-10-27 21:32:45 +02:00
Avi Kivity	3124711fc4	Merge 'Report rows_merged in compaction_history rest api and nodetool' from Łukasz Paszkowski Currently, running the `nodetool compactionhistory` command or using the rest api `curl -X GET --header "Accept: application/json" "http://localhost:10000/compaction_manager/compaction_history"` return compaction history without the `row_merged` field. The series computes rows merged during compaction and provides this information to users via both the nodetool command and the rest api. The `rows_merged` field contains information on merged clustering keys across multiple sstable files. For instance, compacting two sstables of a table consisting of 7 rows where two rows are part of the both sstables, the output would have the following format: {1: 5, 2: 2}. No backport is required. It extends the existing compaction history output. Fixes https://github.com/scylladb/scylladb/issues/666 Closes scylladb/scylladb#20481 * github.com:scylladb/scylladb: test/rest_api: Add tests for compactionhistory nodetool: Add rows merged stats into compactionhistory output compaction: Update compaction history with collected histogram compaction: Remove const qualifier from methods creating sstable readers sstable_set: Add optional statistics to make_local_shard_sstable_reader make_combined_reader: Add optional parameter, combined_reader_statistics reader_selector: Extend with maximum reader count mutation_fragment_merger: Create histogram while consuming mutation fragment batches	2024-10-27 21:26:11 +02:00
Pavel Emelyanov	7595ef7303	test: Squash test::change_generation_number() into test::store() No other usages of the former helper other than immediatelly followed by the latter, no point in keepint it around. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-10-24 11:29:17 +03:00
Pavel Emelyanov	e885b0e6cd	test: Squash test::change_dir() into test::store() No other usages of the former helper other than immediatelly followed by the latter, no point in keepint it around. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-10-24 11:28:39 +03:00
Łukasz Paszkowski	84912c3155	reader_selector: Extend with maximum reader count The maximum reader count allows to predict the number of readers that can be created with create_new_readers(). This helps to correctly allocate a vector size in the rows_merged statistics when a combiner reader is created via make_combined_reader.	2024-10-22 08:15:02 +02:00
Kefu Chai	ce0a86c585	build: cmake: correct some tests' KIND before this change, we build some tests as if they are Seastar tests. but after `415c83fa`, these tests failed to link. because the Seastar::seastar_testing does not expose `-DSEASTAR_TESTING_MAIN` in its cflags. the behavior of the Seastar::seastar_testing is expected. because a test linking against this library is not necessarily driven by the `main()` provided by `testing/seastar_test.hh`. so, in this change, we correct the `KIND` parameter of these tests, so that they use `KIND BOOST`, as these tests can be driven by the `main()` provided by Boost.Test's driver. also there are some tests driven by Boost.Test's `main()`, but in the meanwhile, they utilize seastar_testing, so let's add `Seastar::seastar_testing` to their `LIBRARIES`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21183	2024-10-22 07:10:47 +03:00
Kefu Chai	6ead5a4696	treewide: move log.hh into utils/log.hh the log.hh under the root of the tree was created keep the backward compatibility when seastar was extracted into a separate library. so log.hh should belong to `utils` directory, as it is based solely on seastar, and can be used all subsystems. in this change, we move log.hh into utils/log.hh to that it is more modularized. and this also improves the readability, when one see `#include "utils/log.hh"`, it is obvious that this source file needs the logging system, instead of its own log facility -- please note, we do have two other `log.hh` in the tree. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-10-22 06:54:46 +03:00
Kefu Chai	5cd619a60c	treewide: s/boost::adaptors::map_keys/std::views::keys/ now that we are allowed to use C++23. we now have the luxury of using `std::views::keys`. in this change, we: - replace `boost::adaptors::map_keys` with `std::views::keys` - update affected code to work with `std::views::keys` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21198	2024-10-21 12:47:52 +03:00
Avi Kivity	b5a1173880	utils: small_vector: support from_range_t std::ranges::to<>() has a little protocol with containers to allow them to optimize their construction from ranges. Implement it for small_vector. It optimizes ranges that can have their size determined quickly, or that can be traversed twice to determine the size by reserving up front. Single-pass ranges (std::ranges::input_range) use the less efficient push_back method. A unit test (which fails without the new constructor) is added. Closes scylladb/scylladb#21094	2024-10-21 09:31:38 +03:00
Avi Kivity	c3be2489ce	treewide: drop includes of <boost/range/adaptors.hpp> This includes way too much, including <boost/regex.hpp>, which is huge. Drop includes of adaptors.hpp and replace by what is needed. Closes scylladb/scylladb#21187	2024-10-20 17:17:11 +03:00
Kefu Chai	85518463a9	test/boost: stop using ranges::to() now that we are able to use ranges library provided by the C++ standard library. there is no need to use the homebrew `ranges::to()`. in this change, we switch to `std::ranges::to()` in favor of `ranges::to()`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-10-19 13:21:20 +08:00
Pavel Emelyanov	b11d50f591	Merge 'multishard reader: make it safe to create with admitted permits' from Botond Dénes Passing an admitted permit -- i.e. one with count resources on it -- to the multishard reader, will possibly result in a deadlock, because the permit of the multishard reader is destroyed after the permits of its child readers. Therefore its semaphore resources won't be automatically released until children acquire their own resources. This creates a dependency (an edge in the "resource allocation graph"), where the semaphore used by the multishard reader depends on the semaphores used by children. When such dependencies create a cycle, and permits are acquired by different reads in just the right order, a deadlock will happen. Users of the multishard reader have to be aware of this gotcha -- and of course they aren't. This is small wonder, considering that not even the documentation on the multishard reader mentions this problem. To work around this, the user has to call `reader_permit::release_base_resources()` on the permit, before passing it to the multishard reader. On multiple occasions, developers (including the very author of the multishard reader), forgot or didn't know about this and this resulted in deadlocks down the line. This is a design-flaw of the multishard reader, which is addressed in this PR, after which, it is safe to pass admitted or not admitted permits to the multishard reader, it will handle the call to `release_base_resources()` if needed. After fixing the problem in the multishard reader, the existing calls to `release_base_resources()` on permits passed to multishard readers are removed. A test is added which reproduces the problem and ensures we don't regress. Refs: https://github.com/scylladb/scylladb/issues/20885 (partial fix, there is another deadlock in that issue, which this PR doesn't fix) This fixes (indirectly) a regression introduced by `d98708013c` so it has to be backported to 6.2 Closes scylladb/scylladb#21058 * github.com:scylladb/scylladb: test/boost/mutation_test: add test for multishard permit safety test/lib/reader_lifecycle_policy: add semaphore factory to constructor test/lib/reader_lifecycle_policy: rename factory_function repair/row_level: drop now unneeded release_base_resource() calls readers/multishard: make multishard reader safe to create with admitted permits	2024-10-18 13:30:21 +03:00
Pavel Emelyanov	df6991edd3	test: Do not duplicate sstable twice The statistics_rewrite test case copies an sstable from resources two times: - first time -- explicitly by listing resource components and copying files to the test temp dir - second time -- implicitly, by calling create_links() linking copied files by new set in the staging/ subdirectory The 2nd step is not needed and the history of changes justifies that. The test itself appeared with `70b793e4d3` and it only contained the 2nd "copying" -- test linked files from resource directory and then worked in the newly created set. Later, commit `59c57861ae` added the first step and copied the files from resource into test temp dir. At this point linking copied files because pointless, but was preserved. Let's remove it now. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#21097	2024-10-18 08:31:08 +03:00
Botond Dénes	e1d8cddd09	test/boost/mutation_test: add test for multishard permit safety Add a test checking that the multishard reader will not deadlock, when created with an admitted permit, on a semaphore with a single count resource.	2024-10-17 08:47:50 -04:00
Botond Dénes	5a3fd69374	test/lib/reader_lifecycle_policy: add semaphore factory to constructor Allowing callers to specify how the semaphore is created and stopped, instead of doing so via boolean flags like it is done currently. This method doesn't scale, so use a factory instead.	2024-10-17 08:47:50 -04:00
Alexey Novikov	b965729f0a	replica: implement memtable_flush_period_in_ms schema option implement cassandra original schema option memtable_flush_period_in_ms: Milliseconds before memtables associated with the table are flushed. there are few things concerning this patch: * milliseconds look strange and scary for this option. Unlike Cassandra we use 60000ms (1min) minimum value for this option. * This is limitation of Cassandra but it is impossible to set this option for system tables. However sometimes it could be very useful to use automatic flushing for such a tables: some system tables have small traffic and as a result prevent tombstone garbage collection. Fixes #20270 Closes scylladb/scylladb#20999	2024-10-17 13:41:15 +03:00
Calle Wilund	f2ef75c3da	commitlog_test: Up timeout for large entry tests Fixes #21150 Apparently, on some CI, in debug, these tests can time out (large alloc) without actually failing what they do. Up the timeout (could consider removing as well, but...) so they hopefully pass. Closes scylladb/scylladb#21151	2024-10-16 18:13:04 +03:00
Patryk Jędrzejczak	18d3a6480d	test: test_read_required_hosts: run with the raft-based topology When we made the raft-based topology mandatory, all boost test tests started using it. Then, `test_read_required_hosts` started failing. We left investigating it for later and started running it with `force-gossip-topology-changes` to make it pass. Currently, the test doesn't fail with the raft-based topology anymore. Hence, we remove the FIXME and run the test with a normal config. We don't know when and why the test stopped failing. Investigating it wouldn't be easy, since we don't even know why it failed in the first place. We suspect that there was some bug that is now fixed. This patch only fixes a test, there is no need to backport it. Fixes scylladb/scylladb#18463 Closes scylladb/scylladb#20960	2024-10-11 17:01:20 +02:00
Kamil Braun	4d99cd2055	Merge 'raft: fast tombstone GC for group0-managed tables' from Emil Maskovsky Add the gossip state for broadcasting the nodes state_id. Implemented the Group0 state broadcaster (based on the gossip) that will broadcast the state id of each node and check the minimal state id for the tombstone GC. When there is a change in the tombstone GC minimal state id, the state broadcaster will update the tombstone GC time for the group0-managed tables. The main component of the change is the newly added `group0_state_id_handler` that keeps track, broadcasts and receives the last group0 state_ids across all nodes and sets the tombstone GC deletion time accordingly: * on each group0 change applied, the state_id handler broadcasts the state_id as a gossip state (only if the value has changed) * the handler checks for the node state ids every refresh period (configurable, 1h by default) * on every check, the handler figures out the lowest state_id (timeuuid), which is state_id that all of the nodes already have * the timestamp of this minimum state_id is then used to set the tombstone GC deletion time * the tombstone GC calculation then uses that deletion time to provide the GC time back to the callers, e.g. when doing the compaction * (as the time for tombstone GC calculation has the 1s granularity we actually deduce 1s from the determined timestamp, because it can happen that there were some newer mutations received in the same second that were not distributed across the nodes yet) This change introduces a new flag to the static schema descriptor (`is_group0_table`) that is being checked for this newly added mode in the tombstone GC. We also add a check (in non-release builds only) on every group0 modification that the table has this flag set. The group0 tombstone GC handling is similar to the "repair" tombstone GC mode in a sense (that the tombstone GC time is determined according to a reconciliation action), however it is not explicitly visible to (nor editable by) the user. And also the tombstone GC calculation is much simpler than the "repair" mode calculation - for example, we always use the whole range (as opposed to the "repair" mode that can have specific repair times set for specific ranges). We use the group0 configuration to determine the set of nodes (both current and previous in case of joint configuration) - we need to make sure that we account for all the group0 nodes (if any node didn't provide the state_id yet, the current check round will be skipped, i.e. no GC will be done until all known nodes provide their state_id timestamp value). Also note that the group0 state_id handling works on all nodes independently, i.e. each node might have its own (possibly different) state depending on the gossip application state propagation. This is however not a problem, as some nodes might be behind, but they will catch up eventually, and this solution has the benefit of being distributed (as opposed to having a central point to handle the state, like for example the topology coordinator that has been considered in the early stages of the design). Fixes: scylladb/scylla#15607 New feature, should not be backported. Closes scylladb/scylladb#20394 * github.com:scylladb/scylladb: raft: add the check for the group0 tables raft: fast tombstone GC for group0-managed tables tombstone_gc: refactor the repair map raft: flag the group0-managed tables gossip: broadcast the group0 state id raft/test: add test for the group0 tombstone GC treewide: code cleanup and refactoring	2024-10-11 11:52:27 +02:00
Botond Dénes	86fd9ce8fd	schema/schema: break circular dependency with replica::database The schema module (everything in schema/) is supposed to be towards the leafs in the ScyllaDB inter-module dependency graph. In other words, it should not depend on many other modules. On the other hand, almost the entire codebase depends on the schema module itself. Currently there is a circular dependency between schema and replica::database, as the latter is a required argument for schema::describe(). This is bad, not just because of the dependency mess it introduces, but also because now schema::describe() can only be used by code which has a reference to the database handy. This patch breaks this circular dependency, by introducing the schema_describe_helper interface and providing an implementation for it in database.hh. There is another circular dependency: schema <-> replica::table. This is not addressed by this patch. Closes scylladb/scylladb#20893	2024-10-10 10:07:26 +03:00
Benny Halevy	3a12ad96c7	sstables: scylla_metadata: add sstable identifier Keep a copy of the sstable uuid generation in a new scylla_metadata sstable_identifier attribute. If the SSTable happens to have a numerical generation just create a new time-uuid and log a message about that. Dump this new attribute in scylla sstable dump tool. And add a unit test to verify that the written (and then loaded) sstable identifier matches the sstable's generation. The motivatrion for this change stems from backup deduplication. In essence, an sstable may already have been backed up in a previous snapshot, and we don't want to abck it up again if it's already present on external storage. Today this is based on rclone that compares files checksums, but once scylla will backup the sstables using the native object-storage stack (#19890), we would like to use the sstable globally-unique identifier for deduplication. Although the uuid-generation is encoded in the sstable path, the latter may change, e.g. due to intra-node migration, so keep a copy of the original unique identifier in scylla-metadata, and that attribute would survive file-based or intra-node migrations. Fixes scylladb/scylladb#20459 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#21002	2024-10-10 08:52:46 +03:00
Avi Kivity	bb1867c7c7	Merge 'sstables: Add digest checking in the validation path of the sstable layer' from Nikos Dragazis This PR builds upon the PR for checksum validation (#20207) to further enhance scrub's corruption detection capabilities by validating digests as well. The digest (full checksum) is the checksum over the entire data, as opposed to per-chunk checksums which apply to individual chunks. Until now, digests were not examined on any code paths. This PR integrates digest checking into the compressed/checksummed data sources as an optional feature and enables it only through the validation path of the sstable layer (`sstable::validate()`). The validation path is used by the following tools: * scrub in validate mode * `sstable validate` All other reads, including normal user reads, are unaffected by this change. The PR consists of: * Extensions to the compressed and checksummed data sources to support digest checking. The data sources receive the expected digest as a parameter and calculate the actual digest incrementally across multiple get() calls. The check happens on the get() call that reaches EOF and results to an exception if the digest is invalid. A digest check requires reading the whole file range. Therefore, a partial read or skip() is treated as an internal error. * A new shareable digest component loaded on demand by the validation code. No lifecycle management. * Grouping of old scrub/validate tests for compressed and uncompressed SSTables to reduce code duplication. * scrub/validate tests for SSTables with valid checksums but invalid digests, and SSTables with no digests at all. * scrub/validate tests with 3.x Cassandra SSTables to ensure compatibility. Refs #19058. New feature, no backport is needed. Closes scylladb/scylladb#20720 * github.com:scylladb/scylladb: test: Test scrub/validate with SSTables from Cassandra compaction: Make quarantine optional for perform_sstable_scrub() test: Make random schema optional in scrub_test_framework test: Add tests for invalid digests test: Merge scrub/validate tests for compressed and uncompressed cases sstables: Verify digests on validation path sstables: Check if digest component exists sstables: Add digest in the SSTable components sstables: Add digest check in compressed data source sstables: Add digest check in checksummed data source	2024-10-09 21:33:08 +03:00
Botond Dénes	3e468608e7	Merge 'Collect sstables on boot from all datadirs (and don't collect from S3 twice)' from Pavel Emelyanov There's a long-pending issue in distributed loader. When it populates sstables on boot it loops over table.config.all_datadirs, but ignores the loop cursor (the datadir itslef), instead loading sstables from table.config.dir, which is 0th element of all_datadirs. There's a test for that, but it's also broken. Effectively collection happens from table.config.dir several times. For local sstables that's just wasted work and potentially lost sstables (but nobody seems to configure more than 1 datadir anyway). For S3 sstables it's also wasted work and incorrectness. The fix is for both -- populator and test. The former is to use all_datadirs to construct sstable_directory. To make it happen, creation of sstable_directory now depends on the storage options, the loop is moved into the branch that creates sstable_directory for local storage type. The test fix is to make sure that some sstables in non-default datadir before running population code. Closes scylladb/scylladb#20819 * github.com:scylladb/scylladb: test: Fix test_multiple_data_dirs distributed_loader: Indentation fix after previous patch distributed_loader: Use correct datadir to collect local sstable distributed_loader: Move all-datadirs loop to local storage collecting distributed_loader: Collect table subdirs based on its storage options distributed_loader: Indentation fix after previous patch distributed_loader: Squash loop of collect_subdir into one method distributed_loader: Convert map of directories into a vector distributed_loader: Make start_subdir() method work with directory distributed_loader: Drop local reference variable distributed_loader: Split start_subdir() distributed_loader: Remove allow-offstrategy argument distributed_loader: Make populate() method work with directory distributed_loader: Remove check for sstable_directory presense distributed_loader: Out-line table_populator() methods distributed_loader: Print storage options, not datadir distributed_loader: Print prepared message sstable_directory: Add sstable_state argument ot one of constructors sstable_directory: Add state() method	2024-10-09 14:43:34 +03:00
Pavel Emelyanov	17ec416178	Merge 'Make sure S3 upload completion parses possible error' from Ernest Zaslavsky fixes #20517 Adds `aws_error` which possibly can contain errors from the S3 response body. Adds to the multipart upload completion a check for possible error and issues a retry if the error is retryable Closes scylladb/scylladb#20518 * github.com:scylladb/scylladb: test: add complete_multipart_upload completion tests code: s3 client error handling code: add response parsing and error handling to the complete_multipart_upload code: Introduce AWS errors parsing	2024-10-09 12:01:27 +03:00
Emil Maskovsky	0c9308cf48	raft: add the check for the group0 tables Added the runtime check to ensure that all the tables that are used with the group0 commands are marked as group0 tables.	2024-10-08 21:08:11 +02:00
Pavel Emelyanov	8bfbc563cc	test: Remove sstable factory from test_min_max_clustering_key() The helper makes sstables from env directly. Callers may not create the factor after that. Less code the better. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20983	2024-10-07 20:08:05 +03:00
Nikos Dragazis	7a1ec3aa41	test: Test scrub/validate with SSTables from Cassandra All current unit tests for scrub in validate mode generate random SSTables on the fly. Add some more tests with frozen Cassandra SSTables from the source tree to verify compatibility with Cassandra. Use some of the existing 3.x Cassandra SSTables to test the valid case, and use the same schema to generate some corrupted SSTables for the invalid case. Overall, the new tests cover the following scenarios: * valid compressed/uncompressed * compressed/uncompressed with invalid checksums * compressed/uncompressed with invalid digest For the compressed SSTable with invalid checksums, a small chunk length was used (4KiB) to have more chunks with less disk space. For uncompressed SSTables the chunk length is not configurable. Finally, since the SSTables live in the source tree, the quarantine mechanism was disabled. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-07 15:21:38 +03:00
Nikos Dragazis	5f2be2924e	test: Make random schema optional in scrub_test_framework The scrub_test_framework, which is the foundation for all scrub-related tests, always generates a random schema upon initialization and makes it available to the user. This is useful for running tests with ephemeral SSTables, but is redundant when the creation of the SSTable predates the test (e.g., it lives in the source tree). Turn scrub_test_framework into a template with a boolean parameter to optionally switch off the random schema generation. Also, add an overload for run() to support passing a ready-to-use SSTable instead of mutation fragments. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-07 15:21:38 +03:00
Nikos Dragazis	07ed0a48aa	test: Add tests for invalid digests In a previous patch we extended the validation path of the SSTable layer to validate the digests along with the checksums. Add two tests for compressed and uncompressed SSTables to test the validation API against SSTables with valid checksums but corrupted digests. Add two more tests to ensure that the absence of digest does not affect checksum validation. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-07 15:21:38 +03:00
Nikos Dragazis	39a74fb692	test: Merge scrub/validate tests for compressed and uncompressed cases Currently, every scrub/validate test is duplicated to cover both compressed and uncompressed SSTables. However, except for the compression type, the tests are identical. This leads to some code bloat. Introduce common functions parameterized by the compression type to reduce code duplication. Also, group together the compressed and uncompressed variants into one compression-agnostic test. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-07 15:21:38 +03:00
Pavel Emelyanov	1870873538	test: Fix test_multiple_data_dirs The one was broken from the very beginning. It only checked that after creating a table, its directory is created in all datadirs. But it didn't check that after restart populating happens from the all. That's because all directories by 0th were always empty, so not-populating from them didn't skip any data. Fix it by moving all sstables from datadirs[0] to datadirs[1] before restart. With that update not-populating data from datadirs[1] will be noticed instantly. Fortunately, previous patches fixed that, so the test still passes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-10-07 12:04:23 +03:00
Botond Dénes	07094c3e44	Merge 'replica: Fix tombstone GC during tablet split preparation' from Raphael "Raph" Carvalho During split prepare phase, there will be more than 1 compaction group with overlapping token range for a given replica. Assume tablet 1 has sstable A containing deleted data, and sstable B containing a tombstone that shadows data in A. Then split starts: 1) sstable B is split first, and moved from main (unsplit) group to a split-ready group 2) now compaction runs in split-ready group before sstable A is split tombstone GC logic today only looks at underlying group, so compaction is step 2 will discard the deleted data in A, since it belongs to another group (the unsplit one), and so the tombstone can be purged incorrectly. To fix it, compaction will now work with all uncompacting sstables that belong to the same replica, since tombstone GC requires all sstables that possibly contain shadowed data to be available for correct decision to be made. Fixes https://github.com/scylladb/scylladb/issues/20044. Branches 6.0, 6.1 and 6.2 are vulnerable, so backport is needed. Closes scylladb/scylladb#20939 * github.com:scylladb/scylladb: replica: Fix tombstone GC during tablet split preparation service: Improve error handling for split	2024-10-04 10:29:42 +03:00
Nikos Dragazis	c893f06409	sstables: Add digest check in compressed data source Following the addition of digest check in the checksummed data source, add the same feature to the compressed data source as well. This ensures consistent behavior across any type of SSTable. This is added as an optional feature so that we can preserve the current behavior, that is verify only the per-chunk checksums during normal user reads. To ensure zero cost at runtime when disabled, we introduce the on/off switch as a template parameter. The digest calculation for compressed SSTables depends on the SSTable format, hence the new template argument for the checksum mode. This is consistent with the compressed data sink. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-10-03 18:09:01 +03:00
Tomasz Grabiec	1b82d5117a	sstables: bsearch_clustered_cursor: Narrow down range using "end" position of the block This is optimization. Example: block0: start=aaa, end=aaA block1: start=bbb, end=bbB block2: whatever Before the patch, advance_to("aAA") would skip to block0, and upper bound probe would skip to block1. This way, the reader would read the range of block0 from the data file. After the patch, "end" position is taken into account, so advance_to("aAA") will notice that block0 doesn't contain the position and will skip to block1. This is especially important for dense indexes, as it allows us to skip accessing data file if the search key is missing. It also solves the edge case problem related to the fact that single row reads are using a range which with positions which are not equal to the key, but are before(key) and after(key) for the lower bound and upper bound respectively. Before the patch, advance_to(before("bbb")) would skip to block0, before the position is before the block1's start. And upper bound probe for after("bbb") would point to block2. This way the read would scan block0 needlessly. After the patch, advance_to(before("bbb")) will skip to block1 because we notice based on "end" that block0 doesn't contain the position. This change also ensures that the start position of the upper bound entry of the after_key(pos), where pos is the last advance_to() position, is warm in cache. This is needed to optimize single-row reads with a dense index so that they always read exactly one promoted index block. For this to work, probe_upper_bound() for the after_key(row) always needs to find the upper bound block in cache.	2024-10-03 14:16:05 +02:00
Tomasz Grabiec	c905554121	test: sstables: sstable_3_x_test: Improve failure message	2024-10-03 14:16:05 +02:00
Tomasz Grabiec	7f077893ed	sstables: mx: writer: Never include partition_end marker in promoted index block width Currently, it may happen that the last promoted index block includes the partition_end marker. That's because we first write the partition end marker and then emit the unclosed block. This behavior matches Cassandra (checked in 3.x and 5.0.1). This is problematic for ruling out data file reads based on index. The width field is currently unused, but it will be used later where the width of the last block is used to compute the skip position past the last block for lookups which land after all keys in the partition. If width includes the marker then such a skip would land in the next partition, which is incorrect, as the reader context expects a cell element. Even if that was recognized, it's wrong - if this is not a single partition read (so upper bound is not at the next partition too), then we would read from the wrong (next) partition. We want to be able to make such skips in order to avoid unnecessary data file IO for reads of missing rows. Currently, we would always read the last block even if the key is past its "end" position. Another way to solve this would be to propagate the "past the last block" condition from the index cursor to the reader and let it deal with it, but the logic for that would be complicated. With this fix, there is no special logic required.	2024-10-03 14:09:57 +02:00
Kefu Chai	f9091066b7	treewide: replace boost::irange with std::views::iota where possible when building scylla with the standard library from GCC-14.2, shipped by fedora 41, we have following build failure: ``` /home/kefu/.local/bin/clang++ -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=7 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_PROMISE -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_SSTRING -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"Debug\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -g -Og -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb/build=. -march=x86-64-v3 -mpclmul -Xclang -fexperimental-assignment-tracking=disabled -Werror=unused-result -fstack-clash-protection -fsanitize=address -fsanitize=undefined -MD -MT CMakeFiles/scylla-main.dir/Debug/init.cc.o -MF CMakeFiles/scylla-main.dir/Debug/init.cc.o.d -o CMakeFiles/scylla-main.dir/Debug/init.cc.o -c /home/kefu/dev/scylladb/init.cc In file included from /home/kefu/dev/scylladb/init.cc:12: In file included from /home/kefu/dev/scylladb/db/config.hh:20: In file included from /home/kefu/dev/scylladb/locator/abstract_replication_strategy.hh:26: /home/kefu/dev/scylladb/locator/tablets.hh:410:30: error: unexpected type name 'size_t': expected expression 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ^ /home/kefu/dev/scylladb/locator/tablets.hh:410:23: error: no member named 'irange' in namespace 'boost' 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ~~~~~~~^ /home/kefu/dev/scylladb/locator/tablets.hh:410:38: error: left operand of comma operator has no effect [-Werror,-Wunused-value] 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ^ 3 errors generated. [16/782] Building CXX object CMakeFiles/scylla-main.dir/Debug/keys.cc.o [17/782] Building CXX object CMakeFiles/scylla-main.dir/Debug/counters.cc.o [18/782] Building CXX object CMakeFiles/scylla-main.dir/Debug/partition_slice_builder.cc.o [19/782] Building CXX object CMakeFiles/scylla-main.dir/Debug/mutation_query.cc.o FAILED: CMakeFiles/scylla-main.dir/Debug/mutation_query.cc.o /home/kefu/.local/bin/clang++ -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=7 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_PROMISE -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_SSTRING -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"Debug\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -g -Og -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb/build=. -march=x86-64-v3 -mpclmul -Xclang -fexperimental-assignment-tracking=disabled -Werror=unused-result -fstack-clash-protection -fsanitize=address -fsanitize=undefined -MD -MT CMakeFiles/scylla-main.dir/Debug/mutation_query.cc.o -MF CMakeFiles/scylla-main.dir/Debug/mutation_query.cc.o.d -o CMakeFiles/scylla-main.dir/Debug/mutation_query.cc.o -c /home/kefu/dev/scylladb/mutation_query.cc In file included from /home/kefu/dev/scylladb/mutation_query.cc:12: In file included from /home/kefu/dev/scylladb/schema/schema_registry.hh:17: In file included from /home/kefu/dev/scylladb/replica/database.hh:11: In file included from /home/kefu/dev/scylladb/locator/abstract_replication_strategy.hh:26: /home/kefu/dev/scylladb/locator/tablets.hh:410:30: error: unexpected type name 'size_t': expected expression 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ^ /home/kefu/dev/scylladb/locator/tablets.hh:410:23: error: no member named 'irange' in namespace 'boost' 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ~~~~~~~^ /home/kefu/dev/scylladb/locator/tablets.hh:410:38: error: left operand of comma operator has no effect [-Werror,-Wunused-value] 410 \| return boost::irange<size_t>(0, tablet_count()) \| boost::adaptors::transformed([] (size_t i) { \| ^ In file included from /home/kefu/dev/scylladb/mutation_query.cc:12: In file included from /home/kefu/dev/scylladb/schema/schema_registry.hh:17: In file included from /home/kefu/dev/scylladb/replica/database.hh:37: In file included from /home/kefu/dev/scylladb/db/snapshot-ctl.hh:20: /home/kefu/dev/scylladb/tasks/task_manager.hh:403:54: error: no member named 'irange' in namespace 'boost' 403 \| co_await coroutine::parallel_for_each(boost::irange(0u, smp::count), [&tm, id, &res, &func] (unsigned shard) -> future<> { \| ~~~~~~~^ 4 errors generated. ``` so let's take the opportunity to switch from `boost::irange` to `std::views::iota`. in this change, we: - switch from boost::irange to std::views::iota for better standard library compatibility - retain boost::irange where step parameter is used, as std::views::iota doesn't support it - this change partially modernizes our range usage while maintaining - existing functionality Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20924	2024-10-03 10:33:33 +03:00
Raphael S. Carvalho	93815e0649	replica: Fix tombstone GC during tablet split preparation During split prepare phase, there will be more than 1 compaction group with overlapping token range for a given replica. Assume tablet 1 has sstable A containing deleted data, and sstable B containing a tombstone that shadows data in A. Then split starts: 1) sstable B is split first, and moved from main (unsplit) group to a split-ready group 2) now compaction runs in split-ready group before sstable A is split tombstone GC logic today only looks at underlying group, so compaction is step 2 will discard the deleted data in A, since it belongs to another group (the unsplit one), and so the tombstone can be purged incorrectly. To fix it, compaction will now work with all uncompacting sstables that belong to the same replica, since tombstone GC requires all sstables that possibly contain shadowed data to be available for correct decision to be made. Fixes #20044. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2024-10-02 11:26:13 -03:00
Benny Halevy	5a0f3889e0	treewide: use std::ranges sort functions rather than boost Using the standard library is preffered over boost. In cql3/expr/expression.cc to_sorted_vector got more of a face-list and was modernized to use also std::unique and while at it, to move its input range in the uniquely sorted result vector. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-10-01 14:19:05 +03:00
Avi Kivity	e9425e15b2	treewide: remove dependency on boost asio address_v4 It's not used. There's a comment mentioning it prevents some type conflict, but apparently that was fixed some time ago. Closes scylladb/scylladb#20883	2024-10-01 14:00:50 +03:00
Botond Dénes	e780a3f168	Merge 'fix regressions of building tests with cmake' from Laszlo Ersek Fix two recent regressions of the cmake build -- found this time in the test suite. We (presumably) don't build stable releases (and their tests) with CMake, so backporting these fixes appears unnecessary, even if the regressions have been ported to stable branches. @xemul @dawmd @tchaikov @tgrabiec @scylladb/scylla-maint Closes scylladb/scylladb#20854 * github.com:scylladb/scylladb: test/boost/bptree_test: fix the CMake build test/boost/auth_test: fix the CMake build	2024-10-01 11:14:19 +03:00
Ernest Zaslavsky	5a96549c86	test: add complete_multipart_upload completion tests A primitive python http server is processing s3 client requests and issues either success or error. A multipart uploader should fail or succeed (with or without retries) depending on aforementioned server response	2024-10-01 09:06:24 +03:00
Avi Kivity	fb8743b2d6	Merge 'sstables: Fix use-after-free on page cache buffer when parsing promoted index entries across pages' from Tomasz Grabiec This fixes a use-after-free bug when parsing clustering key across pages. Also includes a fix for allocating section retry, which is potentially not safe (not in practice yet). Details of the first problem: Clustering key index lookup is based on the index file page cache. We do a binary search within the index, which involves parsing index blocks touched by the algorithm. Index file pages are 4 KB chunks which are stored in LSA. To parse the first key of the block, we reuse clustering_parser, which is also used when parsing the data file. The parser is stateful and accepts consecutive chunks as temporary_buffers. The parser is supposed to keep its state across chunks. In `93482439`, the promoted index cursor was optimized to avoid fully page copy when parsing index blocks. Instead, parser is given a temporary_buffer which is a view on the page. A bit earlier, in `b1b5bda`, the parser was changed to keep shared fragments of the buffer passed to the parser in its internal state (across pages) rather than copy the fragments into a new buffer. This is problematic when buffers come from page cache because LSA buffers may be moved around or evicted. So the temporary_buffer which is a view on the LSA buffer is valid only around the duration of a single consume() call to the parser. If the blob which is parsed (e.g. variable-length clustering key component) spans pages, the fragments stored in the parser may be invalidated before the component is fully parsed. As a result, the parsed clustering key may have incorrect component values. This never causes parsing errors because the "length" field is always parsed from the current buffer, which is valid, and component parsing will end at the right place in the next (valid) buffer. The problematic path for clustering_key parsing is the one which calls primitive_consumer::read_bytes(), which is called for example for text components. Fixed-size components are not parsed like this, they store the intermediate state by copying data. This may cause incorrect clustering keys to be parsed when doing binary search in the index, diverting the search to an incorrect block. Details of the solution: We adapt page_view to a temporary_buffer-like API. For this, a new concept is introduced called ContiguousSharedBuffer. We also change parsers so that they can be templated on the type of the buffer they work with (page_view vs temporary_buffer). This way we don't introduce indirection to existing algorithms. We use page_view instead of temporary_buffer in the promoted index parser which works with page cache buffers. page_view can be safely shared via share() and stored across allocating sections. It keeps hold to the LSA buffer even across allocating sections by the means of cached_file::page_ptr. Fixes #20766 Closes scylladb/scylladb#20837 * github.com:scylladb/scylladb: sstables: bsearch_clustered_cursor: Add trace-level logging sstables: bsearch_clustered_cursor: Move definitions out of line test, sstables: Verify parsing stability when allocating section is retried test, sstables: Verify parsing stability when buffers cross page boundary sstables: bsearch_clustered_cursor: Switch parsers to work with page_view cached_file: Adapt page_view to ContiguousSharedBuffer cached_file: Change meaning of page_view::_size to be relative to _offset rather than page start sstables, utils: Allow parsers to work with different buffer types sstables: promoted_index_block_parser: Make reset() always bring parser to initial state sstables: bsearch_clustered_cursor: Switch read_block_offset() to use the read() method sstables: bsearch_clustered_cursor: Fix parsing when allocating section is retried	2024-10-01 00:02:55 +03:00
Calle Wilund	b5d167699c	commitlog: Fix buffer_list_bytes not updated correctly Fixes #20862 With the change in `60af2f3cb2` the bookkeep for buffer memory was changed subtly, the problem here that we would shrink buffer size before we after flush use said buffer's size to decrement the buffer_list_bytes value, previously inc:ed by the full, allocated size. I.e. we would slowly grow this value instead of adjusting properly to actual used bytes. Test included. Closes scylladb/scylladb#20886	2024-09-30 18:04:00 +03:00
Avi Kivity	5d68efe0bd	raft_group0_client: uninclude "db/system_keyspace.hh" It doesn't need it apart from a forward declaration. Files that lost necessary includes are adjusted, and some users of auth_version_t are redirected to the definition outside system_keyspace.	2024-09-28 16:31:53 +03:00
Laszlo Ersek	153279dbfa	test/boost/bptree_test: fix the CMake build Commit `4cf4b7d4ef` ("test: Move B+tree compactiont test from unit to boost", 2024-09-24) introduced the first SEASTAR_THREAD_TEST_CASE to "test/boost/bptree_test.cc" (alongside the prior BOOST_AUTO_TEST_CASEs), but missed changing the KIND of the test from BOOST to SEASTAR. Therefore we get a linker failure: > : && /usr/bin/clang++ -O2 -Xlinker --build-id=sha1 --ld-path=ld.lld > -dynamic-linker=/.../lib64/ld-linux-x86-64.so.2 > test/boost/CMakeFiles/bptree_test.dir/Dev/bptree_test.cc.o -o > test/boost/Dev/bptree_test -L$srcdir/idl/absl::headers > -Wl,-rpath,$srcdir/idl/absl::headers test/lib/Dev/libtest-lib.a > seastar/Dev/libseastar.a /usr/lib64/libxxhash.so > /usr/lib64/libboost_unit_test_framework.so.1.83.0 utils/Dev/libutils.a > -Xlinker --push-state -Xlinker --whole-archive auth/Dev/libscylla_auth.a > -Xlinker --pop-state /usr/lib64/libcrypt.so cdc/Dev/libcdc.a > compaction/Dev/libcompaction.a mutation_writer/Dev/libmutation_writer.a > -Xlinker --push-state -Xlinker --whole-archive dht/Dev/libscylla_dht.a > -Xlinker --pop-state types/Dev/libtypes.a index/Dev/libindex.a -Xlinker > --push-state -Xlinker --whole-archive locator/Dev/libscylla_locator.a > -Xlinker --pop-state message/Dev/libmessage.a gms/Dev/libgms.a > sstables/Dev/libsstables.a readers/Dev/libreaders.a > schema/Dev/libschema.a -Xlinker --push-state -Xlinker --whole-archive > tracing/Dev/libscylla_tracing.a -Xlinker --pop-state > Dev/libscylla-main.a -Xlinker --push-state -Xlinker --whole-archive > Dev/libscylla-zstd.a -Xlinker --pop-state /usr/lib64/libzstd.so > abseil/absl/strings/Dev/libabsl_cord.a > abseil/absl/strings/Dev/libabsl_cordz_info.a > abseil/absl/strings/Dev/libabsl_cord_internal.a > abseil/absl/strings/Dev/libabsl_cordz_functions.a > abseil/absl/strings/Dev/libabsl_cordz_handle.a > abseil/absl/crc/Dev/libabsl_crc_cord_state.a > abseil/absl/crc/Dev/libabsl_crc32c.a > abseil/absl/crc/Dev/libabsl_crc_internal.a > abseil/absl/crc/Dev/libabsl_crc_cpu_detect.a > abseil/absl/strings/Dev/libabsl_str_format_internal.a /usr/lib64/libz.so > service/Dev/libservice.a node_ops/Dev/libnode_ops.a > service/Dev/libservice.a node_ops/Dev/libnode_ops.a -lsystemd > raft/Dev/libraft.a repair/Dev/librepair.a streaming/Dev/libstreaming.a > replica/Dev/libreplica.a db/Dev/libdb.a mutation/Dev/libmutation.a > data_dictionary/Dev/libdata_dictionary.a cql3/Dev/libcql3.a > transport/Dev/libtransport.a cql3/Dev/libcql3.a > transport/Dev/libtransport.a lang/Dev/liblang.a > /usr/lib64/liblua-5.4.so -lm /usr/lib64/libsnappy.so.1.1.10 > abseil/absl/container/Dev/libabsl_raw_hash_set.a > abseil/absl/hash/Dev/libabsl_hash.a abseil/absl/hash/Dev/libabsl_city.a > abseil/absl/types/Dev/libabsl_bad_variant_access.a > abseil/absl/hash/Dev/libabsl_low_level_hash.a > abseil/absl/types/Dev/libabsl_bad_optional_access.a > abseil/absl/container/Dev/libabsl_hashtablez_sampler.a > abseil/absl/profiling/Dev/libabsl_exponential_biased.a > abseil/absl/synchronization/Dev/libabsl_synchronization.a > abseil/absl/debugging/Dev/libabsl_stacktrace.a > abseil/absl/synchronization/Dev/libabsl_graphcycles_internal.a > abseil/absl/synchronization/Dev/libabsl_kernel_timeout_internal.a > abseil/absl/debugging/Dev/libabsl_symbolize.a > abseil/absl/debugging/Dev/libabsl_debugging_internal.a > abseil/absl/base/Dev/libabsl_malloc_internal.a > abseil/absl/debugging/Dev/libabsl_demangle_internal.a > abseil/absl/time/Dev/libabsl_time.a > abseil/absl/strings/Dev/libabsl_strings.a > abseil/absl/strings/Dev/libabsl_strings_internal.a > abseil/absl/strings/Dev/libabsl_string_view.a > abseil/absl/base/Dev/libabsl_throw_delegate.a > abseil/absl/numeric/Dev/libabsl_int128.a > abseil/absl/base/Dev/libabsl_base.a > abseil/absl/base/Dev/libabsl_raw_logging_internal.a > abseil/absl/base/Dev/libabsl_log_severity.a > abseil/absl/base/Dev/libabsl_spinlock_wait.a -lrt > abseil/absl/time/Dev/libabsl_civil_time.a > abseil/absl/time/Dev/libabsl_time_zone.a rust/Dev/libwasmtime_bindings.a > rust/librust_combined.a utils/Dev/libutils.a seastar/Dev/libseastar.a > /usr/lib64/libboost_program_options.so /usr/lib64/libboost_thread.so > /usr/lib64/libboost_chrono.so /usr/lib64/libboost_atomic.so > /usr/lib64/libcares.so /usr/lib64/libfmt.so.10.2.1 /usr/lib64/liblz4.so > /usr/lib64/libgnutls.so -latomic /usr/lib64/libsctp.so > /usr/lib64/libprotobuf.so /usr/lib64/libyaml-cpp.so > /usr/lib64/libhwloc.so /usr/lib64/libnuma.so /usr/lib64/libxxhash.so > /usr/lib64/libcryptopp.so /usr/lib64/libdeflate.so > /usr/lib64/libboost_regex.so.1.83.0 /usr/lib64/libicui18n.so > /usr/lib64/libicuuc.so -ldl && : > ld.lld: error: undefined symbol: main > >>> referenced by > /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../lib64/crt1.o:(_start) > > ld.lld: error: undefined symbol: > seastar::testing::seastar_test::seastar_test(char const, char const, > int, boost::unit_test::decorator::collector_t&) > ooo referenced by bptree_test.cc > >>> > test/boost/CMakeFiles/bptree_test.dir/Dev/bptree_test.cc.o:(_GLOBAL__sub_I_bptree_test.cc) > clang++: error: linker command failed with exit code 1 (use -v to see invocation) Fix the KIND now. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-09-27 12:21:17 +02:00
Laszlo Ersek	5fa87cb1c6	test/boost/auth_test: fix the CMake build Commit `78ab1ee8b7` ("test: Add tests for `CREATE ROLE WITH SALTED HASH`", 2024-09-20) made test/boost/auth_test dependent on cql3, but didn't encode the dependency in "CMakeLists.txt": > FAILED: > test/boost/CMakeFiles/auth_test.dir/RelWithDebInfo/auth_test.cc.o > /usr/bin/clang++ -DBOOST_ALL_DYN_LINK -DFMT_SHARED > -DSCYLLA_BUILD_MODE=release -DSEASTAR_API_LEVEL=7 > -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT > -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SSTRING > -DSEASTAR_TESTING_MAIN -DXXH_PRIVATE_API > -DCMAKE_INTDIR=\"RelWithDebInfo\" -I$srcdir -I$srcdir/build/gen > -I$srcdir/seastar/include -I$srcdir/build/seastar/gen/include > -I$srcdir/build/seastar/gen/src -isystem $srcdir/abseil -isystem > $srcdir/build/rust -ffunction-sections -fdata-sections -O3 -g -gz > -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra > -Wno-error=deprecated-declarations -Wimplicit-fallthrough > -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags > -Wno-missing-field-initializers -Wno-overloaded-virtual > -Wno-unsupported-friend -Wno-enum-constexpr-conversion > -Wno-unused-parameter -ffile-prefix-map=$srcdir/build=. -march=westmere > -Xclang -fexperimental-assignment-tracking=disabled -mllvm > -inline-threshold=2500 -fno-slp-vectorize -Werror=unused-result -MD -MT > test/boost/CMakeFiles/auth_test.dir/RelWithDebInfo/auth_test.cc.o -MF > test/boost/CMakeFiles/auth_test.dir/RelWithDebInfo/auth_test.cc.o.d -o > test/boost/CMakeFiles/auth_test.dir/RelWithDebInfo/auth_test.cc.o -c > $srcdir/test/boost/auth_test.cc > $srcdir/test/boost/auth_test.cc:22:10: fatal error: 'cql3/CqlParser.hpp' > file not found > 22 \| #include "cql3/CqlParser.hpp" > \| ^~~~~~~~~~~~~~~~~~~~ > 1 error generated. State the dependency now. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-09-27 11:38:03 +02:00
Tomasz Grabiec	0279ac5faa	test, sstables: Verify parsing stability when allocating section is retried	2024-09-27 01:25:15 +02:00
Tomasz Grabiec	c09fa0cb98	test, sstables: Verify parsing stability when buffers cross page boundary	2024-09-27 01:25:15 +02:00

1 2 3 4 5 ...

3514 Commits