scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 10:41:12 +00:00

Author	SHA1	Message	Date
Nikos Dragazis	cc10a5f287	test: Check validation errors in scrub tests Scrub was extended in PR #11074 to report validation errors but the unit tests were not updated. Update the tests to check the validation errors reported by scrub. Validation errors must be zero for valid SSTables and non-zero for invalid SSTables. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:28:59 +03:00
Nikos Dragazis	719757fba9	sstables: Enable checksum validation for uncompressed SSTables Extend the `sstable::validate()` to validate the checksums of uncompressed SSTables. Given that this is already supported for compressed SSTables, this allows us to provide consistent behavior across any type of SSTable, be it either compressed or uncompressed. The most prominent use case for this is scrub/validate, which is now able to detect file-level corruption in uncompressed SSTables as well. Note that this change will not affect normal user reads which skip checksum validation altogether. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:28:59 +03:00
Nikos Dragazis	716fc487fd	sstables: Expose integrity option via crawling mutation readers Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:28:59 +03:00
Nikos Dragazis	1d2dc9f2e1	sstables: Expose integrity option via data_consume_rows() Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:28:59 +03:00
Nikos Dragazis	2feced32f7	sstables: Add option for integrity check in data streams Add a new boolean parameter in `sstable::data_stream()` to enable/disable integrity mechanisms in the underlying data streams. Currently, this only affects uncompressed SSTables and it allows to enable/disable checksum validation on each chunk. The validation happens transparently via the checksummed data source implementation. The reason we need this option is to allow differentiating the behavior between normal user reads and scrub/validate reads. We would like to enable scrub to verify checksums for uncompressed SSTables, while leaving normal user reads unchanged for performance reasons (read amplification due to round up of reads to chunk size and loading of the CRC component). Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:27:54 +03:00
Nikos Dragazis	d5bd40ad2c	sstables: Remove unused variable Remove unused stream variable from `sstable::data_stream()`. This was introduced in commit `47e07b787e` but never used. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:27:54 +03:00
Nikos Dragazis	2575d20f41	sstables: Add checksum in the SSTable components Uncompressed SSTables store their checksums in a separate CRC.db file. Add this in the list of SSTable components. Since this component is used only for validation, load the component on-demand for validation tasks and delete it when all validation tasks finish. In more detail: - Make the checksum component shareable and weakly referencable. Also, add a constructor since it is no longer an aggregate. - Use a weak pointer to store a non-owning reference in the components and a shared pointer to keep the object alive while validation runs. Once validation finishes, the component should be cleaned up automatically. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:27:38 +03:00
Nikos Dragazis	b7dfba4c18	sstables: Introduce checksummed file data source implementation Introduce a new data source implementation for uncompressed SSTables. This is just a thin wrapper for a raw data source that also performs checksum validation for each chunk. This way we can have consistent behavior for compressed and uncompressed SSTables. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2024-09-11 12:26:18 +03:00
Botond Dénes	0e5b444777	Merge 'database::get_all_tables_flushed_at: fix return value' from Lakshmi Narayanan Sreethar The `database::get_all_tables_flushed_at` method returns a variable without setting the computed all_tables_flushed_at value. This causes its caller, `maybe_flush_all_tables` to flush all the tables everytime regardless of when they were last flushed. Fix this by returning the computed value from `database::get_all_tables_flushed_at`. Fixes #20301 Requires a backport to 6.0 and 6.1 as they have the same issue. Closes scylladb/scylladb#20471 * github.com:scylladb/scylladb: cql-pytest: add test to verify compaction_flush_all_tables_before_major_seconds config database::get_all_tables_flushed_at: fix return value	2024-09-11 11:43:45 +03:00
Benny Halevy	4e8f3f4cdd	cql-pytest: add test_compaction_tombstone_gc Test tombstone garbage collection with: 1. conflicting live data in memtable (verifying there is no regression in this area) 2. deletion in memtable (reproducing scylladb/scylladb#20423) 3. materialized view update in memtable (reproducing scylladb/scylladb#20424) in materialized_views Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:06:23 +03:00
Benny Halevy	9270348c38	sstable_compaction_test: add mv_tombstone_purge_test Simulate view updates pattern and verify that they don't inhibit tombstone garbage collection. Verify fix for scylladb/scylladb#20424 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:06:23 +03:00
Benny Halevy	0407e50aa4	sstable_compaction_test: tombstone_purge_test: test that old deleted data do not inhibit tombstone garbage collection Tests fix for scylladb/scylladb#20423 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:06:06 +03:00
Benny Halevy	a7caa79df7	sstable_compaction_test: tombstone_purge_test: add testlog debugging Add some testlog debug printouts for the make_* helpers. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:58 +03:00
Benny Halevy	470d301fe3	sstable_compaction_test: tombstone_purge_test: make_expiring: use next_timestamp Rather than forging a timestamp from the gc_clock just use `next_timestamp` do it can be considered for tomebstone purging purposes. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:58 +03:00
Benny Halevy	5849ba83e0	sstable, compaction: add debug logging for extended min timestamp stats Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	7d893a5ed9	compaction: get_max_purgeable_timestamp: use memtable and sstable extended timestamp stats When purging regular tombstone consult the min_live_timestamp, if available. For shadowable_tombstones, consult the min_memtable_live_row_marker_timestamp, if available, otherwise fallback to the min_live_timestamp. If both are missing, fallback to the legacy (and inaccurate) min_timestamp. Fixes scylladb/scylladb#20423 Fixes scylladb/scylladb#20424 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	57e9e9c369	compaction: define max_purgeable_fn Before we add a new, is_shadowable, parameter to it. And define global `can_always_purge` and `can_never_purge` functions, a-la `always_gc` and `never_gc`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	b6fabd98c6	tombstone: can_gc_fn: move declaration to compaction_garbage_collector.hh And define `never_gc` globally, same as `always_gc` Before adding a new, is_shadowable parameter to it. Since it is used in the context of compaction it better fits compaction_garbage_collector header rather than tombstone.hh Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	4de4af954f	sstables: scylla_metadata: add ext_timestamp_stats Store and retrieve the optional extended timestamp statistics (min_live_timestamp and min_live_row_marker_timestamp) in the scylla_metadata component. Note that there is no need for a cluster feature to store those attributes since the scylla_metadata on-disk format is extensible so that old sstables can be read by new versions, seeing the extra stats is missing, and new sstables can be read by old versions that ignore unknown scylla metadata section types. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	6f202cf48b	compaction_group, storage_group, table_state: add extended timestamp stats getters To return the minimum live timestamp and live row-marker timestamp across a compaction_group, storage_group, or table_state. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:57 +03:00
Benny Halevy	14d86a3a12	sstables, memtable: track live timestamps When garbage collecting tombstones, we care only about shadowing of live data. However, currently we track min/max timestamp of both live and dead data, but there is no problem with purging tombstones that shadow dead data (expired or shdowed by other tombstones in the sstable/memtable). Also, for shadowable tombstones, we track live row marker timestamps separately since, if the live row marker timestamp is greater than a shadowable tombstone timestamp, then the row marker would shadow the shadowable tombstone thus exposing the cells in that row, even if their timestasmp may be smaller than the shadow tombstone's. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 19:05:49 +03:00
Abhi	9b09439065	raft: Add descriptions for requested abort errors Fixes: scylladb/scylladb#18902 Closes scylladb/scylladb#20291	2024-09-10 17:56:29 +02:00
Botond Dénes	de81388edb	Merge 'commitlog: Handle oversized entries' from Calle Wilund Refs #18161 Yet another approach to dealing with large commitlog submissions. We handle oversize single mutation by adding yet another entry typo: fragmented. In this case we only add a fragment (aha) of the data that needs storing into each entry, along with metadata to correlate and reconstruct the full entry on replay. Because these fragmented entries are spread over N segments, we also need to add references from the first segment in a chain to the subsequent ones. These are released once we clear the relevant cf_id count in the base. * This approach has the downside that due to how serialization etc works w.r.t. mutations, we need to create an intermediate buffer to hold the full serialized target entry. This is then incrementally written into entries of < max_mutation_size, successively requesting more segments. On replay, when encountering a fragment chain, the fragment is added to a "state", i.e. a mapping of currently processing frag chains. Once we've found all fragments and concatenated the buffers into a single fragmented one, we can issue a replay callback as usual. Note that a replay caller will need to create and provide such a state object. Old signature replay function remains for tests and such. This approach bumps the file format (docs to come). To ensure "atomicity" we both force synchronization, and should the whole op fail, we restore segment state (rewinding), thus discarding data all we wrote. Closes scylladb/scylladb#19472 * github.com:scylladb/scylladb: commitlog/database: Make some commitlog options updatable + add feature listener features/config: Add feature for fragmented commitlog entries docs: Add entry on commitlog file format v4 commitlog_test: Add more oversized cases commitlog_replayer: Replay segments in order created commitlog_replayer: Use replay state to support fragmented entries commitlog_replayer: coroutinize partly commitlog: Handle oversized entries	2024-09-10 17:15:46 +03:00
Benny Halevy	8d67357c42	memtable_encoding_stats_collector: update row_marker: do nothing if missing If the row_marker is missing then its timestamp is missing as well, so there's no point calling update_timestamp for it. Better return early. This should cause no functional change. The following patch will add more logic for tracking extended timestamp stats. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 16:46:34 +03:00
Pavel Emelyanov	b6f662417c	table: Remove unused database& argument from take_snapshot() method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20496	2024-09-10 14:53:06 +03:00
Gleb Natapov	af83c5e53e	group0: stop group0 before draining storage service during shutdown Currently storage service is drained while group0 is still active. The draining stops commitlogs, so after this point no more writes are possible, but if group0 is still active it may try to apply commands which will try to do writes and they will fail causing group0 state machine errors. This is benign since we are shutting down anyway, but better to fix shutdown order to keep logs clean. Fixes scylladb/scylladb#19665	2024-09-10 13:15:56 +02:00
Lakshmi Narayanan Sreethar	a0f4fe3fc4	cql-pytest: add test to verify compaction_flush_all_tables_before_major_seconds config Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2024-09-10 16:39:05 +05:30
Lakshmi Narayanan Sreethar	4ca720f0bd	database::get_all_tables_flushed_at: fix return value The `database::get_all_tables_flushed_at` method returns a variable without setting the computed all_tables_flushed_at value. This causes its caller, `maybe_flush_all_tables` to flush all the tables everytime regardless of when they were last flushed. Fix this by returning the computed value from `database::get_all_tables_flushed_at`. Fixes #20301 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2024-09-10 16:35:47 +05:30
Yaniv Michael Kaul	a4ff0aae47	HACKIGN.md: clarify the use of dbuild when running test.py If you are using dbuild, that's where test.py needs to run. Also, replace 'Docker image' with the more generic 'container' term. Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Closes scylladb/scylladb#20336	2024-09-10 13:40:45 +03:00
Botond Dénes	08f109724b	docs/cql/ddl.rst: fix description of sstable_compression ScyllaDB doesn't support custom compressors. The available compressors are the only available ones, not the default ones. Adjust the text to reflect this. Closes scylladb/scylladb#20225	2024-09-10 13:39:24 +03:00
Pavel Emelyanov	cfa59ab73d	test: Use single temp dir for sharded<sstables::test_env> The test-env in question is mostly started in one-shard mode. Also there are several boost tests that start sharded<> environment. In that case instances on different shards live in different temp dirs. That's not critical yet, but better to have single directory for the whole test. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20412	2024-09-10 11:25:04 +03:00
Artsiom Mishuta	f95c257a1e	[test.py]: Fail test teardown in case of task leakage In test.py every asyncio task spawned during the test must be finished before the next test, otherwise, tests might affect each other results. The developers are responsible for writing asyncio code in a way that doesn’t leave task objects unfinished. Test.py has a mechanism that helps test writers avoid such tasks. At the end of each test case, it verifies that the test did not produce/leave any tasks and sets an event object that fails the next test at the start if this is the case(issue https://github.com/scylladb/scylladb/issues/16472) The problem with this was that breaking the next test was counterintuitive, and the logging for this situation was insufficient and unobvious. notes: Task.cancel() is not an option to avoid task leakage 1) Calling cancel() Does Not Cancel The Task : the cancel() method just request that the target task cancel. 2) Calling cancel() Does Not Block Until The Task is Cancelled: If the caller needs to know the task is cancelled and done, it could await for the target 3) In particular PR, task.cancel() cancell task on client(ManagerClient) but not on http server(ScyllaManager). so "await" is needed. Closes scylladb/scylladb#20012	2024-09-10 10:51:45 +03:00
Pavel Emelyanov	ac2127a640	test: Call table::make_sstable() directly in compaction test The test in question generates a bunch of table_for_tests objects and creates sstables for each. For that it calls test_env::make_sstable(), but it can be made shorter, by calling table method directly. The hidden goal of this change is to remove the explicit caller of table::dir() method. The latter is going away. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20451	2024-09-10 10:19:20 +03:00
Botond Dénes	76bb22664a	Merge 'Sanitize open_sstables() helper in compaction test' from Pavel Emelyanov This includes - coroutinization - elimination of unused overload Closes scylladb/scylladb#20456 * github.com:scylladb/scylladb: test: Squash two open_sstables() helper together test: Coroutinize open_sstables() helper	2024-09-10 10:18:33 +03:00
Botond Dénes	a4a4797e27	Merge 'Alternator: tests and other preparation towards allowing adding a GSI to an existing table' from Nadav Har'El This series prepares us for working on #11567 - allow adding a GSI to a pre-existing table. This will require changing the implementation of GSIs in Alternator to not use real columns in the schema for the materialized view, and instead of a computed column - a function which extracts the desired member from the `:attrs` map and de-serializes it. This series does not contain the GSI re-implementation itself. Rather it contains a few small cleanups and mostly - new regression tests that cover this area, of adding and removing a GSI, and using a GSI, in more details than the tests we already had. I developed most of these tests while working on buggy fixes for #11567; The bugs in those implementations were exposed by the tests added here - they exposed bugs both in the new feature of adding or removing a GSI, and also regressions to the ordinary operation of GSI. So these tests should be helpful for whoever ends up fixing #11567, be it me based on my buggy implementation (which is _not_ included in this patch series), or someone else. No backports needed - this is part of a new feature, which we don't usually backport. Closes scylladb/scylladb#20383 * github.com:scylladb/scylladb: test/alternator: more extensive tests for GSI with two new key attributes test/alternator: test invalid key types for GSI test/alternator: test combination of LSI and GSI test/alternator: expand another test to use different write operations test/alternator: test GSIs with different key types alternator: better error message in some cases of key type mismatch test/alternator: test for more elaborate GSI updates test/alternator: strengthen tests for empty attribute values test/alternator: fix typo in test_batch.py test/alternator: more checks for GSI-key attribute validation Alternator: drop unneeded "IS NOT NULL" clauses in MV of GSI/LSI test/alternator: add more checks for adding/deleting a GSI test/alternator: ensure table deletions in test_gsi.py	2024-09-10 10:13:52 +03:00
Pavel Emelyanov	42f8d06a17	test: Use correct schema in directory tests with created table There are some test cases in sstable_directory_test test actually create a table with CQL and then try to manipulate its sstables with the help of sstable_directory. Those tests use existing local helper that starts sharded<sstable_directory> and this helper passes test-local static schema to sstable_directory constructor. As a result -- the schema of a table that test case created and the schema that sstable_directory works with are different. They match in the columns layout, which helps the test cases pass, but otherwise are two different schema objects with different IDs. It's more correct to use table schema for those runs. The fix introduces another helper to start sharded<sstable_directory>, and the older wrapper around cql_test_env becomes unused. Drop it too not to encourage future tests use it and re-introduce schema mismatch again. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20499	2024-09-10 09:56:26 +03:00
Benny Halevy	f47b5e60bc	sstable_directory: create_pending_deletion_log: place pending_delete log under the base directory To be able to atomically delete sstables both in base table directory and in its sub-directories, like `staging/`, use a shared pending_delete_dir under under the base directory. Note that this requires loading and processing the base directory first. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 09:28:13 +03:00
Benny Halevy	44bd183187	sstables: storage: keep base directory in base class so we can use the base (table) directory for e.g. pending_delete logs, in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 09:28:13 +03:00
Benny Halevy	027e64876a	sstables: storage: define opened_directory in header file So it can be used outside the storage module in the following patches. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 09:28:13 +03:00
Benny Halevy	a7b92d7b6f	sstable_directory: use only dirlog Currently, there are leftover log messages using sstlog rather than dirlog, that was introduced in `aebd965f0e`, and that makes debugging harder. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-09-10 09:28:11 +03:00
Botond Dénes	fc690a60d8	Update tools/cqlsh submodule * tools/cqlsh 86a280a1...b09bc793 (6): > build(deps): bump actions/download-artifact in /.github/workflows > cqlshlib/test: Add test_formatting.py > cqlshlib/test: Use assertEqual instead of assertEquals > cqlsh.py: Send DESCRIBE statement to server before parsing > cqlsh.py: Fix indentation > cqlsh.py: change shebang to /usr/bin/env python3	2024-09-10 08:11:40 +03:00
Lakshmi Narayanan Sreethar	2148e33d37	compaction: remove unnecessary share bump for split, scrub, and upgrade When split, scrub, and upgrade compactions ran under the compaction group, they had to bump up their shares to a minimum of 200 to prevent slow progress as they neared completion, especially in workloads with inconsistent ingestion rates. Since commit `e86965c2` moved these compactions to the maintenance group, this share bump is no longer necessary. This patch removes the unnecessary share allocation. Fixes #20224 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#20495	2024-09-09 22:03:38 +03:00
Avi Kivity	9448260b30	Merge 'major compaction: check only sstables being compacted for tombstone garbage collection' from Lakshmi Narayanan Sreethar Any expired tombstone can be garbage collected if it doesn't shadow data in the commit log, memtable, or uncompacting SSTables. This PR introduces a new mode to major compaction, enabled by the `consider_only_existing_data` flag that bypasses these checks. When enabled, memtables and old commitlog segments are cleared with a system-wide flush and all the sstables (after flush) are included in the compaction, so that it works with all data generated up to a given time point. This new mode works with the assumption that newly written data will not be shadowed by expired tombstones. So it ignores new sstables (and new data written to memtable) created after compaction started. Since there was a system wide flush, commitlog checks can also be skipped when garbage collecting tombstones. Introducing data shadowed by a tombstone during compaction can lead to undefined behavior, even without this PR, as the tombstone may or may not have already been garbage collected. Fixes #19728 Closes scylladb/scylladb#20031 * github.com:scylladb/scylladb: cql-pytest: add test to verify consider_only_existing_data compaction option tools/scylla-nodetool: add consider-only-existing-data option to compact command api: compaction: add `consider_only_existing_data` option compaction: consider gc_check_only_compacting_sstables when deducing max purgeable timestamp compaction: do not check commitlog if gc_check_only_compacting_sstables is enabled tombstone_gc_state: introduce with_commitlog_check_disabled() compaction: introduce new option to check only compacting sstables for gc compaction: rename maybe_flush_all_tables to maybe_flush_commitlog compaction: maybe_flush_all_tables: add new force_flush param	2024-09-09 20:45:41 +03:00
Avi Kivity	894b85ce95	Merge 'hints: send hints with CL=ALL if target is leaving' from Piotr Dulikowski Currently, when attempting to send a hint, we might choose its recipients in one of two ways: - If the original destination is a natural endpoint of the hint, we only send the hint to that node and none other, - Otherwise, we send the hint to all current replicas of the mutation. There is a problem when we decommission a node: while data is streamed away from that node, it is still considered to be a natural endpoint of the data that it used to own. Because of that, it might happen that a hint is sent directly to it but streaming will miss it, effectively resulting in the hint being discarded. As sending the hint _only_ to the leaving replica is a rather bad idea, send the hint to all replicas also in the case when the original destination of the hint is leaving. Note that this is a conservative fix written only with the decommission + vnode-based keyspaces combo in mind. In general, such "data loss" can occur in other situations where the replica set is changing and we go through a streaming phase, i.e. other topology operations in case of vnodes and tablet load balancing. However, the consistency guarantees of hinted handoff in the face of topology changes are not defined and it is not clear what they should be, if there should be any at all. The picture is further complicated by the fact that hints are used by materialized views, and sending view updates to more replicas than necessary can introduce inconsistencies in the form of "ghost rows". This fix was developed in response to a failing test which checked the hint replay + decommission scenario, and it makes it work again. Fixes scylladb/scylla-dtest#4582 Refs scylladb/scylladb#19835 Should be backported to 6.0 and 6.1; the dtest started failing due to topology on raft, which sped up execution of the test and exposed the preexisting problem. Closes scylladb/scylladb#20488 * github.com:scylladb/scylladb: test: topology_custom/test_hints: consistency test for decommission test: topology_custom/test_hints: move sync point helpers to top level test: topology/util: extract find_server_by_host_id hints: send hints with CL=ALL if target is leaving hints: inline do_send_one_mutation	2024-09-09 18:23:13 +03:00
Avi Kivity	c3e19425bd	Merge 'docs/dev/docker-hub.md: refresh aio-max-nr calculation' from Laszlo Ersek ~~~ What we have today in "docs/dev/docker-hub.md" on "aio-max-nr" dates back to scylla commit `f4412029f4` ("docs/docker-hub.md: add quickstart section with --smp 1", 2020-09-22). Problems with the current language: - The "65K" claim as default value on non-production systems is wrong; "fs/aio.c" in Linux initializes "aio_max_nr" to 0x10000, which is 64K. - The section in question uses equal signs (=) incorrectly. The intent was probably to say "which means the same as", but that's not what equality means. - In the same section, the relational operator "<" is bogus. The available AIO count must be at least as high (>=) as the requested AIO count. - Clearer names should be used; adjust_max_networking_aio_io_control_blocks() in "src/core/reactor.cc" sets a great example: - "reactor::max_aio" should be called "storage_iocbs", - "detect_aio_poll" should be called "preempt_iocbs", - "reactor_backend_aio::max_polls" should be called "network_iocbs". - The specific value 10000 for the last one ("network_iocbs") is not correct in scylla's context. It is correct as the Seastar default, but scylla has used 50000 since commit `2cfc517874` ("main, test: adjust number of networking iocbs", 2021-07-18). Rewrite the section to address these problems. See also: - https://github.com/scylladb/scylladb/issues/5981 - https://github.com/scylladb/seastar/pull/2396 - https://github.com/scylladb/scylladb/pull/19921 Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com> ~~~ No need for backporting; the documentation being refreshed targets developers as audience, not end-users. Closes scylladb/scylladb#20398 * github.com:scylladb/scylladb: docs/dev/docker-hub.md: refresh aio-max-nr calculation docs/dev/docker-hub.md: strip trailing whitespace	2024-09-09 15:04:38 +03:00
Botond Dénes	3e0bff161c	Merge 'Use yielding directory lister in sstable_directory' from Pavel Emelyanov The yielding lister is considered to be better replacement that scan_dir(lambda) one. Also, the sstable directory will be patched to scan the contents of S3 bucket and yielding lister fits better for generalization. Closes scylladb/scylladb#20114 * github.com:scylladb/scylladb: sstable_directory: Fix indentation after previous patches sstable_directory: Use yielding lister in .handle_sstables_pending_delete() sstable_directory: Use yielding lister in .cleanup_column_family_temp_sst_dirs() sstable_directory: Use yielding lister in .prepare() sstable_directory: Shorten lister loop sstable_directory: Use with_closeable() in .process() directory_lister: Add noexcept default move-constructor	2024-09-09 14:35:51 +03:00
Pavel Emelyanov	0f48847d02	test: Use shorter with_sstable_directory overload() In sstable directory test there are two of those -- one that works on path, state, env and callback, and the other one that just needs env and callback, getting path from env and assuming state is normal. Two test cases in this test can enjoy the shorter one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20395	2024-09-09 14:25:24 +03:00
Pavel Emelyanov	2bfbbaffac	test: Use sstables::test_env to make sstables for schema loader test This test calls manager directly, but it's shorter to ask test_env for that Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20431	2024-09-09 14:22:58 +03:00
Takuya ASADA	e36c939505	dist: tune LimitNOFILES for large nodes On very large node, LimitNOFILES=80000 may not enough size, it can cause "Too many files" error. To avoid that, let's increase LimitNOFILES on scylla_setup stage, generate optimal value calurated from memory size and number of cpus. Closes scylladb/scylla-enterprise#4304 Closes scylladb/scylladb#20443	2024-09-09 14:13:49 +03:00
Piotr Smaron	60af48f5fd	cql: fix exception when validating KS in CREATE TABLE `c70f321c6f` added an extra check if KS exists. This check can throw `data_dictionary::no_such_keyspace` exception, which is supposed to be caught and a more user-friendly exception should be thrown instead. This commit fixes the above problem and adds a testcase to validate it doesn't appear ever again. Also, I moved the check for the keyspace outside of the `for` loop, as it doesn't need to be checked repeatedly. Fixes: scylladb/scylladb#20097 Closes scylladb/scylladb#20404	2024-09-09 13:30:57 +03:00

1 2 3 4 5 ...

44545 Commits