scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Takuya ASADA	6fcbf66bfb	scylla_sysconfig_setup: handle >=32CPUs correctly Seems like `59adf05` has a bug, the regex pattern only handles first 32CPUs cpuset pattern, and ignores rest. We should extend regex pattern to handle all CPUs. Fixes #10523 Closes #10524 (cherry picked from commit `a9dfe5a8f4`)	2022-05-30 14:27:27 +03:00
Takuya ASADA	e9a3dee234	scylla_sysconfig_setup: avoid perse error on perftune.py --get-cpu-mask Currently, we just passes entire output of perftune.py when getting CPU mask from the script, but it may cause parse error since the script may also print warning message. To avoid that, we need to extract CPU mask from the output. Fixes #10082 Closes #10107 (cherry picked from commit `59adf05951`)	2022-05-30 14:25:21 +03:00
Avi Kivity	279cd44c7f	Update seastar submodule (xfs project attribute zeroed) * seastar 6745a43c10...7a430a0830 (1): > file: don't trample on xfs flags when setting xfs size hint Fixes #10667.	2022-05-29 17:43:43 +03:00
Avi Kivity	c99f768381	Merge 'Rework off strategy compaction locking for branch 5.0' from Raphael "Raph" Carvalho First patch removes incorrect usage of rwlock which should be restricted to minor and major compaction tasks. Second patch revives a semaphore, which was lost in `6737c88045`, as we want major to not wait on off-strategy completion before deciding whether or not it should proceed with execution. It wouldn't proceed with execution if user asked major to stop while waiting for a chance to run. For master, we're going to rely on abortable variant of get_units() to allow major to be quickly aborted. Fixes #10485. Closes #10582 * github.com:scylladb/scylla: compaction_manager: Revive custom job semaphore compaction_manager: Remove rwlock usage in run_custom_job()	2022-05-29 17:38:01 +03:00
Tomasz Grabiec	89a540d54a	sstable: partition_index_cache: Fix abort on bad_alloc during page loading When entry loading fails and there is another request blocked on the same page, attempt to erase the failed entry will abort because that would violate entry_ptr guarantees, which is supposed to keep the entry alive. The fix in `92727ac36c` was incomplete. It only helped for the case of a single loader. This patch makes a more general approach by relaxing the assert. The assert manifested like this: scylla: ./sstables/partition_index_cache.hh:71: sstables::partition_index_cache::entry::~entry(): Assertion `!is_referenced()' failed. Fixes #10617 Closes #10653 (cherry picked from commit `f87274f66a`)	2022-05-27 09:50:32 +03:00
Yaron Kaikov	338edcc02e	release: prepare for 5.0.rc6 scylla-5.0.rc6	2022-05-23 11:37:37 +03:00
Avi Kivity	a8eb5164b2	Update seastar submodule (io_queue delay metrics in 25ms granularity) * seastar 4a30c44c4c...6745a43c10 (1): > metrics: Report IO total times as real numbers Ref #10392	2022-05-19 18:20:15 +03:00
Raphael S. Carvalho	9accb44f9c	compaction_manager: Revive custom job semaphore In commit `6737c88045`, we started using a single semaphore for maintenance operations, which is a good change. However, after introduction of off-strategy, major cannot proceed until off-strategy is done reshaping all its input files. If user requests major to abort, the command will only return once off-strategy is done, and that can take lots of time. In master, we'll allow pending major to be quickly aborted, but that's not possible here as abortable variant of get_units() is not available yet. Here, we'll allow major to proceed in parallel to off-strategy, so major can decide whether or not it should run in parallel. Fixes #10485. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-05-16 20:46:31 -03:00
Raphael S. Carvalho	8878007106	compaction_manager: Remove rwlock usage in run_custom_job() The rwlock usage was introduced in 2017 commit `10eaa2339e`. Resharding was online back then and we want to serialize it with major. Rwlock usage should be restricted to major and minor, as clearly stated in the documentation, but we're still using it in run_custom_job(). It gains us nothing, it only prevents off-strategy and other custom jobs from running concurrently to major. Let's kill this as we want to allow off-strategy to not prevent a major from happening in parallel, as the former works only on the maintenance sstable set and won't interfere with the latter. Refs #10485. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-05-16 20:45:54 -03:00
Yaron Kaikov	9da666e778	release: prepare for 5.0.rc5 scylla-5.0.rc5	2022-05-15 22:09:16 +03:00
Benny Halevy	aca355dec1	table: clear: serialize with ongoing flush Get all flush permits to serialize with any ongoing flushes and preventing further flushes during table::clear, in particular calling discard_completed_segments for every table and clearing the memtables in clear_and_add. Fixes #10423 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `aae532a96b`)	2022-05-15 13:39:03 +03:00
Raphael S. Carvalho	efbb2efd3f	compaction: LCS: don't write to disengaged optional on compaction completion Dtest triggers the problem by: 1) creating table with LCS 2) disabling regular compaction 3) writing a few sstables 4) running maintenance compaction, e.g. cleanup Once the maintenance compaction completes, disengaged optional _last_compacted_keys triggers an exception in notify_completion(). _last_compacted_keys is used by regular for its round-robin file picking policy. It stores the last compacted key for each level. Meaning it's irrelevant for any other compaction type. Regular compaction is responsible for initializing it when it runs for the first time to pick files. But with it disabled, notify_completion() will find it uninitialized, therefore resulting in bad_optional_access. To fix this, the procedure is skipped if _last_compacted_keys is disengaged. Regular compaction, once re-enabled, will be able to fill _last_compacted_keys by looking at metadata of the files. compaction_test.py::TestCompaction::test_disable_autocompaction_doesnt_ block_user_initiated_compactions[CLEANUP-LeveledCompactionStrategy] now passes. Fixes #10378. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #10508 (cherry picked from commit `8e99d3912e`)	2022-05-15 13:20:11 +03:00
Eliran Sinvani	44dc5c4a1d	Revert "table: disable_auto_compaction: stop ongoing compactions" This reverts commit `4affa801a5`. In issue #10146 a write throughput drop of ~50% was reported, after bisect it was found that the change that caused it was adding some code to the table::disable_auto_compaction which stops ongoing compactions and returning a future that resolves once all the compaction tasks for a table, if any, were terminated. It turns out that this function is used only at startup (and in REST api calls which are not used in the test) in the distributed loader just before resharding and loading of the sstable data. It is then reanabled after the resharding and loading is done. For still unknown reason, adding the extra logic of stopping ongoing compactions made the write throughput drop to 50%. Strangely enough this extra logic should (still unvalidated) not have any side effects since no compactions for a table are supposed to be running prior to loading it. This regains the performance but also undo a change which eventually should get in once we find the actual culprit. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes #10559 Reopens #9313. (cherry picked from commit `8e8dc2c930`)	2022-05-15 08:50:38 +03:00
Juliusz Stasiewicz	6b34ba3a4f	CQL: Replace assert by exception on invalid auth opcode One user observed this assertion fail, but it's an extremely rare event. The root cause - interlacing of processing STARTUP and OPTIONS messages - is still there, but now it's harmless enough to leave it as is. Fixes #10487 Closes #10503 (cherry picked from commit `603dd72f9e`)	2022-05-10 14:04:52 +02:00
Yaron Kaikov	f1e25cb4a6	release: prepare for 5.0.rc4	2022-05-10 07:35:53 +03:00
Benny Halevy	c9798746ae	compaction: time_window_compaction_strategy: reset estimated_remaining_tasks when running out of candidates _estimated_remaining_tasks gets updated via get_next_non_expired_sstables -> get_compaction_candidates, but otherwise if we return earlier from get_sstables_for_compaction, it does not get updated and may go out of sync. Refs #10418 (to be closed when the fix reaches branch-4.6) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10419 (cherry picked from commit `01f41630a5`)	2022-05-09 09:35:53 +03:00
Eliran Sinvani	7f70ffc5ce	prepared_statements: Invalidate batch statement too It seams that batch prepared statements always return false for depends_on, this in turn renders the removal criteria from the prepared statements cache to always be false which result by the queries not being evicted. Here we change the function to return the true state meaning, they will return true if one of the sub queries is dependant upon the keyspace and/ or column family. Fixes #10129 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> (cherry picked from commit `4eb0398457`)	2022-05-08 12:31:42 +03:00
Eliran Sinvani	551636ec89	cql3 statements: Change dependency test API to express better it's purpose Cql statements used to have two API functions, depends_on_keyspace and depends_on_column_family. The former, took as a parameter only a table name, which makes no sense. There could be multiple tables with the same name each in a different keyspace and it doesn't make sense to generalize the test - i.e to ask "Does a statement depend on any table named XXX?" In this change we unify the two calls to one - depends on that takes a keyspace name and optionally also a table name, that way every logical dependency tests that makes sense is supported by a single API call. (cherry picked from commit `bf50dbd35b`) Ref #10129	2022-05-08 12:31:02 +03:00
Raphael S. Carvalho	e1130a01e7	table: Close reader if flush fails to peek into fragment An OOM failure while peeking into fragment, to determine if reader will produce any fragment, causes Scylla to abort as flat_mutation_reader expects reader to be closed before destroyed. Let's close it if peek() fails, to handle the scenario more gracefully. Fixes #10027. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220204031553.124848-1-raphaelsc@scylladb.com> (cherry picked from commit `755cec1199`)	2022-05-08 12:16:15 +03:00
Calle Wilund	b0233cb7c5	cdc: Ensure columns removed from log table are registered as dropped If we are redefining the log table, we need to ensure any dropped columns are registered in "dropped_columns" table, otherwise clients will not be able to read data older than now. Includes unit test. Should probably be backported to all CDC enabled versions. Fixes #10473 Closes #10474 (cherry picked from commit `78350a7e1b`)	2022-05-05 11:38:18 +02:00
Avi Kivity	e480c5bf4d	Merge 'loading_cache: force minimum size of unprivileged ' from Piotr Grabowski This series enforces a minimum size of the unprivileged section when performing `shrink()` operation. When the cache is shrunk, we still drop entries first from unprivileged section (as before this commit), however, if this section is already small (smaller than `max_size / 2`), we will drop entries from the privileged section. This is necessary, as before this change the unprivileged section could be starved. For example if the cache could store at most 50 entries and there are 49 entries in privileged section, after adding 5 entries (that would go to unprivileged section) 4 of them would get evicted and only the 5th one would stay. This caused problems with BATCH statements where all prepared statements in the batch have to stay in cache at the same time for the batch to correctly execute. To correctly check if the unprivileged section might get too small after dropping an entry, `_current_size` variable, which tracked the overall size of cache, is changed to two variables: `_unprivileged_section_size` and `_privileged_section_size`, tracking section sizes separately. New tests are added to check this new behavior and bookkeeping of the section sizes. A test is added, that sets up a CQL environment with a very small prepared statement cache, reproduces issue in #10440 and stresses the cache. Fixes #10440. Closes #10456 * github.com:scylladb/scylla: loading_cache_test: test prepared stmts cache loading_cache: force minimum size of unprivileged loading_cache: extract dropping entries to lambdas loading_cache: separately track size of sections loading_cache: fix typo in 'privileged' (cherry picked from commit `5169ce40ef`)	2022-05-04 14:35:53 +03:00
Tomasz Grabiec	7d90f7e93f	loading_cache: Make invalidation take immediate effect There are two issues with current implementation of remove/remove_if: 1) If it happens concurrently with get_ptr(), the latter may still populate the cache using value obtained from before remove() was called. remove() is used to invalidate caches, e.g. the prepared statements cache, and the expected semantic is that values calculated from before remove() should not be present in the cache after invalidation. 2) As long as there is any active pointer to the cached value (obtained by get_ptr()), the old value from before remove() will be still accessible and returned by get_ptr(). This can make remove() have no effect indefinitely if there is persistent use of the cache. One of the user-perceived effects of this bug is that some prepared statements may not get invalidated after a schema change and still use the old schema (until next invalidation). If the schema change was modifying UDT, this can cause statement execution failures. CQL coordinator will try to interpret bound values using old set of fields. If the driver uses the new schema, the coordinaotr will fail to process the value with the following exception: User Defined Type value contained too many fields (expected 5, got 6) The patch fixes the problem by making remove()/remove_if() erase old entries from _loading_values immediately. The predicate-based remove_if() variant has to also invalidate values which are concurrently loading to be safe. The predicate cannot be avaluated on values which are not ready. This may invalidate some values unnecessarily, but I think it's fine. Fixes #10117 Message-Id: <20220309135902.261734-1-tgrabiec@scylladb.com> (cherry picked from commit `8fa704972f`)	2022-05-04 14:35:37 +03:00
Avi Kivity	3e6e8579c6	loading_cache: fix indentation of timestamped_val and two nested type aliases timestamped_val (and two other type aliases) are nested inside loading_cache, but indented as if they were top-level names. Adjust the indent to avoid confusion. Closes #10118 (cherry picked from commit `d1a394fd97`) Ref #10117 - backport prerequisite	2022-05-04 14:35:15 +03:00
Avi Kivity	3e98e17d18	Merge 'replica/database: drop_column_family(): properly cleanup stale querier cache entries' from Botond Dénes Said method has to evict all querier cache entries, belonging to the to-be-dropped table. This is already the case, but there was a window where new entries could sneak in, causing a stale reference to the table to be de-referenced later when they are evicted due to TTL. This window is now closed, the entries are evicted after the method has waited for all ongoing operations on said table to stop. Fixes: #10450 Closes #10451 * github.com:scylladb/scylla: replica/database: drop_column_family(): drop querier cache entries after waiting for ops replica/database: finish coroutinizing drop_column_family() replica/database: make remove(const column_family&) private (cherry picked from commit `7f1e368e92`)	2022-05-01 17:22:57 +03:00
Avi Kivity	a214f8cf6e	Update tools/java submodule (bad IPv6 addresses in nodetool) * tools/java b1e09c8b8f...2241a63bda (1): > CASSANDRA-17581 fix NodeProbe: Malformed IPv6 address at index Fixes #10442	2022-04-28 11:33:15 +03:00
Benny Halevy	e8b92fe34d	replica: distributed_database: populate_column_family: trigger offstrategy compaction only for the base directory In https://github.com/scylladb/scylla/issues/10218 we see off-strategy compaction happening on a table during the initial phases of `distributed_loader::populate_column_family`. It is caused by triggering offtrategy compaction too early, when sstables are populated from the staging directory in `a144d30162`. We need to trigger offstrategy compaction only of the base table directory, never the staging or quarantine dirs. Fixes #10218 Test: unit(dev) DTest: materialized_views_test.py::TestInterruptBuildProcess Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220316152812.3344634-1-bhalevy@scylladb.com> (cherry picked from commit `a1d0f089c8`)	2022-04-24 17:38:53 +03:00
Nadav Har'El	fa479c84ac	config: fix some types in system.config virtual table The system.config virtual tables prints each configuration variable of type T based on the JSON printer specified in the config_type_for<T> in db/config.cc. For two variable types - experimental_features and tri_mode_restriction, the specified converter was wrong: We used value_to_json<string> or value_to_json<vector<string>> on something which was not a string. Unfortunately, value_to_json silently casted the given objects into strings, and the result was garbage: For example as noted in #10047, for experimental_features instead of printing a list of features names, e.g., "raft", we got a bizarre list of one-byte strings with each feature's number (which isn't documented or even guaranteed to not change) as well as carriage-return characters (!?). So solution is a new printable_to_json<T> which works on a type T that can be printed with operator<< - as in fact the above two types can - and the type is converted into a string or vector of strings using this operator<<, not a cast. Also added a cql-pytest test for reading system.config and in particular options of the above two types - checking that they contain sensible strings and not "garbage" like before this patch. Fixes #10047. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220209090421.298849-1-nyh@scylladb.com> (cherry picked from commit `fef7934a2d`)	2022-04-14 19:29:08 +03:00
Tomasz Grabiec	40c26dd2c5	utils/chunked_managed_vector: Fix sigsegv during reserve() Fixes the case of make_room() invoked with last_chunk_capacity_deficit but _size not in the last reserved chunk. Found during code review, no user impact. Fixes #10364. Message-Id: <20220411224741.644113-1-tgrabiec@scylladb.com> (cherry picked from commit `0c365818c3`)	2022-04-13 09:48:34 +03:00
Tomasz Grabiec	2c6f069fd1	utils/chunked_vector: Fix sigsegv during reserve() Fixes the case of make_room() invoked with last_chunk_capacity_deficit but _size not in the last reserved chunk. Found during code review, no known user impact. Fixes #10363. Message-Id: <20220411222605.641614-1-tgrabiec@scylladb.com> (cherry picked from commit `01eeb33c6e`)	2022-04-13 09:47:24 +03:00
Avi Kivity	e27dff0c50	transport: return correct error codes when downgrading v4 {WRITE,READ}_FAILURE to {WRITE,READ}_TIMEOUT Protocol v4 added WRITE_FAILURE and READ_FAILURE. When running under v3 we downgrade these exceptions to WRITE_TIMEOUT and READ_TIMEOUT (since the client won't understand the v4 errors), but we still send the new error codes. This causes the client to become confused. Fix by updating the error codes. A better fix is to move the error code from the constructor parameter list and hard-code it in the constructor, but that is left for a follow-up after this minimal fix. Fixes #5610. Closes #10362 (cherry picked from commit `987e6533d2`)	2022-04-13 09:47:24 +03:00
Tomasz Grabiec	3f03260ffb	utils/chunked_managed_vector: Fix corruption in case there is more than one chunk If reserve() allocates more than one chunk, push_back() should not work with the last chunk. This can result in items being pushed to the wrong chunk, breaking internal invariants. Also, pop_back() should not work with the last chunk. This breaks when there is more than one chunk. Currently, the container is only used in the sstable partition index cache. Manifests by crashes in sstable reader which touch sstables which have partition index pages with more than 1638 partition entries. Introduced in `78e5b9fd85` (4.6.0) Fixes #10290 Message-Id: <20220407174023.527059-1-tgrabiec@scylladb.com> (cherry picked from commit `41fe01ecff`)	2022-04-08 10:53:33 +03:00
Takuya ASADA	1315135fca	docker: enable --log-to-stdout which mistakenly disabled Since our Docker image moved to Ubuntu, we mistakenly copy dist/docker/etc/sysconfig/scylla-server to /etc/sysconfig, which is not used in Ubuntu (it should be /etc/default). So /etc/default/scylla-server is just default configuration of scylla-server .deb package, --log-to-stdout is 0, same as normal installation. We don't want keep the duplicated configuration file anyway, so let's drop dist/docker/etc/sysconfig/scylla-server and configure /etc/default/scylla-server in build_docker.sh. Fixes #10270 Closes #10280 (cherry picked from commit `bdefea7c82`)	2022-04-07 12:13:19 +03:00
Yaron Kaikov	f92622e0de	release: prepare for 5.0.rc3 scylla-5.0.rc3	2022-04-06 14:31:03 +03:00
Takuya ASADA	3bca608db5	docker: run scylla as root Previous versions of Docker image runs scylla as root, but `cb19048` accidently modified it to scylla user. To keep compatibility we need to revert this to root. Fixes #10261 Closes #10325 (cherry picked from commit `f95a531407`)	2022-04-05 12:46:25 +03:00
Takuya ASADA	a93b72d5dd	docker: revert scylla-server.conf service name change We changed supervisor service name at `cb19048`, but this breaks compatibility with scylla-operator. To fix the issue we need to revert the service name to previous one. Fixes #10269 Closes #10323 (cherry picked from commit `41edc045d9`)	2022-04-05 12:40:59 +03:00
Benny Halevy	d58ca2edbd	range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case 2nd std::move(start) looks like a typo in `fe2fa3f20d`. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220404124741.1775076-1-bhalevy@scylladb.com> (cherry picked from commit `2d80057617`) Fixes ##10326	2022-04-05 12:39:13 +03:00
Alexey Kartashov	75740ace2a	dist/docker: fix incorrect locale value Docker build script contains an incorrect locale specification for LC_ALL setting, this commit fixes that. Fixes #10310 Closes #10321 (cherry picked from commit `d86c3a8061`)	2022-04-04 12:51:02 +03:00
Piotr Sarna	d7a1bf6331	cql3: fix qualifying restrictions with IN for indexing When a query contains IN restriction on its partition key, it's currently not eligible for indexing. It was however erroneously qualified as such, which lead to fetching incorrect results. This commit fixes the issue by not allowing such queries to undergo indexing, and comes with a regression test. Fixes #10300 Closes #10302 (cherry picked from commit `c0fd53a9d7`)	2022-04-03 11:20:49 +03:00
Avi Kivity	bbd7d657cc	Update seastar submodule (pidof command not installed) * seastar 1c0d622ba0...4a30c44c4c (1): > seastar-cpu-map.sh: switch from pidof to pgrep Fixes #10238.	2022-03-29 12:36:06 +03:00
Avi Kivity	f5bf4c81d1	Merge 'replica/database: truncate: temporarily disable compaction on table and views before flush' from Benny Halevy Flushing the base table triggers view building and corresponding compactions on the view tables. Temporarily disable compaction on both the base table and all its view before flush and snapshot since those flushed sstables are about to be truncated anyway right after the snapshot is taken. This should make truncate go faster. In the process, this series also embeds `database::truncate_views` into `truncate` and coroutinizes both Refs #6309 Test: unit(dev) Closes #10203 * github.com:scylladb/scylla: replica/database: truncate: fixup indentation replica/database: truncate: temporarily disable compaction on table and views before flush replica/database: truncate: coroutinize per-view logic replica/database: open-code truncate_view in truncate replica/database: truncate: coroutinize run_with_compaction_disabled lambda replica/database: coroutinize truncate compaction_manager: add disable_compaction method (cherry picked from commit `aab052c0d5`)	2022-03-28 15:40:40 +03:00
Benny Halevy	02e8336659	atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal Following up on `a57c087c89`, compare_atomic_cell_for_merge should compare the ttl value in the reverse order since, when comparing two cells that are identical in all attributes but their ttl, we want to keep the cell with the smaller ttl value rather than the larger ttl, since it was written at a later (wall-clock) time, and so would remain longer after it expires, until purged after gc_grace seconds. Fixes #10173 Test: mutation_test.test_cell_ordering, unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302154328.2400717-1-bhalevy@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220306091913.106508-1-bhalevy@scylladb.com> (cherry picked from commit `a085ef74ff`)	2022-03-24 18:00:11 +02:00
Benny Halevy	601812e11b	atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal Unlike atomic_cell_or_collection::equals, compare_atomic_cell_for_merge currently returns std::strong_ordering::equal if two cells are equal in every way except their ttl:s. The problem with that is that the cells' hashes are different and this will cause repair to keep trying to repair discrepancies caused by the ttl being different. This may be triggered by e.g. the spark migrator that computes the ttl based on the expiry time by subtracting the expiry time from the current time to produce a respective ttl. If the cell is migrated multiple times at different times, it will generate cells that the same expiry (by design) but have different ttl values. Fixes #10156 Test: mutation_test.test_cell_ordering, unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302154328.2400717-1-bhalevy@scylladb.com> (cherry picked from commit `a57c087c89`)	2022-03-24 18:00:11 +02:00
Benny Halevy	ea466320d2	atomic_cell: compare_atomic_cell_for_merge: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302113833.2308533-2-bhalevy@scylladb.com> (cherry picked from commit `d43da5d6dc`)	2022-03-24 18:00:11 +02:00
Benny Halevy	25ea831a15	atomic_cell: compare_atomic_cell_for_merge: simplify expiry/deltion_time comparison No need to check first the the cells' expiry is different or that deletion_time is different before comparing them with `<=>`. If they are the same the function returns std::strong_ordering::equal anyhow and that is the same as `<=>` comparing identical values. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302113833.2308533-1-bhalevy@scylladb.com> (cherry picked from commit `be865a29b8`)	2022-03-24 18:00:11 +02:00
Benny Halevy	8648c79c9e	main: shutdown: do not abort on certain system errors Currently any unhandled error during deferred shutdown is rethrown in a noexcept context (in ~deferred_action), generating a core dump. The core dump is not helpful if the cause of the error is "environmental", i.e. in the system, rather than in scylla itself. This change detects several such errors and calls _Exit(255) to exit the process early, without leaving a coredump behind. Otherwise, call abort() explicitly, rather than letting terminate() be called implicitly by the destructor exception handling code. Fixes #9573 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220227101054.1294368-1-bhalevy@scylladb.com> (cherry picked from commit `132c9d5933`)	2022-03-24 14:48:52 +02:00
Nadav Har'El	7ae4d0e6f8	Seastar: backport Seastar fix for missing scring escape in JSON output Backported Seastar fix: > Merge 'json/formatter: Escape strings' from Juliusz Stasiewicz Fixes #9061 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-03-23 20:29:50 +02:00
Piotr Sarna	f3564db941	expression: fix get_value for mismatched column definitions As observed in #10026, after schema changes it somehow happened that a column defition that does not match any of the base table columns was passed to expression verification code. The function that looks up the index of a column happens to return -1 when it doesn't find anything, so using this returned index without checking if it's nonnegative results in accessing invalid vector data, and a segfault or silent memory corruption. Therefore, an explicit check is added to see if the column was actually found. This serves two purposes: - avoiding segfaults/memory corruption - making it easier to investigate the root cause of #10026 Closes #10039 (cherry picked from commit 7b364fec9849e9a342af1c240e3a7185bf5401ef)	2022-03-21 10:37:48 +01:00
Pavel Emelyanov	97caf12836	Update seastar submodule (IO preemption overlap) * seastar 47573503...8ef87d48 (3): > io_queue: Don't let preemption overlap requests > io_queue: Pending needs to keep capacity instead of ticket > io_queue: Extend grab_capacity() return codes Fixes #10233	2022-03-17 11:26:38 +03:00
Yaron Kaikov	839d9ef41a	release: prepare for 5.0.rc2 scylla-5.0.rc2	2022-03-16 14:35:52 +02:00
Benny Halevy	782bd50f92	compaction_manager: rewrite_sstables: do not acquire table write lock Since regular compaction may run in parallel no lock is required per-table. We still acquire a read lock in this patch, for backporting purposes, in case the branch doesn't contain `6737c88045`. But it can be removed entirely in master in a follow-up patch. This should solve some of the slowness in cleanup compaction (and likely in upgrade sstables seen in #10060, and possibly #10166. Fixes #10175 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #10177 (cherry picked from commit `11ea2ffc3c`)	2022-03-14 13:13:48 +02:00

1 2 3 4 5 ...

30036 Commits