scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 03:45:11 +00:00

Author	SHA1	Message	Date
Kamil Braun	784695e3ac	system_keyspace: use system memory for `system.raft` table `system.raft` was using the "user memory pool", i.e. the `dirty_memory_manager` for this table was set to `database::_dirty_memory_manager` (instead of `database::_system_dirty_memory_manager`). This meant that if a write workload caused memory pressure on the user memory pool, internal `system.raft` writes would have to wait for memtables of user tables to get flushed before the write would proceed. This was observed in SCT longevity tests which ran a heavy workload on the cluster and concurrently, schema changes (which underneath use the `system.raft` table). Raft would often get stuck waiting many seconds for user memtables to get flushed. More details in issue #15622. Experiments showed that moving Raft to system memory fixed this particular issue, bringing the waits to reasonable levels. Currently `system.raft` stores only one group, group 0, which is internally used for cluster metadata operations (schema and topology changes) -- so it makes sense to keep use system memory. In the future we'd like to have other groups, for strongly consistent tables. These groups should use the user memory pool. It means we won't be able to use `system.raft` for them -- we'll just have to use a separate table. Fixes: scylladb/scylladb#15622 Closes scylladb/scylladb#15972 (cherry picked from commit `f094e23d84`)	2024-01-25 17:59:49 +01:00
Avi Kivity	351d6d6531	Merge 'Invalidate prepared statements for views when their schema changes.' from Eliran Sinvani When a base table changes and altered, so does the views that might refer to the added column (which includes "SELECT " views and also views that might need to use this column for rows lifetime (virtual columns). However the query processor implementation for views change notification was an empty function. Since views are tables, the query processor needs to at least treat them as such (and maybe in the future, do also some MV specific stuff). This commit adds a call to `on_update_column_family` from within `on_update_view`. The side effect true to this date is that prepared statements for views which changed due to a base table change will be invalidated. Fixes https://github.com/scylladb/scylladb/issues/16392 This series also adds a test which fails without this fix and passes when the fix is applied. Closes scylladb/scylladb#16897 github.com:scylladb/scylladb: Add test for mv prepared statements invalidation on base alter query processor: treat view changes at least as table changes (cherry picked from commit `5810396ba1`)	2024-01-23 21:31:47 +02:00
Takuya ASADA	5a05ccc2f8	scylla_raid_setup: faillback to other paths when UUID not avialable On some environment such as VMware instance, /dev/disk/by-uuid/<UUID> is not available, scylla_raid_setup will fail while mounting volume. To avoid failing to mount /dev/disk/by-uuid/<UUID>, fetch all available paths to mount the disk and fallback to other paths like by-partuuid, by-id, by-path or just using real device path like /dev/md0. To get device path, and also to dumping device status when UUID is not available, this will introduce UdevInfo class which communicate udev using pyudev. Related #11359 Closes scylladb/scylladb#13803 (cherry picked from commit `58d94a54a3`) [syuu: renegerate tools/toolchain/image for new python3-pyudev package] Closes #16938	2024-01-23 16:05:28 +02:00
Botond Dénes	a1603bcb40	readers/multishard: evictable_reader::fast_forward_to(): close reader on exception When the reader is currently paused, it is resumed, fast-forwarded, then paused again. The fast forwarding part can throw and this will lead to destroying the reader without it being closed first. Add a try-catch surrounding this part in the code. Also mark `maybe_pause()` and `do_pause()` as noexcept, to make it clear why that part doesn't need to be in the try-catch. Fixes: #16606 Closes scylladb/scylladb#16630 (cherry picked from commit `204d3284fa`)	2024-01-16 16:57:28 +02:00
Michał Jadwiszczak	29da20b9e0	schema: add scylla specific options to schema description Add `paxos_grace_seconds`, `tombstone_gc`, `cdc` and `synchronous_updates` options to schema description. Fixes: #12389 Fixes: scylladb/scylla-enterprise#2979 Closes #16786	2024-01-16 09:56:08 +02:00
Botond Dénes	7c4ec8cf4b	Update tools/java submodule * tools/java 843096943e...a1eed2f381 (1): > Update JNA dependency to 5.14.0 Fixes: https://github.com/scylladb/scylla-tools-java/issues/371	2024-01-15 15:51:32 +02:00
Aleksandra Martyniuk	5def443cf0	tasks: keep task's children in list If std::vector is resized its iterators and references may get invalidated. While task_manager::task::impl::_children's iterators are avoided throughout the code, references to its elements are being used. Since children vector does not need random access to its elements, change its type to std::list<foreign_task_ptr>, which iterators and references aren't invalidated on element insertion. Fixes: #16380. Closes scylladb/scylladb#16381 (cherry picked from commit `9b9ea1193c`) Closes #16777	2024-01-15 15:38:00 +02:00
Anna Mikhlin	c0604a31fa	release: prepare for 5.2.14 scylla-5.2.14	2024-01-14 16:34:38 +02:00
Pavel Emelyanov	96bb602c62	Update seastar submodule (token bucket duration underflow) * seastar 43a1ce58...29badd99 (1): > shared_token_bucket: Fix duration_for() underflow Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-01-12 15:15:56 +03:00
Botond Dénes	d96440e8b6	Merge '[Backport 5.2] Validate compaction strategy options in prepare' from Aleksandra Martyniuk Table properties validation is performed on statement execution. Thus, when one attempts to create a table with invalid options, an incorrect command gets committed in Raft. But then its application fails, leading to a raft machine being stopped. Check table properties when create and alter statements are prepared. Fixes: https://github.com/scylladb/scylladb/issues/14710. Closes #16750 * github.com:scylladb/scylladb: cql3: statements: delete execute override cql3: statements: call check_restricted_table_properties in prepare cql3: statements: pass data_dictionary::database to check_restricted_table_properties	2024-01-12 10:56:54 +02:00
Aleksandra Martyniuk	ea41a811d6	cql3: statements: delete execute override Delete overriden create_table_statement::execute as it only calls its direct parent's (schema_altering_statement) execute method anyway. (cherry picked from commit `6c7eb7096e`)	2024-01-11 16:43:17 +01:00
Aleksandra Martyniuk	8b77fbc904	cql3: statements: call check_restricted_table_properties in prepare Table properties validation is performed on statement execution. Thus, when one attempts to create a table with invalid options, an incorrect command gets committed in Raft. But then its application fails, leading to a raft machine being stopped. Check table properties when create and alter statements are prepared. The error is no longer returned as an exceptional future, but it is thrown. Adjust the tests accordingly. (cherry picked from commit `60fdc44bce`)	2024-01-11 16:10:26 +01:00
Aleksandra Martyniuk	3ab3a2cc1b	cql3: statements: pass data_dictionary::database to check_restricted_table_properties Pass data_dictionary::database to check_restricted_table_properties as an arguemnt instead of query_processor as the method will be called from a context which does not have access to query processor. (cherry picked from commit `ec98b182c8`)	2024-01-11 16:10:26 +01:00
Botond Dénes	7e9107cc97	Update tools/java submodule * tools/java 79fa02d8a3...843096943e (1): > build.xml: update io.airlift to 0.9 Fixes: scylladb/scylla-tools-java#374	2024-01-11 11:03:29 +02:00
Botond Dénes	abb7ae4309	Update ./tools/jmx submodule * tools/jmx f21550e...50909d6 (1): > scylla-apiclient: drop hk2-locator dependency Fixes: scylladb/scylla-jmx#231	2024-01-10 14:22:14 +02:00
Botond Dénes	2820c63734	Update tools/java submodule * tools/java d7ec9bf45f...79fa02d8a3 (2): > build.xml: update scylla-driver-core to 3.11.5.1 > treewide: update "guava" package Fixes: scylla-tools-java#365 Fixes: scylla-tools-java#343 Closes #16693	2024-01-10 08:19:43 +02:00
Nadav Har'El	ac0056f4bc	Merge 'Fix partition estimation with TWCS tables during streaming' from Raphael "Raph" Carvalho TWCS tables require partition estimation adjustment as incoming streaming data can be segregated into the time windows. Turns out we had two problems in this area that leads to suboptimal bloom filters. 1) With off-strategy enabled, data segregation is postponed, but partition estimation was adjusted as if segregation wasn't postponed. Solved by not adjusting estimation if segregation is postponed. 2) With off-strategy disabled, data segregation is not postponed, but streaming didn't feed any metadata into partition estimation procedure, meaning it had to assume the max windows input data can be segregated into (100). Solved by using schema's default TTL for a precise estimation of window count. For the future, we want to dynamically size filters (see https://github.com/scylladb/scylladb/issues/2024), especially for TWCS that might have SSTables that are left uncompacted until they're fully expired, meaning that the system won't heal itself in a timely manner through compaction on a SSTable that had partition estimation really wrong. Fixes https://github.com/scylladb/scylladb/issues/15704. Closes scylladb/scylladb#15938 * github.com:scylladb/scylladb: streaming: Improve partition estimation with TWCS streaming: Don't adjust partition estimate if segregation is postponed (cherry picked from commit `64d1d5cf62`) Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #16672	2024-01-08 09:06:43 +02:00
Calle Wilund	aaa25e1a78	Commitlog replayer: Range-check skip call Fixes #15269 If segment being replayed is corrupted/truncated we can attempt skipping completely bogues byte amounts, which can cause assert (i.e. crash) in file_data_source_impl. This is not a crash-level error, so ensure we range check the distance in the reader. v2: Add to corrupt_size if trying to skip more than available. The amount added is "wrong", but at least will ensure we log the fact that things are broken Closes scylladb/scylladb#15270 (cherry picked from commit `6ffb482bf3`)	2024-01-05 09:19:45 +02:00
Beni Peled	c57a0a7a46	release: prepare for 5.2.13 scylla-5.2.13	2024-01-03 17:48:59 +02:00
Botond Dénes	740ba3ac2a	tools/schema_loader: read_schema_table_mutation(): close the reader The reader used to read the sstables was not closed. This could sometimes trigger an abort(), because the reader was destroyed, without it being closed first. Why only sometimes? This is due to two factors: * read_mutation_from_flat_mutation_reader() - the method used to extract a mutation from the reader, uses consume(), which does not trigger `set_close_is_required()` (#16520). Due to this, the top-level combined reader did not complain when destroyed without close. * The combined reader closes underlying readers who have no more data for the current range. If the circumstances are just right, all underlying readers are closed, before the combined reader is destoyed. Looks like this is what happens for the most time. This bug was discovered in SCT testing. After fixing #16520, all invokations of `scylla-sstable`, which use this code would trigger the abort, without this patch. So no further testing is required. Fixes: #16519 Closes scylladb/scylladb#16521 (cherry picked from commit `da033343b7`)	2023-12-31 18:13:10 +02:00
Gleb Natapov	76c3dda640	storage_service: register schema version observer before joining group0 and starting gossiper The schema version is updated by group0, so if group0 starts before schema version observer is registered some updates may be missed. Since the observer is used to update node's gossiper state the gossiper may contain wrong schema version. Fix by registering the observer before starting group0 and even before starting gossiper to avoid a theoretical case that something may pull schema after start of gossiping and before the observer is registered. Fixes: #15078 Message-Id: <ZOYZWhEh6Zyb+FaN@scylladb.com> (cherry picked from commit `d1654ccdda`)	2023-12-20 11:14:27 +01:00
Kamil Braun	287546923e	Merge 'db: hints: add checksum to sync_point encoding' from Patryk Jędrzejczak Fixes #9405 `sync_point` API provided with incorrect sync point id might allocate crazy amount of memory and fail with `std::bad_alloc`. To fix this, we can check if the encoded sync point has been modified before decoding. We can achieve this by calculating a checksum before encoding, appending it to the encoded sync point, and compering it with a checksum calculated in `db::hints::decode` before decoding. Closes #14534 * github.com:scylladb/scylladb: db: hints: add checksum to sync point encoding db: hints: add the version_size constant (cherry picked from commit `eb6202ef9c`) The only difference from the original merge commit is the include path of `xx_hasher.hh`. On branch 5.2, this file is in the root directory, not `utils`. Closes #16458	2023-12-19 17:39:50 +02:00
Botond Dénes	c0dab523f9	Update tools/java submodule * tools/java e2aad6e3a0...d7ec9bf45f (1): > Merge "build: take care of old libthrift" from Piotr Grabowski Fixes: scylladb/scylla-tools-java#352 Closes #16464	2023-12-19 17:37:27 +02:00
Michael Huang	5499f7b5a8	cdc: use chunked_vector for topology_description entries Lists can grow very big. Let's use a chunked vector to prevent large contiguous allocations. Fixes: #15302. Closes scylladb/scylladb#15428 (cherry picked from commit `62a8a31be7`)	2023-12-19 13:43:23 +01:00
Piotr Grabowski	7055ac45d1	test: use more frequent reconnection policy The default reconnection policy in Python Driver is an exponential backoff (with jitter) policy, which starts at 1 second reconnection interval and ramps up to 600 seconds. This is a problem in tests (refs #15104), especially in tests that restart or replace nodes. In such a scenario, a node can be unavailable for an extended period of time and the driver will try to reconnect to it multiple times, eventually reaching very long reconnection interval values, exceeding the timeout of a test. Fix the issue by using a exponential reconnection policy with a maximum interval of 4 seconds. A smaller value was not chosen, as each retry clutters the logs with reconnection exception stack trace. Fixes #15104 Closes #15112 (cherry picked from commit `17e3e367ca`)	2023-12-19 13:43:23 +01:00
Gleb Natapov	4ff29d1637	raft: drop assert in server_impl::apply_snapshot for a condition that may happen server_impl::apply_snapshot() assumes that it cannot receive a snapshots from the same host until the previous one is handled and usually this is true since a leader will not send another snapshot until it gets response to a previous one. But it may happens that snapshot sending RPC fails after the snapshot was sent, but before reply is received because of connection disconnect. In this case the leader may send another snapshot and there is no guaranty that the previous one was already handled, so the assumption may break. Drop the assert that verifies the assumption and return an error in this case instead. Fixes: #15222 Message-ID: <ZO9JoEiHg+nIdavS@scylladb.com> (cherry picked from commit `55f047f33f`)	2023-12-19 13:43:23 +01:00
Alexey Novikov	6bcf9e6631	When add duration field to UDT check whether this UDT is used in some clustering key Having values of the duration type is not allowed for clustering columns, because duration can't be ordered. This is correctly validated when creating a table but do not validated when we alter the type. Fixes #12913 Closes scylladb/scylladb#16022 (cherry picked from commit `bd73536b33`)	2023-12-19 06:58:41 -05:00
Takuya ASADA	74dd8f08e3	dist: fix local-fs.target dependency systemd man page says: systemd-fstab-generator(3) automatically adds dependencies of type Before= to all mount units that refer to local mount points for this target unit. So "Before=local-fs.taget" is the correct dependency for local mount points, but we currently specify "After=local-fs.target", it should be fixed. Also replaced "WantedBy=multi-user.target" with "WantedBy=local-fs.target", since .mount are not related with multi-user but depends local filesystems. Fixes #8761 Closes scylladb/scylladb#15647 (cherry picked from commit `a23278308f`)	2023-12-19 13:15:00 +02:00
Botond Dénes	68507ed4d9	Merge '[Backport 5.2] Shard of shard repair task impl' from Aleksandra Martyniuk Shard id is logged twice in repair (once explicitly, once added by logger). Redundant occurrence is deleted. shard_repair_task_impl::id (which contains global repair shard) is renamed to avoid further confusion. Fixes: https://github.com/scylladb/scylladb/issues/12955 Closes #16439 * github.com:scylladb/scylladb: repair: rename shard_repair_task_impl::id repair: delete redundant shard id from logs	2023-12-19 10:28:57 +02:00
Botond Dénes	46a29e9a02	Merge 'alternator: fix isolation of concurrent modifications to tags' from Nadav Har'El Alternator's implementation of TagResource, UntagResource and UpdateTimeToLive (the latter uses tags to store the TTL configuration) was unsafe for concurrent modifications - some of these modifications may be lost. This short series fixes the bug, and also adds (in the last patch) a test that reproduces the bug and verifies that it's fixed. The cause of the incorrect isolation was that we separately read the old tags and wrote the modified tags. In this series we introduce a new function, `modify_tags()` which can do both under one lock, so concurrent tag operations are serialized and therefore isolated as expected. Fixes #6389. Closes #13150 * github.com:scylladb/scylladb: test/alternator: test concurrent TagResource / UntagResource db/tags: drop unsafe update_tags() utility function alternator: isolate concurrent modification to tags db/tags: add safe modify_tags() utility functions migration_manager: expose access to storage_proxy (cherry picked from commit `dba1d36aa6`) Closes #16453	2023-12-19 10:19:31 +02:00
Botond Dénes	23fd6939eb	Merge '[Backport to 5.2] gossiper: mark_alive: use deferred_action to unmark pending' from Benny Halevy Backport the following patches to 5.2: - gossiper: mark_alive: enter background_msg gate (#14791) - gossiper: mark_alive: use deferred_action to unmark pending (#14839) Closes #16452 * github.com:scylladb/scylladb: gossiper: mark_alive: use deferred_action to unmark pending gossiper: mark_alive: enter background_msg gate	2023-12-19 09:06:37 +02:00
Botond Dénes	1cf499cfea	Update tools/java submodule * tools/java 80701efa8d...e2aad6e3a0 (2): > build: update logback dependency > build: update `netty` dependency Fixes: https://github.com/scylladb/scylla-tools-java/issues/363 Fixes: https://github.com/scylladb/scylla-tools-java/issues/364 Closes #16444	2023-12-18 18:19:20 +02:00
Nadav Har'El	91e05dc646	cql: fix SELECT toJson() or SELECT JSON of time column The implementation of "SELECT TOJSON(t)" or "SELECT JSON t" for a column of type "time" forgot to put the time string in quotes. The result was invalid JSON. This is patch is a one-liner fixing this bug. This patch also removes the "xfail" marker from one xfailing test for this issue which now starts to pass. We also add a second test for this issue - the existing test was for "SELECT TOJSON(t)", and the second test shows that "SELECT JSON t" had exactly the same bug - and both are fixed by the same patch. We also had a test translated from Cassandra which exposed this bug, but that test continues to fail because of other bugs, so we just need to update the xfail string. The patch also fixes one C++ test, test/boost/json_cql_query_test.cc, which enshrined the wrong behavior - JSON output that isn't even valid JSON - and had to be fixed. Unlike the Python tests, the C++ test can't be run against Cassandra, and doesn't even run a JSON parser on the output, which explains how it came to enshrine wrong output instead of helping to discover the bug. Fixes #7988 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#16121 (cherry picked from commit `8d040325ab`)	2023-12-18 18:19:20 +02:00
Benny Halevy	a2009c4a8c	gossiper: mark_alive: use deferred_action to unmark pending Make sure _pending_mark_alive_endpoints is unmarked in any case, including exceptions. Fixes #14839 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #14840 (cherry picked from commit `1e7e2eeaee`)	2023-12-18 14:44:22 +02:00
Benny Halevy	999a6bfaae	gossiper: mark_alive: enter background_msg gate The function dispatch a background operation that must be waited on in stop(). \Fixes scylladb/scylladb#14791 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `868e436901`)	2023-12-18 14:42:52 +02:00
Kefu Chai	faef786c88	reloc: strip.sh: always generate symbol list with posix format we compare the symbols lists of stripped ELF file ($orig.stripped) and that of the one including debugging symbols ($orig.debug) to get a an ELF file which includes only the necessary bits as the debuginfo ($orig.minidebug). but we generate the symbol list of stripped ELF file using the sysv format, while generate the one from the unstripped one using posix format. the former is always padded the symbol names with spaces so that their the length at least the same as the section name after we split the fields with "\|". that's why the diff includes the stuff we don't expect. and hence, we have tons of warnings like: ``` objcopy: build/node_exporter/node_exporter.keep_symbols:4910: Ignoring rubbish found on this line ``` when using objcopy to filter the ELF file to keep only the symbols we are interested in. so, in this change * use the same format when dumping the symbols from unstripped ELF file * include the symbols in the text area -- the code, by checking "T" and "t" in the dumped symbols. this was achieved by matching the lines with "FUNC" before this change. * include the the symbols in .init data section -- the global variables which are initialized at compile time. they could be also interesting when debugging an application. Fixes #15513 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#15514 (cherry picked from commit `50c937439b`)	2023-12-18 13:58:14 +02:00
Michał Chojnowski	7e9bdef8bb	row_cache: when the constructor fails, clear `_partitions` in the right allocator If the constructor of row_cache throws, `_partitions` is cleared in the wrong allocator, possibly causing allocator corruption. Fix that. Fixes #15632 Closes scylladb/scylladb#15633 (cherry picked from commit `330d221deb`)	2023-12-18 13:55:16 +02:00
Michael Huang	af38b255c8	cql3: Fix invalid JSON parsing for JSON objects with ASCII keys For JSON objects represented as map<ascii, int>, don't treat ASCII keys as a nested JSON string. We were doing that prior to the patch, which led to parsing errors. Included the error offset where JSON parsing failed for rjson::parse related functions to help identify parsing errors better. Fixes: #7949 Signed-off-by: Michael Huang <michaelhly@gmail.com> Closes scylladb/scylladb#15499 (cherry picked from commit `75109e9519`)	2023-12-18 13:45:57 +02:00
Kefu Chai	c4b699525a	sstables: throw at seeing invalid chunk_len before this change, when running into a zero chunk_len, scylla crashes with `assert(chunk_size != 0)`. but we can do better than printing a backtrace like: ``` scylla: sstables/compress.cc:158: void sstables::compression::segmented_offsets::init(uint32_t): Assertion `chunk_size != 0' failed. ``` so, in this change, a `malformed_sstable_exception` is throw in place of an `assert()`, which is supposed to verify the programming invariants, not for identifying corrupted data file. Fixes #15265 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #15264 (cherry picked from commit `1ed894170c`)	2023-12-18 13:29:02 +02:00
Nadav Har'El	3a24b8c435	sstables: stop warning when auto-snapshot leaves non-empty directory When a table is dropped, we delete its sstables, and finally try to delete the table's top-level directory with the rmdir system call. When the auto-snapshot feature is enabled (this is still Scylla's default), the snapshot will remain in that directory so it won't be empty and will cannot be removed. Today, this results in a long, ugly and scary warning in the log: ``` WARN 2023-07-06 20:48:04,995 [shard 0] sstable - Could not remove table directory "/tmp/scylla-test-198265/data/alternator_alternator_Test_1688665684546/alternator_Test_1688665684546-4238f2201c2511eeb15859c589d9be4d/snapshots": std::filesystem::__cxx11::filesystem_error (error system:39, filesystem error: remove failed: Directory not empty [/tmp/scylla-test-198265/data/alternator_alternator_Test_1688665684546/alternator_Test_1688665684546-4238f2201c2511eeb15859c589d9be4d/snapshots]). Ignored. ``` It is bad to log as a warning something which is completely normal - it happens every time a table is dropped with the perfectly valid (and even default) auto-snapshot mode. We should only log a warning if the deletion failed because of some unexpected reason. And in fact, this is exactly what the code tried to do - it does not log a warning if the rmdir failed with EEXIST. It even had a comment saying why it was doing this. But the problem is that in Linux, deleting a non-empty directory does not return EEXIST, it returns ENOTEMPTY... Posix actually allows both. So we need to check both, and this is the only change in this patch. To confirm this that this patch works, edit test/cql-pytest/run.py and change auto-snapshot from 0 to 1, run test/alternator/run (for example) and see many "Directory not empty" warnings as above. With this patch, none of these warnings appear. Fixes #13538 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #14557 (cherry picked from commit `edfb89ef65`)	2023-12-18 13:26:40 +02:00
Kefu Chai	9e9a488da3	streaming: cast the progress to a float before formatting it before this change, we format a `long` using `{:f}`. fmtlib would throw an exception when actually formatting it. so, let's make the percentage a float before formatting it. Fixes #14587 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14588 (cherry picked from commit `1eb76d93b7`)	2023-12-18 13:16:58 +02:00
Aleksandra Martyniuk	614d15b9f6	repair: rename shard_repair_task_impl::id shard_repair_task_impl::id stores global repair id. To avoid confusion with the task id, the field is renamed to global_repair_id. (cherry picked from commit `d889a599e8`)	2023-12-18 12:08:00 +01:00
Aleksandra Martyniuk	fc2799096f	repair: delete redundant shard id from logs In repair shard id is logged twice. Delete repeated occurence. (cherry picked from commit `f7c88edec5`)	2023-12-18 12:03:26 +01:00
Petr Gusev	b9178bd853	hints: send_one_hint: extend the scope of file_send_gate holder The problem was that the holder in with_gate call was released too early. This happened before the possible call to on_hint_send_failure in then_wrapped. As a result, the effects of on_hint_send_failure (segment_replay_failed flag) were not visible in send_one_file after ctx_ptr->file_send_gate.close(), so we could decide that the segment was sent in full and delete it even if sending of some hints led to errors. Fixes #15110 (cherry picked from commit `9fd3df13a2`)	2023-12-18 13:03:23 +02:00
Kefu Chai	12aacea997	compound_compat: do not format an sstring with {:d} before this change, we format a sstring with "{:d}", fmtlib would throw `fmt::format_error` at runtime when formatting it. this is not expected. so, in this change, we just print the int8_t using `seastar::format()` in a single pass. and with the format specifier of `#02x` instead of adding the "0x" prefix manually. Fixes #14577 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14578 (cherry picked from commit `27d6ff36df`)	2023-12-18 12:47:16 +02:00
Kefu Chai	df30f66bfa	tools/scylla-sstable: dump column_desc as an object before this change, `scylla sstable dump-statistics` prints the "regular_columns" as a list of strings, like: ``` "regular_columns": [ "name", "clustering_order", "type_name", "org.apache.cassandra.db.marshal.UTF8Type", "name", "column_name_bytes", "type_name", "org.apache.cassandra.db.marshal.BytesType", "name", "kind", "type_name", "org.apache.cassandra.db.marshal.UTF8Type", "name", "position", "type_name", "org.apache.cassandra.db.marshal.Int32Type", "name", "type", "type_name", "org.apache.cassandra.db.marshal.UTF8Type" ] ``` but according https://opensource.docs.scylladb.com/stable/operating-scylla/admin-tools/scylla-sstable.html#dump-statistics, > $SERIALIZATION_HEADER_METADATA := { > "min_timestamp_base": Uint64, > "min_local_deletion_time_base": Uint64, > "min_ttl_base": Uint64", > "pk_type_name": String, > "clustering_key_types_names": [String, ...], > "static_columns": [$COLUMN_DESC, ...], > "regular_columns": [$COLUMN_DESC, ...], > } > > $COLUMN_DESC := { > "name": String, > "type_name": String > } "regular_columns" is supposed to be a list of "$COLUMN_DESC". the same applies to "static_columnes". this schema makes sense, as each column should be considered as a single object which is composed of two properties. but we dump them like a list. so, in this change, we guard each visit() call of `json_dumper()` with `StartObject()` and `EndObject()` pair, so that each column is printed as an object. after the change, "regular_columns" are printed like: ``` "regular_columns": [ { "name": "clustering_order", "type_name": "org.apache.cassandra.db.marshal.UTF8Type" }, { "name": "column_name_bytes", "type_name": "org.apache.cassandra.db.marshal.BytesType" }, { "name": "kind", "type_name": "org.apache.cassandra.db.marshal.UTF8Type" }, { "name": "position", "type_name": "org.apache.cassandra.db.marshal.Int32Type" }, { "name": "type", "type_name": "org.apache.cassandra.db.marshal.UTF8Type" } ] ``` Fixes #15036 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #15037 (cherry picked from commit `c82f1d2f57`)	2023-12-18 12:26:36 +02:00
Michał Sala	2427bda737	forward_service: introduce shutdown checks This commit introduces a new boolean flag, `shutdown`, to the forward_service, along with a corresponding shutdown method. It also adds checks throughout the forward_service to verify the value of the shutdown flag before retrying or invoking functions that might use the messaging service under the hood. The flag is set before messaging service shutdown, by invoking forward_service::shutdown in main. By checking the flag before each call that potentially involves the messaging service, we can ensure that the messaging service is still operational. If the flag is false, indicating that the messaging service is still active, we can proceed with the call. In the event that the messaging service is shutdown during the call, appropriate exceptions should be thrown somewhere down in called functions, avoiding potential hangs. This fix should resolve the issue where forward_service retries could block the shutdown. Fixes #12604 Closes #13922 (cherry picked from commit `e0855b1de2`)	2023-12-18 12:25:25 +02:00
Petr Gusev	27adf340ef	storage_proxy: mutation:: make frozen_mutation [[ref]] We had a redundant copy in receive_mutation_handler forward_fn callback. This frozen_mutation is dynamically allocated and can be arbitrary large. Fixes: #12504 (cherry picked from commit `5adbb6cde2`)	2023-12-18 12:20:40 +02:00
Botond Dénes	5c33c9d6a6	Merge 'thrift: return address in listen_addresses() only after server is ready' from Marcin Maliszkiewicz This is used for readiness API: /storage_service/rpc_server and the fix prevents from returning 'true' prematurely. Some improvement for readiness was added in `a51529dd15` but thrift implementation wasn't fully done. Fixes https://github.com/scylladb/scylladb/issues/12376 Closes #13319 * github.com:scylladb/scylladb: thrift: return address in listen_addresses() only after server is ready thrift: simplify do_start_server() with seastar:async (cherry picked from commit `9a024f72c4`)	2023-12-18 12:20:40 +02:00
Kamil Braun	9aaaa66981	Merge 'cql3: fix a few misformatted printouts of column names in error messages' from Nadav Har'El Fix a few cases where instead of printing column names in error messages, we printed weird stuff like ASCII codes or the address of the name. Fixes #13657 Closes #13658 * github.com:scylladb/scylladb: cql3: fix printing of column_specification::name in some error messages cql3: fix printing of column_definition::name in some error messages (cherry picked from commit `a29b8cd02b`)	2023-12-18 09:55:37 +02:00

1 2 3 4 5 ...

35009 Commits