scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Jenkins Promoter	f126ccb2e9	Update ScyllaDB version to: 5.4.2	2024-01-04 23:02:23 +02:00
Benny Halevy	d8586fd101	compaction_manager: perform_cleanup: ignore condition_variable_timed_out The polling loop was intended to ignore `condition_variable_timed_out` and check for progress using a longer `max_idle_duration` timeout in the loop. Fixes #15669 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#15671 (cherry picked from commit `68a7bbe582`)	2024-01-04 12:16:56 +02:00
Pavel Emelyanov	a228d09017	Merge ' tools/utils: tool_app_template: handle the case of no args ' from Botond Dénes Currently, `tool_app_template::run_async()` crashes when invoked with empty argv (with just `argv[0]` populated). This can happen if the tool app is invoked without any further args, e.g. just invoking `scylla nodetool`. The crash happens because unconditional dereferencing of `argv[1]` to get the current operation. To fix, add an early-exit for this case, just printing a usage message and exiting with exit code 2. Fixes: #16451 Closes scylladb/scylladb#16456 * github.com:scylladb/scylladb: test: add regression tests for invoking tools with no args tools/utils: tool_app_template: handle the case of no args tools/utils: tool_app_template: remove "scylla-" prefix from app name (cherry picked from commit `5866d265c3`)	2024-01-04 10:49:26 +02:00
Botond Dénes	c5f2095f6e	tools/schema_loader: read_schema_table_mutation(): close the reader The reader used to read the sstables was not closed. This could sometimes trigger an abort(), because the reader was destroyed, without it being closed first. Why only sometimes? This is due to two factors: * read_mutation_from_flat_mutation_reader() - the method used to extract a mutation from the reader, uses consume(), which does not trigger `set_close_is_required()` (#16520). Due to this, the top-level combined reader did not complain when destroyed without close. * The combined reader closes underlying readers who have no more data for the current range. If the circumstances are just right, all underlying readers are closed, before the combined reader is destoyed. Looks like this is what happens for the most time. This bug was discovered in SCT testing. After fixing #16520, all invokations of `scylla-sstable`, which use this code would trigger the abort, without this patch. So no further testing is required. Fixes: #16519 Closes scylladb/scylladb#16521 (cherry picked from commit `da033343b7`)	2023-12-31 18:12:33 +02:00
Nadav Har'El	3d22f42cf9	Merge 'select statement: verify EXECUTE permissions only for non native functions' from Eliran Sinvani Commit `62458b8e4f` introduced the enforcement of EXECUTE permissions of functions in cql select. However, according to the reference in #12869, the permissions should be enforced only on UDFs and UDAs. The code does not distinguish between the two so the permissions are also unintenionally enforced also on native function. This commit introduce the distinction and only enforces the permissions on non native functions. Fixes #16526 Manually verified (before and after change) with the reproducer supplied in #16526 and also with some the `min` and `max` native functions. Also added test that checks for regression on native functions execution and verified that it fails on authorization before the fix and passes after the fix. Closes scylladb/scylladb#16556 * github.com:scylladb/scylladb: test.py: Add test for native functions permissions select statement: verify EXECUTE permissions only for non native functions (cherry picked from commit `fc71c34597`) scylla-5.4.1	2023-12-27 14:30:52 +02:00
Avi Kivity	8ca5794756	Merge 'cql: fix regression in SELECT * GROUP BY' from Nadav Har'El This short series fixes a regression from Scylla 5.2 to Scylla 5.4 in "SELECT * GROUP BY" - this query was supposed to return just a single row from each partition (the first one in clustering order), but after the expression rewrite started to wrongly return all rows. The series also includes a regression test that verifies that this query works doesn't work correctly before this series, but works with this patch - and also works as expected in Scylla 5.2 and in Cassadra. Fixes #16531. Closes scylladb/scylladb#16559 * github.com:scylladb/scylladb: test/cql-pytest: check that most aggregators don't take "" cql-pytest: add reproducer for GROUP BY regression cql: fix regression in SELECT GROUP BY (cherry picked from commit `3968fc11bf`)	2023-12-26 10:47:06 +02:00
Anna Stuchlik	abeeefb427	doc: add Raft verification to 5.4 upgrade This commit adds the Raft verification step to the 5.2-to-5.4 upgrade guide. It is V2 of https://github.com/scylladb/scylladb/pull/16347. Closes scylladb/scylladb#16481	2023-12-20 11:43:01 +01:00
Botond Dénes	9c482ff262	Update tools/java submodule * tools/java 9387ac10...fcfe7b7c (1): > Merge "build: take care of old libthrift" from Piotr Grabowski Fixes: scylladb/scylla-tools-java#352 Closes scylladb/scylladb#16463	2023-12-19 17:38:11 +02:00
Takuya ASADA	bfc98d1909	dist: fix local-fs.target dependency systemd man page says: systemd-fstab-generator(3) automatically adds dependencies of type Before= to all mount units that refer to local mount points for this target unit. So "Before=local-fs.taget" is the correct dependency for local mount points, but we currently specify "After=local-fs.target", it should be fixed. Also replaced "WantedBy=multi-user.target" with "WantedBy=local-fs.target", since .mount are not related with multi-user but depends local filesystems. Fixes #8761 Closes scylladb/scylladb#15647 (cherry picked from commit `a23278308f`)	2023-12-19 13:14:22 +02:00
Botond Dénes	2cef52aeaa	Update tools/java submodule * tools/java f9cce789...9387ac10 (2): > build: update logback dependency > build: update `netty` dependency Fixes: https://github.com/scylladb/scylla-tools-java/issues/363 Fixes: https://github.com/scylladb/scylla-tools-java/issues/364 Closes scylladb/scylladb#16442	2023-12-18 17:13:05 +02:00
Alexey Novikov	a55561fc64	When add duration field to UDT check whether this UDT is used in some clustering key Having values of the duration type is not allowed for clustering columns, because duration can't be ordered. This is correctly validated when creating a table but do not validated when we alter the type. Fixes #12913 Closes scylladb/scylladb#16022 (cherry picked from commit `bd73536b33`)	2023-12-18 14:22:25 +02:00
Raphael S. Carvalho	7288bdfe09	sstables: Fix update of tombstone GC settings to have immediate effect After "repair: Get rid of the gc_grace_seconds", the sstable's schema (mode, gc period if applicable, etc) is used to estimate the amount of droppable data (or determine full expiration = max_deletion_time < gc_before). It could happen that the user switched from timeout to repair mode, but sstables will still use the old mode, despite the user asked for a new one. Another example is when you play with value of grace period, to prevent data resurrection if repair won't be able to run in a timely manner. The problem persists until all sstables using old GC settings are recompacted or node is restarted. To fix this, we have to feed latest schema into sstable procedures used for expiration purposes. Fixes #15643. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#15746 (cherry picked from commit `fded314e46`)	2023-12-18 14:14:02 +02:00
Eliran Sinvani	ac7ed6857a	use_statement: Covert an exception to a future exception The use statement execution code can throw if the keyspace is doesn't exist, this can be a problem for code that will use execute in a fiber since the exception will break the fiber even if `then_wrapped` is used. Fixes #14449 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes scylladb/scylladb#14394 (cherry picked from commit `c5956957f3`)	2023-12-15 13:03:05 +02:00
Nadav Har'El	bc8ff68cf6	cql: fix SELECT toJson() or SELECT JSON of time column The implementation of "SELECT TOJSON(t)" or "SELECT JSON t" for a column of type "time" forgot to put the time string in quotes. The result was invalid JSON. This is patch is a one-liner fixing this bug. This patch also removes the "xfail" marker from one xfailing test for this issue which now starts to pass. We also add a second test for this issue - the existing test was for "SELECT TOJSON(t)", and the second test shows that "SELECT JSON t" had exactly the same bug - and both are fixed by the same patch. We also had a test translated from Cassandra which exposed this bug, but that test continues to fail because of other bugs, so we just need to update the xfail string. The patch also fixes one C++ test, test/boost/json_cql_query_test.cc, which enshrined the wrong behavior - JSON output that isn't even valid JSON - and had to be fixed. Unlike the Python tests, the C++ test can't be run against Cassandra, and doesn't even run a JSON parser on the output, which explains how it came to enshrine wrong output instead of helping to discover the bug. Fixes #7988 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#16121 (cherry picked from commit `8d040325ab`)	2023-12-15 11:41:47 +02:00
Israel Fruchter	0974ef893e	docker: put cqlsh configuration in correct place since always we were putting cqlsh configuration into `~/.cqlshrc` acording to commit from 8 years ago [1], this path is deprecated. until this commit [2], actully remove this path from cqlsh code as part of moving to scylla-cqlsh, we got [2], and didn't notice until the first release with it. this change write the configuration into `~/.casssndra/cqlshrc` as this is the default place cqlsh is looking. [1]: `13ea8a6669/bin/cqlsh.py (L264)` [2]: `2024ea4796` Fixes: scylladb/scylladb#16329 Closes scylladb/scylladb#16340 (cherry picked from commit `514ef48d75`)	2023-12-14 14:15:40 +02:00
Botond Dénes	9fc4c265a5	Merge 'mutation_query: properly send range tombstones in reverse queries' from Michał Chojnowski reconcilable_result_builder passes range tombstone changes to _rt_assembler using table schema, not query schema. This means that a tombstone with bounds (a; b), where a < b in query schema but a > b in table schema, will not be emitted from mutation_query. This is a very serious bug, because it means that such tombstones in reverse queries are not reconciled with data from other replicas. If any queried replica has a row, but not the range tombstone which deleted the row, the reconciled result will contain the deleted row. In particular, range deletes performed while a replica is down will not later be visible to reverse queries which select this replica, regardless of the consistency level. As far as I can see, this doesn't result in any persistent data loss. Only in that some data might appear resurrected to reverse queries, until the relevant range tombstone is fully repaired. This series fixes the bug and adds a minimal reproducer test. Fixes #10598 Closes scylladb/scylladb#16003 * github.com:scylladb/scylladb: mutation_query_test: test that range tombstones are sent in reverse queries mutation_query: properly send range tombstones in reverse queries (cherry picked from commit `65e42e4166`)	2023-12-14 12:52:51 +02:00
Botond Dénes	0518e47daf	Update tools/java submodule * tools/java 3764ae94...f9cce789 (1): > Merge "print more informative error when fail to parse sstable generation" from Kefu Chai Fixes: scylladb/scylla-tools-java#360	2023-12-12 09:49:35 +02:00
Yaron Kaikov	1e8eb6172a	build_docker.sh: Upgrade package during creation and remove sshd service When scanning our latest docker image using `trivy` (command: `trivy image docker.io/scylladb/scylla-nightly:latest`), it shows we have OS packages which are out of date. Also removing `openssh-server` and `openssh-client` since we don't use it for our docker images Fixes: https://github.com/scylladb/scylladb/issues/16222 Closes scylladb/scylladb#16224 (cherry picked from commit `7ce6962141`) Closes scylladb/scylladb#16359	2023-12-11 10:56:46 +02:00
Paweł Zakrzewski	14814c972e	auth: fix error message when consistency level is not met Propagate `exceptions::unavailable_exception` error message to the client such as cqlsh. Fixes #2339 (cherry picked from commit `400aa2e932`)	2023-12-07 14:49:20 +02:00
Jenkins Promoter	7a67db594a	Update ScyllaDB version to: 5.4.1	2023-12-06 16:58:35 +02:00
Anna Stuchlik	5434fcb5a8	doc: replace the OSS-only link on the Raft page This commit replaces the link to the OSS-only page (the 5.2-to-5.4 upgrade guide not present in the Enterprise docs) on the Raft page. While providing the link to the specific upgrade guide is more user-friendly, it causes build failures of the Enterprise documentation. I've replaced it with the link to the general Upgrade section. The ".. only:: opensource" directive used to wrap the OSS-only content correctly excludes the content form the Enterprise docs - but it doesn't prevent build warnings. This commit must be backported to branch-5.4 to prevent errors in all versions. Closes scylladb/scylladb#16176 (cherry picked from commit `24d5dbd66f`)	2023-12-06 13:19:03 +02:00
Nadav Har'El	b4ef2248cc	Backport fixes for nodetool commands with Alternator GSI in the database Fixes #16153 * jmx 166599f...f45067f (3): > ColumnFamilyStore: only quote table names if necessary > APIBuilder: allow quoted scope names > ColumnFamilyStore: don't fail if there is a table with ":" in its name * java dfbf3726ee...3764ae94db (1): > NodeProbe: allow addressing table name with colon in it Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#16294	2023-12-06 10:46:54 +02:00
Anna Stuchlik	21996e12ae	doc: enabling experimental Raft-managed topology This commit adds a short paragraph to the Raft page to explain how to enable consistent topology updates with Raft - an experimental feature in version 5.4. The paragraph should satisfy the requirements for version 5.4. The Raft page will be rewritten in the next release when consistent topology changes with Raft will be GA. Fixes https://github.com/scylladb/scylladb/issues/15080 Requires backport to branch-5.4. Closes scylladb/scylladb#16273 (cherry picked from commit `409e20e5ab`)	2023-12-06 08:57:46 +02:00
Anna Stuchlik	df7b96a092	doc: add metric upgrade info to the 5.4 upgrade This commit adds the information about metrics update to the 5.2-to-5.4 upgrade guide. Fixes https://github.com/scylladb/scylladb/issues/15966 Closes scylladb/scylladb#16161 (cherry picked from commit `97244eb68e`)	2023-12-05 15:17:28 +02:00
Anna Stuchlik	5df85094d9	doc: fix rollback in the 4.6-to-5.0 upgrade guide This commit fixes the rollback procedure in the 4.6-to-5.0 upgrade guide: - The "Restore system tables" step is removed. - The "Restore the configuration file" command is fixed. - The "Gracefully shutdown ScyllaDB" command is fixed. In addition, there are the following updates to be in sync with the tests: - The "Backup the configuration file" step is extended to include a command to backup the packages. - The Rollback procedure is extended to restore the backup packages. - The Reinstallation section is fixed for RHEL. Refs https://github.com/scylladb/scylladb/issues/11907 This commit must be backported to branch-5.4, branch-5.2, and branch-5.1 Closes scylladb/scylladb#16155 (cherry picked from commit `1e80bdb440`)	2023-12-05 15:09:59 +02:00
Anna Stuchlik	a0ca8900e1	doc: fix rollback in the 5.0-to-5.1 upgrade guide This commit fixes the rollback procedure in the 5.0-to-5.1 upgrade guide: - The "Restore system tables" step is removed. - The "Restore the configuration file" command is fixed. - The "Gracefully shutdown ScyllaDB" command is fixed. In addition, there are the following updates to be in sync with the tests: - The "Backup the configuration file" step is extended to include a command to backup the packages. - The Rollback procedure is extended to restore the backup packages. - The Reinstallation section is fixed for RHEL. Also, I've the section removed the rollback section for images, as it's not correct or relevant. Refs https://github.com/scylladb/scylladb/issues/11907 This commit must be backported to branch-5.4, branch-5.2, and branch-5.1 Closes scylladb/scylladb#16154 (cherry picked from commit `7ad0b92559`)	2023-12-05 15:07:58 +02:00
Anna Stuchlik	98bd287177	doc: fix rollback in the 5.1-to-5.2 upgrade guide This commit fixes the rollback procedure in the 5.1-to-5.2 upgrade guide: - The "Restore system tables" step is removed. - The "Restore the configuration file" command is fixed. - The "Gracefully shutdown ScyllaDB" command is fixed. In addition, there are the following updates to be in sync with the tests: - The "Backup the configuration file" step is extended to include a command to backup the packages. - The Rollback procedure is extended to restore the backup packages. - The Reinstallation section is fixed for RHEL. Also, I've the section removed the rollback section for images, as it's not correct or relevant. Refs https://github.com/scylladb/scylladb/issues/11907 This commit must be backported to branch-5.4 and branch-5.2. Closes scylladb/scylladb#16152 (cherry picked from commit `91cddb606f`)	2023-12-05 14:57:24 +02:00
Anna Stuchlik	c4a249022f	doc: fix rollback for RHEL (install) in 5.4 This commit fixes the installation command in the Rollback section for RHEL/Centos in the 5.2-5.4 upgrade guide. It's a follow-up to https://github.com/scylladb/scylladb/pull/16114 where the command was not updated. Refs https://github.com/scylladb/scylladb/issues/11907 This commit must be backported to branch-5.4. Closes scylladb/scylladb#16156 (cherry picked from commit `52c2698978`)	2023-12-05 14:56:20 +02:00
Jenkins Promoter	58a89e7a42	Update ScyllaDB version to: 5.4.0 scylla-5.4.0	2023-12-04 15:00:40 +02:00
Botond Dénes	1a0424db01	Update ./tools/jmx submodule * ./tools/jmx 9a03d4fa...166599f0 (1): > StorageService: Normalize endpoint inetaddress strings to java form Fixes: scylladb/scylladb#16039	2023-12-04 12:53:13 +02:00
Botond Dénes	6d7919041b	Merge 'row_cache: abort on exteral_updater::execute errors' from Benny Halevy Currently the cache updaters aren't exception safe yet they are intended to be. Instead of allowing exceptions from `external_updater::execute` escape `row_cache::update`, abort using `on_fatal_internal_error`. Future changes should harden all `execute` implementations to effectively make them `noexcept`, then the pure virtual definition can be made `noexcept` to cement that. \Fixes scylladb/scylladb#15576 \Closes scylladb/scylladb#15577 * github.com:scylladb/scylladb: row_cache: abort on exteral_updater::execute errors row_cache: do_update: simplify _prev_snapshot_pos setup (cherry picked from commit `4a0f16474f`) Closes scylladb/scylladb#16256	2023-12-03 20:52:03 +02:00
Jenkins Promoter	17c15f6222	Update ScyllaDB version to: 5.4.0-rc4	2023-12-03 20:28:53 +02:00
Michał Chojnowski	91d1c9153b	position_in_partition: make operator= exception-safe The copy assignment operator of _ck can throw after _type and _bound_weight have already been changed. This leaves position_in_partition in an inconsistent state, potentially leading to various weird symptoms. The problem was witnessed by test_exception_safety_of_reads. Specifically: in cache_flat_mutation_reader::add_to_buffer, which requires the assignment to _lower_bound to be exception-safe. The easy fix is to perform the only potentially-throwing step first. Fixes #15822 Closes scylladb/scylladb#15864 (cherry picked from commit `93ea3d41d8`)	2023-11-30 15:00:39 +02:00
Avi Kivity	95364e2454	Update seastar submodule (spins on epoll) * seastar bab1625cf3...95a38bb0c6 (1): > epoll: Avoid spinning on aborted connections Fixes #12774 Fixes #7753 Fixes #13337	2023-11-30 14:07:17 +02:00
Avi Kivity	6d779f58a9	Update seastar submodule to scylla-seastar.git This lets us backport seastar patches into branch-5.4.	2023-11-30 14:07:17 +02:00
Jenkins Promoter	b956646ba2	Update ScyllaDB version to: 5.4.0-rc3 scylla-5.4.0-rc3	2023-11-29 14:29:54 +02:00
Anna Stuchlik	62b93018ac	doc: add experimental support for object storage This commit adds information on how to enable object storage for a keyspace. The "Keyspace storage options" section already existed in the doc, but it was not valid as the support was only added in version 5.4 The scope of this commit: - Update the "Keyspace storage options" section. - Add the information about object storage support to the Data Definition> CREATE KEYSPACE section * Marked as "Experimental". * Excluded from the Enterprise docs with the .. only:: opensource directive. This commit must be backported to branch-5.4, as support for object storage was added in version 5.4. Closes scylladb/scylladb#16081 (cherry picked from commit `bfe19c0ed2`)	2023-11-29 08:39:12 +02:00
Piotr Grabowski	b0410c9391	install-dependencies.sh: update node_exporter to 1.7.0 Update node_exporter to 1.7.0. The previous version (1.6.1) was flagged by security scanners (such as Trivy) with HIGH-severity CVE-2023-39325. 1.7.0 release fixed that problem. [Botond: regenerate frozen toolchain] Fixes #16085 Closes scylladb/scylladb#16086 Closes scylladb/scylladb#16090 (cherry picked from commit `321459ec51`) [avi: regenerate frozen toolchain]	2023-11-27 16:48:30 +00:00
Botond Dénes	6f073dfa54	Update ./tools/jmx and ./tools/java submodules * ./tools/jmx 8d15342e...9a03d4fa (1): > Merge "scylla-apiclient: update several Java dependencies" from Piotr Grabowski * ./tools/java 3c09ab97...dfbf3726 (1): > Merge 'build: update several dependencies' from Piotr Grabowski Update build dependencies which were flagged by security scanners. Refs: scylladb/scylla-jmx#220 Refs: scylladb/scylla-tools-java#351 Closes scylladb/scylladb#16149	2023-11-23 18:34:24 +02:00
Anna Stuchlik	a24b53e6bb	doc: fix rollback in the 5.2-to-5.4 upgrade guide This commit fixes the rollback procedure in the 5.2-to-5.4 upgrade guide: - The "Restore system tables" step is removed. - The "Restore the configuration file" command is fixed. - The "Gracefully shutdown ScyllaDB" command is fixed. In addition, there are the following updates to be in sync with the tests: - The "Backup the configuration file" step is extended to include a command to backup the packages. - The Rollback procedure is extended to restore the backup packages. - The Reinstallation section is fixed for RHEL. Also, I've removed the optional step to enable consistent schema management from the list of steps - the appropriate section has already been removed, but it remained in the procedure description, which was misleading. Refs https://github.com/scylladb/scylladb/issues/11907 This commit must be backported to branch-5.4 Closes scylladb/scylladb#16114 (cherry picked from commit `3751acce42`)	2023-11-23 10:10:20 +02:00
Anna Mikhlin	219adcea71	release: prepare for 5.4.0-rc2 scylla-5.4.0-rc2	2023-11-21 13:48:38 +00:00
Nadav Har'El	6d01d01deb	Merge 'Materialize_views: don't construct `global_schema_ptr` from views schemas that lacks base information' from Eliran Sinvani This miniset addresses two potential conversions to `global_schema_ptr` of incomplete materialized views schemas. One of them was completely unnecessary and also is a "chicken and an egg" problem where on the sync schema procedure itself a view schema was converted to `global_schema_ptr` solely for the purposes of logging. This can create a "hickup" in the materialized views updates if they are comming from a node with a different mv schema. The reason why sometimes a synced schema can have no base info is because of deactivision and reactivision of the schema inside the `schema_registry` which doesn't restore the base information due to lack of context. When a schema is synced the problem becomes easy since we can just use the latest base information from the database. Fixes #14011 Closes scylladb/scylladb#14861 * github.com:scylladb/scylladb: migration manager: fix incomplete mv schemas returned from get_schema_for_write migration_manager: do not globalize potentially incomplete schema (cherry picked from commit `5752dc875b`)	2023-11-21 09:53:24 +02:00
Botond Dénes	b1f54efc2d	gms,service: add a feature to protect the usage of allow_mutation_read_page_without_live_row allow_mutation_read_page_without_live_row is a new option in the partition_slice::option option set. In a mixed clusters, old nodes possibly don't know this new option, so its usage must be protected by a cluster feature. This patch does just that. Fixes: #15795 Closes scylladb/scylladb#15890 (cherry picked from commit `f53961248d`)	2023-11-21 09:49:04 +02:00
Botond Dénes	bc1202aab2	api/storage_service: start/stop native transport in the statement sg Currently, it is started/stopped in the streaming/maintenance sg, which is what the API itself runs in. Starting the native transport in the streaming sg, will lead to severely degraded performance, as the streaming sg has significantly less CPU/disk shares and reader concurrency semaphore resources. Furthermore, it will lead to multi-paged reads possibly switching between scheduling groups mid-way, triggering an internal error. To fix, use `with_scheduling_group()` for both starting and stopping native transport. Technically, it is only strictly necessary for starting, but I added it for stop as well for consistency. Also apply the same treatment to RPC (Thrift). Although no one uses it, best to fix it, just to be on the safe side. I think we need a more systematic approach for solving this once and for all, like passing the scheduling group to the protocol server and have it switch to it internally. This allows the server to always run on the correct scheduling group, not depending on the caller to remember using it. However, I think this is best done in a follow-up, to keep this critical patch small and easily backportable. Fixes: #15485 Closes scylladb/scylladb#16019 (cherry picked from commit `dfd7981fa7`)	2023-11-20 19:47:49 +02:00
Takuya ASADA	2cb709461c	scylla_post_install.sh: detect RHEL correctly $ID_LIKE = "rhel" works only on RHEL compatible OSes, not for RHEL itself. To detect RHEL correctly, we also need to check $ID = "rhel". Fixes #16040 Closes scylladb/scylladb#16041 (cherry picked from commit `338a9492c9`)	2023-11-20 19:36:00 +02:00
Tomasz Grabiec	44c72f6e56	Merge 'Multishard mutation query test fix misses expectations' from Botond Dénes There are two tests, test_read_all and test_read_with_partition_row_limits, which asserts on every page as well as at the end that there are no misses whatsoever. This is incorrect, because it is possible that on a given page, not all shards participate and thus there won't be a saved reader on every shard. On the subsequent page, a shard without a reader may produce a miss. This is fine. Refine the asserts, to check that we have only as much misses, as many shards we have without readers on them. Fixes: https://github.com/scylladb/scylladb/issues/14087 Closes scylladb/scylladb#15806 * github.com:scylladb/scylladb: test/boost/multishard_mutation_query_test: fix querier cache misses expectations test/lib/test_utils: add require_* variants for all comparators (cherry picked from commit `457d170078`)	2023-11-19 19:34:44 +02:00
Marcin Maliszkiewicz	6943447c6a	db: view: run local materialized view mutations on a separate smp service group When base write triggers mv write and it needs to be send to another shard it used the same service group and we could end up with a deadlock. This fix affects also alternator's secondary indexes. Testing was done using (yet) not committed framework for easy alternator performance testing: https://github.com/scylladb/scylladb/pull/13121. I've changed hardcoded max_nonlocal_requests config in scylla from 5000 to 500 and then ran: ./build/release/scylla perf-alternator-workloads --workdir /tmp/scylla-workdir/ --smp 2 \ --developer-mode 1 --alternator-port 8000 --alternator-write-isolation forbid --workload write_gsi \ --duration 60 --ring-delay-ms 0 --skip-wait-for-gossip-to-settle 0 --continue-after-error true --concurrency 2000 Without the patch when scylla is overloaded (i.e. number of scheduled futures being close to max_nonlocal_requests) after couple seconds scylla hangs, cpu usage drops to zero, no progress is made. We can confirm we're hitting this issue by seeing under gdb: p seastar::get_smp_service_groups_semaphore(2,0)._count $1 = 0 With the patch I wasn't able to observe the problem, even with 2x concurrency. I was able to make the process hang with 10x concurrency but I think it's hitting different limit as there wasn't any depleted smp service group semaphore and it was happening also on non mv loads. Fixes https://github.com/scylladb/scylladb/issues/15844 Closes scylladb/scylladb#15845 (cherry picked from commit `020a9c931b`)	2023-11-19 18:47:11 +02:00
Anna Stuchlik	b259bb43c6	doc: mark the link to upgrade guide as OSS-only This commit adds the .. only:: opensource directive to the Raft page to exclude the link to the 5.2-to-5.4 upgrade guide from the Enterprise documentation. The Raft page belongs to both OSS and Enterprise documentation sets, while the upgrade guide is OSS-only. This causes documentation build issues in the Enterprise repository, for example, https://github.com/scylladb/scylla-enterprise/pull/3242. As a rule, all OSS-only links should be provided by using the .. only:: opensource directive. This commit must be backported to branch-5.4 to prevent errors in the documentation for ScyllaDB Enterprise 2024.1 (backport) Closes scylladb/scylladb#16064 (cherry picked from commit `ca22de4843`)	2023-11-17 11:00:04 +02:00
Botond Dénes	88e96def63	migration_manager: also reload schema on enabling digest_insensitive_to_expiry Currently, when said feature is enabled, we recalcuate the schema digest. But this feature also influences how table versions are calculated, so it has to trigger a recalculation of all table versions, so that we can guarantee correct versions. Before, this used to happen by happy accident. Another feature -- table_digest_insensitive_to_expiry -- used to take care of this, by triggering a table version recalulation. However this feature only takes effect if digest_insensitive_to_expiry is also enabled. This used to be the case incidently, by the time the reload triggered by table_digest_insensitive_to_expiry ran, digest_insensitive_to_expiry was already enabled. But this was not guaranteed whatsoever and as we've recently seen, any change to the feature list, which changes the order in which features are enabled, can cause this intricate balance to break. This patch makes digest_insensitive_to_expiry also kick off a schema reload, to eliminate our dependence on (unguaranteed) feature order, and to guarantee that table schemas have a correct version after all features are enabled. In fact, all schema feature notification handlers now kick off a full schema reload, to ensure bugs like this don't creep in, in the future. Fixes: #16004 Closes scylladb/scylladb#16013 (cherry picked from commit `22381441b0`)	2023-11-16 17:46:53 +02:00
Kamil Braun	187e275147	system_keyspace: use system memory for `system.raft` table `system.raft` was using the "user memory pool", i.e. the `dirty_memory_manager` for this table was set to `database::_dirty_memory_manager` (instead of `database::_system_dirty_memory_manager`). This meant that if a write workload caused memory pressure on the user memory pool, internal `system.raft` writes would have to wait for memtables of user tables to get flushed before the write would proceed. This was observed in SCT longevity tests which ran a heavy workload on the cluster and concurrently, schema changes (which underneath use the `system.raft` table). Raft would often get stuck waiting many seconds for user memtables to get flushed. More details in issue #15622. Experiments showed that moving Raft to system memory fixed this particular issue, bringing the waits to reasonable levels. Currently `system.raft` stores only one group, group 0, which is internally used for cluster metadata operations (schema and topology changes) -- so it makes sense to keep use system memory. In the future we'd like to have other groups, for strongly consistent tables. These groups should use the user memory pool. It means we won't be able to use `system.raft` for them -- we'll just have to use a separate table. Fixes: scylladb/scylladb#15622 Closes scylladb/scylladb#15972 (cherry picked from commit `f094e23d84`)	2023-11-16 12:51:03 +01:00

1 2 3 4 5 ...

39421 Commits