scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 16:03:20 +00:00

Author	SHA1	Message	Date
Benny Halevy	ecba37dbfd	effective_replication_map: enable_lw_shared_from_this So a effective_replication_map_ptr can be generated using a raw pointer by effective_replication_map_factory. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Benny Halevy	f4f41e2908	effective_replication_map: define factory_key To be used to locate the effective_replication_map in the to-be-introduced effective_replication_map_factory. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Benny Halevy	5947de7674	keyspace: get a reference to the erm_factory To be used for creating effective_replication_map. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Benny Halevy	1d7556d099	main: pass erm_factory to storage_service To be used for creating effective_replication_map when token_metadata changes, and update all keyspaces with it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Benny Halevy	242043368e	main: pass erm_factory to storage_proxy To be used for creating the effective_replication_map per keyspace. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Benny Halevy	3fed73e7c2	locator: add effective_replication_map_factory It will be used further to create shared copies of effective_replication_map based on replication_strategy type and config options. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-19 10:46:51 +02:00
Raphael S. Carvalho	4b1bb26d5a	compaction: Make maybe_replace_exhausted_sstables_by_sst() more robust Make it more robust by tracking both partial and sealed sstables. This way, maybe_r__e__s__by_sst() won't pick partial sstables as part of incremental compaction. It works today because interposer consumer isn't enabled with incremental compaction, so there's a single consumer which will have sealed the sstable before the function for early replacement is called, but the story is different if both is enabled. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211117135817.16274-1-raphaelsc@scylladb.com>	2021-11-17 17:21:53 +02:00
Avi Kivity	bc75e2c1d1	treewide: wrap runtime formats with fmt::runtime for fmt 8 fmt 8 checks format strings at compile time, and requires that non-compile-time format strings be wrapped with fmt::runtime(). Do that, and to allow coexistence with fmt 7, supply our own do-nothing version of fmt::runtime() if needed. Strictly speaking we shouldn't be introducing names into the fmt namespace, but this is transitional only. Closes #9640	2021-11-17 15:21:36 +02:00
Gleb Natapov	6744b466e4	cql3: co-routinize alter_type_statement::announce_migration Message-Id: <YZUAlx3fHdVRSlqX@scylladb.com>	2021-11-17 15:20:37 +02:00
Gavin Howell	28f8c3987e	docs/alternator: copyedit alternator.md Line 41. Grammar correction needed. Unclear meaning in sentence. word "message" added after "error". Comma added after "message". Closes #9648	2021-11-17 15:06:21 +02:00
Gavin Howell	7b0a5cdeb2	docs/alternator: typo in compatibility.md Line 170. "PoinInTime" changed to "PointInTime" Closes #9650	2021-11-17 15:03:40 +02:00
Calle Wilund	a8bb4dcd28	tls: Add certficate_revocation_list option for client/server encryption options Fixes #9630 Adds support for importing a CRL certificate reovcation list. This will be monitored and reloaded like certs/keys. Allows blacklisting individual certs. Closes #9655	2021-11-17 14:24:22 +02:00
Nadav Har'El	82bcc2cbd2	Merge: redis: get controller in line Merged patch series from Botond Dénes: Redis's controller, unlike all other protocol's controllers is called service and is not even in the redis namespace. This is made even worse by the redis directory also having a server.{hh,cc}, making one always second guessing on which is what. This series applies to the redis controller the convention used by (almost) all other service controller classes: * They are called controller * They are in a file called ${protocol}/controller.{hh,cc} * They are in a namespace ${protocol} (Thrift is not perfectly following this either). Botond Dénes (3): redis: redis_service: move in redis namespace redis: redis::service -> redis::controller redis: mv service.* -> controller.* configure.py \| 2 +- main.cc \| 10 ++++----- redis/{service.cc => controller.cc} \| 32 ++++++++++++++++------------- redis/{service.hh => controller.hh} \| 10 ++++----- 4 files changed, 29 insertions(+), 25 deletions(-) rename redis/{service.cc => controller.cc} (87%) rename redis/{service.hh => controller.hh} (93%)	2021-11-17 14:19:36 +02:00
Botond Dénes	d4d4c0ace7	redis: mv service.* -> controller.*	2021-11-17 13:58:49 +02:00
Botond Dénes	618adeddd8	redis: redis::service -> redis::controller Follow the naming scheme for the controller class/instance used by all other protocol controllers: * rename class: service -> controller; * rename variable in main.cc: redis -> redis_ctl;	2021-11-17 13:47:44 +02:00
Botond Dénes	95510c6f92	redis: redis_service: move in redis namespace	2021-11-17 13:44:41 +02:00
Piotr Jastrzebski	033a75ff96	cdc: Don't support "on" and "off" values for preimage any more This is an undocumented feature that causes confusion so let's get rid of it. tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Closes #9639	2021-11-17 11:54:11 +01:00
Tzach Livyatan	0b6c49b03e	docs website: update latest branch to 4.5 Closes #9638	2021-11-17 12:33:22 +02:00
Avi Kivity	12d29b28ab	raft: generator: correct constraints on members A member variable is a reference, not a pure value, so std::same_as<> needs to be given a reference (and clanf 13 insists). However, clang 12 doesn't accept the correct constraint, so use std::convertible_to<> as a compromise. Closes #9642	2021-11-17 11:27:52 +02:00
Benny Halevy	9548220b70	compaction_manager: submit_offstrategy: remove task in finally clause Now, when the offstrategy task is stopped, it exits the repeat loop if (!can_proceed(task)) without going through _tasks.remove(task) - causing the assert in compaction_manger::remove to trip, as stop_ongoing_compactions will be resolved while the task is still listed in _tasks. Fixes #9634 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-11-17 09:53:59 +02:00
Avi Kivity	720e9521f0	utils: build_id: correct fmt include fmt::print(std::ostream&) is in <fmt/ostream.h> Closes #9641	2021-11-17 09:02:57 +02:00
Avi Kivity	edcdbc16d3	db: heat weighted load balancing: remove unused variable total_deficit The variable is write-only. Closes #9647	2021-11-17 09:02:23 +02:00
Avi Kivity	e51fcc22f3	sstable_loader: add missing include <cfloat> Needed for FLT_EPSILON Closes #9646	2021-11-17 09:01:49 +02:00
Avi Kivity	2c1e30a12a	test: network_topology_strategy_test: remove unused variable total_rf It is write-only. Closes #9645	2021-11-17 09:01:24 +02:00
Avi Kivity	cba07a3145	test: perf: fix format string for scheduling_latency_measurer Need a colon to introduce the format after the default argument specifier. Found by fmt 8. Closes #9644	2021-11-17 09:00:56 +02:00
Avi Kivity	6ece375fc8	repair: add missing include <cfloat> Needed for FLT_EPSILON Closes #9643	2021-11-17 09:00:11 +02:00
Amos Kong	32e62252e1	debian/build_offline_installer.sh: config apt to keep downloaded packages The downloaded packages might be deleted autotically after installation, then we will provide an incomplete installer to user. This patch changed to config apt to keep the downloaded packages before installation. Signed-off-by: Amos Kong <kongjianjun@gmail.com> Closes #9592	2021-11-16 17:47:01 +02:00
Avi Kivity	e2c27ee743	Merge 'commitlog: recalculate disk footprint on delete_segment exceptions' from Calle Wilund If we get errors/exceptions in delete_segments we can (and probably will) loose track of disk footprint counters. This can in turn, if using hard limits, cause us to block indefinitely on segment allocation since we might think we have larger footprint than we actually do. Of course, if we actually fail deleting a segment, it is 100% true that we still technically hold this disk footprint (now unreachable), but for cases where for example outside forces (or wacky tests) delete a file behind our backs, this might not be true. One could also argue that our footprint is the segments and file names we keep track of, and the rest is exterior sludge. In any case, if we have any exceptions in delete_segments, we should recalculate disk footprint based on current state, and restart all new_segment paths etc. Fixes #9348 (Note: this is based on previous PR #9344 - so shows these commits as well. Actual changes are only the latter two). Closes #9349 * github.com:scylladb/scylla: commitlog: Recalculate footprint on delete_segment exceptions commitlog_test: Add test for exception in alloc w. deleted underlying file commitlog: Ensure failed-to-create-segment is re-deleted commitlog::allocate_segment_ex: Don't re-throw out of function	2021-11-16 17:44:56 +02:00
Tomasz Grabiec	bf6898a5a0	lsa: Add sanity checks around lsa_buffer operations We've been observing hard to explain crashes recently around lsa_buffer destruction, where the containing segment is absent in _segment_descs which causes log_heap::adjust_up to abort. Add more checks to catch certain impossible senarios which can lead to this sooner. Refs #9192. Message-Id: <20211116122346.814437-1-tgrabiec@scylladb.com>	2021-11-16 14:25:02 +02:00
Tomasz Grabiec	4d627affc3	lsa: Mark compact_segment_locked() as noexcept We cannot recover from a failure in this method. The implementation makes sure it never happens. Invariants will be broken if this throws. Detect violations early by marking as noexcept. We could make it exception safe and try to leave the data structures in a consistent state but the reclaimer cannot make progress if this throws, so it's pointless. Refs #9192 Message-Id: <20211116122019.813418-1-tgrabiec@scylladb.com>	2021-11-16 14:23:10 +02:00
Pavel Emelyanov	a62631d441	config: Enable developer-mode by default in dev/debug modes Other than looking sane, this change continues the founded by the --workdir option tradition of freeing the developer form annoying necessity to type too many options when scylla is started by hand for devel purposes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211116104815.31822-1-xemul@scylladb.com>	2021-11-16 12:53:33 +02:00
Pavel Emelyanov	ba16318457	generic_server: Keep server alive during conn background processing There's at least one tiny race in generic_server code. The trailing .handle_exception after the conn->process() captures this, but since the whole continuation chain happens in the background, that this can be released thus causing the whole lambda to execute on freed generic_server instance. This, in turn, is not nice because captured this is used to get a _logger from. The fix is based on the observation that all connections pin the server in memory until all of them (connections) are destructed. Said that, to keep the server alive in the aforementioned lambda it's enough to make sure the conn variable (it's lw_shared_ptr on the connection) is alive in it. Not to generate a bunch of tiny continuations with identical set of captures -- tail the single .then_wrapped() one and do whatever is needed to wrap up the connection processing in it. tests: unit(dev) fixes: #9316 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211115105818.11348-1-xemul@scylladb.com>	2021-11-16 11:10:39 +02:00
Pavel Emelyanov	6131aea203	scylla-gdb: Handle new fair_queue::priority_class_data representation In the full-duplex capable scheduler the _handles list contains direct pointers on pclass data, not lw_shared_ptr's. Most of the time this container is empty so this bug is not triggerable right at once. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211116084250.21399-1-xemul@scylladb.com>	2021-11-16 11:07:21 +02:00
Botond Dénes	b136746040	mutation_partition: deletable_row::apply(shadowable_tombstone): remove redundant maybe_shadow() Shadowing is already checked by the underlying row_tombstone::apply(). This redundant check was introduced by a previous fix to #9483 (`6a76e12768`). The rest of that patch is good. Refs: #9483 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20211115091513.181233-1-bdenes@scylladb.com>	2021-11-15 17:50:41 +01:00
Botond Dénes	64bb48855c	flat_mutation_reader: revamp flat_mutation_reader_from_mutations() Add schema parameter so that: * Caller has better control over schema -- especially relevant for reverse reads where it is not possible to follow the convention of passing the query schema which is reversed compared to that of the mutations. * Now that we don't depend on the mutations for the schema, we can lift the restriction on mutations not being empty: this leads to safer code. When the mutations parameter is empty, an empty reader is created. Add "make_" prefix to follow convention of similar reader factory functions. Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20211115155614.363663-1-bdenes@scylladb.com>	2021-11-15 17:58:46 +02:00
Nadav Har'El	6e1344eb4f	alternator: better error handling for wrongly-encoded numbers In the DynamoDB API, a number is encoded in JSON requests as something like: {"N": "123"} - the type is "N" and the value "123". Note that the value of the number is encoded as a string, because the floating-point range and accuracy of DynamoDB differs from what various JSON libraries may support. We have a function unwrap_number() which supported the value of the number being encoded as an actual number, not a string. But we should NOT support this case - DynamoDB doesn't. In this patch we add a test that confirms that DynamoDB doesn't, and remove the unnecessary case from unwrap_number(). The unnecessary case also had a FIXME, so it's a good opportunity to get rid of a FIXME. When writing the test, I noticed that the error which DynamoDB returns in this case is SerializionException instead of the more usual ValidationException. I don't know why, but let's also change the error type in this patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211115125738.197099-1-nyh@scylladb.com>	2021-11-15 14:47:49 +01:00
Botond Dénes	802e3642a0	Update tools/java submodule * tools/java cb6c1d07a7...8fae618f7f (2): > removeNode: support ignoreNodes options > build: replace yum with dnf	2021-11-15 15:41:27 +02:00
Botond Dénes	f313706d80	Update tools/jmx submodule * tools/jmx d6225c5...2c43d99 (2): > removeNode: support ignoreNodes options > build: replace yum with dnf	2021-11-15 15:41:27 +02:00
Avi Kivity	a19d00ef9b	dist: scylla_raid_setup: mount XFS with online discard Online discard asks the disk to erase flash memory cells as soon as files are deleted. This gives the disk more freedom to choose where to place new files, so it improves performance. On older kernel versions, and on really bad disks, this can reduce performance so we add an option to disable it. Since fstrim is pointless when online discard is enabled, we don't configure it if online discard is selected. I tested it on an AWS i3.large instance, the flag showd up in `mount` after configuration. Closes #9608	2021-11-15 14:16:08 +02:00
Avi Kivity	c17101604f	Merge 'Revert "scylla_util.py: return bool value on systemd_unit.is_active()"' from Takuya ASADA On scylla_unit.py, we provide `systemd_unit.is_active()` to return `systemctl is-active` output. When we introduced systemd_unit class, we just returned `systemctl is-active` output as string, but we changed the return value to bool after that (`2545d7fd43`). This was because `if unit.is_active():` always becomes True even it returns "failed" or "inactive", to avoid such scripting bug. However, probably this was mistake. Because systemd unit state is not 2 state, like "start" / "stop", there are many state. And we already using multiple unit state ("activating", "failed", "inactive", "active") in our Cloud image login prompt: https://github.com/scylladb/scylla-machine-image/blob/next/common/scylla_login#L135 After we merged `2545d7fd43`, the login prompt is broken, because it does not return string as script expected (https://github.com/scylladb/scylla-machine-image/issues/241). I think we should revert `2545d7fd43`, it should return exactly same value as `systemctl is-active` says. Fixes #9627 Fixes scylladb/scylla-machine-image#241 Closes #9628 * github.com:scylladb/scylla: scylla_ntp_setup: use string in systemd_unit.is_active() Revert "scylla_util.py: return bool value on systemd_unit.is_active()"	2021-11-15 13:56:28 +02:00
Takuya ASADA	279fabe9b4	scylla_ntp_setup: use string in systemd_unit.is_active() Since we reverted `2545d7fd43`, we need to use string instead of bool value.	2021-11-15 19:50:31 +09:00
Takuya ASADA	d646673705	Revert "scylla_util.py: return bool value on systemd_unit.is_active()" This reverts commit `2545d7fd43`. Fixes #9627 Fixes scylladb/scylla-machine-image#241	2021-11-15 19:50:31 +09:00
Pavel Emelyanov	4e86936850	redis: Remove stop_server deferred action from main Commit `3f56c49a9e` put redis into protocol_servers list of storage service. Since then there's no need in explicit stop_server call on shutdown -- the protocol_servers thing will do it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211109154259.1196-1-xemul@scylladb.com>	2021-11-15 11:58:44 +02:00
Avi Kivity	4d7a013e94	sstables: mx: writer: make large partition stats accounting branch-free It is bad form to introduce branches just for statistics, since branches can be expensive (even when perfectly predictable, they consume branch history resources). Switch to simple addition instead; this should be not cause any cache misses since we already touch other statistics earlier. The inputs are already boolean, but cast them to boolean just so it is clear we're adding 0/1, not a count. Closes #9626	2021-11-15 11:28:48 +02:00
Benny Halevy	9d4262e264	protocol_server: add per-protocol is_server_running method Change `b0a2a9771f` broke the generic api implementation of is_native_transport_running that relied on the addresses list being empty agter the server is stopped. To fix that, this change introduces a pure virtual method: protocol_server::is_server_running that can be implemented by each derived class. Test: unit(dev) DTest: nodetool_additional_test.py:TestNodetool.binary_test Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211114135248.588798-1-bhalevy@scylladb.com>	2021-11-14 16:01:31 +02:00
Avi Kivity	c9b8b84411	build: replace yum with dnf dnf has replaced yum on Fedora and CentOS. On modern versions of Fedora, you have to install an extra package to get the old name working, so avoid that inconvenience and use dnf directly. Closes #9622	2021-11-14 14:41:47 +02:00
Michael Livshin	a7511cf600	system keyspace: record partitions with too many rows Add "rows" field to system.large_partitions. Add partitions to the table when they are too large or have too many rows. Fixes #9506 Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Closes #9577	2021-11-14 14:25:18 +02:00
Avi Kivity	98ec98ba36	Update seastar submodule * seastar 04c6787b35...f8a038a0a2 (1): > http: disable Nagle's algorithm for the http server Fixes #9619.	2021-11-14 13:21:06 +02:00
Avi Kivity	6cb3caaf39	Update seastar submodule * seastar a189cdc45...04c6787b3 (12): > Convert std::result_of to std::invoke_result > Merge "IO queue full-duplex mode" from Pavel E > Merge "Report bytes/ops for R and W separately" from Pavel E > websocket: override std::exception::what() correctly > tests: websocket_test: remove unused lambda capture > Merge "Improve IO classes preemption" from Pavel E > Revert "Merge "Improve IO classes preemption" from Pavel E" > Merge "Add skeleton implementation of a WebSocket server" from Piotr S > Merge "Improve IO classes preemption" from Pavel E > io_queue: Add starvation time metrics (per-class) > Revert "Merge "Add skeleton implementation of a WebSocket server" from Piotr S" > Merge "Add skeleton implementation of a WebSocket server" from Piotr S	2021-11-13 11:56:28 +02:00
Piotr Sarna	cc544ba117	service: coroutinize client_state.cc No functional changes, but makes the code shorter and gets rid of a few allocations. Coroutinizing has_column_family_access is deliberately skipped and commented, since some callers expect this function to throw instead of returning an exceptional future. Message-Id: <958848a1eeeef490b162d2d2b805c8a14fc9082b.1636704996.git.sarna@scylladb.com>	2021-11-12 21:52:29 +02:00

1 2 3 4 5 ...

28959 Commits