scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 03:45:11 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	9211df2cdf	snapshot: Make check_snapshot_not_exist a method Sanitation. It now can access the this->_db pointer. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:26:15 +03:00
Pavel Emelyanov	ba47ef0397	snapshots: Move ops gate from storage_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:17:21 +03:00
Pavel Emelyanov	e439873319	snapshot: Move lock from storage_service For this de-static run_snapshot_*_operation (because we no longer have the static global to get the lock from) and make the snapshot_ctl be peering_sharded_service to call invoke_on. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:17:19 +03:00
Pavel Emelyanov	d674baacef	snapshot: Move all code into db::snapshot_ctl class This includes - rename namespace in snapshot-ctl.[cc\|hh] - move methods from storage_service to snapshot_ctl - move snapshot_details struct - temporarily make storage_service._snapshot_lock and ._snapshot_ops public - replace two get_local_storage_service() occurrences with this._db The latter is not 100% clear as the code that does this references "this" from another shard, but the _db in question is the distributed object, so they are all the same on all instances. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:59:53 +03:00
Pavel Emelyanov	8d36607044	storage_service: Move all snapshot code into snapshot-ctl.cc This is plain move, no other modifications are made, even the "service" namespace is kept, only few broken indentation fixes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:54:15 +03:00
Pavel Emelyanov	d989d9c1c7	snapshots: Initial skeleton A placeholder for snapshotting code that will be moved into it from the storage_service. Also -- pass it through the API for future use. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:54:14 +03:00
Pavel Emelyanov	9a8a1635b7	snapshots: Properly shutdown API endpoints Now with the seastar httpd routes unset() at hands we can shut down individual API endpoints. Do this for snapshot calls, this will make snapshot controller stop safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 17:27:45 +03:00
Pavel Emelyanov	b608652622	api: Rewrap set_server_snapshot lambda The lambda calls the core snapshot method deep inside the json marshalling callback. This will bring problems with stopping the snapshot controller in the next patches. To prepare for this -- call the .get_snapshot_details() first, then keep the result in do_with() context. This change doesn't affect the issue the lambde in question is about to solve as the whole result set is anyway kept in memory while being streamed outside. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 17:27:45 +03:00
Benny Halevy	a843945115	comapction: restore % in compaction completion message The % sign fell off in `c4841fa735` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200625151352.736561-1-bhalevy@scylladb.com>	2020-06-25 18:11:59 +02:00
Avi Kivity	e5be3352cf	database, streaming, messaging: drop streaming memtables Before Scylla 3.0, we used to send streaming mutations using individual RPC requests and flush them together using dedicated streaming memtables. This mechanism is no longer in use and all versions that use it have long reached end-of-life. Remove this code.	2020-06-25 15:25:54 +02:00
Raphael S. Carvalho	b17d20b5f4	reshape: LCS: avoid unnecessary work on level 0 No need to sort level 0 as we only check if levels > 0 are disjoint. Also taking the opportunity to avoid copies when sorting. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200624151921.20160-1-raphaelsc@scylladb.com>	2020-06-24 18:27:22 +03:00
Rafael Ávila de Espíndola	67c22c8697	commitlog::read_log_file: Don't discard a future This makes the code a bit easier to read as there are no discarded futures and no references to having to keep a subscription alive, which we don't with current seastar. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200527013120.179763-1-espindola@scylladb.com>	2020-06-24 17:22:29 +03:00
Botond Dénes	5ff6ac52b2	scylla-gdb.py: collection element func: accept references and pointers to collections Add support to references (both lvalue and rvalue) and pointers to collections as well, in addition to plain values. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200624101305.428925-1-bdenes@scylladb.com>	2020-06-24 13:31:18 +03:00
Avi Kivity	a9c7a1a86c	Merge "repair: row_level: prevent deadlocks when repairing homogenous nodes" from Botond " Row level repair, when using a local reader, is prone to deadlocking on the streaming reader concurrency semaphore. This has been observed to happen with at least two participating nodes, running more concurrent repairs than the maximum allowed amount of reads by the concurrency semaphore. In this situation, it is possible that two repair instances, competing for the last available permits on both nodes, get a permit on one of the nodes and get queued on the other one respectively. As neither will let go of the permit it already acquired, nor give up waiting on the failed-to-acquired permit, a deadlock happens. To prevent this, we make the local repair reader evictable. For this we reuse the already existing evictable reader mechanism of the multishard combining reader. This patchset refactors this evictable reader mechanism into a standalone flat mutation reader, then exposes it to the outside world. The repair reader is paused after the repair buffer is filled, which is currently 32MB, so the cost of a possible reader recreation is amortized over 32MB read. The repair reader is said to be local, when it can use the shard-local partitioner. This is the case if the participating nodes are homogenous (their shard configuration is identical), that is the repair instance has to read just from one shard. A non-local reader uses the multishard reader, which already makes its shard readers evictable and hence is not prone to the deadlock described here. Fixes: #6272 Tests: unit(dev, release, debug) " * 'repair-row-level-evictable-local-reader/v3' of https://github.com/denesb/scylla: repair: row_level: destroy reader on EOS or error repair: row_level: use evictable_reader for local reads mutation_reader: expose evictable_reader mutation_reader: evictable_reader: add auto_pause flag mutation_reader: make evictable_reader a flat_mutation_reader mutation_reader: s/inactive_shard_read/inactive_evictable_reader/ mutation_reader: move inactive_shard_reader code up mutation_reader: fix indentation mutation_reader: shard_reader: extract remote_reader as evictable_reader mutation_reader: reader_lifecycle_policy: make semaphore() available early	2020-06-24 12:55:34 +03:00
Piotr Sarna	c2939c67b2	test: add a case for local altering of distributed tables Local altering, which does not propagate the change to other nodes, should not be allowed for a non-local table. Refs #6700 Message-Id: <34a2b191c0e827f296e6d720dc31bf8bda0fd160.1592990796.git.sarna@scylladb.com>	2020-06-24 12:51:41 +03:00
Piotr Sarna	835734c99d	cql3: disallow altering non-local tables with local queries The database has a mechanism of performing internal CQL queries, mainly to edit its own local tables. Unfortunately, it's easy to use the interface incorrectly - e.g. issuing an `ALTER TABLE` statement on a non-local table will result in not propagating the schema change to other nodes, which in turn leads to inconsistencies. In order to avoid such mistakes (one of them was a root cause of #6513), when an attempt to alter a distributed table via a local interface is performed, it results in an error. Tests: unit(dev) Fixes #6700 Message-Id: <61be3defb57be79f486e6067ceff4f4c965e34cb.1592990796.git.sarna@scylladb.com>	2020-06-24 12:51:40 +03:00
Raphael S. Carvalho	864eb20002	reshape: Fix reshaping procedure for LCS The function that determines if a level L, where L > 0, is disjoint, is returning false if level is disjoint. That's because it incorrectly accounts an overlapping SSTable in the level as a disjoint SSTable. So we need to inverse the logic. The side effect is that boot will always try to reshape levels greater than 0 because reshape procedure incorrectly thinks that levels are overlapping when they're actually disjoint. Fixes #6695. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200623180221.229695-1-raphaelsc@scylladb.com>	2020-06-24 12:50:19 +03:00
Avi Kivity	1398628e8a	Update seastar submodule cql3/functions/error_injection_fcts.cc adjusted for smp::invoke_on_all() now requiring nothrow move constructible functions. * seastar 7664f991b9...11e86172ba (4): > Merge "smp: make submit_to noexcept" from Benny > memory: Fix clang build > Fix a debug build with SEASTAR_TASK_BACKTRACE > manual_clock: Add missing includes	2020-06-24 12:49:50 +03:00
Botond Dénes	be452b1f91	service: storage_proxy: log exception returned from replica with more context Currently the message only mentions the endpoint and the error message returned from the replica. Add the keyspace and table to this message to provide more context. This should help investigations of such errors greatly, as in the case of tests where there is usually a single table, we can already guess what exactly is timing out based on this. We should add even more context, like the kind of the query (single partition or range scan) but this information is not readily available in the surrounding scope so this patch defers it. Refs: #6548 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200624054647.413256-1-bdenes@scylladb.com>	2020-06-24 11:30:37 +03:00
Piotr Sarna	df91e9a4c7	alternator: clean up string view conversions Manual translation from JSON to string_view is replaced with rjson::to_string_view helper function. In one place, a redundant string_view intermediary is removed in favor of creating the string straight from JSON. Message-Id: <2aa9d9fedd73f14b7640870d14db4f2f0bd7bd8a.1592936139.git.sarna@scylladb.com>	2020-06-23 21:45:27 +03:00
Piotr Sarna	4558401aee	alternator: drop using global migration manager As part of "war on globals", the unneeded usage of global migration manager instance is dropped. Message-Id: <c9b2fab57e62185daa2441458f9a3a5e7e0a3908.1592936139.git.sarna@scylladb.com>	2020-06-23 21:43:57 +03:00
Piotr Sarna	f4e8cfe03b	alternator: fix propagating tags Updating tags was erroneously done locally, which means that the schema change was not propagated to other nodes. The new code announces new schema globally. Fixes #6513 Branches: 4.0,4.1 Tests: unit(dev) dtest(alternator_tests.AlternatorTest.test_update_condition_expression_and_write_isolation) Message-Id: <3a816c4ecc33c03af4f36e51b11f195c231e7ce1.1592935039.git.sarna@scylladb.com>	2020-06-23 21:27:55 +03:00
Botond Dénes	fbbc86e18c	repair: row_level: destroy reader on EOS or error To avoid having to make it an optional with all the additional checks, we just replace it with an empty reader instead, this also also achieves the desired effect of releasing the read permit and all the associated resources early.	2020-06-23 21:08:21 +03:00
Botond Dénes	080f00b99a	repair: row_level: use evictable_reader for local reads Row level repair, when using a local reader, is prone to deadlocking on the streaming reader concurrency semaphore. This has been observed to happen with at least two participating nodes, running more concurrent repairs than the maximum allowed amount of reads by the concurrency semaphore. In this situation, it is possible that two repair instances, competing for the last available permits on both nodes, get a permit on one of the nodes and get queued on the other one respectively. As neither will let go of the permit it already acquired, nor give up waiting on the failed-to-acquired permit, a deadlock happens. To prevent this, we make the local repair reader evictable. For this we reuse the newly exposed evictable reader. The repair reader is paused after the repair buffer is filled, which is currently 32MB, so the cost of a possible reader recreation is amortized over 32MB read. The repair reader is said to be local, when it can use the shard-local partitioner. This is the case if the participating nodes are homogenous (their shard configuration is identical), that is the repair instance has to read just from one shard. A non-local reader uses the multishard reader, which already makes its shard readers evictable and hence is not prone to the deadlock described here.	2020-06-23 21:08:21 +03:00
Botond Dénes	542d9c3711	mutation_reader: expose evictable_reader Expose functions for the outside world to create evictable readers. We expose two functions, which create an evictable reader with `auto_pause::yes` and `auto_pause::no` respectively. The function creating the latter also returns a handle in addition to the reader, which can be used to pause the reader.	2020-06-23 21:08:21 +03:00
Botond Dénes	1cc31deff9	mutation_reader: evictable_reader: add auto_pause flag Currently the evictable reader unconditionally pauses the underlying reader after each use (`fill_buffer()` or `fast_forward_to()` call). This is fine for current users (the multishard reader), but the future user we are doing all this refactoring for -- repair -- will want to control when the underlying reader is paused "manually". Both these behaviours can easily be supported in a single implementation, so we add an `auto_pause` flag to allow the creator of the evictable reader to control this.	2020-06-23 21:08:21 +03:00
Botond Dénes	af9e1c23e1	mutation_reader: make evictable_reader a flat_mutation_reader The `evictable_reader` class is almost a proper flat mutation reader already, it roughly offers the same interface. This patch makes this formal: changing the class to inherit from `flat_mutation_reader::impl`, and implement all virtual methods. This also entails a departure from using the lifecycle policy to pause/resume and create readers, instead using more general building blocks like the reader concurrency semaphore and a mutation source.	2020-06-23 21:08:21 +03:00
Rafael Ávila de Espíndola	64c8164e6c	everywhere: Update to seastar api v4 (when_all_succeed returning a tuple) We now just need to replace a few calls to then with then_unpack. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200618172100.111147-1-espindola@scylladb.com>	2020-06-23 19:40:18 +03:00
Raphael S. Carvalho	47f63d021a	sstables/sstable_directory: improve log message in reshape() We were blind about the table which needed reshape and its compaction strategy, so let's improve log message. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622192502.187532-4-raphaelsc@scylladb.com>	2020-06-23 19:40:18 +03:00
Raphael S. Carvalho	39f96a5572	distributed_loader: Don't mutate levels to zero when populating column family Unlike refresh on upload dir, column family population shouldn't mutate level of SSTables to level 0. Otherwise, LCS will have to regenerate all levels by rewriting the data multiple times, hurting a lot the write amplification and consequently the node performance. That's also affecting the time for a node to boot because reshape may be triggered as a result of this. Refs #6695. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622192502.187532-2-raphaelsc@scylladb.com>	2020-06-23 19:40:18 +03:00
Benny Halevy	2d7c39de88	storage_service: set_tables_autocompaction: fix not-initialized-yet logic Typo introduced in `bb07678346`, set_tables_autocompaction should reject too-early requests if !_initialized rather than if _initialized. Fixes a bunch of compaction dtests. For example: https://jenkins.scylladb.com/view/master/job/scylla-master/job/dtest-release/530/testReport/compaction_test/TestCompaction_with_DateTieredCompactionStrategy/disable_autocompaction_twice_test/ ``` True is not false : Expected to have autocompaction disabled but got it is enabled ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Tests: - unit(dev), - compaction_test:TestCompaction_with_DateTieredCompactionStrategy.disable_autocompaction_twice_test(dev) Message-Id: <20200623151418.439534-1-bhalevy@scylladb.com>	2020-06-23 19:40:18 +03:00
Avi Kivity	c72365d862	thrift: switch csharp backend to netstd The thrift compiler (since 0.13 at least) complains that the csharp target is deprecated and recommend replacing it with netstd. Since we don't use either, humor it. I suspect that this warning caused some spurious rebuilds, but have not proven it.	2020-06-23 19:40:18 +03:00
Piotr Sarna	6d224ae131	cql3: add missing filtering stats bump In a single case of indexed queries, the filtered_rows_read_total metrics was not updated, which could result in inconsistencies between filtered_rows_read_total and filtered_rows_matched_total later. Message-Id: <9a5a741da4c6cf030329610ba8b8e340be85c8e6.1592902295.git.sarna@scylladb.com>	2020-06-23 19:40:18 +03:00
Piotr Sarna	7480015721	cql3, service: decouple cql_stats from query pagers Pager belongs to a different layer than CQL and thus should not be coupled with CQL stats - if any different frontends want to use paging, they shouldn't be forced to instantiate CQL stats at all. Same goes with CQL restrictions, but that will require much bigger refactoring, so is left for later. Message-Id: <5585eb470949e3457334ffd6dba80742abf3a631.1592902295.git.sarna@scylladb.com>	2020-06-23 19:40:18 +03:00
Nadav Har'El	428e8b5c96	docker readme: remove outdated warning In the section explaining how to build a docker image for a self-built Scylla executable, we have a warning that even if you already built Scylla, build_reloc.sh will re-run configure.py and rebuild the executable with slightly different options. The re-run of configure.py and ninja still happens (see issue #6547) but we no longer pass different options to configure.py, so the rebuild usually doesn't do anything and finishes in seconds, and the paragraph warning about the rebuild is no longer relevant. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200621093049.975044-1-nyh@scylladb.com>	2020-06-23 19:40:18 +03:00
Avi Kivity	8d67537178	Update seastar submodule * seastar a6c8105443...7664f991b9 (13): > gate: add try_enter and try_with_gate > Merge "Manage reference counts in the file API" from Rafael > cmake: Refactor a bit of duplicated code > stream: Delete _sub > future: Add a rethrow_exception to future_state_base > future: Use a new seastar::nested_exception in finally > cmake: only apply C++ compile options to C++ language > testing: Enable fail-on-abandoned-failed-futures by default > future: Correct a few hypercorrect uses of std::forward > futures_test: Test using future::then with functions > Merge "io-queue: A set of cleanups collected so far" from Pavel E > tmp_file: Replace futurize_apply with futurize_invoke > future: Replace promise::set_coroutine with forward_state_and_schedule Contains update to tests from Rafael: tests: Update for fail-on-abandoned-failed-futures's new default This depends on the corresponding change in seastar. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-23 19:39:54 +03:00
Botond Dénes	4485864ada	mutation_reader: s/inactive_shard_read/inactive_evictable_reader/ Rename `inactive_shard_read` to `inactive_evictable_reader` to reflect that the fact that the evictable reader is going to be of general use, not specific to the multishard reader.	2020-06-23 10:01:38 +03:00
Botond Dénes	b6ed054c08	mutation_reader: move inactive_shard_reader code up It will be used by the `evictable_reader` code too in the next patches.	2020-06-23 10:01:38 +03:00
Botond Dénes	e3ea1c9080	mutation_reader: fix indentation Deferred from the previous patch.	2020-06-23 10:01:38 +03:00
Botond Dénes	f9d1916499	mutation_reader: shard_reader: extract remote_reader as evictable_reader We want to make the evictable reader mechanism used in the multishard reader pipeline available for general (re)use, as a standalone flat mutation reader implementation. The first step is extracting `shard_reader::remote_reader` the class implementing this logic into a top-level class, also renamed to `evictable_reader`.	2020-06-23 10:01:38 +03:00
Botond Dénes	63309f925c	mutation_reader: reader_lifecycle_policy: make semaphore() available early Currently all reader lifecycle policy implementations assume that `semaphore()` will only be called after at least one call to `make_reader()`. This assumption will soon not hold, so make sure `semaphore()` can be called at any time, including before any calls are made to `make_reader()`.	2020-06-23 10:01:38 +03:00
Raphael S. Carvalho	9033fa82d7	compaction: Reduce boilerplate to create new compaction type Run id and compaction type can now be figured out from the base class. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622160645.177707-1-raphaelsc@scylladb.com>	2020-06-22 20:27:57 +02:00
Takuya ASADA	2d25697873	scylla_swap_setup: fix systemd-escape path On Ubuntu 18.04 and ealier & Deiban 10 and ealier, /usr merge is not done, so /usr/bin/systemd-escape and /bin/systemd-escape is different place, and we call /usr/bin but Debian variants tries to install the command in /bin. Drop full path, just call command name and resolve by default PATH. Fixes: #6650	2020-06-22 17:42:06 +03:00
Raphael S. Carvalho	2a171ee470	reshape: LCS: fix the target level of reshaping job LCS reshape job may pick a wrong level because we iterate through levels from index 1 and stop the iteration as soon as the current level is NOT disjoint, so it happens that we never reach the upper levels, meaning the level of the first NOT disjoint level is used, and not the actual maximum filled level. That's fixed by doing the iteration in the inverse order. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200618154112.8335-1-raphaelsc@scylladb.com>	2020-06-22 16:40:57 +03:00
Avi Kivity	de38091827	priority_manager: merge streaming_read and streaming_write classes into one class Streaming is handled by just once group for CPU scheduling, so separating it into read and write classes for I/O is artificial, and inflates the resources we allow for streaming if both reads and writes happen at the same time. Merge both classes into one class ("streaming") and adjust callers. The merged class has 200 shares, so it reduces streaming bandwidth if both directions are active at the same time (which is rare; I think it only happens in view building).	2020-06-22 15:09:04 +03:00
Takuya ASADA	9e51acec1f	reloc: simplified .deb build process We don't really need to have two build_deb.sh, merge it to reloc.	2020-06-22 14:03:13 +03:00
Takuya ASADA	67c0439c7d	reloc: simplified .rpm build process We don't really need to have two build_rpm.sh, merge it to reloc.	2020-06-22 14:03:13 +03:00
Takuya ASADA	90e28c5fcf	scylla_raid_setup: daemon-reload after mounts.conf installed systemd requires daemon-reload after adding drop-in file, so we need to do that after writing mounts.conf. Fixes #6674	2020-06-22 14:03:13 +03:00
Takuya ASADA	d6165bc1c3	dist/debian/python3: drop dependency on pystache Same as `287d6e5`, we need to drop pystache from package build script since Fedora 32 dropped it.	2020-06-22 14:03:13 +03:00
Juliusz Stasiewicz	a35b71c247	cdc: Handling of timeout/unavailable exceptions in streams fetching Retrying the operation of fetching generations not always makes sense. In this patch only the lightest exceptions (timeout and unavailable) trigger retrying, while the heavy, unrecoverable ones abort the operation and get logged on ERROR level. Fixes #6557	2020-06-22 14:03:13 +03:00

1 2 3 4 5 ...

22552 Commits