scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 06:53:12 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	68a4739a42	table: remove sstable::shared() condition from backlog tracker add/remove functions Now that table no longer accept shared SSTables, those two functions can be simplified by removing the shared condition. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-06-29 14:21:34 -03:00
Raphael S. Carvalho	343efe797d	table: No longer accept a shared SSTable With off-strategy work on reshard on boot and refresh, table no longer needs to work with Shared SSTables. That will unlock a host of cleanups. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-06-29 14:21:04 -03:00
Nadav Har'El	23ce6864a3	alternator test: ProjectionExpression test for BatchGetItem The tests in test_projection_expression.py test that ProjectionExpression works - including attribute paths - for the GetItem, Query and Scan operations. There is a fourth read operation - BatchGetItem, and it supports ProjectionExpression too. We tested BatchGetItem + ProjectionExpression in test_batch.py, but this only tests the basic feature, with top-level attributes, and we were missing a test for nested document paths. This patch adds such a test. It is still xfailing on Alternator (and passing on DynamoDB), because attribute paths are still not supported (this is issue #5024). Refs #5024. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200629063244.287571-1-nyh@scylladb.com>	2020-06-29 08:51:05 +02:00
Nadav Har'El	b6fdd956bd	alternator test: ProjectionExpression tests for document paths This patch adds three more tests for the ProjectionExpression parameter of GetItem. They are tests for nested document paths like a.b[2].c. We don't support nested paths in Alternator yet (this is issue #5024), so the new tests all xfail (and pass on DynamoDB). We already had similar tests for UpdateExpression, which also needs to support document paths, but the tests were missing for ProjectionExpression. I am planning to start the implementation of document paths with ProjectionExpression (which is the simplest use of document paths), so I want the tests for this expression to be as complete as possible. Refs #5024. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200628213208.275050-1-nyh@scylladb.com>	2020-06-29 08:50:55 +02:00
Avi Kivity	509442b128	Merge "Move snapshot code from storage_service into independent component" from Pavel E " The snapshotting code is already well isolated from the rest of the storage_service, so it's relatively easy to move it into independent component, thus de-bloating the storage_service. As a side effect this allows painless removal of calls to global get_storage_service() from schema::describe code. Test: unit(debug), dtest.snapshot_test(dev), manual start-stop " * 'br-snapshot-controller-4' of https://github.com/xemul/scylla: snap: Get rid of storage_service reference in schema.cc main: Stop http server snapshot: Make check_snapshot_not_exist a method snapshots: Move ops gate from storage_service snapshot: Move lock from storage_service snapshot: Move all code into db::snapshot_ctl class storage_service: Move all snapshot code into snapshot-ctl.cc snapshots: Initial skeleton snapshots: Properly shutdown API endpoints api: Rewrap set_server_snapshot lambda	2020-06-28 13:17:32 +03:00
Takuya ASADA	a77882f075	scylla_setup: don't print prompt message multiple times when disk list passed When comma sepalated disk list passed to RAID prompt, it show up prompt message multiple times. It should always just one time. Fixes #6724	2020-06-28 12:19:22 +03:00
Takuya ASADA	835e76fdfc	scylla_setup: don't add same disk device twice We shouldn't accept adding same disk twice for RAID prompt. Fixes #6711	2020-06-28 12:19:22 +03:00
Botond Dénes	e31f7316c0	mutation_reader: evictable_reader: add assert against pause handle leak We are currently investigating a segmentation fault, which is suspected to be caused by a leaked pause handle. Although according to the latest theory the handle leak is not the root cause of the issue, just a symptom, its better to catch any bugs that would cause a handle leaking at the act, and not later when some side-effect causes a segfault. Refs: #6613 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200625153729.522811-1-bdenes@scylladb.com>	2020-06-28 12:08:25 +03:00
Avi Kivity	3e2eeec83a	Merge "Fix handling of decimals with negative scales" from Rafael " Before this series scylla would effectively infinite loop when, for example, casting a decimal with a negative scale to float. Fixes #6720 " * 'espindola/fix-decimal-issue' of https://github.com/espindola/scylla: big_decimal: Add a test for a corner case big_decimal: Correctly handle negative scales big_decimal: Add a as_rational member function big_decimal: Move constructors out of line	2020-06-28 12:06:35 +03:00
Dejan Mircevski	2d91e5f6a0	configure.py: Drop unused var cassandra_interface The variable doesn't appear to be used anywhere. Tests: manually run configure.py Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-06-27 21:20:05 +03:00
Dejan Mircevski	65030f1406	configure.py: Update gcc version check As HACKING.md suggests, we now require gcc version >= 10. Set the minimum at 10.1.1, as that is the first official 10 release: https://gcc.gnu.org/releases.html Tests: manually run configure.py and ensure it passes/fails appropriately. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-06-27 21:19:00 +03:00
Dejan Mircevski	a12bbef980	README.md: dedupe "offers offers" The word "offers" was inadvertently repeated in a sentence. Signed-off-by: Dejan Mircevski <github@mircevski.com>	2020-06-27 21:17:33 +03:00
Pavel Emelyanov	f045cec586	snap: Get rid of storage_service reference in schema.cc Now when the snapshot stopping is correctly handled, we may pull the database reference all the way down to the schema::describe(). One tricky place is in table::napshot() -- the local db reference is pulled through an smp::submit_to call, but thanks to the shard checks in the place where it is needed the db is still "local" Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:28:25 +03:00
Pavel Emelyanov	8d2e05778c	main: Stop http server Currently it's not stopped at all, so calling a REST request shutdown-time may crash things at random places. Fixes: #5702 But it's not the end of the story. Since the server stays up while we are shutting things down, each subsystem should carefully handle the cases when it's half-down, but a request comes. A better solution is to unregister rest verbs eventually, but httpd's rules cannot do it now. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:27:28 +03:00
Pavel Emelyanov	9211df2cdf	snapshot: Make check_snapshot_not_exist a method Sanitation. It now can access the this->_db pointer. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:26:15 +03:00
Pavel Emelyanov	ba47ef0397	snapshots: Move ops gate from storage_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:17:21 +03:00
Pavel Emelyanov	e439873319	snapshot: Move lock from storage_service For this de-static run_snapshot_*_operation (because we no longer have the static global to get the lock from) and make the snapshot_ctl be peering_sharded_service to call invoke_on. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:17:19 +03:00
Pavel Emelyanov	d674baacef	snapshot: Move all code into db::snapshot_ctl class This includes - rename namespace in snapshot-ctl.[cc\|hh] - move methods from storage_service to snapshot_ctl - move snapshot_details struct - temporarily make storage_service._snapshot_lock and ._snapshot_ops public - replace two get_local_storage_service() occurrences with this._db The latter is not 100% clear as the code that does this references "this" from another shard, but the _db in question is the distributed object, so they are all the same on all instances. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:59:53 +03:00
Pavel Emelyanov	8d36607044	storage_service: Move all snapshot code into snapshot-ctl.cc This is plain move, no other modifications are made, even the "service" namespace is kept, only few broken indentation fixes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:54:15 +03:00
Pavel Emelyanov	d989d9c1c7	snapshots: Initial skeleton A placeholder for snapshotting code that will be moved into it from the storage_service. Also -- pass it through the API for future use. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:54:14 +03:00
Pavel Emelyanov	9a8a1635b7	snapshots: Properly shutdown API endpoints Now with the seastar httpd routes unset() at hands we can shut down individual API endpoints. Do this for snapshot calls, this will make snapshot controller stop safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 17:27:45 +03:00
Pavel Emelyanov	b608652622	api: Rewrap set_server_snapshot lambda The lambda calls the core snapshot method deep inside the json marshalling callback. This will bring problems with stopping the snapshot controller in the next patches. To prepare for this -- call the .get_snapshot_details() first, then keep the result in do_with() context. This change doesn't affect the issue the lambde in question is about to solve as the whole result set is anyway kept in memory while being streamed outside. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 17:27:45 +03:00
Rafael Ávila de Espíndola	85bb7ff743	big_decimal: Add a test for a corner case This behavior is different from cassandra, but without arithmetic operations it doesn't seem possible to notice the difference from CQL. Using avg produces the same results, since we use an initial value of 0 (scale = 0). Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:37:23 -07:00
Rafael Ávila de Espíndola	684f32c862	big_decimal: Correctly handle negative scales A negative scale was being passed an a positive value to boost::multiprecision::pow, which would never finish. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:34:10 -07:00
Rafael Ávila de Espíndola	bac0f3a9ee	big_decimal: Add a as_rational member function This just refactors some duplicated code so that it can be fixed in one place. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:33:31 -07:00
Rafael Ávila de Espíndola	77725ce1a4	big_decimal: Move constructors out of line Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-25 15:33:01 -07:00
Benny Halevy	a843945115	comapction: restore % in compaction completion message The % sign fell off in `c4841fa735` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200625151352.736561-1-bhalevy@scylladb.com>	2020-06-25 18:11:59 +02:00
Avi Kivity	e5be3352cf	database, streaming, messaging: drop streaming memtables Before Scylla 3.0, we used to send streaming mutations using individual RPC requests and flush them together using dedicated streaming memtables. This mechanism is no longer in use and all versions that use it have long reached end-of-life. Remove this code.	2020-06-25 15:25:54 +02:00
Raphael S. Carvalho	b17d20b5f4	reshape: LCS: avoid unnecessary work on level 0 No need to sort level 0 as we only check if levels > 0 are disjoint. Also taking the opportunity to avoid copies when sorting. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200624151921.20160-1-raphaelsc@scylladb.com>	2020-06-24 18:27:22 +03:00
Rafael Ávila de Espíndola	67c22c8697	commitlog::read_log_file: Don't discard a future This makes the code a bit easier to read as there are no discarded futures and no references to having to keep a subscription alive, which we don't with current seastar. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200527013120.179763-1-espindola@scylladb.com>	2020-06-24 17:22:29 +03:00
Botond Dénes	5ff6ac52b2	scylla-gdb.py: collection element func: accept references and pointers to collections Add support to references (both lvalue and rvalue) and pointers to collections as well, in addition to plain values. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200624101305.428925-1-bdenes@scylladb.com>	2020-06-24 13:31:18 +03:00
Avi Kivity	a9c7a1a86c	Merge "repair: row_level: prevent deadlocks when repairing homogenous nodes" from Botond " Row level repair, when using a local reader, is prone to deadlocking on the streaming reader concurrency semaphore. This has been observed to happen with at least two participating nodes, running more concurrent repairs than the maximum allowed amount of reads by the concurrency semaphore. In this situation, it is possible that two repair instances, competing for the last available permits on both nodes, get a permit on one of the nodes and get queued on the other one respectively. As neither will let go of the permit it already acquired, nor give up waiting on the failed-to-acquired permit, a deadlock happens. To prevent this, we make the local repair reader evictable. For this we reuse the already existing evictable reader mechanism of the multishard combining reader. This patchset refactors this evictable reader mechanism into a standalone flat mutation reader, then exposes it to the outside world. The repair reader is paused after the repair buffer is filled, which is currently 32MB, so the cost of a possible reader recreation is amortized over 32MB read. The repair reader is said to be local, when it can use the shard-local partitioner. This is the case if the participating nodes are homogenous (their shard configuration is identical), that is the repair instance has to read just from one shard. A non-local reader uses the multishard reader, which already makes its shard readers evictable and hence is not prone to the deadlock described here. Fixes: #6272 Tests: unit(dev, release, debug) " * 'repair-row-level-evictable-local-reader/v3' of https://github.com/denesb/scylla: repair: row_level: destroy reader on EOS or error repair: row_level: use evictable_reader for local reads mutation_reader: expose evictable_reader mutation_reader: evictable_reader: add auto_pause flag mutation_reader: make evictable_reader a flat_mutation_reader mutation_reader: s/inactive_shard_read/inactive_evictable_reader/ mutation_reader: move inactive_shard_reader code up mutation_reader: fix indentation mutation_reader: shard_reader: extract remote_reader as evictable_reader mutation_reader: reader_lifecycle_policy: make semaphore() available early	2020-06-24 12:55:34 +03:00
Piotr Sarna	c2939c67b2	test: add a case for local altering of distributed tables Local altering, which does not propagate the change to other nodes, should not be allowed for a non-local table. Refs #6700 Message-Id: <34a2b191c0e827f296e6d720dc31bf8bda0fd160.1592990796.git.sarna@scylladb.com>	2020-06-24 12:51:41 +03:00
Piotr Sarna	835734c99d	cql3: disallow altering non-local tables with local queries The database has a mechanism of performing internal CQL queries, mainly to edit its own local tables. Unfortunately, it's easy to use the interface incorrectly - e.g. issuing an `ALTER TABLE` statement on a non-local table will result in not propagating the schema change to other nodes, which in turn leads to inconsistencies. In order to avoid such mistakes (one of them was a root cause of #6513), when an attempt to alter a distributed table via a local interface is performed, it results in an error. Tests: unit(dev) Fixes #6700 Message-Id: <61be3defb57be79f486e6067ceff4f4c965e34cb.1592990796.git.sarna@scylladb.com>	2020-06-24 12:51:40 +03:00
Raphael S. Carvalho	864eb20002	reshape: Fix reshaping procedure for LCS The function that determines if a level L, where L > 0, is disjoint, is returning false if level is disjoint. That's because it incorrectly accounts an overlapping SSTable in the level as a disjoint SSTable. So we need to inverse the logic. The side effect is that boot will always try to reshape levels greater than 0 because reshape procedure incorrectly thinks that levels are overlapping when they're actually disjoint. Fixes #6695. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200623180221.229695-1-raphaelsc@scylladb.com>	2020-06-24 12:50:19 +03:00
Avi Kivity	1398628e8a	Update seastar submodule cql3/functions/error_injection_fcts.cc adjusted for smp::invoke_on_all() now requiring nothrow move constructible functions. * seastar 7664f991b9...11e86172ba (4): > Merge "smp: make submit_to noexcept" from Benny > memory: Fix clang build > Fix a debug build with SEASTAR_TASK_BACKTRACE > manual_clock: Add missing includes	2020-06-24 12:49:50 +03:00
Botond Dénes	be452b1f91	service: storage_proxy: log exception returned from replica with more context Currently the message only mentions the endpoint and the error message returned from the replica. Add the keyspace and table to this message to provide more context. This should help investigations of such errors greatly, as in the case of tests where there is usually a single table, we can already guess what exactly is timing out based on this. We should add even more context, like the kind of the query (single partition or range scan) but this information is not readily available in the surrounding scope so this patch defers it. Refs: #6548 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200624054647.413256-1-bdenes@scylladb.com>	2020-06-24 11:30:37 +03:00
Piotr Sarna	df91e9a4c7	alternator: clean up string view conversions Manual translation from JSON to string_view is replaced with rjson::to_string_view helper function. In one place, a redundant string_view intermediary is removed in favor of creating the string straight from JSON. Message-Id: <2aa9d9fedd73f14b7640870d14db4f2f0bd7bd8a.1592936139.git.sarna@scylladb.com>	2020-06-23 21:45:27 +03:00
Piotr Sarna	4558401aee	alternator: drop using global migration manager As part of "war on globals", the unneeded usage of global migration manager instance is dropped. Message-Id: <c9b2fab57e62185daa2441458f9a3a5e7e0a3908.1592936139.git.sarna@scylladb.com>	2020-06-23 21:43:57 +03:00
Piotr Sarna	f4e8cfe03b	alternator: fix propagating tags Updating tags was erroneously done locally, which means that the schema change was not propagated to other nodes. The new code announces new schema globally. Fixes #6513 Branches: 4.0,4.1 Tests: unit(dev) dtest(alternator_tests.AlternatorTest.test_update_condition_expression_and_write_isolation) Message-Id: <3a816c4ecc33c03af4f36e51b11f195c231e7ce1.1592935039.git.sarna@scylladb.com>	2020-06-23 21:27:55 +03:00
Botond Dénes	fbbc86e18c	repair: row_level: destroy reader on EOS or error To avoid having to make it an optional with all the additional checks, we just replace it with an empty reader instead, this also also achieves the desired effect of releasing the read permit and all the associated resources early.	2020-06-23 21:08:21 +03:00
Botond Dénes	080f00b99a	repair: row_level: use evictable_reader for local reads Row level repair, when using a local reader, is prone to deadlocking on the streaming reader concurrency semaphore. This has been observed to happen with at least two participating nodes, running more concurrent repairs than the maximum allowed amount of reads by the concurrency semaphore. In this situation, it is possible that two repair instances, competing for the last available permits on both nodes, get a permit on one of the nodes and get queued on the other one respectively. As neither will let go of the permit it already acquired, nor give up waiting on the failed-to-acquired permit, a deadlock happens. To prevent this, we make the local repair reader evictable. For this we reuse the newly exposed evictable reader. The repair reader is paused after the repair buffer is filled, which is currently 32MB, so the cost of a possible reader recreation is amortized over 32MB read. The repair reader is said to be local, when it can use the shard-local partitioner. This is the case if the participating nodes are homogenous (their shard configuration is identical), that is the repair instance has to read just from one shard. A non-local reader uses the multishard reader, which already makes its shard readers evictable and hence is not prone to the deadlock described here.	2020-06-23 21:08:21 +03:00
Botond Dénes	542d9c3711	mutation_reader: expose evictable_reader Expose functions for the outside world to create evictable readers. We expose two functions, which create an evictable reader with `auto_pause::yes` and `auto_pause::no` respectively. The function creating the latter also returns a handle in addition to the reader, which can be used to pause the reader.	2020-06-23 21:08:21 +03:00
Botond Dénes	1cc31deff9	mutation_reader: evictable_reader: add auto_pause flag Currently the evictable reader unconditionally pauses the underlying reader after each use (`fill_buffer()` or `fast_forward_to()` call). This is fine for current users (the multishard reader), but the future user we are doing all this refactoring for -- repair -- will want to control when the underlying reader is paused "manually". Both these behaviours can easily be supported in a single implementation, so we add an `auto_pause` flag to allow the creator of the evictable reader to control this.	2020-06-23 21:08:21 +03:00
Botond Dénes	af9e1c23e1	mutation_reader: make evictable_reader a flat_mutation_reader The `evictable_reader` class is almost a proper flat mutation reader already, it roughly offers the same interface. This patch makes this formal: changing the class to inherit from `flat_mutation_reader::impl`, and implement all virtual methods. This also entails a departure from using the lifecycle policy to pause/resume and create readers, instead using more general building blocks like the reader concurrency semaphore and a mutation source.	2020-06-23 21:08:21 +03:00
Rafael Ávila de Espíndola	64c8164e6c	everywhere: Update to seastar api v4 (when_all_succeed returning a tuple) We now just need to replace a few calls to then with then_unpack. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200618172100.111147-1-espindola@scylladb.com>	2020-06-23 19:40:18 +03:00
Raphael S. Carvalho	47f63d021a	sstables/sstable_directory: improve log message in reshape() We were blind about the table which needed reshape and its compaction strategy, so let's improve log message. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622192502.187532-4-raphaelsc@scylladb.com>	2020-06-23 19:40:18 +03:00
Raphael S. Carvalho	39f96a5572	distributed_loader: Don't mutate levels to zero when populating column family Unlike refresh on upload dir, column family population shouldn't mutate level of SSTables to level 0. Otherwise, LCS will have to regenerate all levels by rewriting the data multiple times, hurting a lot the write amplification and consequently the node performance. That's also affecting the time for a node to boot because reshape may be triggered as a result of this. Refs #6695. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200622192502.187532-2-raphaelsc@scylladb.com>	2020-06-23 19:40:18 +03:00
Benny Halevy	2d7c39de88	storage_service: set_tables_autocompaction: fix not-initialized-yet logic Typo introduced in `bb07678346`, set_tables_autocompaction should reject too-early requests if !_initialized rather than if _initialized. Fixes a bunch of compaction dtests. For example: https://jenkins.scylladb.com/view/master/job/scylla-master/job/dtest-release/530/testReport/compaction_test/TestCompaction_with_DateTieredCompactionStrategy/disable_autocompaction_twice_test/ ``` True is not false : Expected to have autocompaction disabled but got it is enabled ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Tests: - unit(dev), - compaction_test:TestCompaction_with_DateTieredCompactionStrategy.disable_autocompaction_twice_test(dev) Message-Id: <20200623151418.439534-1-bhalevy@scylladb.com>	2020-06-23 19:40:18 +03:00
Avi Kivity	c72365d862	thrift: switch csharp backend to netstd The thrift compiler (since 0.13 at least) complains that the csharp target is deprecated and recommend replacing it with netstd. Since we don't use either, humor it. I suspect that this warning caused some spurious rebuilds, but have not proven it.	2020-06-23 19:40:18 +03:00

1 2 3 4 5 ...

22570 Commits