scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-03 13:37:04 +00:00

Author	SHA1	Message	Date
Avi Kivity	756b14f309	Merge 'cql3: Drop unneeded filtering when continuous clustering-key is selected' from Dejan Mircevski I noticed that we require filtering for continuous clustering key, which is not necessary. I dropped the requirement and made sure the correct data is read from the storage proxy. The corresponding dtest PR: https://github.com/scylladb/scylla-dtest/pull/1727 Tests: unit (dev,debug), dtest (next-gating, cqlpy) Closes #7460 github.com:scylladb/scylla: cql3: Delete some newlines cql3: Drop superfluous ALLOW FILTERING cql3: Drop unneeded filtering for continuous CK	2020-11-10 17:41:00 +02:00
Botond Dénes	7f07b95dd3	utils/chunked_vector: reserve_partial(): better explain how to properly use Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20201110130953.435123-1-bdenes@scylladb.com>	2020-11-10 15:45:01 +02:00
Eliran Sinvani	8380ac93c5	build: Make artifacts product aware This commit changes the build file generation and the package creation scripts to be product aware. This will change the relocatable package archives to be named after the product, this commit deals with two main things: 1. Creating the actual Scylla server relocatable with a product prefixed name - which is independent of any other change 2. Expect all other packages to create product prefixed archive - which is dependant uppon the actual submodules creating product prefixed archives. If the support is not introduced in the submodules first this will break the package build. Tests: Scylla full build with the original product and a different product name. Closes #7581	2020-11-10 14:38:10 +02:00
Takuya ASADA	f8c7d899b4	dist/debian: fix typo for scylla-server.service filename Currently debian_files_gen.py mistakenly renames scylla-server.service to "scylla-server." on non-standard product name environment such as scylla-enterprise, it should be fix to correct filename. Fixes #7423	2020-11-10 10:38:41 +02:00
Pavel Solodovnikov	2997f6bd2e	cmake: redesign scylla's `CMakeLists.txt` to finally allow full-fledged building This patch introduces many changes to the Scylla `CMakeLists.txt` to enable building Scylla without resorting to pre-building with a previous configure.py build, i.e. cmake script can now be used as a standalone solution to build and execute scylla. Submodules, such as Seastar and Abseil, are also dealt with by importing their CMake scripts directly via `add_subdirectory` calls. Other submodules, such as `libdeflate` now have a custom command to build the library at runtime. There are still a lot of things that are incomplete, though: * Missing auxiliary packaging targets * Unit-tests are not built (First priority to address in the following patches) * Compile and link flags are mostly hardcoded to the values appropriate for the most recent Fedora 33 installation. System libraries should be found via built-in `Find` scripts, compiler and linker flags should be observed and tested by executing feature tests. The current build is aimed to be built by GCC, need to support Clang since we are moving to it. * Utility cmake functions should be moved to a separate "cmake" directory. The script is updated to use the most recent CMake version available in Fedora 33, which is 3.18. Right now this is more of a PoC rather that a full-fledged solution but as far as it's not used widely, we are free to evolve it in a relaxed manner, improving it step by step to achieve feature parity with `configure.py` solution. The value in this patch is that now we are able to use any C++ IDE capable of dealing with CMake solutions and take advantage of their built-in capabilities, such as: * Building a code model to efficiently navigate code. * Find references to symbols. * Use pretty-printers, beautifiers and other tools conveniently. * Run scylla and debug it right from the IDE. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20201103221619.612294-1-pa.solodovnikov@scylladb.com>	2020-11-10 10:34:27 +02:00
Nadav Har'El	78c598e08e	alternator: add missing TableId field to DescribeTable response DescribeTable should return a UUID "TableId" in its reponse. We alread had it for CreateTable, and now this patch adds it to DescribeTable. The test for this feature is no longer xfail. Moreover, I improved the test to not only check that the TableId field is present - it should also match the documented regular expression (the standard representation of a UUID). Refs #5026 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201104114234.363046-1-nyh@scylladb.com>	2020-11-09 20:21:47 +01:00
Benny Halevy	8bcdf39a18	hints/manager: scan_for_hints_dirs: fix use-after-move This use-after move was apprently exposed after switching to clang in commit `eb861e68e9`. The directory_entry is required for std::stoi(de.name.c_str()) and later in the catch{} clause. This shows in the node logs as a "Ignore invalid directory" debug log message with an empty name, and caused the hintedhandoff_rebalance_test to fail when hints files aren't rebalanced. Test: unit(dev) DTest: hintedhandoff_additional_test.py:TestHintedHandoff.hintedhandoff_rebalance_test (dev, debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201106172017.823577-1-bhalevy@scylladb.com>	2020-11-09 16:32:54 +01:00
Takuya ASADA	4410934829	install.sh: show warning nonroot mode when systemd does not support user mode On older distribution such as CentOS7, it does not support systemd user mode. On such distribution nonroot mode does not work, show warning message and skip running systemctl --user. Fixes #7071	2020-11-09 12:16:35 +02:00
Piotr Wojtczak	72c7f25a29	db: add TransitionalAuthorizer and TransitionalAuthenticator... ... to config descriptions We allow setting the transitional auth as one of the options in scylla.yaml, but don't mention it at all in the field's description. Let's change that. Closes #7565	2020-11-09 10:51:54 +01:00
Gleb Natapov	a01dd636ea	suppress ubsan error in boost::deque::clear() The function is used by raft and fails with ubsan and clang. The ub is harmless. Lets wait for it to be fixed in boost. Message-Id: <20201109090353.GZ3722852@scylladb.com>	2020-11-09 11:25:19 +02:00
Bentsi Magidovich	956b97b2a8	scylla_util.py: fix exception handling in curl Retry mechanism didn't work when URLError happend. For example: urllib.error.URLError: <urlopen error [Errno 101] Network is unreachable> Let's catch URLError instead of HTTP since URLError is a base exception for all exceptions in the urllib module. Fixes: #7569 Closes #7567	2020-11-09 10:20:35 +02:00
Benny Halevy	02f5659f21	sstables mx/writer: clustering_blocks_input_range::next: warn on potentially bad key If _offset falls beyond compound_type->types().size() ignore the extra components instead of accessing out of the types vector range. FIXME: we should validate the thrift key against the schema and reject it in the thrift handler layer. Refs #7568 Test: unit(dev) DTest: cql_tests.py:MiscellaneousCQLTester.cql3_insert_thrift_test (dev, debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201108175738.1006817-1-bhalevy@scylladb.com>	2020-11-08 20:53:14 +02:00
Avi Kivity	6b4a7fa515	Revert "Revert "config: Do not enable repair based node operations by default"" This reverts commit `71d0d58f8c`. Repair based node operations are still not ready and will be re-enabled after more testing and fixes.	2020-11-08 14:09:50 +02:00
Michał Chojnowski	1eb19976b9	database: make changes to durable_writes effective immediately Users can change `durable_writes` anytime with ALTER KEYSPACE. Cassandra reads the value of `durable_writes` every time when applying a mutation, so changes to that setting take effect immediately. That is, mutations are added to the commitlog only when `durable_writes` is `true` at the moment of their application. Scylla reads the value of `durable_writes` only at `keyspace` construction time, so changes to that setting take effect only after Scylla is restarted. This patch fixes the inconsistency. Fixes #3034 Closes #7533	2020-11-06 17:53:22 +01:00
Tomasz Grabiec	894abfa6fc	Merge "raft: miscellaneous fixes" from Kostja This series provides assorted fixes which are a pre-requisite for the joint consensus implementation series which follows. * scylla-dev/raft-misc: raft: fix raft_fsm_test flakiness raft: drop a waiter of snapshoted entry raft: use correct type for node info in add_server() raft: overload operator<< for debugging	2020-11-06 15:34:16 +01:00
Konstantin Osipov	c4bbbac975	raft: fix raft_fsm_test flakiness When election_threshold expires, the current node can become a candidate, in which case it won't switch back to follower state upon vote_request.	2020-11-06 17:06:07 +03:00
Gleb Natapov	552745d3d3	raft: drop a waiter of snapshoted entry An index that is waited can be included in an installed snapshot in which case there is no way to know if the entry was committed or not. Abort such waiters with an appropriate error.	2020-11-06 17:06:07 +03:00
Gleb Natapov	8bab38c6fa	raft: use correct type for node info in add_server()	2020-11-06 17:06:07 +03:00
Alejo Sanchez	2e4977b24c	raft: overload operator<< for debugging Overload operator<< for ostream and print relevant state for server, fsm, log, and typed_uint64 types. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-11-06 17:06:07 +03:00
Tomasz Grabiec	3591e7dffd	Merge "Remove unused args from range_tombstone methods" from Pavel Emelyanov * https://github.com/xemul/scylla/tree/br-range-tombstone-unused-args-2: range_tombstone: Remove unused trim-front arg from .apply() range_tombstone: Undefault argument in .apply range_tombstone: Remove unused schema arg from .set_start	2020-11-06 15:04:15 +01:00
Tomasz Grabiec	6d0d55aa72	Merge "Unglobal query processor instance" from Pavel Emelyanov The query processor is present in the global namespace and is widely accessed with global get(_local)?_query_processor(). There's a long-term task to get rid of this globality and make services and componenets reference each-other and, for and due-to this, start and stop in specific order. This set makes this for the query processor. The remaining users of it are -- alternator, controllers for client services, schema_tables and sys_dist_ks. All of them except for the schema_tables are fixed just by passing the reference on query processor with small patches. The schema tables accessing qp sit deep inside the paxos code, but can be "fixed" with the qctx thing until the qctx itself is de-globalized. * https://github.com/xemul/scylla/tree/br-rip-global-query-processor: code: RIP global query processor instance cql test env: Keep query processor reference on board system distributed keyspace: Start sharded service erarlier schema_tables: Use qctx to make internal requests transport: Keep sharded query processor reference on controller thrift: Keep sharded query processor reference on controller alternator: Use local query processor reference to get keys alternator: Keep local query processor reference in server	2020-11-06 14:24:41 +01:00
Pavel Emelyanov	bbd7463960	range_tombstone: Remove unused trim-front arg from .apply() The only caller of this method always passes true to it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-11-06 15:13:05 +03:00
Pavel Emelyanov	787a496caf	range_tombstone: Undefault argument in .apply The only purpose of this change is to compile (git-bisect safety) and thus prove that the next patch is correct. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-11-06 15:13:05 +03:00
Pavel Emelyanov	3da3d448c8	range_tombstone: Remove unused schema arg from .set_start Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-11-06 15:13:05 +03:00
Piotr Sarna	b61d4bc8d0	db: degrade view building progress loading error to warning When the view builder cannot read view building progress from an internal CQL table it produces an error message, but that only confuses the user and the test suite -- this situation is entirely recoverable, because the builder simply assumes that there is no progress and the view building should start from scratch. Fixes #7527 Closes #7558	2020-11-06 10:19:11 +02:00
Avi Kivity	512daa75a6	Merge 'repair: Use single writer for all followers' from Asias He repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525 Closes #7528 * github.com:scylladb/scylla: repair: No more vector for _writer_done and friends repair: Use single writer for all followers	2020-11-05 18:45:07 +01:00
Gleb Natapov	e1442282d1	raft: test: do not store data in initializer_list Lifetime rules for initializer_list is weird. Use vector instead. Message-Id: <20201105111309.GT3722852@scylladb.com>	2020-11-05 18:44:50 +01:00
Michał Chojnowski	f6c33f5775	dbuild: export $HOME seen by dbuild, not by $tool The default of DBUILD_TOOL=docker requires passwordless access to docker by the user of dbuild. This is insecure, as any user with unconstrained access to docker is root equivalent. Therefore, users might prefer to run docker as root (e.g. by setting DBUILD_TOOL="sudo docker"). However, `$tool -e HOME` exports HOME as seen by $tool. This breaks dbuild when `$tool` runs docker as a another user. `$tool -e HOME="$HOME"` exports HOME as seen by dbuild, which is the intended behaviour. Closes #7555	2020-11-05 18:44:50 +01:00
Michał Chojnowski	8f74c7e162	dbuild: Replace stray use of `docker` with `$tool` Instead of invoking `$tool`, as is done everywhere else in dbuild, kill_it() invoked `docker` explicitly. This was slightly breaking the script for DBUILD_TOOL other than `docker`. Closes #7554	2020-11-05 18:44:49 +01:00
Tomasz Grabiec	fb9b5cae05	sstables: ka/la: Fix abort when next_partition() is called with certain reader state Cleanup compaction is using consume_pausable_in_thread() to skip over disowned partitions, which uses flat_mutation_reader::next_partition(). The implementation of next_partition() for the sstable reader has a bug which may cause the following assertion failure: scylla: sstables/mp_row_consumer.hh:422: row_consumer::proceed sstables::mp_row_consumer_k_l::flush(): Assertion `!_ready' failed. This happens when the sstable reader's buffer gets full when we reach the partition end. The last fragment of the partition won't be pushed into the buffer but will stay in the _ready variable. When next_partition() is called in this state, _ready will not be cleared and the fragment will be carried over to the next partition. This will cause assertion failure when the reader attempts to emit the first fragment of the next partition. The fix is to clear _ready when entering a partition, just like we clear _range_tombstones there. Fixes #7553. Message-Id: <1604534702-12777-1-git-send-email-tgrabiec@scylladb.com>	2020-11-05 18:44:49 +01:00
Nadav Har'El	7ff72b0ba5	Merge 'secondary_index: fix returned rows token ordering' from Piotr Grabowski Fixes returned rows ordering to proper signed token ordering. Before this change, rows were sorted by token, but using unsigned comparison, meaning that negative tokens appeared after positive tokens. Rename `token_column_computation` to `legacy_token_column_computation` and add some comments describing this computation. Added (new) `token_column_computation` which returns token as `long_type`, which is sorted using signed comparison - the correct ordering of tokens. Add new `correct_idx_token_in_secondary_index` feature, which flags that the whole cluster is able to use new `token_column_computation`. Switch token computation in secondary indexes to (new) `token_column_computation`, which fixes the ordering. This column computation type is only set if cluster supports `correct_idx_token_in_secondary_index` feature to make sure that all nodes will be able to compute new `token_column_computation`. Also old indexes will need to be rebuilt to take advantage of this fix, as new token column computation type is only set for new indexes. Fix tests according to new token ordering and add one new test to validate this aspect explicitly. Fixes #7443 Tested manually a scenario when someone created an index on old version of Scylla and then migrated to new Scylla. Old index continued to work properly (but returning in wrong order). Upon dropping and re-creating the index, it still returned the same data, but now in correct order. Closes #7534 * github.com:scylladb/scylla: tests: add token ordering test of indexed selects tests: fix tests according to new token ordering secondary_index: use new token_column_computation feature: add correct_idx_token_in_secondary_index column_computation: add token_column_computation token_column_computation: rename as legacy	2020-11-05 18:44:49 +01:00
Benny Halevy	f93fb55726	repair: repair_writer: do not capture lw_shared_ptr cross-shard The shared_from_this lw_shared_ptr must not be accessed across shards. Capturing it in the lambda passed to mutation_writer::distribute_reader_and_consume_on_shards causes exactly that since the captured lw_shared_ptr is copied on other shards, and ends up in memory corruption as seen in #7535 (probably due to lw_shared_ptr._count going out-of-sync when incremented/decremented in parallel on other shards with no synchronization. This was introduced in `289a08072a`. The writer is not needed in the body of this lambda anyways so it doesn't need to capture it. It is already held by the continuations until the end of the chain. Fixes #7535 Test: repair_additional_test:RepairAdditionalTest.repair_disjoint_row_3nodes_diff_shard_count_test (dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201104142216.125249-1-bhalevy@scylladb.com>	2020-11-05 18:44:49 +01:00
Tomasz Grabiec	dccd47eec6	Merge "make raft clang compatible" from Gleb " Since we are switching to clang due to raft make it actually compile with clang. " tgrabiec: Dropped the patch "raft: compile raft by default" because the replication_test still fails in debug mode: /usr/include/boost/container/deque.hpp:1802:63: runtime error: applying non-zero offset 8 to null pointer * 'raft-clang-v2' of github.com:scylladb/scylla-dev: raft: Use different type to create type dependent statement for static assertion raft: drop use of <ranges> for clang raft: make test compile with clang raft: drop -fcoroutines support from configure.py	2020-11-05 18:42:31 +01:00
Asias He	db28efb28a	repair: No more vector for _writer_done and friends Now that both repair followers and repair master use a single writer. We can get rid of the vector associated with _writer_done and friends. Fixes #7525	2020-11-05 13:28:40 +08:00
Asias He	998b153f86	repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525	2020-11-05 13:28:40 +08:00
Pekka Enberg	edf04cd348	Update tools/python3 submodule * tools/python3 cfa27b3...1763a1a (1): > Relocatable Package: create product prefixed relocatable archive	2020-11-04 14:24:20 +02:00
Pekka Enberg	5519ce2f0e	Update tools/jmx submodule * tools/jmx c51906e...6174a47 (2): > Relocatable Package: create product prefixed relocatable archive > build(deps-dev): bump junit from 4.8.2 to 4.13.1	2020-11-04 14:24:15 +02:00
Avi Kivity	193d1942f2	build: silence gcc ABI interoperability warning on arm A gcc bug [1] caused objects built by different versions of gcc not to interoperate. Gcc helpfully warns when it encounters code that could be affected. Since we build everything with one version, and as that versions is far newer than the last version generating incorrect code, we can silence that warning without issue. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77728 Closes #7495	2020-11-04 13:29:51 +02:00
Tomasz Grabiec	a7837a9a3b	Merge "Enable raft tests" from Kostja Do not run tests which are not built. For that, pass the test list from configure.py to test.py via ninja unit_test_list target. Minor cleanups. * scylla-dev.git/test.py-list: test: enable raft tests test.py: do not run tests which are not built configure.py: add a ninja command to print unit test list test.py: handle ninja mode_list failure configure.py: don't pass modes_list unless it's used	2020-11-04 12:25:04 +01:00
Piotr Grabowski	491987016c	tests: add token ordering test of indexed selects Add new test validating that rows returned from both non-indexed selects and indexed selects return rows sorted in token order (making sure that both positive and negative tokens are present to test if signed comparison order is maintained).	2020-11-04 12:02:42 +01:00
Piotr Grabowski	2bd23fbfa9	tests: fix tests according to new token ordering Fix tests to adhere to new (correct) token ordering of rows when querying tables with secondary indexes.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	2342b386f4	secondary_index: use new token_column_computation Switches token column computation to (new) token_column_computation, which fixes #7443, because new token column will be compared using signed comparisons, not the previous unsigned comparison of CQL bytes type. This column computation type is only set if cluster supports correct_idx_token_in_secondary_index feature to make sure that all nodes will be able to compute (new) token_column_computation. Also old indexes will need to be rebuilt to take advantage of this fix, as new token column computation type is only set for new indexes.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	6624d933c9	feature: add correct_idx_token_in_secondary_index Add new correct_idx_token_in_secondary_index feature, which will be used to determine if all nodes in the cluster support new token_column_computation. This column computation will replace legacy_token_column_computation in secondary indexes, which was incorrect as this column computation produced values that when compared with unsigned comparison (CQL type bytes comparison) resulted in different ordering than token signed comparison. See issue: https://github.com/scylladb/scylla/issues/7443	2020-11-04 12:02:42 +01:00
Piotr Grabowski	9fc2dc59b8	column_computation: add token_column_computation Introduce new token_column_computation class which is intended to replace legacy_token_column_computation. The new column computation returns token as long_type, which means that it will be ordered according to signed comparison (not unsigned comparison of bytes), which is the correct ordering of tokens.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	b1350af951	token_column_computation: rename as legacy Raname token_column_computation to legacy_token_column_computation, as it will be replaced with new column_computation. The reason is that this computation returns bytes, but all tokens in Scylla can now be represented by int64_t. Moreover, returning bytes causes invalid token ordering as bytes comparison is done in unsigned way (not signed as int64_t). See issue: https://github.com/scylladb/scylla/issues/7443	2020-11-04 12:00:18 +01:00
Eliran Sinvani	4c434f3fa4	moving avarage rate: Keep computed rates in zero until they are meaningful When computing moving average rates too early after startup, the rate can be infinite, this is simply because the sample interval since the system started is too small to generate meaningful results. Here we check for this situation and keep the rate at 0 if it happens to signal that there are still no meaningful results. This incident is unlikely to happen since it can happen only during a very small time window after restart, so we add a hint to the compiler to optimize for that in order to have a minimum impact on the normal usecase. Fixes #4469	2020-11-04 11:13:59 +02:00
Avi Kivity	8aa842614a	test: gossip_test: configure database memory allocation correctly The memory configuration for the database object was left at zero. This can cause the following chain of failures: - the test is a little slow due to the machine being overloaded, and debug mode - this causes the memtable flush_controller timer to fire before the test completes - the backlog computation callback is called - this calculates the backlog as dirty_memory / total_memory; this is 0.0/0.0, which resolves to NaN - eventually this gets converted to an integer - UBSAN dooesn't like the convertion from NaN to integer, and complains Fix by initializing dbcfg.available_memory. Test: gossip_test(debug), 1000 repetitions with concurrency 6 Closes #7544	2020-11-04 09:26:08 +02:00
Calle Wilund	1db9da2353	alternator::streams: Workaround fix for apparent code gen bug in seq_number Fixes #7325 When building with clang on fedora32, calling the string_view constructor of bignum generates broken ID:s (i.e. parsing borks). Creating a temp std::string fixes it. Closes #7542	2020-11-04 09:26:08 +02:00
Benny Halevy	1d199c31f8	storage_service: check_for_endpoint_collision: copy gossip state across preemeption point Since `11a8912093`, get_gossip_status returns a std::string_view rather than a sstring. As seen in dtest we may print garbage to the log if we print the string_view after preemption (calling _gossiper.reset_endpoint_state_map().get()) Test: update_cluster_layout_tests:TestUpdateClusterLayout.simple_add_two_nodes_in_parallel_test (dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201103132720.559168-1-bhalevy@scylladb.com>	2020-11-04 09:26:08 +02:00
Konstantin Osipov	507ca98748	test: enable raft tests It's safe to do this since now the tests are only run if they are configured.	2020-11-03 21:30:11 +03:00

1 2 3 4 5 ...

24185 Commits