scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 20:57:00 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	3da3d448c8	range_tombstone: Remove unused schema arg from .set_start Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-11-06 15:13:05 +03:00
Piotr Sarna	b61d4bc8d0	db: degrade view building progress loading error to warning When the view builder cannot read view building progress from an internal CQL table it produces an error message, but that only confuses the user and the test suite -- this situation is entirely recoverable, because the builder simply assumes that there is no progress and the view building should start from scratch. Fixes #7527 Closes #7558	2020-11-06 10:19:11 +02:00
Avi Kivity	512daa75a6	Merge 'repair: Use single writer for all followers' from Asias He repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525 Closes #7528 * github.com:scylladb/scylla: repair: No more vector for _writer_done and friends repair: Use single writer for all followers	2020-11-05 18:45:07 +01:00
Gleb Natapov	e1442282d1	raft: test: do not store data in initializer_list Lifetime rules for initializer_list is weird. Use vector instead. Message-Id: <20201105111309.GT3722852@scylladb.com>	2020-11-05 18:44:50 +01:00
Michał Chojnowski	f6c33f5775	dbuild: export $HOME seen by dbuild, not by $tool The default of DBUILD_TOOL=docker requires passwordless access to docker by the user of dbuild. This is insecure, as any user with unconstrained access to docker is root equivalent. Therefore, users might prefer to run docker as root (e.g. by setting DBUILD_TOOL="sudo docker"). However, `$tool -e HOME` exports HOME as seen by $tool. This breaks dbuild when `$tool` runs docker as a another user. `$tool -e HOME="$HOME"` exports HOME as seen by dbuild, which is the intended behaviour. Closes #7555	2020-11-05 18:44:50 +01:00
Michał Chojnowski	8f74c7e162	dbuild: Replace stray use of `docker` with `$tool` Instead of invoking `$tool`, as is done everywhere else in dbuild, kill_it() invoked `docker` explicitly. This was slightly breaking the script for DBUILD_TOOL other than `docker`. Closes #7554	2020-11-05 18:44:49 +01:00
Tomasz Grabiec	fb9b5cae05	sstables: ka/la: Fix abort when next_partition() is called with certain reader state Cleanup compaction is using consume_pausable_in_thread() to skip over disowned partitions, which uses flat_mutation_reader::next_partition(). The implementation of next_partition() for the sstable reader has a bug which may cause the following assertion failure: scylla: sstables/mp_row_consumer.hh:422: row_consumer::proceed sstables::mp_row_consumer_k_l::flush(): Assertion `!_ready' failed. This happens when the sstable reader's buffer gets full when we reach the partition end. The last fragment of the partition won't be pushed into the buffer but will stay in the _ready variable. When next_partition() is called in this state, _ready will not be cleared and the fragment will be carried over to the next partition. This will cause assertion failure when the reader attempts to emit the first fragment of the next partition. The fix is to clear _ready when entering a partition, just like we clear _range_tombstones there. Fixes #7553. Message-Id: <1604534702-12777-1-git-send-email-tgrabiec@scylladb.com>	2020-11-05 18:44:49 +01:00
Nadav Har'El	7ff72b0ba5	Merge 'secondary_index: fix returned rows token ordering' from Piotr Grabowski Fixes returned rows ordering to proper signed token ordering. Before this change, rows were sorted by token, but using unsigned comparison, meaning that negative tokens appeared after positive tokens. Rename `token_column_computation` to `legacy_token_column_computation` and add some comments describing this computation. Added (new) `token_column_computation` which returns token as `long_type`, which is sorted using signed comparison - the correct ordering of tokens. Add new `correct_idx_token_in_secondary_index` feature, which flags that the whole cluster is able to use new `token_column_computation`. Switch token computation in secondary indexes to (new) `token_column_computation`, which fixes the ordering. This column computation type is only set if cluster supports `correct_idx_token_in_secondary_index` feature to make sure that all nodes will be able to compute new `token_column_computation`. Also old indexes will need to be rebuilt to take advantage of this fix, as new token column computation type is only set for new indexes. Fix tests according to new token ordering and add one new test to validate this aspect explicitly. Fixes #7443 Tested manually a scenario when someone created an index on old version of Scylla and then migrated to new Scylla. Old index continued to work properly (but returning in wrong order). Upon dropping and re-creating the index, it still returned the same data, but now in correct order. Closes #7534 * github.com:scylladb/scylla: tests: add token ordering test of indexed selects tests: fix tests according to new token ordering secondary_index: use new token_column_computation feature: add correct_idx_token_in_secondary_index column_computation: add token_column_computation token_column_computation: rename as legacy	2020-11-05 18:44:49 +01:00
Benny Halevy	f93fb55726	repair: repair_writer: do not capture lw_shared_ptr cross-shard The shared_from_this lw_shared_ptr must not be accessed across shards. Capturing it in the lambda passed to mutation_writer::distribute_reader_and_consume_on_shards causes exactly that since the captured lw_shared_ptr is copied on other shards, and ends up in memory corruption as seen in #7535 (probably due to lw_shared_ptr._count going out-of-sync when incremented/decremented in parallel on other shards with no synchronization. This was introduced in `289a08072a`. The writer is not needed in the body of this lambda anyways so it doesn't need to capture it. It is already held by the continuations until the end of the chain. Fixes #7535 Test: repair_additional_test:RepairAdditionalTest.repair_disjoint_row_3nodes_diff_shard_count_test (dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201104142216.125249-1-bhalevy@scylladb.com>	2020-11-05 18:44:49 +01:00
Tomasz Grabiec	dccd47eec6	Merge "make raft clang compatible" from Gleb " Since we are switching to clang due to raft make it actually compile with clang. " tgrabiec: Dropped the patch "raft: compile raft by default" because the replication_test still fails in debug mode: /usr/include/boost/container/deque.hpp:1802:63: runtime error: applying non-zero offset 8 to null pointer * 'raft-clang-v2' of github.com:scylladb/scylla-dev: raft: Use different type to create type dependent statement for static assertion raft: drop use of <ranges> for clang raft: make test compile with clang raft: drop -fcoroutines support from configure.py	2020-11-05 18:42:31 +01:00
Asias He	db28efb28a	repair: No more vector for _writer_done and friends Now that both repair followers and repair master use a single writer. We can get rid of the vector associated with _writer_done and friends. Fixes #7525	2020-11-05 13:28:40 +08:00
Asias He	998b153f86	repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525	2020-11-05 13:28:40 +08:00
Pekka Enberg	edf04cd348	Update tools/python3 submodule * tools/python3 cfa27b3...1763a1a (1): > Relocatable Package: create product prefixed relocatable archive	2020-11-04 14:24:20 +02:00
Pekka Enberg	5519ce2f0e	Update tools/jmx submodule * tools/jmx c51906e...6174a47 (2): > Relocatable Package: create product prefixed relocatable archive > build(deps-dev): bump junit from 4.8.2 to 4.13.1	2020-11-04 14:24:15 +02:00
Avi Kivity	193d1942f2	build: silence gcc ABI interoperability warning on arm A gcc bug [1] caused objects built by different versions of gcc not to interoperate. Gcc helpfully warns when it encounters code that could be affected. Since we build everything with one version, and as that versions is far newer than the last version generating incorrect code, we can silence that warning without issue. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77728 Closes #7495	2020-11-04 13:29:51 +02:00
Tomasz Grabiec	a7837a9a3b	Merge "Enable raft tests" from Kostja Do not run tests which are not built. For that, pass the test list from configure.py to test.py via ninja unit_test_list target. Minor cleanups. * scylla-dev.git/test.py-list: test: enable raft tests test.py: do not run tests which are not built configure.py: add a ninja command to print unit test list test.py: handle ninja mode_list failure configure.py: don't pass modes_list unless it's used	2020-11-04 12:25:04 +01:00
Piotr Grabowski	491987016c	tests: add token ordering test of indexed selects Add new test validating that rows returned from both non-indexed selects and indexed selects return rows sorted in token order (making sure that both positive and negative tokens are present to test if signed comparison order is maintained).	2020-11-04 12:02:42 +01:00
Piotr Grabowski	2bd23fbfa9	tests: fix tests according to new token ordering Fix tests to adhere to new (correct) token ordering of rows when querying tables with secondary indexes.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	2342b386f4	secondary_index: use new token_column_computation Switches token column computation to (new) token_column_computation, which fixes #7443, because new token column will be compared using signed comparisons, not the previous unsigned comparison of CQL bytes type. This column computation type is only set if cluster supports correct_idx_token_in_secondary_index feature to make sure that all nodes will be able to compute (new) token_column_computation. Also old indexes will need to be rebuilt to take advantage of this fix, as new token column computation type is only set for new indexes.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	6624d933c9	feature: add correct_idx_token_in_secondary_index Add new correct_idx_token_in_secondary_index feature, which will be used to determine if all nodes in the cluster support new token_column_computation. This column computation will replace legacy_token_column_computation in secondary indexes, which was incorrect as this column computation produced values that when compared with unsigned comparison (CQL type bytes comparison) resulted in different ordering than token signed comparison. See issue: https://github.com/scylladb/scylla/issues/7443	2020-11-04 12:02:42 +01:00
Piotr Grabowski	9fc2dc59b8	column_computation: add token_column_computation Introduce new token_column_computation class which is intended to replace legacy_token_column_computation. The new column computation returns token as long_type, which means that it will be ordered according to signed comparison (not unsigned comparison of bytes), which is the correct ordering of tokens.	2020-11-04 12:02:42 +01:00
Piotr Grabowski	b1350af951	token_column_computation: rename as legacy Raname token_column_computation to legacy_token_column_computation, as it will be replaced with new column_computation. The reason is that this computation returns bytes, but all tokens in Scylla can now be represented by int64_t. Moreover, returning bytes causes invalid token ordering as bytes comparison is done in unsigned way (not signed as int64_t). See issue: https://github.com/scylladb/scylla/issues/7443	2020-11-04 12:00:18 +01:00
Eliran Sinvani	4c434f3fa4	moving avarage rate: Keep computed rates in zero until they are meaningful When computing moving average rates too early after startup, the rate can be infinite, this is simply because the sample interval since the system started is too small to generate meaningful results. Here we check for this situation and keep the rate at 0 if it happens to signal that there are still no meaningful results. This incident is unlikely to happen since it can happen only during a very small time window after restart, so we add a hint to the compiler to optimize for that in order to have a minimum impact on the normal usecase. Fixes #4469	2020-11-04 11:13:59 +02:00
Avi Kivity	8aa842614a	test: gossip_test: configure database memory allocation correctly The memory configuration for the database object was left at zero. This can cause the following chain of failures: - the test is a little slow due to the machine being overloaded, and debug mode - this causes the memtable flush_controller timer to fire before the test completes - the backlog computation callback is called - this calculates the backlog as dirty_memory / total_memory; this is 0.0/0.0, which resolves to NaN - eventually this gets converted to an integer - UBSAN dooesn't like the convertion from NaN to integer, and complains Fix by initializing dbcfg.available_memory. Test: gossip_test(debug), 1000 repetitions with concurrency 6 Closes #7544	2020-11-04 09:26:08 +02:00
Calle Wilund	1db9da2353	alternator::streams: Workaround fix for apparent code gen bug in seq_number Fixes #7325 When building with clang on fedora32, calling the string_view constructor of bignum generates broken ID:s (i.e. parsing borks). Creating a temp std::string fixes it. Closes #7542	2020-11-04 09:26:08 +02:00
Benny Halevy	1d199c31f8	storage_service: check_for_endpoint_collision: copy gossip state across preemeption point Since `11a8912093`, get_gossip_status returns a std::string_view rather than a sstring. As seen in dtest we may print garbage to the log if we print the string_view after preemption (calling _gossiper.reset_endpoint_state_map().get()) Test: update_cluster_layout_tests:TestUpdateClusterLayout.simple_add_two_nodes_in_parallel_test (dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201103132720.559168-1-bhalevy@scylladb.com>	2020-11-04 09:26:08 +02:00
Konstantin Osipov	507ca98748	test: enable raft tests It's safe to do this since now the tests are only run if they are configured.	2020-11-03 21:30:11 +03:00
Konstantin Osipov	5f90582362	test.py: do not run tests which are not built Use ninja unit_test_list to find out the list of configured tests. If a test is not configured by configure.py, do not try to run it.	2020-11-03 21:30:08 +03:00
Konstantin Osipov	9198e38311	configure.py: add a ninja command to print unit test list test.py needs this list to avoid running tests which are not configured, and hence not built.	2020-11-03 21:27:45 +03:00
Konstantin Osipov	ef9c63a6d9	test.py: handle ninja mode_list failure Print an error message if the subcommand fails. Use a regular expression to match output.	2020-11-03 21:06:17 +03:00
Konstantin Osipov	7fa08496b0	configure.py: don't pass modes_list unless it's used Don't redefine modes_list if it's not used by the ninja file formatter.	2020-11-03 21:02:55 +03:00
Benny Halevy	9d91d38502	SCYLLA-VERSION-GEN: change master version to 4.4.dev Now that scylla-ccm and scylla-dtest conform to PEP-440 version comparison (See https://www.python.org/dev/peps/pep-0440/) we can safely change scylla version on master to be the development branch for the next release. The version order logic is: 4.3.dev is followed by 4.3.rc[i] followed by 4.3.[n] Note that also according to https://blog.jasonantman.com/2014/07/how-yum-and-rpm-compare-versions/ 4.3.dev < 4.3.rc[i] < 4.3.[n] as "dev" < "rc" by alphabetical order and both "dev" and "rc*" < any number, based on the general rule that alphabetical strings compare as less than numbers. Refs scylladb/scylla-machine-image#79 Test: unit Dtest: gating Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201015151153.726637-1-bhalevy@scylladb.com>	2020-11-03 13:42:54 +02:00
Avi Kivity	25e6a9e493	Merge "utils/large_bitset: reserve memory for _storage gently" from Botond " Introduce a gentle (yielding) implementation of reserve for chunked vector and use it when reserving the backing storage vector for large bitset. Large bitset is used by bloom filters, which can be quite large and have been observed to cause stalls when allocating memory for the storage. Fixes: #6974 Tests: unit(dev) " * 'gentle-reserve/v1' of https://github.com/denesb/scylla: utils/large_bitset: use reserve_partial() to reserve _storage utils/chunked_vector: add reserve_partial()	2020-11-03 13:42:54 +02:00
Tomasz Grabiec	5abddc8568	Merge "Testing performance of different collections" from Pavel Emelyanov There's a perf_bptree test that compares B+ tree collection with std::set and std::map ones. There will come more, also the "patterns" to compare are not just "fill with keys" and "drain to empty", so here's the perf_collection test, that measures timings of - fill with keys - drain key by key - empty with .clear() call - full scan with iterator - insert-and-remove of a single element for currently used collections - std::set - std::map - intrusive_set_external_comparator - bplus::tree * https://github.com/xemul/scylla/tree/br-perf-collection-test: test: Generalize perf_bptree into perf_collection perf_collection: Clear collection between itartions perf_collection: Add intrusive_set_external_comparator perf_collection: Add test for single element insertion perf_collection: Add test for destruction with .clear() perf_collection: Add test for full scan time	2020-11-03 13:42:54 +02:00
Gleb Natapov	88a1274583	raft: Use different type to create type dependent statement for static assertion For some reason the one that woks for gcc does not work for clang.	2020-11-03 08:49:54 +02:00
Gleb Natapov	b6b51bf17e	raft: drop use of <ranges> for clang	2020-11-03 08:49:54 +02:00
Gleb Natapov	847400ee96	raft: make test compile with clang clang does not allow to return a future<> with co_return and it is more strict with type conversion.	2020-11-03 08:49:54 +02:00
Gleb Natapov	ff18072de8	raft: drop -fcoroutines support from configure.py We switched to clang and it does not have this flag.	2020-11-03 08:49:54 +02:00
Botond Dénes	a08b640fa7	utils/large_bitset: use reserve_partial() to reserve _storage To avoid stalls when reserving memory for a large bloom filter. The filter creation already has a yielding loop for initialization, this patch extends it to reservation of memory too.	2020-11-02 18:03:19 +02:00
Botond Dénes	bb908b1750	utils/chunked_vector: add reserve_partial() A variant of reserve() which allows gentle reserving of memory. This variant will allocate just one chunk at a time. To drive it to completion, one should call it repeatedly with the return value of the previous call, until it returns 0. This variant will be used in the next patch by the large bitset creation code, to avoid stalls when allocating large bloom filters (which are backed by large bitset).	2020-11-02 18:02:01 +02:00
Piotr Wojtczak	caa3c471c0	Validate ascii values when creating from CQL Although the code for it existed already, the validation function hasn't been invoked properly. This change fixes that, adding a validating check when converting from text to specific value type and throwing a marshal exception if some characters are not ASCII. Fixes #5421 Closes #7532	2020-11-02 16:47:32 +02:00
Pavel Emelyanov	364ddab148	test: Do not dump test log onto terminal When unit tests fail the test.py dump their output on the screen. This is impossible to read this output from the terminal, all the more so the logs are anyway saved in the testlog/ directory. At the same time the names of the failed tests are all left _before_ these logs, and if the terminal history is not large enough, it becomes quite annoying to find the names out. The proposal is not to spoil the terminal with raw logs -- just names and summaries. Logs themselves are at testlog/$mode/$name_of_the_test.log Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20201031154518.22257-1-xemul@scylladb.com>	2020-11-02 15:42:34 +02:00
Tomasz Grabiec	ba42e7fcc5	multishard_mutation_query: Propagate mutation_reader::forwarding flag Otherwise all readers will be created with the default forwarding::yes. This inhibits some optimizations (e.g. results in more sstable read-ahead). It will also be problematic when we introduce mutation sources which don't support forwarding::yes in the future. Message-Id: <1604065206-3034-1-git-send-email-tgrabiec@scylladb.com>	2020-11-02 15:24:36 +02:00
Avi Kivity	eb861e68e9	build: switch to clang as the default compiler Clang brings us working support for coroutines, which are needed for Raft and for code simplification. perf_simple_query as well as full system tests show no significant performance regression. Test: unit(dev, release, debug) Closes #7531	2020-11-02 14:18:13 +02:00
Nadav Har'El	ffbd487c86	Merge 'alternator::streams: Use end-of-record info in get_records' from Calle Wilund Fixes #7496 Since cdc log now has an end-of-batch/record marker that tells us explicitly that we've read the last row of a change, we can use this instead of timestamp checks + limit extra to ensure we have complete records. Note that this does not try to fulfill user query limit exact. To do this we would need to add a loop and potentially re-query if quried rows are not enough. But that is a separate exercise, and superbly suited for coroutines! Closes #7498 * github.com:scylladb/scylla: alternator::streams: Reduce the query limit depending on cdc opts alternator::streams: Use end-of-record info in get_records	2020-11-02 13:34:00 +02:00
Tomasz Grabiec	2dfc5f1ee5	Merge "Cleanup gossiper endpoint interface" from Benny This series cleans up the gossiper endpoint_state interface marking methods const and const noexcept where possible. To achieve that, endpoint_state::get_status was changed to return a string_view rather than a sstring so it won't need to allocate memory. Also, the get_cluster_name and get_partitioner_name were changes to return a const sstring& rather than sstring so they won't need to allocate memory. The motivation for the series stems from #7339 where an exception in get_host_id within a storage_service notification handler, called from seastar::defer crashed the server. With this series, get_host_id may still throw exceptions on logical error, but not from calling get_application_state_ptr. Refs #7339 Test: unit(dev) * tag 'gossiper-endpoint-noexcept-v2': gossiper: mark trivial methods noexcept gossiper: get_cluster_name, get_partitioner_name: make noexcept gossiper: get_gossip_status: return string_view and make noexcept gms/endpoint_state: mark methods using get_status noexcept gms/endpoint_state: get_status: return string_view and make noexcept gms/endpoint_state: mark get_application_state_ptr and is_cql_ready noexcept gms/endpoint_state: mark trivial methods noexcept gms/heart_beat_state: mark methods noexcept gms/versioned_value: mark trivial methods noexcept gms/version_generator: mark get_next_version noexcept fb_utilities.hh: mark methods noexcept messaging: msg_addr: mark methods noexcept gms/inet_address: mark methods noexcept	2020-11-02 12:30:30 +01:00
Avi Kivity	7a3376907e	Merge 'improvements for GCE image' from Bentsi when logging in to the GCE instance that is created from the GCE image it takes 10 seconds to understand that we are not running on AWS. Also, some unnecessary debug logging messages are printed: ``` bentsi@bentsi-G3-3590:~/devel/scylladb$ ssh -i ~/.ssh/scylla-qa-ec2 bentsi@35.196.8.86 Warning: Permanently added '35.196.8.86' (ECDSA) to the list of known hosts. Last login: Sun Nov 1 22:14:57 2020 from 108.128.125.4 _____ _ _ _____ ____ / ____\| \| \| \| \| __ \\| _ \ \| (___ ___ _ _\| \| \| __ _\| \| \| \| \|_) \| \___ \ / __\| \| \| \| \| \|/ _` \| \| \| \| _ < ____) \| (__\| \|_\| \| \| \| (_\| \| \|__\| \| \|_) \| \|_____/ \___\|\__, \|_\|_\|\__,_\|_____/\|____/ __/ \| \|___/ Version: 666.development-0.20201101.6be9f4938 Nodetool: nodetool help CQL Shell: cqlsh More documentation available at: http://www.scylladb.com/doc/ By default, Scylla sends certain information about this node to a data collection server. For information, see http://www.scylladb.com/privacy/ WARNING:root:Failed to grab http://169.254.169.254/latest/... WARNING:root:Failed to grab http://169.254.169.254/latest/... Initial image configuration failed! To see status, run 'systemctl status scylla-image-setup' [bentsi@artifacts-gce-image-jenkins-db-node-aa57409d-0-1 ~]$ ``` this PR fixes this Closes #7523 * github.com:scylladb/scylla: scylla_util.py: remove unnecessary logging scylla_util.py: make is_aws_instance faster scylla_util.py: added ability to control sleep time between retries in curl()	2020-11-02 12:32:25 +02:00
Piotr Sarna	b66c285f94	schema_tables: fix fixing old secondary index schemas Old secondary index schemas did not have their idx_token column marked as computed, and there already exists code which updates them. Unfortunately, the fix itself contains an error and doesn't fire if computed columns are not yet supported by the whole cluster, which is a very common situation during upgrades. Fixes #7515 Closes #7516	2020-11-02 12:30:20 +02:00
Takuya ASADA	100127bc02	install.sh: allow --packaging with nonroot mode Since scylla-ccm wants to skip systemctl, we need to support --packaging in nonroot mode too. Related: #7187	2020-11-02 12:07:14 +02:00
Calle Wilund	7c8f457bab	alternator::streams: Reduce the query limit depending on cdc opts Avoid querying much more than needed. Since we have exact row markers now, this is more safe to do.	2020-11-02 08:37:27 +00:00

1 2 3 4 5 ...

24151 Commits