scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 19:35:12 +00:00

Author	SHA1	Message	Date
Vlad Zolotarov	c65a110839	main: remove the "experimental" tag from the hinted handoff feature Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-07-06 19:19:40 -04:00
Vlad Zolotarov	83ba6d84a1	db::hints::manager: implement rebalance() method Rebalance hints segments that need to be sent among all present shards. Ensure that after rebalancing the difference between the number of segments of any two shards is not greater than 1. Try to minimize the amount of "file rename" operations in order to achieve the needed result. Note: "Resharding" is a particular case of rebalancing. Tests: dtest: hintedhandoff_additional_test.py:TestHintedHandoff.hintedhandoff_rebalance_test Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-07-06 19:18:46 -04:00
Duarte Nunes	0ec3ff0611	Merge 'Add ALLOW FILTERING metrics' from Piotr " This series addresses issue #3575 by adding 3 ALLOW FILTERING related metrics to help profile queries: * number of read request that required filtering * total number of rows read that required filtering * number of rows read that required filtering and matched Tests: unit (release) " * 'allow_filtering_metrics_4' of https://github.com/psarna/scylla: cql3: publish ALLOW FILTERING metrics cql3: add updating ALLOW FILTERING metrics cql3: define ALLOW FILTERING metrics	2018-07-06 11:19:37 +01:00
Piotr Sarna	4a435e6f66	cql3: publish ALLOW FILTERING metrics ALLOW FILTERING related metrics are registered and published. Fixes #3575	2018-07-06 12:00:37 +02:00
Piotr Sarna	03f2f8633b	cql3: add updating ALLOW FILTERING metrics Metrics related to ALLOW FILTERING queries are now properly updated on read requests.	2018-07-06 12:00:29 +02:00
Piotr Sarna	8cb242ab0b	cql3: define ALLOW FILTERING metrics The following metrics are defined for ALLOW FILTERING: * number of read request that required filtering * total number of rows read that required filtering * number of rows read that required filtering and matched	2018-07-06 10:43:18 +02:00
Raphael S. Carvalho	dfd1e1229e	sstables/compaction_manager: fix typo in function name to reevaluate postponed compaction Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180702185343.26682-1-raphaelsc@scylladb.com>	2018-07-05 18:54:14 +03:00
Takuya ASADA	4df982fe07	dist/common/scripts/scylla_sysconfig_setup: fix typo Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180705133313.16934-1-syuu@scylladb.com>	2018-07-05 16:38:14 +03:00
Avi Kivity	7a1bcd9ad3	Merge "Improve mutation printing in GDB" from Tomasz " This is a series of patches which make it possible for a human to examine contents of cache or memtables from GDB. " * 'tgrabiec/gdb-cache-printers' of github.com:tgrabiec/scylla: gdb: Add pretty printer for managed_vector gdb: Add pretty printer for rows gdb: Add mutation_partition pretty printer gdb: Add pretty printer for partition_entry gdb: Add pretty printer for managed_bytes gdb: Add iteration wrapper for intrusive_set_external_comparator gdb: Add iteration wrapper for boost intrusive set	2018-07-05 14:08:58 +03:00
Avi Kivity	f55a2fe3a7	main: improve reporting of dns resolution errors A report that C-Ares returned some errors tells the user nothing. Improve the error message by including the name of the configuration variable and its value. Message-Id: <20180705084959.10872-1-avi@scylladb.com>	2018-07-05 10:24:41 +01:00
Duarte Nunes	c126b00793	Merge 'ALLOW FILTERING support' from Piotr " The main idea of this series is to provide a filtering_visitor as a specialised result_set_builder::visitor implementation that keeps restriction info and applies it on query results. Also, since allow_filtering checking is not correct now (e.g. #2025) on select_statement level, this series tries to fix any issues related to it. Still in TODO: * handling CONTAINS relation in single column restriction filtering * handling multi-column restrictions - especially EQ, which can be split into multiple single-column restrictions * more tests - it's never enough; especially esoteric cases like filtering queries which also use secondary indexes, paging tests, etc. Tests: unit (release) " * 'allow_filtering_6' of https://github.com/psarna/scylla: tests: add allow_filtering tests to cql_query_test cql3: enable ALLOW FILTERING service: add filtering_pager cql3: optimize filtering partition keys and static rows cql3: add filtering visitor cql3: move result_set_builder functions to header cql3: amend need_filtering() cql3: add single column primary key restrictions getters cql3: expose single column primary key restrictions cql3: add needs_filtering to primary key restrictions cql3: add simpler single_column_restriction::is_satisfied_by	2018-07-05 10:18:08 +01:00
Piotr Sarna	a7dd02309f	tests: add allow_filtering tests to cql_query_test Test cases for ALLOW FILTERING are added to cql_query_test suite.	2018-07-05 10:50:43 +02:00
Piotr Sarna	27bf20aa3f	cql3: enable ALLOW FILTERING Enables 'ALLOW FILTERING' queries by transfering control to result_set_builder::filtering_visitor. Both regular and primary key columns are allowed, but some things are left unimplemented: - multi-column restrictions - CONTAINS queries Fixes #2025	2018-07-05 10:50:43 +02:00
Piotr Sarna	7b018f6fd6	service: add filtering_pager For paged results of an 'ALLOW FILTERING' query, a filtering pager is provided. It's based on a filtering_visitor for result_builder.	2018-07-05 10:50:43 +02:00
Piotr Sarna	a08fba19e3	cql3: optimize filtering partition keys and static rows If any restriction on partition key or static row part fails, it will be so for every row that belongs to a partition. Hence, full check of the rest of the rows is skipped.	2018-07-05 10:50:43 +02:00
Piotr Sarna	2a0b720102	cql3: add filtering visitor In order to filter results of an 'ALLOW FILTERING' query, a visitor that can take optional filter for result_builder is provided. It defaults to nop_filter, which accepts all rows.	2018-07-05 10:50:43 +02:00
Piotr Sarna	1cf5653f89	cql3: move result_set_builder functions to header Moving function definitions to header is a preparation step before turning result_set_builder into a template.	2018-07-05 10:50:43 +02:00
Piotr Sarna	4d3d32f465	cql3: amend need_filtering() Previous implementation of need_filtering() was too eager to assume that index query should be used, whereas sometimes a query should just be filtered.	2018-07-05 10:50:39 +02:00
Avi Kivity	dd083122f9	Update scylla-ami submodule * dist/ami/files/scylla-ami 0fd9d23...67293ba (1): > scylla_install_ami: fix broken argument parser Fixes #3578.	2018-07-05 09:48:06 +03:00
Avi Kivity	f4caa418ff	Merge "Fix the "LCS data-loss bug"" from Botond " This series fixes the "LCS data-loss bug" where full scans (and everything that uses them) would miss some small percentage (> 0.001%) of the keys. This could easily lead to permanent data-loss as compaction and decomission both use full scans. `aeffbb673` worked around this bug by disabling the incremental reader selectors (the class identified as the source of the bug) altogether. This series fixes the underlying issue and reverts `aeffbb673`. The root cause of the bug is that the `incremental_reader_selector` uses the current read position to poll for new readers using `sstable_set::incremental_selector::select()`. This means that when the currently open sstables contain no partitions that would intersect with some of the yet unselected sstables, those sstables would be ignored. Solve the problem by not calling `select()` with the current read position and always pass the `next_position` returned in the previous call. This means that the traversal of the sstable-set happens at a pace defined by the sstable-set itself and this guarantees that no sstable will be jumped over. When asked for new readers the `incremental_reader_selector` will now iteratively call `select()` using the `next_position` from the previous `select()` call until it either receives some new, yet unselected sstables, or `next_position` surpasses the read position (in which case `select()` will be tried again later). The `sstable_set::incremental_selector` was not suitable in its present state to support calling `select()` with the `next_position` from a previous call as in some cases it could not make progress due to inclusiveness related ambiguities. So in preparation to the above fix `sstable_set` was updated to work in terms of ring-position instead of tokens. Ring-position can express positions in a much more fine-grained way then token, including positions after/before tokens and keys. This allows for a clear expression of `next_position` such that calling `select()` with it guarantees forward progress in the token-space. Tests: unit(release, debug) Refs: #3513 " * 'leveled-missing-keys/v4' of https://github.com/denesb/scylla: tests/mutation_reader_test: combined_mutation_reader_test: use SEASTAR_THREAD_TEST_CASE tests/mutation_reader_test: refactor combined_mutation_reader_test tests/mutation_reader_test: fix reader_selector related tests Revert "database: stop using incremental selectors" incremental_reader_selector: don't jump over sstables mutation_reader: reader_selector: use ring_position instead of token sstables_set::incremental_selector: use ring_position instead of token compatible_ring_position: refactor to compatible_ring_position_view dht::ring_position_view: use token_bound from ring_position i_partitioner: add free function ring-position tri comparator mutation_reader_merger::maybe_add_readers(): remove early return mutation_reader_merger: get rid of _key	2018-07-05 09:33:12 +03:00
Takuya ASADA	3bcc123000	dist/ami: hardcode target for scylla_current_repo since we don't have --target option anymore We break build_ami.sh since we dropped Ubuntu support, scylla_current_repo command does not finishes because of less argument ('--target' with no distribution name, since $TARGET is always blank now). It need to hardcoded as centos. Fixes #3577 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180705035251.29160-1-syuu@scylladb.com>	2018-07-05 09:31:43 +03:00
Paweł Dziepak	07a429e837	test.py: do not disable human-readable format with --jenkins flag When test.py is run with --jenkins flag Boost UTF is asked to generate an XML file with the test results. This automatically disables the human-readable output printed to stdout. There is no real reason to do so and it is actually less confusing when the Boost UTF messages are in the test output together with Scylla logger messages. Message-Id: <20180704172913.23462-1-pdziepak@scylladb.com>	2018-07-05 09:31:15 +03:00
Raphael S. Carvalho	7d6af5da3a	sstables/compaction_manager: properly reevaluate postponed compactions for leveled strategy Function to reevaluate postponed compaction was called too early for strategies that don't allow parallel compaction (only leveled strategy (LCS) at this moment). Such strategies must first have the ongoing compaction deregistered before reevaluating the postponed ones. Manager uses task list of ongoing compaction to decides if there's ongoing compaction for a given column family. So compaction could stop making progress at all if and only if we stop flushing new data. So it could happen that a column family would be left with lots of pending compaction, leading the user to think all compacting is done, but after reboot, there will be lots of compaction activity. We'll both improve method to detect parallel compaction here and also add a call to reevaluate postponed compaction after compaction is done. Fixes #3534. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180702185327.26615-1-raphaelsc@scylladb.com>	2018-07-04 16:30:21 +01:00
Botond Dénes	b32f94d31e	tests/mutation_reader_test: combined_mutation_reader_test: use SEASTAR_THREAD_TEST_CASE	2018-07-04 17:42:37 +03:00
Botond Dénes	77ad085393	tests/mutation_reader_test: refactor combined_mutation_reader_test Make combined_mutation_reader_test more interesting: * Set the levels on the sstables * Arrange the sstables so that they test for the "jump over sstables" bug. * Arrange the sstables so that they test for the "gap between sstables". While at it also make the code more compact.	2018-07-04 17:42:37 +03:00
Botond Dénes	4b57fc9aea	tests/mutation_reader_test: fix reader_selector related tests Don't assume the partition keys use lexical ordering. Add some additional checks.	2018-07-04 17:42:37 +03:00
Botond Dénes	a9c465d7d2	Revert "database: stop using incremental selectors" The data-loss bug is fixed, the incremental selector can be used again. This reverts commit `aeffbb6732`.	2018-07-04 17:42:37 +03:00
Botond Dénes	c37aff419e	incremental_reader_selector: don't jump over sstables Passing the current read position to the `incremental_selector::select()` can lead to "jumping" through sstables. This can happen when the currently open sstables have no partition that intersects with a yet unselected sstable that has an intersecting range nevertheless, in other words there is a gap in the selected sstables that this unselected one completely fits into. In this case the unselected sstable will be completely omitted from the read. The solution is to not to avoid calling `select()` with a position that is larger than the `next_position` returned from the previous `select()` call. Instead, call `select()` repeatedly with the `next_position` from the previous call, until either at least one new sstable is selected or the current read position is surpassed. This guarantess that no sstables will be jumped over. In other words, advance the incremental selector in a pace defined by itself thus guaranteeing that no sstable will be jumped over.	2018-07-04 17:42:37 +03:00
Botond Dénes	81a03db955	mutation_reader: reader_selector: use ring_position instead of token sstable_set::incremental selector was migrated to ring position, follow suit and migrate the reader_selector to use ring_position as well. Above correctness this also improves efficiency in case of dense tables, avoiding prematurely selecting sstables that share the token but start at different keys, altough one could argue that this is a niche case.	2018-07-04 17:42:37 +03:00
Botond Dénes	a8e795a16e	sstables_set::incremental_selector: use ring_position instead of token Currently `sstable_set::incremental_selector` works in terms of tokens. Sstables can be selected with tokens and internally the token-space is partitioned (in `partitioned_sstable_set`, used for LCS) with tokens as well. This is problematic for severeal reasons. The sub-range sstables cover from the token-space is defined in terms of decorated keys. It is even possible that multiple sstables cover multiple non-overlapping sub-ranges of a single token. The current system is unable to model this and will at best result in selecting unnecessary sstables. The usage of token for providing the next position where the intersecting sstables change [1] causes further problems. Attempting to walk over the token-space by repeatedly calling `select()` with the `next_position` returned from the previous call will quite possibly lead to an infinite loop as a token cannot express inclusiveness/exclusiveness and thus the incremental selector will not be able to make progress when the upper and lower bounds of two neighbouring intervals share the same token with different inclusiveness e.g. [t1, t2](t2, t3]. To solve these problems update incremental_selector to work in terms of ring position. This makes it possible to partition the token-space amoing sstables at decorated key granularity. It also makes it possible for select() to return a next_position that is guaranteed to make progress. partitioned_sstable_set now builds the internal interval map using the decorated key of the sstables, not just the tokens. incremental_selector::select() now uses `dht::ring_position_view` as both the selector and the next_position. ring_position_view can express positions between keys so it can also include information about inclusiveness/exclusiveness of the next interval guaranteeing forward progress. [1] `sstable_set::incremental_selector::selection::next_position`	2018-07-04 17:42:33 +03:00
Duarte Nunes	33d7de0805	Merge 'Expose sharding information to connections' from Avi " In the same way that drivers can route requests to a coordinator that is also a replica of the data used by the request, we can allow drivers to route requests directly to the shard. This patchset adds and documents a way for drivers to know which shard a connection is connected to, and how to perform this routing. " * tag 'shard-info-alt/v1' of https://github.com/avikivity/scylla: doc: documented protocol extension for exposing sharding transport: expose more information about sharding via the OPTIONS/SUPPORTED messages dht: add i_partitioner::sharding_ignore_msb()	2018-07-04 13:01:21 +01:00
Botond Dénes	8084ce3a8e	query_pager: use query::is_single_partition() to check for singular range Use query::is_single_partition() to check whether the queried ranges are singular or not. The current method of using `dht::partition_range::is_singular()` is incorrect, as it is possible to build a singular range that doesn't represent a single partition. `query::is_single_partition()` correctly checks for this so use it instead. Found during code-review. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <f671f107e8069910a2f84b14c8d22638333d571c.1530675889.git.bdenes@scylladb.com>	2018-07-04 10:04:50 +01:00
Takuya ASADA	3cb7ddaf68	dist/debian/build_deb.sh: make build_deb.sh more simplified Use is_debian()/is_ubuntu() to detect target distribution, also install pystache by path since package name is different between Fedora and CentOS. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180703193224.4773-1-syuu@scylladb.com>	2018-07-04 11:12:26 +03:00
Takuya ASADA	ed1d0b6839	dist/ami/files/.bash_profile: drop Ubuntu support Drop Ubuntu support on login prompt, too. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180703192813.4589-1-syuu@scylladb.com>	2018-07-04 11:12:26 +03:00
Piotr Sarna	f42eaff75e	cql3: add single column primary key restrictions getters Getters for single column partition/clustering key restrictions are added to statement_restrictions.	2018-07-04 09:48:32 +02:00
Piotr Sarna	a99acbc376	cql3: expose single column primary key restrictions Underlying single_column_restrictions are exposed for single_column_primary_key_restrictions via a const method.	2018-07-04 09:48:32 +02:00
Piotr Sarna	f7a2f15935	cql3: add needs_filtering to primary key restrictions Primary key restrictions sometimes require filtering. These functions return true if ALLOW FILTERING needs to be enabled in order to satisfy these restrictions.	2018-07-04 09:48:32 +02:00
Piotr Sarna	6aec9e711f	cql3: add simpler single_column_restriction::is_satisfied_by Currently restriction::is_satisfied_by() accepts only keys and rows as arguments. In this commit, a version that only takes bytes of data is provided. This simpler version applies to single_column_restriction only, because it compares raw bytes underneath anyway. For other restriction types, simplified is_satisfied_by is not defined.	2018-07-04 09:48:32 +02:00
Botond Dénes	bf2645c616	compatible_ring_position: refactor to compatible_ring_position_view compatible_ring_position's sole purpose is to allow creating boost::icl::interval_map with dht::ring_position as the key and list of sstables as the value. This function is served equally well if compatible_ring_position wraps a `dht::ring_position_view` instead of a `dht::ring_position` with the added benefit of not having to copy the possibly heavy `dht::decorated_key` around. It also makes it possible to do lookups with `dht::ring_position_view` which is much more versatile and allows avoiding copies just to make lookups. The only downside is that `dht::ring_position_view` requires the lifetime of the "viewed" object to be taken care of. This is not a concern however, as so long as an interval is present in the map the represented sstable is guaranteed to be alive to, as the interval map participates in the ownership of the stored sstables. Rename compatible_ring_position to compatible_ring_position_view to reflect the changes. While at it upgrade the std::experimental::optional to std::optional.	2018-07-04 08:19:39 +03:00
Botond Dénes	48b07ba5d3	dht::ring_position_view: use token_bound from ring_position Currently dht::ring_position_view's dht::token constructor takes the token bound in the form of a raw `uint8_t`. This allows for passing a weight of "0" which is illegal as single token does not represent a single ring position but an interval as arbitrary number of keys can have the same token. dht::ring_position uses an enum in its dht::token constructor. Import that same enum into the dht::ring_position_view scope and take a `token_bound` instead of `uint8_t`. This is especially important as in later patches the internal weight of the ring_position_view will be exposed and illegal values can cause all sorts of problems.	2018-07-04 08:19:34 +03:00
Alexys Jacob	8c03c1e2ce	Support Gentoo Linux on node_health_check script. Gentoo Linux was not supported by the node_health_check script which resulted in the following error message displayed: "This s a Non-Supported OS, Please Review the Support Matrix" This patch adds support for Gentoo Linux while adding a TODO note to add support for authenticated clusters which the script does not support yet. Signed-off-by: Alexys Jacob <ultrabug@gentoo.org> Message-Id: <20180703124458.3788-1-ultrabug@gentoo.org>	2018-07-03 20:18:13 +03:00
Tomasz Grabiec	2ffb621271	Merge "Fix atomic_cell_or_collection::external_memory_usage()" from Paweł After the transition to the new in-memory representation in `aab6b0ee27` 'Merge "Introduce new in-memory representation for cells" from Paweł' atomic_cell_or_collection::external_memory_usage() stopped accounting for the externally stored data. Since, it wasn't covered by the unit tests the bug remained unnotices until now. This series fixes the memory usage calculation and adds proper unit tests. * https://github.com/pdziepak/scylla.git fix-external-memory-usage/v1: tests/mutation: properly mark atomic_cells that are collection members imr::utils::object: expose size overhead data::cell: expose size overhead of external chunks atomic_cell: add external chunks and overheads to external_memory_usage() tests/mutation: test external_memory_usage()	2018-07-03 14:58:10 +02:00
Botond Dénes	c236a96d7d	tests/cql_query_tess: add unit test for querying empty ranges test A bug was found recently (#3564) in the paging logic, where the code assumed the queried ranges list is non-empty. This assumption is incorrect as there can be valid (if rare) queries that can result in the ranges list to be empty. Add a unit test that executes such a query with paging enabled to detect any future bugs related to assumptions about the ranges list being non-empty. Refs: #3564 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <f5ba308c4014c24bb392060a7e72e7521ff021fa.1530618836.git.bdenes@scylladb.com>	2018-07-03 13:43:17 +01:00
Botond Dénes	59a30f0684	query_pager: be prepared to _ranges being empty do_fetch_page() checks in the beginning whether there is a saved query state already, meaning this is not the first page. If there is not it checks whether the query is for a singulular partitions or a range scan to decide whether to enable the stateful queries or not. This check assumed that there is at least one range in _ranges which will not hold under some circumstances. Add a check for _ranges being empty. Fixes: #3564 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <cbe64473f8013967a93ef7b2104c7ca0507afac9.1530610709.git.bdenes@scylladb.com>	2018-07-03 11:05:01 +01:00
Avi Kivity	eafd16266d	tests: reduce multishard_mutation_test runtime in debug mode Debug mode is so slow that generating 1000 mutations is too much for it. High memory use can also confuse the santitizers that track each allocation. Reduce mutation count from 1000 to 10 in debug mode.	2018-07-03 12:01:44 +03:00
Avi Kivity	a36b1f1967	Merge "more scylla_setup fixes" from Takuya " Added NIC / Disk existance check, --force-raid mode on scylla_raid_setup. " * 'scylla_setup_fix4' of https://github.com/syuu1228/scylla: dist/common/scripts/scylla_raid_setup: verify specified disks are unused dist/common/scripts/scylla_raid_setup: add --force-raid to construct raid even only one disk is specified dist/common/scripts/scylla_setup: don't accept disk path if it's not block device dist/common/scripts/scylla_raid_setup: verify specified disk paths are block device dist/common/scripts/scylla_sysconfig_setup: verify NIC existance	2018-07-03 11:03:08 +03:00
Takuya ASADA	d0f39ea31d	dist/common/scripts/scylla_raid_setup: verify specified disks are unused Currently only scylla_setup interactive mode verifies selected disks are unused, on non-interactive mode we get mdadm/mkfs.xfs program error and python backtrace when disks are busy. So we should verify disks are unused also on scylla_raid_setup, print out simpler error message.	2018-07-03 14:50:34 +09:00
Takuya ASADA	3289642223	dist/common/scripts/scylla_raid_setup: add --force-raid to construct raid even only one disk is specified User may want to start RAID volume with only one disk, add an option to force constructing RAID even only one disk specified.	2018-07-03 14:50:34 +09:00
Takuya ASADA	e0c16c4585	dist/common/scripts/scylla_setup: don't accept disk path if it's not block device Need to ignore input when specified path is not block device.	2018-07-03 14:50:34 +09:00
Takuya ASADA	24ca2d85c6	dist/common/scripts/scylla_raid_setup: verify specified disk paths are block device Verify disk paths are block device, exit with error if not.	2018-07-03 14:50:34 +09:00

1 2 3 4 5 ...

16019 Commits