scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	00a63663d6	bytes_ostream: increase max chunk size to 128 kB 128 kB is the size of the LSA segment and therefore the default size of any kind of chunks, fragments and buffers. Message-Id: <20180709155615.22500-1-pdziepak@scylladb.com>	2018-07-09 19:59:51 +03:00
Tomasz Grabiec	1336744a05	mutation_fragment: Fix clustering_row::equal() using incorrect column kind Incorrect column_kind was passed, which may cause wrong type to be used for comparison if schema contains static columns. Affects only tests. Spotted during code review. Message-Id: <1531144991-2658-1-git-send-email-tgrabiec@scylladb.com>	2018-07-09 15:25:17 +01:00
Avi Kivity	ed7855a8a6	Update seastar submodule * seastar 216d499...aac6cf1 (5): > reactor: pollable_fd: limit fragment count to IOV_MAX > tests: silence more "-Werror=sign-compare" warnings > reactor: include <boost/next_prior.hpp> > Use `#pragma once` everywhere > .gitignore: adds __pycache__ directory	2018-07-09 17:01:44 +03:00
Gleb Natapov	617666efb0	storage_proxy: use logger's exception printer to report read failure Use existing exception pretty printer since it handles nested exceptions. Message-Id: <20180709122826.GT28899@scylladb.com>	2018-07-09 15:31:14 +03:00
Duarte Nunes	156817e00e	db/size_estimates_virtual_reader: Use left-exclusive token ranges We were considering the token ranges in the size_estimates system table to be inclusive, which is incorrect and incompatible with Cassandra. While we ignore the inclusiveness of the partition_range bounds when selecting sstables, we do take it into account in estimated_keys_for_range(). We would thus select the correct sstables, but could over-estimate the range size nonetheless. Tests: virtual_reader_test(release) Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180709115919.5106-1-duarte@scylladb.com>	2018-07-09 15:26:32 +03:00
Takuya ASADA	1a5a40e5f6	dist/common/scripts/scylla_util.py: use os.open(O_EXCL) to verify disk is unused To simplify is_unused_disk(), just try to open the disk instead of checking multiple block subsystems. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180709102729.30066-1-syuu@scylladb.com>	2018-07-09 13:29:15 +03:00
Avi Kivity	7d0df2a06d	Update scylla-ami submodule * dist/ami/files/scylla-ami 67293ba...5200f3f (1): > Add custom script options to AMI user-data	2018-07-09 13:21:30 +03:00
Gleb Natapov	ac27d1c93b	storage_proxy: fix rpc connection failure handling by read operation Currently rpc::closed_error is not counted towards replica failure during read and thus read operation waits for timeout even if one of the nodes dies. Fix this by counting rpc::closed_error towards failed attempts. Fixes #3590. Message-Id: <20180708123522.GC28899@scylladb.com>	2018-07-09 10:05:31 +03:00
Avi Kivity	2f8537b178	database: demote "Setting compaction strategy" log message to debug level It's not very helpful in normal operation, and generates much noise, especially when there are many tables. Message-Id: <20180708070051.8508-1-avi@scylladb.com>	2018-07-08 10:27:03 +01:00
Avi Kivity	512baf536f	storage_proxy: implement write timeouts Require a timeout parameter for storage_proxy::mutate_begin() and all its callers (all the way to thrift and cql modification_statement and batch_statement). This should fix spurious debug-mode test failures, where overcommit and general debug slowness result in the default timeouts being exceeded. Since the tests use infinite timeouts, they should not time out any more. Tests: unit (release), with an extra patch that aborts when a non-infinite timeout is detected. Message-Id: <20180707204424.17116-1-avi@scylladb.com>	2018-07-08 10:27:03 +01:00
Takuya ASADA	929ba016ed	dist/common/scripts/scylla_util.py: strip double quote from sysconfig parameter Current sysconfig_parser.get() returns parameter including double quote, it will cause problem by append text using sysconfig_parser.set(). Fixes #3587 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180706172219.16859-1-syuu@scylladb.com>	2018-07-08 10:47:41 +03:00
Duarte Nunes	1beed0ca16	Merge 'hinted handoff: add rebalancing and unmark as experimental' from Vlad " This series adds the last missing part of the HH feature list (as in the design doc) - rebalancing; and finally removes the "experimental" tag from the HH. " * 'hinted_handoff_rebalance-v3' of https://github.com/vladzcloudius/scylla: main: remove the "experimental" tag from the hinted handoff feature db::hints::manager: implement rebalance() method	2018-07-07 20:38:07 +01:00
Vlad Zolotarov	7495c8e56d	dist: scylla_lib.sh: get_mode_cpu_set: split the declaration and ssignment to the local variable In bash local variable declaration is a separate operation with its own exit status (always 0) therefore constructs like local var=`cmd` will always result in the 0 exit status ($? value) regardless of the actual result of "cmd" invocation. To overcome this we should split the declaration and the assignment to be like this: local var var=`cmd` Fixes #3508 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <1529702903-24909-3-git-send-email-vladz@scylladb.com>	2018-07-07 18:04:19 +03:00
Vlad Zolotarov	f3ca17b1a1	dist: scylla_lib.sh: get_mode_cpu_set: don't let the error messages out References #3508 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <1529702903-24909-2-git-send-email-vladz@scylladb.com>	2018-07-07 18:04:18 +03:00
Avi Kivity	e79fccdf7b	Update seastar submodule * seastar d7f35d7...216d499 (10): > temporary_buffer: Add clone method() > temporary_buffer: Make move-assignment operator noexcept. > deleter: Make move-assignment operator noexcept. > reactor: don't become inefficient when max_task_backlog is exceeded > reactor: switch cumulative time metrics resolution from nanoseconds to milliseconds > preempt: annotate for branch prediction > tests: silence "-Werror=sign-compare" warnings > Merge "Support one I/O Scheduler per device" from Glauber > rpc: make rpc server scheduling aware > Add SEASTAR_USER_CFLAGS and SEASTAR_ENABLE_WERROR	2018-07-07 17:48:25 +03:00
Vlad Zolotarov	c65a110839	main: remove the "experimental" tag from the hinted handoff feature Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-07-06 19:19:40 -04:00
Vlad Zolotarov	83ba6d84a1	db::hints::manager: implement rebalance() method Rebalance hints segments that need to be sent among all present shards. Ensure that after rebalancing the difference between the number of segments of any two shards is not greater than 1. Try to minimize the amount of "file rename" operations in order to achieve the needed result. Note: "Resharding" is a particular case of rebalancing. Tests: dtest: hintedhandoff_additional_test.py:TestHintedHandoff.hintedhandoff_rebalance_test Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-07-06 19:18:46 -04:00
Piotr Sarna	77aa97f62a	cql3: fix ALLOW FILTERING iterator In original series cell iterator for regular cells was erroneously taken by copy instead of by reference, which will result in iterating over the first value indefinitely. Also, the same iterator was not updated for collections, which is fixed too. Message-Id: <83297adf8121de4fd37257c87f250d61ea9ec80b.1530892191.git.sarna@scylladb.com>	2018-07-06 17:23:12 +01:00
Duarte Nunes	0ec3ff0611	Merge 'Add ALLOW FILTERING metrics' from Piotr " This series addresses issue #3575 by adding 3 ALLOW FILTERING related metrics to help profile queries: * number of read request that required filtering * total number of rows read that required filtering * number of rows read that required filtering and matched Tests: unit (release) " * 'allow_filtering_metrics_4' of https://github.com/psarna/scylla: cql3: publish ALLOW FILTERING metrics cql3: add updating ALLOW FILTERING metrics cql3: define ALLOW FILTERING metrics	2018-07-06 11:19:37 +01:00
Piotr Sarna	4a435e6f66	cql3: publish ALLOW FILTERING metrics ALLOW FILTERING related metrics are registered and published. Fixes #3575	2018-07-06 12:00:37 +02:00
Piotr Sarna	03f2f8633b	cql3: add updating ALLOW FILTERING metrics Metrics related to ALLOW FILTERING queries are now properly updated on read requests.	2018-07-06 12:00:29 +02:00
Piotr Sarna	8cb242ab0b	cql3: define ALLOW FILTERING metrics The following metrics are defined for ALLOW FILTERING: * number of read request that required filtering * total number of rows read that required filtering * number of rows read that required filtering and matched	2018-07-06 10:43:18 +02:00
Raphael S. Carvalho	dfd1e1229e	sstables/compaction_manager: fix typo in function name to reevaluate postponed compaction Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180702185343.26682-1-raphaelsc@scylladb.com>	2018-07-05 18:54:14 +03:00
Takuya ASADA	4df982fe07	dist/common/scripts/scylla_sysconfig_setup: fix typo Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180705133313.16934-1-syuu@scylladb.com>	2018-07-05 16:38:14 +03:00
Avi Kivity	7a1bcd9ad3	Merge "Improve mutation printing in GDB" from Tomasz " This is a series of patches which make it possible for a human to examine contents of cache or memtables from GDB. " * 'tgrabiec/gdb-cache-printers' of github.com:tgrabiec/scylla: gdb: Add pretty printer for managed_vector gdb: Add pretty printer for rows gdb: Add mutation_partition pretty printer gdb: Add pretty printer for partition_entry gdb: Add pretty printer for managed_bytes gdb: Add iteration wrapper for intrusive_set_external_comparator gdb: Add iteration wrapper for boost intrusive set	2018-07-05 14:08:58 +03:00
Avi Kivity	f55a2fe3a7	main: improve reporting of dns resolution errors A report that C-Ares returned some errors tells the user nothing. Improve the error message by including the name of the configuration variable and its value. Message-Id: <20180705084959.10872-1-avi@scylladb.com>	2018-07-05 10:24:41 +01:00
Duarte Nunes	c126b00793	Merge 'ALLOW FILTERING support' from Piotr " The main idea of this series is to provide a filtering_visitor as a specialised result_set_builder::visitor implementation that keeps restriction info and applies it on query results. Also, since allow_filtering checking is not correct now (e.g. #2025) on select_statement level, this series tries to fix any issues related to it. Still in TODO: * handling CONTAINS relation in single column restriction filtering * handling multi-column restrictions - especially EQ, which can be split into multiple single-column restrictions * more tests - it's never enough; especially esoteric cases like filtering queries which also use secondary indexes, paging tests, etc. Tests: unit (release) " * 'allow_filtering_6' of https://github.com/psarna/scylla: tests: add allow_filtering tests to cql_query_test cql3: enable ALLOW FILTERING service: add filtering_pager cql3: optimize filtering partition keys and static rows cql3: add filtering visitor cql3: move result_set_builder functions to header cql3: amend need_filtering() cql3: add single column primary key restrictions getters cql3: expose single column primary key restrictions cql3: add needs_filtering to primary key restrictions cql3: add simpler single_column_restriction::is_satisfied_by	2018-07-05 10:18:08 +01:00
Piotr Sarna	a7dd02309f	tests: add allow_filtering tests to cql_query_test Test cases for ALLOW FILTERING are added to cql_query_test suite.	2018-07-05 10:50:43 +02:00
Piotr Sarna	27bf20aa3f	cql3: enable ALLOW FILTERING Enables 'ALLOW FILTERING' queries by transfering control to result_set_builder::filtering_visitor. Both regular and primary key columns are allowed, but some things are left unimplemented: - multi-column restrictions - CONTAINS queries Fixes #2025	2018-07-05 10:50:43 +02:00
Piotr Sarna	7b018f6fd6	service: add filtering_pager For paged results of an 'ALLOW FILTERING' query, a filtering pager is provided. It's based on a filtering_visitor for result_builder.	2018-07-05 10:50:43 +02:00
Piotr Sarna	a08fba19e3	cql3: optimize filtering partition keys and static rows If any restriction on partition key or static row part fails, it will be so for every row that belongs to a partition. Hence, full check of the rest of the rows is skipped.	2018-07-05 10:50:43 +02:00
Piotr Sarna	2a0b720102	cql3: add filtering visitor In order to filter results of an 'ALLOW FILTERING' query, a visitor that can take optional filter for result_builder is provided. It defaults to nop_filter, which accepts all rows.	2018-07-05 10:50:43 +02:00
Piotr Sarna	1cf5653f89	cql3: move result_set_builder functions to header Moving function definitions to header is a preparation step before turning result_set_builder into a template.	2018-07-05 10:50:43 +02:00
Piotr Sarna	4d3d32f465	cql3: amend need_filtering() Previous implementation of need_filtering() was too eager to assume that index query should be used, whereas sometimes a query should just be filtered.	2018-07-05 10:50:39 +02:00
Avi Kivity	dd083122f9	Update scylla-ami submodule * dist/ami/files/scylla-ami 0fd9d23...67293ba (1): > scylla_install_ami: fix broken argument parser Fixes #3578.	2018-07-05 09:48:06 +03:00
Avi Kivity	f4caa418ff	Merge "Fix the "LCS data-loss bug"" from Botond " This series fixes the "LCS data-loss bug" where full scans (and everything that uses them) would miss some small percentage (> 0.001%) of the keys. This could easily lead to permanent data-loss as compaction and decomission both use full scans. `aeffbb673` worked around this bug by disabling the incremental reader selectors (the class identified as the source of the bug) altogether. This series fixes the underlying issue and reverts `aeffbb673`. The root cause of the bug is that the `incremental_reader_selector` uses the current read position to poll for new readers using `sstable_set::incremental_selector::select()`. This means that when the currently open sstables contain no partitions that would intersect with some of the yet unselected sstables, those sstables would be ignored. Solve the problem by not calling `select()` with the current read position and always pass the `next_position` returned in the previous call. This means that the traversal of the sstable-set happens at a pace defined by the sstable-set itself and this guarantees that no sstable will be jumped over. When asked for new readers the `incremental_reader_selector` will now iteratively call `select()` using the `next_position` from the previous `select()` call until it either receives some new, yet unselected sstables, or `next_position` surpasses the read position (in which case `select()` will be tried again later). The `sstable_set::incremental_selector` was not suitable in its present state to support calling `select()` with the `next_position` from a previous call as in some cases it could not make progress due to inclusiveness related ambiguities. So in preparation to the above fix `sstable_set` was updated to work in terms of ring-position instead of tokens. Ring-position can express positions in a much more fine-grained way then token, including positions after/before tokens and keys. This allows for a clear expression of `next_position` such that calling `select()` with it guarantees forward progress in the token-space. Tests: unit(release, debug) Refs: #3513 " * 'leveled-missing-keys/v4' of https://github.com/denesb/scylla: tests/mutation_reader_test: combined_mutation_reader_test: use SEASTAR_THREAD_TEST_CASE tests/mutation_reader_test: refactor combined_mutation_reader_test tests/mutation_reader_test: fix reader_selector related tests Revert "database: stop using incremental selectors" incremental_reader_selector: don't jump over sstables mutation_reader: reader_selector: use ring_position instead of token sstables_set::incremental_selector: use ring_position instead of token compatible_ring_position: refactor to compatible_ring_position_view dht::ring_position_view: use token_bound from ring_position i_partitioner: add free function ring-position tri comparator mutation_reader_merger::maybe_add_readers(): remove early return mutation_reader_merger: get rid of _key	2018-07-05 09:33:12 +03:00
Takuya ASADA	3bcc123000	dist/ami: hardcode target for scylla_current_repo since we don't have --target option anymore We break build_ami.sh since we dropped Ubuntu support, scylla_current_repo command does not finishes because of less argument ('--target' with no distribution name, since $TARGET is always blank now). It need to hardcoded as centos. Fixes #3577 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180705035251.29160-1-syuu@scylladb.com>	2018-07-05 09:31:43 +03:00
Paweł Dziepak	07a429e837	test.py: do not disable human-readable format with --jenkins flag When test.py is run with --jenkins flag Boost UTF is asked to generate an XML file with the test results. This automatically disables the human-readable output printed to stdout. There is no real reason to do so and it is actually less confusing when the Boost UTF messages are in the test output together with Scylla logger messages. Message-Id: <20180704172913.23462-1-pdziepak@scylladb.com>	2018-07-05 09:31:15 +03:00
Raphael S. Carvalho	7d6af5da3a	sstables/compaction_manager: properly reevaluate postponed compactions for leveled strategy Function to reevaluate postponed compaction was called too early for strategies that don't allow parallel compaction (only leveled strategy (LCS) at this moment). Such strategies must first have the ongoing compaction deregistered before reevaluating the postponed ones. Manager uses task list of ongoing compaction to decides if there's ongoing compaction for a given column family. So compaction could stop making progress at all if and only if we stop flushing new data. So it could happen that a column family would be left with lots of pending compaction, leading the user to think all compacting is done, but after reboot, there will be lots of compaction activity. We'll both improve method to detect parallel compaction here and also add a call to reevaluate postponed compaction after compaction is done. Fixes #3534. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180702185327.26615-1-raphaelsc@scylladb.com>	2018-07-04 16:30:21 +01:00
Botond Dénes	b32f94d31e	tests/mutation_reader_test: combined_mutation_reader_test: use SEASTAR_THREAD_TEST_CASE	2018-07-04 17:42:37 +03:00
Botond Dénes	77ad085393	tests/mutation_reader_test: refactor combined_mutation_reader_test Make combined_mutation_reader_test more interesting: * Set the levels on the sstables * Arrange the sstables so that they test for the "jump over sstables" bug. * Arrange the sstables so that they test for the "gap between sstables". While at it also make the code more compact.	2018-07-04 17:42:37 +03:00
Botond Dénes	4b57fc9aea	tests/mutation_reader_test: fix reader_selector related tests Don't assume the partition keys use lexical ordering. Add some additional checks.	2018-07-04 17:42:37 +03:00
Botond Dénes	a9c465d7d2	Revert "database: stop using incremental selectors" The data-loss bug is fixed, the incremental selector can be used again. This reverts commit `aeffbb6732`.	2018-07-04 17:42:37 +03:00
Botond Dénes	c37aff419e	incremental_reader_selector: don't jump over sstables Passing the current read position to the `incremental_selector::select()` can lead to "jumping" through sstables. This can happen when the currently open sstables have no partition that intersects with a yet unselected sstable that has an intersecting range nevertheless, in other words there is a gap in the selected sstables that this unselected one completely fits into. In this case the unselected sstable will be completely omitted from the read. The solution is to not to avoid calling `select()` with a position that is larger than the `next_position` returned from the previous `select()` call. Instead, call `select()` repeatedly with the `next_position` from the previous call, until either at least one new sstable is selected or the current read position is surpassed. This guarantess that no sstables will be jumped over. In other words, advance the incremental selector in a pace defined by itself thus guaranteeing that no sstable will be jumped over.	2018-07-04 17:42:37 +03:00
Botond Dénes	81a03db955	mutation_reader: reader_selector: use ring_position instead of token sstable_set::incremental selector was migrated to ring position, follow suit and migrate the reader_selector to use ring_position as well. Above correctness this also improves efficiency in case of dense tables, avoiding prematurely selecting sstables that share the token but start at different keys, altough one could argue that this is a niche case.	2018-07-04 17:42:37 +03:00
Botond Dénes	a8e795a16e	sstables_set::incremental_selector: use ring_position instead of token Currently `sstable_set::incremental_selector` works in terms of tokens. Sstables can be selected with tokens and internally the token-space is partitioned (in `partitioned_sstable_set`, used for LCS) with tokens as well. This is problematic for severeal reasons. The sub-range sstables cover from the token-space is defined in terms of decorated keys. It is even possible that multiple sstables cover multiple non-overlapping sub-ranges of a single token. The current system is unable to model this and will at best result in selecting unnecessary sstables. The usage of token for providing the next position where the intersecting sstables change [1] causes further problems. Attempting to walk over the token-space by repeatedly calling `select()` with the `next_position` returned from the previous call will quite possibly lead to an infinite loop as a token cannot express inclusiveness/exclusiveness and thus the incremental selector will not be able to make progress when the upper and lower bounds of two neighbouring intervals share the same token with different inclusiveness e.g. [t1, t2](t2, t3]. To solve these problems update incremental_selector to work in terms of ring position. This makes it possible to partition the token-space amoing sstables at decorated key granularity. It also makes it possible for select() to return a next_position that is guaranteed to make progress. partitioned_sstable_set now builds the internal interval map using the decorated key of the sstables, not just the tokens. incremental_selector::select() now uses `dht::ring_position_view` as both the selector and the next_position. ring_position_view can express positions between keys so it can also include information about inclusiveness/exclusiveness of the next interval guaranteeing forward progress. [1] `sstable_set::incremental_selector::selection::next_position`	2018-07-04 17:42:33 +03:00
Duarte Nunes	33d7de0805	Merge 'Expose sharding information to connections' from Avi " In the same way that drivers can route requests to a coordinator that is also a replica of the data used by the request, we can allow drivers to route requests directly to the shard. This patchset adds and documents a way for drivers to know which shard a connection is connected to, and how to perform this routing. " * tag 'shard-info-alt/v1' of https://github.com/avikivity/scylla: doc: documented protocol extension for exposing sharding transport: expose more information about sharding via the OPTIONS/SUPPORTED messages dht: add i_partitioner::sharding_ignore_msb()	2018-07-04 13:01:21 +01:00
Botond Dénes	8084ce3a8e	query_pager: use query::is_single_partition() to check for singular range Use query::is_single_partition() to check whether the queried ranges are singular or not. The current method of using `dht::partition_range::is_singular()` is incorrect, as it is possible to build a singular range that doesn't represent a single partition. `query::is_single_partition()` correctly checks for this so use it instead. Found during code-review. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <f671f107e8069910a2f84b14c8d22638333d571c.1530675889.git.bdenes@scylladb.com>	2018-07-04 10:04:50 +01:00
Takuya ASADA	3cb7ddaf68	dist/debian/build_deb.sh: make build_deb.sh more simplified Use is_debian()/is_ubuntu() to detect target distribution, also install pystache by path since package name is different between Fedora and CentOS. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180703193224.4773-1-syuu@scylladb.com>	2018-07-04 11:12:26 +03:00
Takuya ASADA	ed1d0b6839	dist/ami/files/.bash_profile: drop Ubuntu support Drop Ubuntu support on login prompt, too. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180703192813.4589-1-syuu@scylladb.com>	2018-07-04 11:12:26 +03:00

1 2 3 4 5 ...

16035 Commits