scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-03 13:37:04 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	311cd6403c	cql3/statements: verify that counter column cannot be added into non-counter table A check, to validate that counter column cannot be added into non-counter table, is missing for alter table statement. Validation is performed when building new schema, but it's limited to checking that a schema will not contain both counter and non-counter columns. Due to lack of validation, the added counter column could be incorrectly persisted to the schema, but this results in a crash when setting the new schema to its table. On restart, it can be confirmed that the schema change was indeed persisted when describing the table. This problem is fixed by doing proper validation for the alter table statement, which consists of making sure a new counter column cannot be added to a non-counter table. The test cdc_disallow_cdc_for_counters_test is adjusted because one of its tests was built on the assumption that counter column can be added into a non-counter table. Fixes #7065. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200824155709.34743-1-raphaelsc@scylladb.com> (cherry picked from commit `1c29f0a43d`)	2020-08-25 18:45:30 +03:00
Avi Kivity	59aa1834a7	Merge "repair: row_level: prevent deadlocks when repairing homogenous nodes" from Botond " This series backports the series "repair: row_level: prevent deadlocks when repairing homogenous nodes" (merged as `a9c7a1a86`) to branch-4.1. " Fixes #6272 * 'repair-row-level-evictable-local-reader/branch-4.1' of https://github.com/denesb/scylla: repair: row_level: destroy reader on EOS or error repair: row_level: use evictable_reader for local reads mutation_reader: expose evictable_reader mutation_reader: evictable_reader: add auto_pause flag mutation_reader: make evictable_reader a flat_mutation_reader mutation_reader: s/inactive_shard_read/inactive_evictable_reader/ mutation_reader: move inactive_shard_reader code up mutation_reader: fix indentation mutation_reader: shard_reader: extract remote_reader as evictable_reader mutation_reader: reader_lifecycle_policy: make semaphore() available early	2020-08-23 18:06:12 +03:00
Botond Dénes	436b305286	view_update_generator: fix race between registering and processing sstables `fea83f6` introduced a race between processing (and hence removing) sstables from `_sstables_with_tables` and registering new ones. This manifested in sstables that were added concurrently with processing a batch for the same sstables being dropped and the semaphore units associated with them not returned. This resulted in repairs being blocked indefinitely as the units of the semaphore were effectively leaked. This patch fixes this by moving the contents of `_sstables_with_tables` to a local variable before starting the processing. A unit test reproducing the problem is also added. Fixes: #6892 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200817160913.2296444-1-bdenes@scylladb.com> (cherry picked from commit `22a6493716`)	2020-08-23 18:04:29 +03:00
Botond Dénes	eece444547	mutation_reader: expose evictable_reader Expose functions for the outside world to create evictable readers. We expose two functions, which create an evictable reader with `auto_pause::yes` and `auto_pause::no` respectively. The function creating the latter also returns a handle in addition to the reader, which can be used to pause the reader. (cherry picked from commit `542d9c3711`)	2020-08-20 16:10:16 +03:00
Nadav Har'El	d5e5a6fe48	alternator: fix Expected's "NULL" operator with missing AttributeValueList The "NULL" operator in Expected (old-style conditional operations) doesn't have any parameters, so we insisted that the AttributeValueList be empty. However, we forgot to allow it to also be missing - a possibility which DynamoDB allows. This patch adds a test to reproduce this case (the test passes on DyanmoDB, fails on Alternator before this patch, and succeeds after this patch), and a fix. Fixes #6816. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200709161254.618755-1-nyh@scylladb.com> (cherry picked from commit `f549d147ea`)	2020-08-03 20:42:15 +03:00
Botond Dénes	5d6a7272e7	sstables: clamp estimated_partitions to [1, +inf) in writers In some cases estimated number of partitions can be 0, which is albeit a legit estimation result, breaks many low-level sstable writer code, so some of these have assertions to ensure estimated partitions is > 0. To avoid hitting this assert all users of the sstable writers do the clamping, to ensure estimated partitions is at least 1. However leaving this to the callers is error prone as #6913 has shown it. As this clamping is standard practice, it is better to do it in the writers themselves, avoiding this problem altogether. This is exactly what this patch does. It also adds two unit tests, one that reproduces the crash in #6913, and another one that ensures all sstable writers are fine with estimated partitions being 0 now. Call sites previously doing the clamping are changed to not do it, it is unnecessary now as the writer does it itself. Fixes #6913 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200724120227.267184-1-bdenes@scylladb.com> [avi: adjust sstable_datafile_test's use of compaction_descriptor and make_permit] (cherry picked from commit `fe127a2155`)	2020-07-28 09:55:34 +03:00
Dejan Mircevski	db286c5ca4	cql/restrictions: Handle `WHERE a>0 AND a<0` WHERE clauses with start point above the end point were handled incorrectly. When the slice bounds are transformed to interval bounds, the resulting interval is interpreted as wrap-around (because start > end), so it contains all values above 0 and all values below 0. This is clearly incorrect, as the user's intent was to filter out all possible values of a. Fix it by explicitly short-circuiting to false when start > end. Add a test case. Fixes #5799. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> (cherry picked from commit `921dbd0978`)	2020-07-08 13:21:00 +03:00
Botond Dénes	519fcd4729	db/view: view_update_generator: re-balance wait/signal on the register semaphore The view update generator has a semaphore to limit concurrency. This semaphore is waited on in `register_staging_sstable()` and later the unit is returned after the sstable is processed in the loop inside `start()`. This was broken by `4e64002`, which changed the loop inside `start()` to process sstables in per table batches, however didn't change the `signal()` call to return the amount of units according to the number of sstables processed. This can cause the semaphore units to dry up, as the loop can process multiple sstables per table but return just a single unit. This can also block callers of `register_staging_sstable()` indefinitely as some waiters will never be released as under the right circumstances the units on the semaphore can permanently go below 0. In addition to this, `4e64002` introduced another bug: table entries from the `_sstables_with_tables` are never removed, so they are processed every turn. If the sstable list is empty, there won't be any update generated but due to the unconditional `signal()` described above, this can cause the units on the semaphore to grow to infinity, allowing future staging sstables producers to register a huge amount of sstables, causing memory problems due to the amount of sstable readers that have to be opened (#6603, #6707). Both outcomes are equally bad. This patch fixes both issues and modifies the `test_view_update_generator` unit test to reproduce them and hence to verify that this doesn't happen in the future. Fixes: #6774 Refs: #6707 Refs: #6603 Tests: unit(dev) Signed-off-by: Botond DÃ©nes <bdenes@scylladb.com> Message-Id: <20200706135108.116134-1-bdenes@scylladb.com> (cherry picked from commit `5ebe2c28d1`)	2020-07-08 12:00:12 +03:00
Juliusz Stasiewicz	d396a298d6	cdc: Fix segfault when stream ID key is too short When a token is calculated for stream_id, we check that the key is exactly 16 bytes long. If it's not - `minimum_token` is returned and client receives empty result. This used to be the expected behavior for empty keys; now it's extended to keys of any incorrect length. Fixes #6570 (cherry picked from commit `8628ede009`)	2020-07-05 15:09:44 +03:00
Avi Kivity	c5e2fad1c8	Merge "Fix handling of decimals with negative scales" from Rafael " Before this series scylla would effectively infinite loop when, for example, casting a decimal with a negative scale to float. Fixes #6720 " * 'espindola/fix-decimal-issue' of https://github.com/espindola/scylla: big_decimal: Add a test for a corner case big_decimal: Correctly handle negative scales big_decimal: Add a as_rational member function big_decimal: Move constructors out of line (cherry picked from commit `3e2eeec83a`)	2020-06-29 12:05:39 +03:00
Alejo Sanchez	194ff1d226	lwt: validate before constructing metadata LWT batches conditions can't span multiple tables. This was detected in batch_statement::validate() called in ::prepare(). But ::cas_result_set_metadata() was built in the constructor, causing a bitset assert/crash in a reported scenario. This patch moves validate() to the constructor before building metadata. Closes #6332 Tested with https://github.com/scylladb/scylla-dtest/pull/1465 [avi: adjust spelling of exception message to 4.1 spelling] Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> (cherry picked from commit `d1521e6721`)	2020-06-21 18:20:41 +03:00
Piotr Sarna	3f8345f1b8	alternator: fix the return type of PutItem Even if there are no attributes to return from PutItem requests, we should return a valid JSON object, not an empty string. Fixes #6568 Tests: unit(dev) (cherry picked from commit `8fc3ca855e`)	2020-06-21 12:21:19 +03:00
Piotr Sarna	891a3fa243	alternator: fix returning UnprocessedKeys unconditionally Client libraries (e.g. PynamoDB) expect the UnprocessedKeys and UnprocessedItems attributes to appear in the response unconditionally - it's hereby added, along with a simple test case. Fixes #6569 Tests: unit(dev) (cherry picked from commit `3aff52f56e`)	2020-06-21 12:19:18 +03:00
Kamil Braun	81dc8eeec7	cdc: rename CDC description tables Commit `968177da04` has changed the schema of cdc_topology_description and cdc_description tables in the system_distributed keyspace. Unfortunately this was a backwards-incompatible change: these tables would always be created, irrespective of whether or not "experimental" was enabled. They just wouldn't be populated with experimental=off. If the user now tries to upgrade Scylla from a version before this change to a version after this change, it will work as long as CDC is protected b the experimental flag and the flag is off. However, if we drop the flag, or if the user turns experimental on, weird things will happen, such as nodes refusing to start because they try to populate cdc_topology_description while assuming a different schema for this table. The simplest fix for this problem is to rename the tables. This fix must get merged in before CDC goes out of experimental. If the user upgrades his cluster from a pre-rename version, he will simply have two garbage tables that he is free to delete after upgrading. sstables and digests need to be regenerated for schema_digest_test since this commit effectively adds new tables to the system_distributed keyspace. This doesn't result in schema disagreement because the table is announced to all nodes through the migration manager. (cherry picked from commit `d89b7a0548`) Fixes #6537.	2020-06-14 09:15:36 +03:00
Raphael S. Carvalho	2d72f7d8e5	compaction: Disable garbage collected writer if interposer consumer is used GC writer, used for incremental compaction, cannot be currently used if interposer consumer is used. That's because compaction assumes that GC writer will be operated only by a single compaction writer at a given point in time. With interposer consumer, multiple writers will concurrently operate on the same GC writer, leading to race condition which potentially result in use-after-free. Let's disable GC writer if interposer consumer is enabled. We're not losing anything because GC writer is currently only needed on strategies which don't implement an interposer consumer. Resharding will always disable GC writer, which is the expected behavior because it doesn't support incremental compaction yet. The proper fix, which allows GC writer and interposer consumer to work together, will require more time to implement and test, and for that reason, I am postponing it as #6472 is a showstopper for the current release. Fixes #6472. tests: mode(dev). [Raphael: Fixed compilation failure in unit test test_bug_6472 for backport] Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com> (cherry picked from commit `097a5e9e07`) Message-Id: <20200610203928.86717-1-raphaelsc@scylladb.com>	2020-06-11 13:21:56 +03:00
Nadav Har'El	bf509c3b16	alternator: add mandatory configurable write isolation mode Alternator supports four ways in which write operations can use quorum writes or LWT or both, which we called "write isolation policies". Until this patch, Alternator defaulted to the most generally safe policy, "always_use_lwt". This default could have been overriden for each table separately, but there was no way to change this default for all tables. This patch adds a "--alternator-write-isolation" configuration option which allows changing the default. Moreover, @dorlaor asked that users must explicitly choose this default mode, and not get "always_use_lwt" without noticing. The previous default, "always_use_lwt" supports any workload correctly but because it uses LWT for all writes it may be disappointingly slow for users who run write-only workloads (including most benchmarks) - such users might find the slow writes so disappointing that they will drop Scylla. Conversely, a default of "forbid_rmw" will be faster and still correct, but will fail on workloads which need read-modify-write operations - and suprise users that need these operations. So Dor asked that that none of the write modes be made the default, and users must make an informed choice between the different write modes, rather than being disappointed by a default choice they weren't aware of. So after this patch, Scylla refuses to boot if Alternator is enabled but a "--alternator-write-isolation" option is missing. The patch also modifies the relevant documentation, adds the same option to our docker image, and the modifies the test-running script test/alternator/run to run Scylla with the old default mode (always_use_lwt), which we need because we want to test RMW operations as well. Fixes #6452 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200524160338.108417-1-nyh@scylladb.com> (cherry picked from commit `c3da9f2bd4`)	2020-05-31 13:42:11 +03:00
Avi Kivity	8a026b8b14	Revert "compaction_manager: allow early aborts through abort sources." This reverts commit `e8213fb5c3`. It results in an assertion failure in remove_index_file_test. Fixes #6413. (cherry picked from commit `5b971397aa`)	2020-05-13 18:26:34 +03:00
Tomasz Grabiec	2078016f84	test: memory_footprint: Avoid invalid identifiers as columnnames Column name should not start with a digit, as can be the case with random_string(). Message-Id: <1588860648-15796-1-git-send-email-tgrabiec@scylladb.com>	2020-05-07 17:33:34 +03:00
Pavel Emelyanov	ef181fb2d0	test: Add option to flush memtables for perf_simple_query The test in question measures the speed of memtables, not the row_cache. With this option it can do both. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200507140603.12350-1-xemul@scylladb.com>	2020-05-07 16:09:40 +02:00
Ivan Prisyazhnyy	84e25e8ba4	api: support table auto compaction control The patch implements: - /storage_service/auto_compaction API endpoint - /column_family/autocompaction/{name} API endpoint Those APIs allow to control and request the status of background compaction jobs for the existing tables. The implementation introduces the table::_compaction_disabled_by_user. Then the CompactionManager checks if it can push the background compaction job for the corresponding table. New members === table::enable_auto_compaction(); table::disable_auto_compaction(); bool table::is_auto_compaction_disabled_by_user() const Test === Tests: unit(sstable_datafile_test autocompaction_control_test), manual $ ninja build/dev/test/boost/sstable_datafile_test $ ./build/dev/test/boost/sstable_datafile_test --run_test=autocompaction_control_test -- -c1 -m2G --overprovisioned --unsafe-bypass-fsync 1 --blocked-reactor-notify-ms 2000000 The test tries to submit a compaction job after playing with autocompaction control table switch. However, there is no reliable way to hook pending compaction task. The code assumed that with_scheduling_group() closure will never preempt execution of the stats check. Revert === Reverts commit `c8247ac`. In previous version the execution sometimes resulted into the following error: test/boost/sstable_datafile_test.cc(1076): fatal error: in "autocompaction_control_test": critical check cm->get_stats().pending_tasks == 1 \|\| cm->get_stats().active_tasks == 1 has failed This version adds a few sstables to the cf, starts the compaction and awaits until it is finished. API change === - `/column_family/autocompaction/` always returned `true` while answering to the question: if the autocompaction disabled (see https://github.com/scylladb/scylla-jmx/blob/master/src/main/java/org/apache/cassandra/db/ColumnFamilyStore.java#L321). now it answers to the question: if the autocompaction for specific table is enabled. The question logic is inverted. The patch to the JMX is required. However, the change is decent because all old values were invalid (it always reported all compactions are disabled). - `/column_family/autocompaction/` got support for POST/DELETE per table Fixes === Fixes #1488 Fixes #1808 Fixes #440 Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com>	2020-05-07 16:23:38 +03:00
Nadav Har'El	f12989ff73	alternator/test: minor cleanup in test_key_condition_expression.py Some minor cleanups, mostly comments, in test_key_condition_expression.py Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200506212849.16207-1-nyh@scylladb.com>	2020-05-07 13:58:44 +02:00
Botond Dénes	791acc7f38	sstables: sstable_reader: fix read range upper bound calculation for reverse slices The single-key sstable reader uses the clustering ranges from the slice to determine the upper bound of the disk read-range using the index. For this is simply uses the end bound of the last clustering ranges. For reverse reads however the clustering ranges in the slice are in reverse order, so this will in fact be the upper bound of the smallest range. Depending on whether the distance between the clustering range is big enough for the sstable reader to use the index to skip between them, this will lead to either reading too little data or an assert failure. This patch fixes the problematic function `get_slice_upper_bound()` to consider reverse reads as well. Initially I thought there will be more mishandling of reverse slices, but actually `mutation_fragment_filter`, the component doing the actual slicing of rows, is already reverse-slice aware. A unit test which reproduces the assert failure is also added. Fixes: #6171 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200507114956.271799-1-bdenes@scylladb.com>	2020-05-07 14:52:04 +03:00
Avi Kivity	bef8e5e930	Merge "Don't invalidate row cache when adding GC SStable to SSTable Set" from Raphael " Garbage collected SSTables, created by incremental compaction process, are being added to the SSTable set using a function that invalidates row cache using the range of the SSTable itself. That's incorrect because data in GC SSTables come from preexisting SSTables in set, meaning the state of data isn't changed and so no need for invalidation at all. Incorrect invalidation like this is a source of read performance issues. This problem is fixed by including GC SSTables to the descriptor which is used to specify changes to the SSTable set, which is the correct thing to do given that a midway failure could leave the set in an incorrect state. Fixes #5956. Fixes #6275. tests: unit(dev) " * 'fix_issue_5956_v4' of github.com:raphaelsc/scylla: sstables/compaction: Don't invalidate row cache when adding GC SSTable to SSTable set sstables/compaction: Change meaning of compaction_completion_desc input and output fields sstables/compaction: Clean up code around garbage_collected_sstable_writer	2020-05-07 14:10:49 +03:00
Glauber Costa	e8213fb5c3	compaction_manager: allow early aborts through abort sources. The shutdown process of compaction manager starts with an explicit call from the database object. However that can only happen everything is already initialized. This works well today, but I am soon to change the resharding process to operate before the node is fully ready. One can still stop the database in this case, but reshardings will have to finish before the abort signal is processed. This patch passes the existing abort source to the construction of the compaction_manager and subscribes to it. If the abort source is triggered, the compaction manager will react to it firing and all compactions it manages will be stopped. We still want the database object to be able to wait for the compaction manager, since the database is the object that owns the lifetime of the compaction manager. To make that possible we'll use a future that is return from stop(): no matter what triggered the abort, either an early abort during initial resharding or a database-level event like drain, everything will shut down in the right order. The abort source is passed to the database, who is responsible from constructing the compaction manager. Tests: unit (dev), manual start+stop, manual drain + stop Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200506184749.98288-1-glauber@scylladb.com>	2020-05-07 13:24:47 +03:00
Avi Kivity	fbf2194b31	Merge 'cql3: Fix detection of bound variables in tuples' from Juliusz This is unrelated to counters, but happens to fix #4209 `tuple::delayed_value::contains_bind_marker` used to check that ALL terms are bound (not that ANY of them is bound). As a result, scylla would crash in prepare codepath for collections of tuples. After this fix `invalid_request_exception` is thrown instead. * jul-stas-4209-crash-on-counter-shards-set: boost/tests: test for bound variable in a list of tuple literals cql3: fix detection of bound variables in tuples	2020-05-07 13:13:51 +03:00
Juliusz Stasiewicz	7b48d8c33c	boost/tests: test for bound variable in a list of tuple literals This test checks that the list literals of tuples with some (but not all!) bind markers are rejected.	2020-05-07 11:03:53 +02:00
Pavel Solodovnikov	55d89d2cbe	lwt: add cql tests to test delete+insert behavior on the same row in one batch Add a couple of cql tests regarding conditional batches: 1. Verify that "delete" takes priority over "insert" when applied to the same row within the same batch. 2. Test that a workaround for the issue works as expected (i.e. delete only individual cells instead of the full record). Tests: unit(dev) Fixes: #6273 Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200506201200.176590-1-pa.solodovnikov@scylladb.com>	2020-05-07 10:53:22 +02:00
Avi Kivity	2b0c317dec	test: lib: exception_utils: fix crash with fmt-6.2.0 fmt, the formatting library we use, detects types with conversion to std::string_view (and formats them as strings) and types that support operator<<(std::ostream, const T&) (and performs custom formatting on them). However, if <fmt/ostream.h>, the latter is not done. The problem happens with seastar::sstring, which implements both, and debug mode, which disables inlining. Some translation units do include <fmt/ostream.h>, and so generate code to do custom formatting. exception_utils.cc doesn't, and so generates code to format via string_view conversion. At link time, the compiler picks one of the generated functions and includes it in the final binary; it happened to pick one generated outside exception_utils.cc, using custom formatting. However, there is also code in fmt to encode which path fmt chose - string_view or custom. This code is constexpr and so is evaluated in exception_utils.cc. The result is that the function to perform formatting of seastar::sstring uses custom formatting, while the descriptor containing the method used says it is formatting via string_view. This is enough to cause a crash. The problem is limited to debug mode, since in other modes all this code is inlined, and so is consistent within the translation unit. We need a more general fix (hopefully in fmt), but for now a simple fix is to add the missing include. Ref https://github.com/fmtlib/fmt/issues/1662	2020-05-07 08:59:02 +03:00
Avi Kivity	6f1a8cfeea	Merge 'Use special partitioner for CDC Log' from Piotr " CDC has to create CDC streams that are co-located with corresponding BaseTable data. This is not always easy. Especially for small vnodes. This PR introduces new partitioner which allows us to easily find such stream ids that the stream belongs to a given vnode and shard. The idea is that a partitioner accepts only keys that are a blob composed of two int64 numbers. The first number is the token of the key. Tests: unit(dev), dtests(CDC) " * haaawk-cdc_partitioner: cdc:use CDCPartitioner for CDC Log dht: Add find_first_token_for_shard dht: use long_token in token::to_int64 cdc: add CDCPartitioner stream_id: add token_from_bytes static function i_partitioner: Stop distinguishing whether keys order is preserved	2020-05-06 20:29:27 +03:00
Nadav Har'El	ddb483461a	test/alternator: xfailing tests for FilterExpression feature This patch adds a comprehensive, hopefully complete, test for the yet-unimplemented FilterExpression feature. FilterExpression is the modern syntax which allows filtering the results of Query and Scan requests. The patch includes 50 tests spanning more than 700 lines of code, testing (hopefully) all the various FilterExpression features, sub-cases, syntax peculiarities, and so on. As usual, all included tests pass when run against DynamoDB ("pytest --aws") and xfail when run against Scylla. This test should be helpful to understand how to implement FilterExpression correctly, as well as test the future implementation. Refs #5038. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200503165639.15320-1-nyh@scylladb.com>	2020-05-06 12:56:20 +03:00
Raphael S. Carvalho	a214ccdf89	sstables/compaction: Don't invalidate row cache when adding GC SSTable to SSTable set Garbage collected SSTable is incorrectly added to SSTable set with a function that invalidates row cache. This problem is fixed by adding GC SStable to set using mechanism which replaces old sstables with new sstables. Also, adding GC SSTable to set in a separate call is not correct. We should make sure that GC SSTable reaches the SSTable set at the same time its respective old (input) SSTable is removed from the set, and that's done using a single request call to table. Fixes #5956. Fixes #6275. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-05-05 12:03:19 -03:00
Raphael S. Carvalho	8f4458f1d5	sstables/compaction: Change meaning of compaction_completion_desc input and output fields input_sstables is renamed to old_sstables and is about old SSTables that should be deleted and removed from the SSTable set. output_sstables is renamed to new_sstables and is about new SSTable that should be added to the SSTable set, replacing the old ones. This will allow us, for example, to add auxiliary SSTables to SSTable set using the same call which replaces output SSTables by input SSTables in compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-05-05 12:03:08 -03:00
Benny Halevy	580d397d2e	test: database_test: do_with_some_data: retain tmpdir for test duration Currently, the test seems to use the tmpdir class in a wrong way, just to get a path to a temporary directory. It should keep the tmpdir object around for the duration of the test so the temporary directory will be automatically removed when the test completes. Refs #6344 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200504153810.202218-1-bhalevy@scylladb.com>	2020-05-05 11:37:18 +03:00
Piotr Sarna	1c4e8f5030	alternator: fix checking max item depth Maximum item depth accepted by DynamoDB is 32, and alternator chose 39 as its arbitrary value in order to provide 7 shining new levels absolutely free of charge. Unfortunately, our code which checks the nesting level in rapidjson parsing bumps the counter by 2 for every object, which is due to rapidjson's internal implementation. In order to actually support at least 32 levels, the threshold is simply doubled. This commit comes with a test case which ensures that 32-nested items are accepted both by alternator and DynamoDB. The test case failed for alternator before the fix. Fixes #6366 Tests: unit(dev), alternator(local, remote)	2020-05-04 23:46:20 +03:00
Glauber Costa	c5cdd77f8e	gossip_test: start the compaction manager explicitly Right now the compaction_manager needs to be started explicitly. We may change it in the future, but right now that's how it is. Everything works now even without it, because compaction_manager::stop happens to work even if it was not started. But it is technically illegal. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200504143048.17201-1-glauber@scylladb.com>	2020-05-04 17:40:32 +03:00
Avi Kivity	f3bcd4d205	Merge 'Support SSL Certificate Hot Reloading' from Calle " Fixes #6067 Makes the scylla endpoint initializations that support TLS use reloadable certificate stores, watching used cert + key files for changes, and reload iff modified. Tests in separate dtest set. " * elcallio-calle/reloadable-tls: transport: Use reloadable tls certificates redis: Use reloadable tls certificates alternator: Use reloadable tls certificates messaging_service: Use reloadable TLS certificates	2020-05-04 15:11:16 +03:00
Piotr Sarna	bec95a0605	treewide: use thread-safe variant of localtime In order to ensure thread-safety, all usages of localtime() are replaced with localtime_r(), which may accept a local buffer. Tests: unit(dev) Fixes #6364 Message-Id: <ad4a0c0e1707f0318325718715a3a647e3ebfdfe.1588592156.git.sarna@scylladb.com>	2020-05-04 14:46:08 +03:00
Calle Wilund	08d069f78d	messaging_service: Use reloadable TLS certificates Changes messaging service rpc to use reloadable tls certificates iff tls is enabled- Note that this means that the service cannot start listening at construction time if TLS is active, and user need to call start_listen_ex to initialize and actually start the service. Since "normal" messaging service is actually started from gms, this route too is made a continuation.	2020-05-04 11:32:21 +00:00
Glauber Costa	55f5ca39a9	sstable_test: rework test to use a thread The compaction_manager test lives inside a thread and it is not taking advantage of it, with continuations all over. One of the side effects of it is that the test is calling stop() twice on the compaction_manager. While this works today, it is not good practice. A change I am making is just about to break it. This patch converts the test to fully use .get() instead of chained continuations and in doing so also guarantees that the compaction manager will be RAII-stopped just one, from a defer object. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200503161420.8346-2-glauber@scylladb.com>	2020-05-03 19:54:04 +03:00
Glauber Costa	70e5252a5d	table: no longer accept online loading of SSTable files in the main directory Loading SSTables from the main directory is possible, to be compatible with Cassandra, but extremely dangerous and not recommended. From the beginning, we recommend using an separate, upload/ directory. In all this time, perhaps due to how the feature's usefulness is reduced in Cassandra due to the possible races, I have never seen anyone coming from Cassandra doing procedures involving refresh at all. Loading SSTables from the main directory forces us to disable writes to the table temporarily until the SSTables are sorted out. If we get rid of this, we can get rid of the disabling of the writes as well. We can't do it now because if we want to be nice to the odd user that may be using refresh through the main directory without our knowledge we should at least error out. This patch, then, does that: it errors out if SSTables are found in the main directory. It will not proceed with the refresh, and direct the user to the upload directory. The main loop in reshuffle_sstables is left in place structurally for now, but most of it is gone. The test for is is deleted. After a period of deprecation we can start ignoring these SSTables and get rid of the lock. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200429144511.13681-1-glauber@scylladb.com>	2020-05-03 08:40:38 +03:00
Avi Kivity	8925e00e96	Merge 'Fix hang in multishard_writer' from Asias " This series fix hang in multishard_writer when error happens. It contains - multishard_writer: Abort the queue attached to consumers when producer fails - repair: Fix hang when the writer is dead Fixes #6241 Refs: #6248 " * asias-stream_fix_multishard_writer_hang: repair: Fix hang when the writer is dead mutation_writer_test: Add test_multishard_writer_producer_aborts multishard_writer: Abort the queue attached to consumers when producer fails	2020-04-30 12:27:55 +03:00
Nadav Har'El	ff5615d59d	alternator test: drastically reduce time to boot Scylla The alternator test, test/alternator/run, runs Scylla and runs the various tests against it. Before this patch, just booting Scylla took about 26 seconds (for a dev build, on my laptop). This patch reduces this delay to less than one second! It turns out that almost the entire delay was artificial, two periods of 12 seconds "waiting for the gossip to settle", which are completely unnecessary in the one-node cluster used in the Alternator test. So a simple "--skip-wait-for-gossip-to-settle 0" parameter eliminates these long delays completely. Amusingly, the Scylla boot is now so fast, that I had to change a "sleep 2" in the test script to "sleep 1", because 2 seconds is now much more than it takes to boot Scylla :-) Fixes #6310. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200428145035.22894-1-nyh@scylladb.com>	2020-04-29 07:55:03 +02:00
Piotr Sarna	09e4f3b917	alternator: implement ScanIndexForward The ScanIndexForward parameter is now fully implemented and can accept ScanIndexForward=false in order to query the partitions in reverse clustering order. Note that reading partition slices in reverse order is less efficient than forward scans and may put a strain on memory usage, especially for large partitions, since the whole partition is currently fetched in order to be reversed. Fixes #5153	2020-04-28 11:44:46 +03:00
Piotr Sarna	be5d3f4733	Merge 'A bunch of refactors in versioned_value and gossiper' from Kamil 1. Remove the `versioned_value::factory` class, it didn't add any value. It just forced us to create an object for making `versioned_value`s, for no sensible reason. 2. Move some `versioned_value` deserialization code (string -> internal data structures) into the versioned_value module. Previously, it was scattered all around the place. 3. Make `gossiper::get_seeds` const and return a const reference. I needed these refactors for a PR I was preparing to fix an issue with CDC. The attempt of fixing the issue failed (I'm trying something different now), but the refactors might be useful anyway. * kbr--vv-refactor: gossiper: make `get_seeds` method const and return a const ref versioned_value: remove versioned_value::factory class gms: move TOKENS string deserialization code into versioned_value	2020-04-28 10:27:45 +02:00
Raphael S. Carvalho	5ac0d31323	test: perf_simple_query: fix test with smp count > 1 that code doesn't run under a thread, so let's futurize it. the code worked with single cpu because get() returns right away due to no deferring point. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200427155303.82763-1-raphaelsc@scylladb.com>	2020-04-27 18:58:25 +03:00
Piotr Sarna	c32faee657	Merge 'counters: Fix filtering of counters' from Juliusz Queries with `ALLOW FILTERING` and constraints on counter values used to be rejected as "unimplemented". The reason was a missing tri-comparator, which is added in this patch. Fixes #5635 * jul-stas-5635-filtering-on-counters: cql/tests: Added test for filtering on counter columns counters: add comparator and remove `unimplemented` from restrictions	2020-04-27 13:53:34 +02:00
Juliusz Stasiewicz	afee590ed7	cql/tests: Added test for filtering on counter columns Tested predicates: IN, EQ, GE, GT, LE, LT. Untouched counters are expected to evaluate as 0. Deleted counters are expected not to appear at all.	2020-04-27 13:36:16 +02:00
Rafael Ávila de Espíndola	0d89bbd57f	row_cache_alloc_stress_test: Make sure GCC can't delete a new We want to test that a std::bad_alloc is thrown, but GCC 10 has a new optimization (-fallocation-dce) that removes dead allocations. This patch assigns the value returned by new to a global so that GCC cannot delete it. With this all tests in a dev build pass with GCC 10. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200424201531.225807-1-espindola@scylladb.com>	2020-04-26 15:22:04 +03:00
Rafael Ávila de Espíndola	543a9ebd9b	tests: Wait for a few futures GCC 10 now warns on these. This fixes the dev build with gcc 10. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200424161006.17857-1-espindola@scylladb.com>	2020-04-26 15:20:40 +03:00
Nadav Har'El	92e36c5df5	test/alternator: increase timeout on Scylla boot The Alternator test boots Scylla to test against it. We set an arbitrary timeout for this boot to succeed: 100 seconds. This 100 seconds is significantly more than 25 seconds it takes on my laptop, and I though we'll never reach it. But it turns out that in some setups - running the very slow debug build on slow and overcommitted nodes - 100 seconds is not enough. So this patch doubles the timeout to 200 seconds. Note that this "200 seconds" is just a timeout, and doesn't affect normal runs: Both a successful boot and a failed boot are recognized as soon as they happen, and we never unnecessarily wait the entire 200 seconds. Fixes #6271. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200422193920.17079-1-nyh@scylladb.com>	2020-04-23 07:47:21 +02:00

1 2 3 4 5 ...

453 Commits