scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	c95dd67d11	utils: Introduce cached_file It is a read-through cache of a file. Will be used to cache contents of the promoted index area from the index file. Currently, cached pages are evicted manually using the invalidate_*() method family, or when the object is destroyed. The cached_file represents a subset of the file. The reason for this is to satisfy two requirements. One is that we have a page-aligned caching, where pages are aligned relative to the start of the underlying file. This matches requirements of the seastar I/O engine on I/O requests. Another requirement is to have an effective way to populate the cache using an unaligned buffer which starts in the middle of the file when we know that we won't need to access bytes located before the buffer's position. See populate_front(). If we couldn't assume that, we wouldn't be able to insert an unaligned buffer into the cache.	2020-06-16 16:15:23 +02:00
Kamil Braun	a1e235b1a4	CDC: Don't split collection tombstone away from base update Overwriting a collection cell using timestamp T is a process with following steps: 1. inserting a row marker (if applicable) with timestamp T; 2. writing a collection tombstone with timestamp T-1; 3. writing the new collection value with timestamp T. Since CDC does clustering of the operations by timestamp, this would result in 3 separate calls to `transform` (in case of INSERT, or 2 - in the case of UPDATE), which seems excessive, especially when pre-/postimage is enabled. This patch makes collection tombstones being treated as if they had the same TS as the base write and thus they are processed in one call to `transform` (as long as TTLs are not used). Also, `cdc_test` had to be updated in places that relied on former splitting strategy. Fixes #6084	2020-06-07 17:09:05 +03:00
Raphael S. Carvalho	8e47f61df7	compaction: Enable tombstone expiration based on the presence of the sstable set For tombstone expiration to proceed correctly without the risk of resurrecting data, the sstable set must be present. Regular compaction and derivatives provide the sstable set, so they're able to expire tombstones with no resurrection risk. Resharding, on the other hand, can run on any shard, not necessarily on the same shard that one of the input sstables belongs to, so it currently cannot provide a sstable set for tombstone expiration to proceed safely. That being said, let's only do expiration based on the presence of the set. This makes room for the sstable set to be feeded to compaction via descriptor, allowing even resharding to do expiration. Currently, compaction thinks that sstable set can only come from the table, and that also needs to be changed for further flexibility. It's theoretically possible that a given resharding job will resurrect data if a fully expired SSTable is resharded at a shard which it doesn't belong to. Resharding will have no way to tell that expiring all that data will lead to resurrection because the relevant SSTables are at different shards. This is fixed by checking for fully expired sstables only on presence of the sstable set. Fixes #6600. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200605200954.24696-1-raphaelsc@scylladb.com>	2020-06-07 11:46:48 +03:00
Kamil Braun	1b7f1806ac	test: improve comments on test_schema_digest_does_not_change This test tends to cause a lot of discussion resulting from not understanding what is actually being tested. Closes https://github.com/scylladb/scylla/issues/6582.	2020-06-05 14:30:02 +02:00
Kamil Braun	d89b7a0548	cdc: rename CDC description tables Commit `968177da04` has changed the schema of cdc_topology_description and cdc_description tables in the system_distributed keyspace. Unfortunately this was a backwards-incompatible change: these tables would always be created, irrespective of whether or not "experimental" was enabled. They just wouldn't be populated with experimental=off. If the user now tries to upgrade Scylla from a version before this change to a version after this change, it will work as long as CDC is protected b the experimental flag and the flag is off. However, if we drop the flag, or if the user turns experimental on, weird things will happen, such as nodes refusing to start because they try to populate cdc_topology_description while assuming a different schema for this table. The simplest fix for this problem is to rename the tables. This fix must get merged in before CDC goes out of experimental. If the user upgrades his cluster from a pre-rename version, he will simply have two garbage tables that he is free to delete after upgrading. sstables and digests need to be regenerated for schema_digest_test since this commit effectively adds new tables to the system_distributed keyspace. This doesn't result in schema disagreement because the table is announced to all nodes through the migration manager.	2020-06-05 09:59:16 +02:00
Avi Kivity	0c34e114e2	Merge "Upgrade to seastar api version 3" (make_file_output_stream returns future) from Rafael " The new seastar api changes make_file_output_stream and make_file_data_sink to return futures. This series includes a few refactoring patches and the actual transition. " * 'espindola/api-v3-v3' of https://github.com/espindola/scylla: table: Fix indentation everywhere: Move to seastar api level 3 sstables: Pass an output_stream to make_compressed_file_.*_format_output_stream sstables: Pass a data_sink to checksummed_file_writer's constructor sstables: Convert a file_writer constructor to a static make sstables: Move file_writer constructor out of line	2020-06-03 23:09:49 +03:00
Rafael Ávila de Espíndola	e5876f6696	everywhere: Move to seastar api level 3 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-03 10:32:46 -07:00
Rafael Ávila de Espíndola	13282b3d4c	sstables: Pass an output_stream to make_compressed_file_.*_format_output_stream This is a bit simpler as we don't have to pass in the options and moves the calls to make_file_output_stream to places where we can handle futures. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-03 10:32:46 -07:00
Raphael S. Carvalho	fb6976f1b9	Make sure SSTables created by streaming are added to backlog tracker New SStables are only added to backlog tracker if set_unshared() was called on their behalf. SStables created for streaming are not being added to the tracker because make_streaming_sstable_for_write() doesn't call set_unshared() nor does it caller. Which results in backlog not accounting for their existence, which means backlog will be much lower than expected. This problem could be fixed by adding a set_unshared() call but it turns out we don't even need set_unshared() anymore. It was introduced when Scylla metadata didn't exist, now a SSTable has built-in knowledge of whether or not it's shared. Relying on every SSTable creator calling set_unshared() is bug prone. Let's get rid of it and let the SStable itself say whether or not it's shared. If an imported SSTable has not Scylla metadata, Scylla will still be able to compute shards using token range metadata. Refs #6021. Refs #6227. Fixes #6441. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200512220226.134481-1-raphaelsc@scylladb.com>	2020-06-03 17:35:22 +03:00
Tomasz Grabiec	087fa42c1d	Merge "utils: inject errors around paxos stages" from Alejo Add Paxos error injections before/after save promise, proposal, decision, paxos_response_handler, delete decision. Adds a method to inject an error providing a lambda while avoiding to add a continuation when the error injection is disabled. For this provide error exception and enter() to allow flow control (i.e. return) on simple error injections without lambdas. Also includes Pavel's patch for CQL API for error injections, updated to current error injection API and added one_shot support. Also added some basic CQL API boost tests. For CQL API there's a limitation of the current grammar not supporting f(<terminal>) so values have to be inserted in a table until this is resolved. See #5411 * https://github.com/alecco/scylla/tree/error_injection_v11: paxos: fix indentation paxos: add error injections utils: add timeout error injection with lambda utils: error injection add enter() for control flow utils: error injections provide error exceptions failure_injector: implement CQL API for failure injector class lwt: fix disabled error injection templates	2020-06-03 15:42:10 +02:00
Alejo Sanchez	a8b14b0227	utils: add timeout error injection with lambda Even though calling then() on a ready future does not allocate a continuation, calling then on the result of it will allocate. This error injection only adds a continuation in the dependency chain if error injections are enabled at compile timeand this particular error injection is enabled. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-06-03 14:44:00 +02:00
Alejo Sanchez	0321172677	utils: error injection add enter() for control flow For control flow (i.e. return) and simplicity add enter() method. For disabled injections, this method is const returning false, therefore it has no overhead. Add boost test. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-06-03 14:42:48 +02:00
Piotr Sarna	ecc4a87a24	test: add test cases to big_decimal_test Test cases for big decimals were quite complete, but since the implementation was recently changed, some corner cases are added: - incorrect strings - numbers not fitting into uint64_t - numbers less than uint64_t::max themselves, but with the unscaled value exceeding the maximum	2020-06-01 16:11:49 +02:00
Botond Dénes	7c56e79355	test/multishard_mutation_query_test: eliminate another unsafely used boost test macro Boost test macros are not thread safe, using them from multiple threads results in garbled XML test report output. `3f1823a4f0` replaced most of the thread-unsafe boost test macros in multishard_mutation_query_test, but one still managed to slip through the cracks. This patch removes that as well. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200529130706.149603-3-bdenes@scylladb.com>	2020-05-31 16:08:02 +03:00
Botond Dénes	c5b0e8a45a	test: move thread-safe test macro alternatives to lib/test_utils.hh Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200529130706.149603-2-bdenes@scylladb.com>	2020-05-31 16:08:02 +03:00
Botond Dénes	7ea64b1838	test: mutation_reader_test: use <ranges> Replace all the ranges stuff we use from boost with the std equivalents. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200529141407.158960-3-bdenes@scylladb.com>	2020-05-31 12:58:59 +03:00
Avi Kivity	0c6bbc84cd	Merge "Classify queries based on their initiator, rather than their target" from Botond " Currently we classify queries as "system" or "user" based on the table they target. The class of a query determines how the query is treated, currently: timeout, limits for reverse queries and the concurrency semaphore. The catch is that users are also allowed to query system tables and when doing so they will bypass the limits intended for user queries. This has caused performance problems in the past, yet the reason we decided to finally address this is that we want to introduce a memory limit for unpaged queries. Internal (system) queries are all unpaged and we don't want to impose the same limit on them. This series uses scheduling groups to distinguish user and system workloads, based on the assumption that user workloads will run in the statement scheduling group, while system workloads will run in the main (or default) scheduling group, or perhaps something else, but in any case not in the statement one. Currently the scheduling group of reads and writes is lost when going through the messaging service, so to be able to use scheduling groups to distinguish user and system reads this series refactors the messaging service to retain this distinction across verb calls. Furthermore, we execute some system reads/writes as part of user reads/writes, such as auth and schema sync. These processes are tagged to run in the main group. This series also centralises query classification on the replica and moves it to a higher level. More specifically, queries are now classified -- the scheduling group they run in is translated to the appropriate query class specific configuration -- on the database level and the configuration is propagated down to the lower layers. Currently this query class specific configuration consists of the reader concurrency semaphore and the max memory limit for otherwise unlimited queries. A corollary of the semaphore begin selected on the database level is that the read permit is now created before the read starts. A valid permit is now available during all stages of the read, enabling tracking the memory consumption of e.g. the memtable and cache readers. This change aligns nicely with the needs of more accurate reader memory tracking, which also wants a valid permit that is available in every layer. The series can be divided roughly into the following distinct patch groups: * 01-02: Give system read concurrency a boost during startup. * 03-06: Introduce user/system statement isolation to messaging service. * 07-13: Various infrastructure changes to prepare for using read permits in all stages of reads. * 14-19: Propagate the semaphore and the permit from database to the various table methods that currently create the permit. * 20-23: Migrate away from using the reader concurrency semaphore for waiting for admission, use the permit instead. * 24: Introduce `database::make_query_config()` and switch the database methods needing such a config to use it. * 25-31: Get rid of all uses of `no_reader_permit()`. * 32-33: Ban empty permits for good. * 34: querier_cache: use the queriers' permits to obtain the semaphore. Fixes: #5919 Tests: unit(dev, release, debug), dtest(bootstrap_test.py:TestBootstrap.start_stop_test_node), manual testing with a 2 node mixed cluster with extra logging. " * 'query-class/v6' of https://github.com/denesb/scylla: (34 commits) querier_cache: get semaphore from querier reader_permit: forbid empty permits reader_permit: fix reader_resources::operator bool treewide: remove all uses of no_reader_permit() database: make_multishard_streaming_reader: pass valid permit to multi range reader sstables: pass valid permits to all internal reads compaction: pass a valid permit to sstable reads database: add compaction read concurrency semaphore view: use valid permits for reads from the base table database: use valid permit for counter read-before-write database: introduce make_query_class_config() reader_concurrency_semaphore: remove wait_admission and consume_resources() test: move away from reader_concurrency_semaphore::wait_admission() reader_permit: resource_units: introduce add() mutation_reader: restricted_reader: work in terms of reader_permit row_cache: pass a valid permit to underlying read memtable: pass a valid permit to the delegate reader table: require a valid permit to be passed to most read methods multishard_mutation_query: pass a valid permit to shard mutation sources querier: add reader_permit parameter and forward it to the mutation_source ...	2020-05-29 10:11:44 +03:00
Raphael S. Carvalho	097a5e9e07	compaction: Disable garbage collected writer if interposer consumer is used GC writer, used for incremental compaction, cannot be currently used if interposer consumer is used. That's because compaction assumes that GC writer will be operated only by a single compaction writer at a given point in time. With interposer consumer, multiple writers will concurrently operate on the same GC writer, leading to race condition which potentially result in use-after-free. Let's disable GC writer if interposer consumer is enabled. We're not losing anything because GC writer is currently only needed on strategies which don't implement an interposer consumer. Resharding will always disable GC writer, which is the expected behavior because it doesn't support incremental compaction yet. The proper fix, which allows GC writer and interposer consumer to work together, will require more time to implement and test, and for that reason, I am postponing it as #6472 is a showstopper for the current release. Fixes #6472. tests: mode(dev). Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200526195428.230472-1-raphaelsc@scylladb.com>	2020-05-29 08:26:43 +02:00
Alejo Sanchez	bb08b5ad5a	utils: error injections provide error exceptions Provide non-timeout error exception to facilitate control flow in injected errors. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-05-28 11:13:55 +02:00
Pavel Solodovnikov	014883d560	failure_injector: implement CQL API for failure injector class The following UDFs are defined to control failure injector API usage: * enable_injection(name, args) * disable_injection(name) All arguments have string type. As currently function(terminal) is not supported by the parser, the arguments must come from selected rows. Added boost test for CQL API. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-05-28 11:13:55 +02:00
Alejo Sanchez	2c7e01a3b6	lwt: fix disabled error injection templates Fix disabled injection templates to match enabled ones. Fix corresponding test to not be a continuation. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-05-28 11:13:55 +02:00
Botond Dénes	e678f06a5e	querier_cache: get semaphore from querier Currently the `querier_cache` is passed a semaphore during its construction and it uses this semaphore to do all the inactive reader registering/unregistering. This is inaccurate as in theory cached reads could belong to different semaphores (although currently this is not yet the case). As all queriers store a valid permit now, use this permit to obtain the semaphore the querier is associated with, and register the inactive read with this semaphore.	2020-05-28 11:34:35 +03:00
Botond Dénes	d68ac8bf18	treewide: remove all uses of no_reader_permit()	2020-05-28 11:34:35 +03:00
Botond Dénes	e4c591aa67	database: introduce make_query_class_config() And use it to obtain any query-class specific configuration that was obtained from `table::config` before, such as the read concurrency semaphore and the max memory limit for unlimited queries. As all users of these items get these from the query class config now, we can remove them from `table::config`.	2020-05-28 11:34:35 +03:00
Botond Dénes	a08467da29	test: move away from reader_concurrency_semaphore::wait_admission() And use the reader_permit for this instead. This refactoring has revealed a pre-existing bug in the `test_lifecycle_policy`, which is also addressed in this patch. The bug is that said policy executes reader destructions in the background, and these are not waited for. For some reason, the semaphore -> permit transition pushes these races over the edge and we start seeing some of these destruction fibers still being unfinished when test scopes are exited, causing all sorts of trouble. The solution is to introduce a special gate that tests can use to wait for all background work to finish, before the test scope is exited.	2020-05-28 11:34:35 +03:00
Botond Dénes	4409579352	mutation_reader: restricted_reader: work in terms of reader_permit We want to refactor all read resource tracking code to work through the read_permit, so refactor the restricted reader to also do so.	2020-05-28 11:34:35 +03:00
Botond Dénes	fe024cecdc	row_cache: pass a valid permit to underlying read All reader are soon going to require a valid permit, so make sure we have a valid permit which we can pass to the underlying reader when creating it. This means `row_cache::make_reader()` now also requires a permit to be passed to it.	2020-05-28 11:34:35 +03:00
Botond Dénes	9ede82ebf8	memtable: pass a valid permit to the delegate reader All reader are soon going to require a valid permit, so make sure we have a valid permit which we can pass to the delegate reader when creating it. This means `memtable::make_flat_reader()` now also requires a permit to be passed to it. Internally the permit is stored in `scanning_reader`, which is used both for flushes and normal reads. In the former case a permit is not required.	2020-05-28 11:34:35 +03:00
Botond Dénes	cc5137ffe3	table: require a valid permit to be passed to most read methods Now that the most prevalent users (range scan and single partition reads) all pass valid permits we require all users to do so and propagate the permit down towards `make_sstable_reader()`. The plan is to use this permit for restricting the sstable readers, instead of the semaphore the table is configured with. The various `make_streaming_*reader()` overloads keep using the internal semaphores as but they also create the permit before the read starts and pass it to `make_sstable_reader()`.	2020-05-28 11:34:35 +03:00
Botond Dénes	d5ebd763ff	multishard_mutation_query: pass a valid permit to shard mutation sources In preparation of a valid permit being required to be passed to all mutation sources, create a permit before creating the shard readers and pass it to the mutation source when doing so. The permit is also persisted in the `shard_mutation_querier` object when saving the reader, which is another forward looking change, to allow the querier-cache to use it to obtain the semaphore the read is actually registered with.	2020-05-28 11:34:35 +03:00
Botond Dénes	bad53c4245	querier: add reader_permit parameter and forward it to the mutation_source In preparation of a valid permit being required to be passed to all mutation sources, also add a permit to the querier object, which is then passed to the source when it is used to create a reader.	2020-05-28 11:34:35 +03:00
Botond Dénes	14743c4412	data_query, mutation_query: use query_class_config We want to move away from the current practice of selecting the relevant read concurrency semaphore inside `table` and instead want to pass it down from `database` so that we can pass down a semaphore that is appropriate for the class of the query. Use the recently created `query_class_config` struct for this. This is added as a parameter to `data_query`, `mutation_query` and propagated down to the point where we create the `querier` to execute the read. We are already propagating down a parameter down the same route -- max_memory_reverse_query -- which also happens to be part of `query_class_config`, so simply replace this parameter with a `query_class_config` one. As the lower layers are not prepared for a semaphore passed from above, make sure this semaphore is the same that is selected inside `table`. After the lower layers are prepared for a semaphore arriving from above, we will switch it to be the appropriate one for the class of the query.	2020-05-28 11:34:35 +03:00
Botond Dénes	0b4ec62332	flat_mutation_reader: flat_multi_range_reader: add reader_permit parameter Mutation sources will soon require a valid permit so make sure we have one and pass it to the mutation sources when creating the underlying readers. For now, pass no_reader_permit() on call sites, deferring the obtaining of a valid permit to later patches.	2020-05-28 11:34:35 +03:00
Avi Kivity	829e2508d0	logalloc: fix entropy depletion in test_compaction_with_multiple_regions() test_compaction_with_multiple_regions() has two calls to std::shuffle(), one using std::default_random_engine() has the PRNG, but the other, later on, using the std::random_device directly. This can cause failures due to entropy pool exhaustion. Fix by making the `random` variable refer to the PRNG, not the random_device, and adjust the first std::shuffle() call. This hides the random_device so it can't be used more than once. Message-Id: <20200527124247.2187364-1-avi@scylladb.com>	2020-05-27 15:51:16 +03:00
Botond Dénes	3f1823a4f0	multishard_mutation_query_test: don't use boost test macros in multiple shards Boost test macros are not safe to use in multiple shards (threads). Doing so will result in their output being interwoven, making it unreadable and generating invalid XML test reports. There was a lot of back-and-forth on how to solve this, including introducing thread-safe wrappers of the boost test macros, that use locks. This patch does something much simple: it defines a bunch of replacement utility functions for the used macros. These functions use the thread safe seastar logger to log messages and throw exceptions when the test has to be failed, which is pretty much what boost test does too. With this the previously seen complaint about invalid XML is gone. Example log messages from the utility functions: DEBUG 2020-05-27 13:32:54,248 [shard 1] testlog - check_equal(): OK @ validate_result() test/boost/multishard_mutation_query_test.cc:863: ckp{0004fe57c8d2} == ckp{0004fe57c8d2} DEBUG 2020-05-27 13:32:54,248 [shard 1] testlog - require(): OK @ validate_result() test/boost/multishard_mutation_query_test.cc:855 Fixes: #4774 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200527104426.176342-1-bdenes@scylladb.com>	2020-05-27 15:50:05 +03:00
Pekka Enberg	8721534dfb	Merge "tests: avoid exhausting random_device entropy" from Avi " In several tests we were calling random_device::operator() in a tight loop. This is a slow operation, and in gcc 10 can fail if called too frequently due to a bug [1]. Change to use a random_engine instead, seeded once from the random_device. Tests: unit (dev) [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94087 " * 'entropy' of git://github.com/avikivity/scylla: tests: lsa_sync_eviction_test: don't exhaust random number entropy tests: querier_cache_test: don't exhaust random number entropy tests: loading_cache_test: don't exhaust random number entropy tests: dynamic_bitset_test: don't exhaust random number entropy	2020-05-27 08:40:06 +03:00
Kamil Braun	7a98db2ab3	cdc: set ttl column in log rows which update only collections	2020-05-27 08:40:05 +03:00
Avi Kivity	8d27e1b4a9	Merge 'Propagate tracing to materialized view update path' from Piotr S In order to improve materialized views' debuggability, tracing points are added to view update generation path. Example trace: ``` ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+-----------+----------------+----------- Execute CQL3 query \| 2020-04-27 13:13:46.834000 \| 127.0.0.1 \| 0 \| 127.0.0.1 Parsing a statement [shard 0] \| 2020-04-27 13:13:46.834346 \| 127.0.0.1 \| 1 \| 127.0.0.1 Processing a statement [shard 0] \| 2020-04-27 13:13:46.834426 \| 127.0.0.1 \| 80 \| 127.0.0.1 Creating write handler for token: -3248873570005575792 natural: {127.0.0.1, 127.0.0.3} pending: {} [shard 0] \| 2020-04-27 13:13:46.834494 \| 127.0.0.1 \| 148 \| 127.0.0.1 Creating write handler with live: {127.0.0.3, 127.0.0.1} dead: {} [shard 0] \| 2020-04-27 13:13:46.834507 \| 127.0.0.1 \| 161 \| 127.0.0.1 Sending a mutation to /127.0.0.3 [shard 0] \| 2020-04-27 13:13:46.834519 \| 127.0.0.1 \| 173 \| 127.0.0.1 Executing a mutation locally [shard 0] \| 2020-04-27 13:13:46.834532 \| 127.0.0.1 \| 186 \| 127.0.0.1 View updates for ks.t require read-before-write - base table reader is created [shard 0] \| 2020-04-27 13:13:46.834570 \| 127.0.0.1 \| 224 \| 127.0.0.1 Reading key {{-3248873570005575792, pk{000400000002}}} from sstable /home/sarna/.ccm/scylla-1/node1/data/ks/t-162ef290887811eaa4bf000000000000/mc-1-big-Data.db [shard 0] \| 2020-04-27 13:13:46.834608 \| 127.0.0.1 \| 262 \| 127.0.0.1 /home/sarna/.ccm/scylla-1/node1/data/ks/t-162ef290887811eaa4bf000000000000/mc-1-big-Index.db: scheduling bulk DMA read of size 8 at offset 0 [shard 0] \| 2020-04-27 13:13:46.834635 \| 127.0.0.1 \| 289 \| 127.0.0.1 /home/sarna/.ccm/scylla-1/node1/data/ks/t-162ef290887811eaa4bf000000000000/mc-1-big-Index.db: finished bulk DMA read of size 8 at offset 0, successfully read 8 bytes [shard 0] \| 2020-04-27 13:13:46.834975 \| 127.0.0.1 \| 629 \| 127.0.0.1 Message received from /127.0.0.1 [shard 0] \| 2020-04-27 13:13:46.834988 \| 127.0.0.3 \| 11 \| 127.0.0.1 /home/sarna/.ccm/scylla-1/node1/data/ks/t-162ef290887811eaa4bf000000000000/mc-1-big-Data.db: scheduling bulk DMA read of size 41 at offset 0 [shard 0] \| 2020-04-27 13:13:46.835015 \| 127.0.0.1 \| 669 \| 127.0.0.1 View updates for ks.t require read-before-write - base table reader is created [shard 0] \| 2020-04-27 13:13:46.835020 \| 127.0.0.3 \| 44 \| 127.0.0.1 Generated 1 view update mutations [shard 0] \| 2020-04-27 13:13:46.835080 \| 127.0.0.3 \| 104 \| 127.0.0.1 Sending view update for ks.t_v2_idx_index to 127.0.0.2, with pending endpoints = {}; base token = -3248873570005575792; view token = 3728482343045213994 [shard 0] \| 2020-04-27 13:13:46.835095 \| 127.0.0.3 \| 119 \| 127.0.0.1 Sending a mutation to /127.0.0.2 [shard 0] \| 2020-04-27 13:13:46.835105 \| 127.0.0.3 \| 129 \| 127.0.0.1 View updates for ks.t were generated and propagated [shard 0] \| 2020-04-27 13:13:46.835117 \| 127.0.0.3 \| 141 \| 127.0.0.1 /home/sarna/.ccm/scylla-1/node1/data/ks/t-162ef290887811eaa4bf000000000000/mc-1-big-Data.db: finished bulk DMA read of size 41 at offset 0, successfully read 41 bytes [shard 0] \| 2020-04-27 13:13:46.835160 \| 127.0.0.1 \| 813 \| 127.0.0.1 Sending mutation_done to /127.0.0.1 [shard 0] \| 2020-04-27 13:13:46.835164 \| 127.0.0.3 \| 188 \| 127.0.0.1 Mutation handling is done [shard 0] \| 2020-04-27 13:13:46.835177 \| 127.0.0.3 \| 201 \| 127.0.0.1 Generated 1 view update mutations [shard 0] \| 2020-04-27 13:13:46.835215 \| 127.0.0.1 \| 869 \| 127.0.0.1 Locally applying view update for ks.t_v2_idx_index; base token = -3248873570005575792; view token = 3728482343045213994 [shard 0] \| 2020-04-27 13:13:46.835226 \| 127.0.0.1 \| 880 \| 127.0.0.1 Successfully applied local view update for 127.0.0.1 and 0 remote endpoints [shard 0] \| 2020-04-27 13:13:46.835253 \| 127.0.0.1 \| 907 \| 127.0.0.1 View updates for ks.t were generated and propagated [shard 0] \| 2020-04-27 13:13:46.835256 \| 127.0.0.1 \| 910 \| 127.0.0.1 Got a response from /127.0.0.1 [shard 0] \| 2020-04-27 13:13:46.835274 \| 127.0.0.1 \| 928 \| 127.0.0.1 Delay decision due to throttling: do not delay, resuming now [shard 0] \| 2020-04-27 13:13:46.835276 \| 127.0.0.1 \| 930 \| 127.0.0.1 Mutation successfully completed [shard 0] \| 2020-04-27 13:13:46.835279 \| 127.0.0.1 \| 933 \| 127.0.0.1 Done processing - preparing a result [shard 0] \| 2020-04-27 13:13:46.835286 \| 127.0.0.1 \| 941 \| 127.0.0.1 Message received from /127.0.0.3 [shard 0] \| 2020-04-27 13:13:46.835331 \| 127.0.0.2 \| 14 \| 127.0.0.1 Sending mutation_done to /127.0.0.3 [shard 0] \| 2020-04-27 13:13:46.835399 \| 127.0.0.2 \| 82 \| 127.0.0.1 Mutation handling is done [shard 0] \| 2020-04-27 13:13:46.835413 \| 127.0.0.2 \| 96 \| 127.0.0.1 Got a response from /127.0.0.2 [shard 0] \| 2020-04-27 13:13:46.835639 \| 127.0.0.3 \| 662 \| 127.0.0.1 Delay decision due to throttling: do not delay, resuming now [shard 0] \| 2020-04-27 13:13:46.835640 \| 127.0.0.3 \| 664 \| 127.0.0.1 Successfully applied view update for 127.0.0.2 and 1 remote endpoints [shard 0] \| 2020-04-27 13:13:46.835649 \| 127.0.0.3 \| 673 \| 127.0.0.1 Got a response from /127.0.0.3 [shard 0] \| 2020-04-27 13:13:46.835841 \| 127.0.0.1 \| 1495 \| 127.0.0.1 Request complete \| 2020-04-27 13:13:46.834944 \| 127.0.0.1 \| 944 \| 127.0.0.1 ``` Fixes #6175 Tests: unit(dev), manual * psarna-propagate_tracing_to_more_write_paths: db,view: add tracing to view update generation path treewide: propagate trace state to write path	2020-05-27 08:40:05 +03:00
Avi Kivity	11698aafc1	tests: querier_cache_test: don't exhaust random number entropy rand_int() re-creates a random device each time it is called. Change it to use a static random_device, and get random numbers from a random_engine instead of from the device directly. This avoids exhausting entropy, see [1] for details. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94087	2020-05-26 20:51:16 +03:00
Avi Kivity	e2f4c689b1	tests: loading_cache_test: don't exhaust random number entropy rand_int() re-creates a random device each time it is called. Change it to use a static random_device, and get random numbers from a random_engine instead of from the device directly. This avoids exhausting entropy, see [1] for details. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94087	2020-05-26 20:49:58 +03:00
Avi Kivity	85da266cf4	tests: dynamic_bitset_test: don't exhaust random number entropy tests_random_ops() extracts a real random number from a random_device. Change it to use a random number engine. This avoids exhausting entropy, see [1] for details. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94087	2020-05-26 20:46:45 +03:00
Piotr Sarna	032a531ea6	test: add unit tests for alternator base64 conversions The test cases verify that base64 operations encode and decode their data properly. Tests: unit(dev)	2020-05-21 18:26:59 +03:00
Piotr Sarna	92aadb94e5	treewide: propagate trace state to write path In order to add tracing to places where it can be useful, e.g. materialized view updates and hinted handoff, tracing state is propagated to all applicable call sites.	2020-05-18 16:05:23 +02:00
Avi Kivity	beaeda5234	database: remove variadic future from query() and query_mutations() Variadic futures are deprecated; replace with future<std::tuple<...>>. Tests: unit (dev)	2020-05-17 18:45:38 +02:00
Dejan Mircevski	8db7e4cc96	cql: Add test for invalid unbounded DELETE In `add40d4e59`, we relaxed the prohibition of unbounded DELETE and stopped testing the failure message. But there are still scenarios when unbounded DELETE is prohibited, so add a test to ensure we continue to catch it where appropriate. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-05-17 12:28:36 +03:00
Avi Kivity	b155eef726	Merge "allow early aborts through abort sources." from Glauber " The shutdown process of compaction manager starts with an explicit call from the database object. However that can only happen everything is already initialized. This works well today, but I am soon to change the resharding process to operate before the node is fully ready. One can still stop the database in this case, but reshardings will have to finish before the abort signal is processed. This patch passes the existing abort source to the construction of the compaction_manager and subscribes to it. If the abort source is triggered, the compaction manager will react to it firing and all compactions it manages will be stopped. We still want the database object to be able to wait for the compaction manager, since the database is the object that owns the lifetime of the compaction manager. To make that possible we'll use a future that is return from stop(): no matter what triggered the abort, either an early abort during initial resharding or a database-level event like drain, everything will shut down in the right order. The abort source is passed to the database, who is responsible from constructing the compaction manager Tests: unit (debug), manual start+stop, manual drain + stop, previously failing dtests. "	2020-05-17 11:49:00 +03:00
Avi Kivity	777d5e88c3	types: support altering fixed-size integer types to varint Fixed-size integer types are legal varints - both are serialized as two's complement in network byte order. So there's tinyint, shortint, int, and bigint can be interpreted as varints. Change is_compatible_with() to reflect that. Message-Id: <20200516115143.28690-2-avi@scylladb.com>	2020-05-17 11:31:00 +03:00
Avi Kivity	ff57e4d9a5	types: make short and byte types value-compatible with varint The short and byte types are two's complement network byte order, just like varint (except fixed size) and so varint can read them just fine. Mark them as value compatible like int32_type and long_type. A unit test is added. Message-Id: <20200516115143.28690-1-avi@scylladb.com>	2020-05-17 11:31:00 +03:00
Glauber Costa	7423ccc318	compaction_manager: allow early aborts through abort sources. The shutdown process of compaction manager starts with an explicit call from the database object. However that can only happen everything is already initialized. This works well today, but I am soon to change the resharding process to operate before the node is fully ready. One can still stop the database in this case, but reshardings will have to finish before the abort signal is processed. This patch passes the existing abort source to the construction of the compaction_manager and subscribes to it. If the abort source is triggered, the compaction manager will react to it firing and all compactions it manages will be stopped. We still want the database object to be able to wait for the compaction manager, since the database is the object that owns the lifetime of the compaction manager. To make that possible we'll use a future that is return from stop(): no matter what triggered the abort, either an early abort during initial resharding or a database-level event like drain, everything will shut down in the right order. The abort source is passed to the database, who is responsible from constructing the compaction manager. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-05-13 16:51:25 -04:00
Glauber Costa	e29701ca1c	compaction_manager: expand state to be able to differentiate between enabled and stopped We are having many issues with the stop code in the compaction_manager. Part of the reason is that the "stopped" state has its meaning overloaded to indicate both "compaction manager is not accepting compactions" and "compaction manager is not ready or destructed". In a later step we could default to enabled-at-start, but right now we maintain current behavior to minimize noise. It is only possible to stop the compaction manager once. It is possible to enable / disable the compaction manager many times. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-05-13 16:51:25 -04:00

1 2 3 4 5 ...

390 Commits