scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 11:00:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	8747c684e0	Merge 'Move timeouts to client state' from Piotr Sarna This series is extracted from #7913 as it may prove useful to other series as well, and #7913 might take a while until its merged, given that it also depends on other unmerged pull requests. The idea of this series is to move timeouts to the client state, which will allow changing them independently for each session - e.g. by setting per-service-level timeouts and initializing the values from attached service levels (see #7867). Closes #8140 * github.com:scylladb/scylla: treewide: remove timeout config from query options cql3: use timeout config from client state instead of query options cql3: use timeout config from client state instead of query options cql3: use timeout config from client state instead of query options service: add timeout config to client state	2021-03-01 20:34:35 +02:00
Piotr Sarna	c5214eb096	treewide: remove timeout config from query options Timeout config is now stored in each connection, so there's no point in tracking it inside each query as well. This patch removes timeout_config from query_options and follows by removing now unnecessary parameters of many functions and constructors.	2021-02-25 17:20:27 +01:00
Kamil Braun	d4937daaea	cdc: introduce cdc::generation_service This commit introduces a new service crafted to handle CDC generation management: listening and reacting to generation changes in the cluster. The implementation is a stub for now, the service reacts to generation changes by simply logging the event. The commit plugs the service in, initializing it in main and test code, passing a reference to storage_service and having storage_service start the service (using the `after_join` method): the service only starts doing its job after the node joins the token ring (either on bootstrap or restart).	2021-02-22 12:45:43 +01:00
Botond Dénes	ba7a9d2ac3	imr: switch back to open-coded description of structures Commit `aab6b0ee27` introduced the controversial new IMR format, which relied on a very template-heavy infrastructure to generate serialization and deserialization code via template meta-programming. The promise was that this new format, beyond solving the problems the previous open-coded representation had (working on linearized buffers), will speed up migrating other components to this IMR format, as the IMR infrastructure reduces code bloat, makes the code more readable via declarative type descriptions as well as safer. However, the results were almost the opposite. The template meta-programming used by the IMR infrastructure proved very hard to understand. Developers don't want to read or modify it. Maintainers don't want to see it being used anywhere else. In short, nobody wants to touch it. This commit does a conceptual revert of `aab6b0ee27`. A verbatim revert is not possible because related code evolved a lot since the merge. Also, going back to the previous code would mean we regress as we'd revert the move to fragmented buffers. So this revert is only conceptual, it changes the underlying infrastructure back to the previous open-coded one, but keeps the fragmented buffers, as well as the interface of the related components (to the extent possible). Fixes: #5578	2021-02-16 23:43:07 +01:00
Benny Halevy	29002e3b48	flat_mutation_reader: return future from next_partition To allow it to asynchronously close underlying readers on next_partition(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-13 17:35:07 +02:00
Raphael S. Carvalho	198b87503f	row_cache: allow external updater to decouple preparation from execution External updater may do some preparatory work like constructing a new sstable list, and at the end atomically replace the old list by the new one. Decoupling the preparation from execution will give us the following benefits: - the preparation step can now yield if needed to avoid reactor stalls, as it's been futurized. - the execution step will now be able to provide strong exception guarantees, as it's now decoupled from the preparation step which can be non-exception-safe. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:45 -03:00
Benny Halevy	29ed59f8c4	main: start a shared_token_metadata And use it to get a token_metadata& compatible with current usage, until the services are converted to use token_metadata_ptr. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Benny Halevy	fcca64b4f6	test: imr_test should run automatically Unclear why it was places in test/manual in commit `1c8736f998` Test: boost/imr_test Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201022093826.12009-1-bhalevy@scylladb.com>	2020-10-22 12:40:30 +03:00
Avi Kivity	d5e94ab224	test: partition_data_test: don't capture structured bindings in lambdas Clang does not yet implement p1091r3, which allows lambdas to capture structured bindings. To accomodate it, don't use structured bindings for variables that are later captured.	2020-10-16 15:24:45 +03:00
Botond Dénes	ff623e70b3	reader_concurrency_semaphore: name permits Require a schema and an operation name to be given to each permit when created. The schema is of the table the read is executed against, and the operation name, which is some name identifying the operation the permit is part of. Ideally this should be different for each site the permit is created at, to be able to discern not only different kind of reads, but different code paths the read took. As not all read can be associated with one schema, the schema is allowed to be null. The name will be used for debugging purposes, both for coredump debugging and runtime logging of permit-related diagnostics.	2020-10-13 12:32:13 +03:00
Botond Dénes	2ee026f26f	test/manual/sstable_scan_footprint_test: run test body in statement sched group So that queries are processed in said scheduling group and thus they use the user read concurrency semaphore.	2020-09-28 11:27:49 +03:00
Botond Dénes	272a54b81c	test/manual/sstable_scan_footprint_test: move test main code into separate function	2020-09-28 11:27:49 +03:00
Botond Dénes	29861b068e	test/manual/sstable_scan_footprint_test: sprinkle some thread::maybe_yield():s To avoid stalls.	2020-09-28 11:27:49 +03:00
Botond Dénes	daa9fa72f1	test/manual/sstable_scan_footprint_test: make clustering row size configurable So that large-row workloads can be simulated too.	2020-09-28 11:27:49 +03:00
Botond Dénes	2ff326a41a	test/manual/sstable_scan_footprint_test: document sstable related command line arguments	2020-09-28 11:27:49 +03:00
Botond Dénes	6ca0464af5	mutation_fragment: add schema and permit We want to start tracking the memory consumption of mutation fragments. For this we need schema and permit during construction, and on each modification, so the memory consumption can be recalculated and pass to the permit. In this patch we just add the new parameters and go through the insane churn of updating all call sites. They will be used in the next patch.	2020-09-28 11:27:23 +03:00
Botond Dénes	3fab83b3a1	flat_mutation_reader: impl: add reader_permit parameter Not used yet, this patch does all the churn of propagating a permit to each impl. In the next patch we will use it to track to track the memory consumption of `_buffer`.	2020-09-28 10:53:48 +03:00
Pavel Emelyanov	ee41645a1a	tests: Use own instances of messaging_service The global one is going away, no core code uses it, so all tests can be safely switched to use their own instances. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	4ea3c2797c	storage_service: Keep reference on sharded messaging service It is a bit step backward in the storage-service decompsition campaign, but... Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	65bd54604d	gossiper: Use messaging service by reference Gossiper needs messaging service, the messaging is started before the gossiper, so we can push the former reference into it. Gossiper is not stopped for real, neither the messaging service is, so the memory usage is still safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	dc0918e255	tests: Keep local reference on global messaging Some tests directly reference the global messaging service. For the sake of simpler patching wrap this global reference with a local one. Once the global messaging service goes away tests will get their own instances. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Piotr Jastrzebski	c001374636	codebase wide: replace count with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously `count` function was often used in various ways. `contains` does not only express the intend of the code better but also does it in more unified way. This commit replaces all the occurences of the `count` with the `contains`. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b4ef3b4bc24f49abe04a2aba0ddd946009c9fcb2.1597314640.git.piotr@scylladb.com>	2020-08-15 20:26:02 +03:00
Benny Halevy	e2340d0684	config: enable_sstables_md_format by default Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 19:19:32 +03:00
Benny Halevy	65239a6e50	config: add enable_sstables_md_format MD format is disabled by default at this point. The option extends enable_sstables_mc_format so that both are needed to be set for supporting the md format. The MD_FORMAT cluster feature will be added in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Wojciech Mitros	4863e8a11f	tests: add large paging state tests Add a unit test checking if the top 32 bits of the number of remaining rows in paging state is used correctly and a manual test checking if it's possible to select over 2^32 rows from a table and a virtual reader for this table.	2020-08-04 18:44:29 +02:00
Botond Dénes	9eab5bca27	query_*(): use the coordinator specified memory limit for unlimited queries It is important that all replicas participating in a read use the same memory limits to avoid artificial differences due to different amount of results. The coordinator now passes down its own memory limit for reads, in the form of max_result_size (or max_size). For unpaged or reverse queries this has to be used now instead of the locally set max_memory_unlimited_query configuration item. To avoid the replicas accidentally using the local limit contained in the `query_class_config` returned from `database::make_query_class_config()`, we refactor the latter into `database::get_reader_concurrency_semaphore()`. Most of its callers were only interested in the semaphore only anyway and those that were interested in the limit as well should get it from the coordinator instead, so this refactoring is a win-win.	2020-07-28 18:00:29 +03:00
Piotr Sarna	446b89f408	test: move json tests from manual/ to boost/ Manual tests are, as the name suggests, not run automatically, which makes them more prone to regressions. JSON tests are fast and correct, so there's no reason for them to be marked as manual. Message-Id: <dea75b0a0d1c238d12382a28840978884ac6ec2c.1594023481.git.sarna@scylladb.com>	2020-07-06 11:24:12 +03:00
Piotr Sarna	83ab41c76d	test: add json test for parsing from map Our JSON legacy helper functions for parsing documents to/from string maps are indirectly tested by several unit tests, e.g. caching_options_test.cc. They however lacked one corner case detected only by dtest - parsing an empty map from a null JSON document. This case is hereby added in order to prevent future regressions. Message-Id: <df8243bd083b2ba198df665aeb944c8710834736.1594020411.git.sarna@scylladb.com>	2020-07-06 10:28:55 +03:00
Piotr Sarna	1b37517aab	rjson: move quote_json_string to rjson This utility function is used for type serialization, but it also has a dedicated unit test, so it needs to be globally reachable.	2020-07-03 10:27:23 +02:00
Pavel Emelyanov	60e283b23e	auth: Move away from storage_service Now after the auth start/stop is standalone, we can remove reference from storage service to it. This frees some tests from the need to carry the auth service around for nothing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-12 22:14:33 +03:00
Rafael Ávila de Espíndola	e5876f6696	everywhere: Move to seastar api level 3 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-06-03 10:32:46 -07:00
Botond Dénes	e4c591aa67	database: introduce make_query_class_config() And use it to obtain any query-class specific configuration that was obtained from `table::config` before, such as the read concurrency semaphore and the max memory limit for unlimited queries. As all users of these items get these from the query class config now, we can remove them from `table::config`.	2020-05-28 11:34:35 +03:00
Botond Dénes	a08467da29	test: move away from reader_concurrency_semaphore::wait_admission() And use the reader_permit for this instead. This refactoring has revealed a pre-existing bug in the `test_lifecycle_policy`, which is also addressed in this patch. The bug is that said policy executes reader destructions in the background, and these are not waited for. For some reason, the semaphore -> permit transition pushes these races over the edge and we start seeing some of these destruction fibers still being unfinished when test scopes are exited, causing all sorts of trouble. The solution is to introduce a special gate that tests can use to wait for all background work to finish, before the test scope is exited.	2020-05-28 11:34:35 +03:00
Piotr Sarna	92aadb94e5	treewide: propagate trace state to write path In order to add tracing to places where it can be useful, e.g. materialized view updates and hinted handoff, tracing state is propagated to all applicable call sites.	2020-05-18 16:05:23 +02:00
Kamil Braun	1f7290a0ff	versioned_value: remove versioned_value::factory class If there was a Most Useless Abstraction award, this would be a good candidate.	2020-04-20 12:57:16 +02:00
Piotr Sarna	8fea5075f2	test: fix manual gossip test When trying to get rid of a large stack warning for gossip test, I found out that it actually does not run at all for multiple reasons: 1. It segfaults due to wrong initialization order 2. After fixing that, it segfaults on use-after-free (due to capturing a shared pointer by reference instead of by copy) 3. After that, cleanups are in order: * seastar thread does not need to be spawned inside another thread; * default captures are harmful, so they're made explicit instead; * db::config is moved to heap, to finally get rid of the warning. Tests: manual(gossip) Message-Id: <feaca415d0d29a16c541f9987645365310663630.1585128338.git.sarna@scylladb.com>	2020-04-06 11:07:10 +02:00
Avi Kivity	88ade3110f	treewide: replace calls to engine().some_api() with some_api() This removes the need to include reactor.hh, a source of compile time bloat. In some places, the call is qualified with seastar:: in order to resolve ambiguities with a local name. Includes are adjusted to make everything compile. We end up having 14 translation units including reactor.hh, primarily for deprecated things like reactor::at_exit(). Ref #1	2020-04-05 12:46:04 +03:00
Rafael Ávila de Espíndola	eca0ac5772	everywhere: Update for deprecated apply functions Now apply is only for tuples, for varargs use invoke. This depends on the seastar changes adding invoke. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200324163809.93648-1-espindola@scylladb.com>	2020-03-25 08:49:53 +02:00
Rafael Ávila de Espíndola	69874f4330	feature_service: Remove default constructor This makes user that feature_config_from_db_config is used for both tests and main.cc. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200312153453.37282-2-espindola@scylladb.com>	2020-03-16 11:01:15 +02:00
Pavel Emelyanov	0298a6270e	storage_service: Forget cql_config It needs the config purely to feed one into thrift/transport server, since the latter two no longer needs one, neither does the former. As a nice side effect -- some tests no longer have to carry the cql_config on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-03-09 11:58:06 +03:00
Konstantin Osipov	ac0717fb64	test: consistently use a global testlog object in all tests Use test/lib/log.hh in all tests now that we have it.	2020-03-05 13:34:24 +03:00
Botond Dénes	1073094f04	database: database::query(), database::apply(): remove default timeouts	2020-02-27 19:14:12 +02:00
Botond Dénes	f6013a39ec	reader_concurrency_semaphore: wait_admission(): remove default timeout	2020-02-27 18:43:12 +02:00
Piotr Jastrzebski	ca4a89d239	dht: add dht::decorate_key and replace all dht::global_partitioner().decorate_key with dht::decorate_key It is an improvement because dht::decorate_key takes schema and uses it to obtain partitioner instead of using global partitioner as it was before. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:06 +01:00
Pavel Emelyanov	2f3490dc8d	gossiper: Use own token_metadata Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	c5997b573c	tokens: Switch into standalone sharded instance Way too many places in code needs storage_service just for token_metadata. These references increase the amount of get(_local)?_storage_service() calls and create loops in components dependencies. Keep the token_metadata separately from storage_service and pass instances' references where needed (for now -- only into the storage_service itself). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	d1775dd701	utils: Move disk-error-handler into it The disk-error-handler is purely auxiliary thing that helps propagating IO errors to the rest of the code. It well deserves not sitting in the root namespace. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200207112443.18475-1-xemul@scylladb.com>	2020-02-09 17:26:52 +02:00
Rafael Ávila de Espíndola	bd93a0af52	types: Return bytes_opt from data_value::serialize Since a data_value can contain a null value, returning bytes from serialize() was losing information as it was mapping null to empty. This also introduces a serialize_nonnull that still returns bytes, but results in an internal error if called with a null value. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-29 14:04:59 -08:00
Botond Dénes	e4616f92fe	test/manual/sstable_scan_footprint_test: improve memory consumption diagnostics This test is all about tracking measured memory consumption vs. real memory consumption. To make this easier add additional diagnostics: * enable seastar heap profiler for the duration of the reads (seastar has to be compiled with `-DSEASTAR_HEAPPROF`). * Add a stats collector, which periodically collects stats such as non-LSA free/used memory, LSA free/used memory and memory tracked by the reader concurrency semaphore. These stats are written to a `.csv` file, allowing importing them into a spreadsheet and processing them.	2020-01-28 10:15:55 +02:00
Botond Dénes	9e9c59d125	tests/manual/sstable_scan_footprint_test: use the semaphore to determine read rate Currently the test fires the configured amount of reads at once. This is somewhat restricting in the number of testable scenarios. For example, it doesn't allow one to see if the semaphore correctly tracks the memory consumption of existing reads, by firing new reads after a while. Replace this algorithm by one which fires reads with a configured concurrency, then waits for the semaphore's queue (if any) to drain, before firing new reads. The test can now be configured with the total amount of reads to fire, and with the read-concurrency, i.e. the number of reads to fire at once in each iteration. This allows for much greater flexibility in the different test scenarios. The previous behaviour can still be achieved by configuring a concurrency of 100. This patch also adds better error handling. Reads are aborted on the first error and errors are caught and not allowed to bubble up past the test's main function and are logged instead. Extensive logging is also added to be able to monitor the system while the test is running.	2020-01-28 10:15:53 +02:00

1 2

56 Commits