scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-23 01:50:35 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	0944d69475	repair, streaming: Generalize consumer lambdas Both streaming and repair call the distributed sstables writing with equal lambdas each being ~30 lines of code. The only difference between them is repair might request offstrategy compaction for new sstable. Generalization of these two pieces save lines of codes and speeds the release/repair/row_level.o compilation by half a minute (out of twelve). tests: unit(dev) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210531133113.23003-1-xemul@scylladb.com>	2021-06-06 09:21:23 +03:00
Lubos Kosco	777771df34	scylla_util.py: Relax GCE setup NVMe device checks We don't want to fail I/O setup if there are more than one NVMe devices mounted as root nor if there are no NVMe devices. Fixes #8032 Closes #8444	2021-06-06 09:21:23 +03:00
Botond Dénes	b0056f88dc	test.py: revamp coverage support Instead of attempting to universally set the proper environment necessary for tests to generate profiling data such that coverage.py can process it, allow each Test subclass to set up the environment as needed by the specific Test variant. With this we now have support for all current test types, including cql, cql-pytest and alternator tests.	2021-06-06 09:21:23 +03:00
Botond Dénes	438391b4cc	scripts/coverage.py: check that --path is a directory To detect a bad --path that would fail coverage generation early.	2021-06-06 09:21:23 +03:00
Botond Dénes	ca91fd0e34	scripts/coverage.py: update main()'s docstring with new --run modifiers And fix a typo while there.	2021-06-06 09:21:23 +03:00
Botond Dénes	2ba3fc2e11	scripts/coverage.py: add --distinct-id parameter Yet another modifier for `--run`, allowing running the same executable multiple times and then generating a coverage report across all runs. This will also be used by test.py for those test suites (cql test) which run the same executable multiple times, with different inputs.	2021-06-06 09:21:23 +03:00
Botond Dénes	b1f46b3693	scripts/coverage.py: add --executable parameter Another modifier for `--run`, allowing to override the test executable path. This is useful when the real test is ran through a run-script, like in the case of cql-pytest.	2021-06-06 09:21:23 +03:00
Avi Kivity	e9e5663731	build, utils/bptree.hh: drop -Wno-gnu-designator warning Drop the warning about old-stye GNU designated initializers and convert two violations in bptree.hh to the standard C++20 syntax. Closes #8743	2021-05-31 18:51:49 +03:00
Nadav Har'El	ff81072f64	cql-pytest: port Cassandra's unit test validation/entities/secondary_index_test In this patch, we port validation/entities/secondary_index_test.java, resulting in 41 tests for various aspects of secondary indexes. Some of the original Java tests required direct access to the Cassandra internals not available through CQL, so those tests were omitted. In porting these tests, I uncovered 9 previously-unknown bugs in Scylla: Refs #8600: IndexInfo system table lists MV name instead of index name Refs #8627: Cleanly reject updates with indexed values where value > 64k Refs #8708: Secondary index is missing partitions with only a static row Refs #8711: Finding or filtering with an empty string with a secondary index seems to be broken Refs #8714: Improve error message on unsupported restriction on partition key Refs #8717: Recent fix accidentally broke CREATE INDEX IF NOT EXISTS Refs #8724: Wrong error message when attempting index of UDT column with a duration Refs #8744: Index-creation error message wrongly refers to "map" - it can be any collection Refs #8745: Secondary index CREATE INDEX syntax is missing the "values" option These tests also provide additional reproducers for already known issues: Refs #2203: Add support for SASI Refs #2962: Collection column indexing Refs #2963: Static column indexing Refs #4244: Add support for mixing token, multi- and single-column restrictions Due to these bugs, 15 out of the 41 tests here currently xfail. We actually had more failing tests, but we fixed a few of the above issues before this patch went in, so their tests are passing at the time of this submission. All 41 tests pass when running against Cassandra. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210531112354.970028-1-nyh@scylladb.com>	2021-05-31 18:31:13 +03:00
Piotr Sarna	389a0a52c9	treewide: revamp workload type for service levels This patch is not backward compatible with its original, but it's considered fine, since the original workload types were not yet part of any release. The changes include: - instead of using 'unspecified' for declaring that there's no workload type for a particular service level, NULL is used for that purpose; NULL is the standard way of representing lack of data - introducing a delete marker, which accompanies NULL and makes it possible to distinguish between wanting to forcibly reset a workload type to unspecified and not wanting to change the previous value - updating the tests accordingly These changes come in as a single patch, because they're intertwined with each other and the tests for workload types are already in place; an attempt to split them proved to be more complicated than it's worth. Tests: unit(release) Closes #8763	2021-05-31 18:18:33 +03:00
Piotr Dulikowski	b0c22f2e39	repair: trigger repair abort_source only from shard 0 When user requests repair to be forcefully aborted, the `_abort_all_as` abort source could be modified from multiple shards in parallel by the `tracker::abort_all_repairs()` function, which can lead to undefined behavior and to a crash. This commit makes sure that `_abort_all_as` is used only from shard 0 when repair is aborted. Fixes #8693 Closes #8734	2021-05-31 15:57:31 +03:00
Avi Kivity	e96ff3d82d	dist: add new docker building process The new process has the following differences from the Dockerfile based image: - Using buildah commands instead of a Dockerfile. This is more flexible since we don't need to pack everything into a "build context" and transfer it to the container; instead we interact with the container as we build it. - Using packages instead of a remote yum repository. This makes it easy to create an image in one step (no need to create a repository, promote, then download the packages back via yum. It means that the image cannot be upgraded via yum, but container images are usually just replaced with a new version. - Build output is an OCI archive (e.g. a tarball), not a docker image in a local repoistory. This means the build process can later be integrated into ninja, since the artifact is just a file. The file can be uploaded into a repository or made available locally with skopeo. - any build mode is supported, not just release. This can be used for quick(er) testing with dev mode. I plan to integrate it further into the build system, but currently this is blocked on a buildah bug [1]. [1] https://github.com/containers/buildah/issues/3262 Closes #8730	2021-05-31 10:05:22 +03:00
Nadav Har'El	2440569984	secondary index: fix error message which erroneously refered to "map" The value of a frozen collection may only be indexed (using a secondary index) in full - it is not allowed to index only the keys for example - "CREATE INDEX idx ON table (keys(v))" is not allowed. The error message referred to a frozen<map>, but the problem can happen on any frozen collection (e.g., a frozen set), not just a frozen map, so can be confusing to a user who used a frozen set, and getting an error about a frozen map. So this patch fixes the error message to refer to a "frozen collection". Note that the Cassandra error message in this case is different - it reads: "Frozen collections are immutable and must be fully indexed". Fixes #8744. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210529094056.825117-1-nyh@scylladb.com>	2021-05-30 23:23:20 +03:00
Botond Dénes	cd6bbd37a4	utils/utf8.c: move includes outside of namespaces Including in the middle of a namespace is not a good practice. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210528142502.962947-1-bdenes@scylladb.com>	2021-05-30 23:23:20 +03:00
Raphael S. Carvalho	a7cdd846da	compaction: Prevent tons of compaction of fully expired sstable from happening in parallel Compaction manager can start tons of compaction of fully expired sstable in parallel, which may consume a significant amount of resources. This problem is caused by weight being released too early in compaction, after data is all compacted but before table is called to update its state, like replacing sstables and so on. Fully expired sstables aren't actually compacted, so the following can happen: - compaction 1 starts for expired sst A with weight W, but there's nothing to be compacted, so weight W is released, then calls table to update state. - compaction 2 starts for expired sst B with weight W, but there's nothing to be compacted, so weight W is released, then calls table to update state. - compaction 3 starts for expired sst C with weight W, but there's nothing to be compacted, so weight W is released, then calls table to update state. - compaction 1 is done updating table state, so it finally completes and releases all the resources. - compaction 2 is done updating table state, so it finally completes and releases all the resources. - compaction 3 is done updating table state, so it finally completes and releases all the resources. This happens because, with expired sstable, compaction will release weight faster than it will update table state, as there's nothing to be compacted. With my reproducer, it's very easy to reach 50 parallel compactions on a single shard, but that number can be easily worse depending on the amount of sstables with fully expired data, across all tables. This high parallelism can happen only with a couple of tables, if there are many time windows with expired data, as they can be compacted in parallel. Prior to `55a8b6e3c9`, weight was released earlier in compaction, before last sstable was sealed, but right now, there's no need to release weight earlier. Weight can be released in a much simpler way, after the compaction is actually done. So such compactions will be serialized from now on. Fixes #8710. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210527165443.165198-1-raphaelsc@scylladb.com> [avi: drop now unneeded storage_service_for_tests]	2021-05-30 23:22:51 +03:00
Benny Halevy	1c0769d789	table: clear: make exception safe It is currently possible that _memtables->add_memtable() will throw after _memtables->clear(), leaving the memtables list completely empty. However, we do rely on always having at least one allocated in the memtables list as active_memtable() references a lw_shared_ptr<memtable> at the back of the memtables vector, and it expected to always be allocated via add_memtable() upon construction and after clear(). This change moves the implementation of this convention to memtable_list::clear() and makes the latter exception safe by first allocating the to-be-added empty memtable and only then clearing the vector. Refs #8749 Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210530100232.2104051-1-bhalevy@scylladb.com>	2021-05-30 13:22:52 +03:00
Avi Kivity	791412b046	test: user_defined_function_test: raise Lua timeout user_defined_function_test fails sporadically in debug mode due to lua timeout. Raise the timeout to avoid the failure, but not so much that the test that expects timout becomes too slow. Fixes #8746. Closes #8747	2021-05-30 13:10:57 +03:00
Piotr Jastrzebski	76d7c761d1	schema: Stop using deprecated constructor This is another boring patch. One of schema constructors has been deprecated for many years now but was used in several places anyway. Usage of this constructor could lead to data corruption when using MX sstables because this constructor does not set schema version. MX reading/writing code depends on schema version. This patch replaces all the places the deprecated constructor is used with schema_builder equivalent. The schema_builder sets the schema version correctly. Fixes #8507 Test: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <4beabc8c942ebf2c1f9b09cfab7668777ce5b384.1622357125.git.piotr@scylladb.com>	2021-05-30 11:58:27 +03:00
Nadav Har'El	1507bbb35a	cql-pytest: increase default server-side timeouts Sometimes the cql-pytest tests run extremely slowly. This can be a combination of running the debug build (which is naturally slow) and a test machine which is overcommitted, or experiencing some transient swap storm or some similar event. We don't want tests, which we run on a 100% reliable setups, to fail just because they run into timeouts in Scylla when they run very slowly. We already noticed this problem in the past, and increased the CQL client timeout in conftest.py from the default of 10 seconds to 120 seconds - the old default of 10 seconds was not enough for some long operations (such as creating a table with multiple views) when the test ran very slowly. However, this only fixed the client-side timeout. We also have a bunch of server-side timeouts, configured to all sorts of arbitrary (and fairly small) numbers. For example, the server has a "write request timeout" option, which defaults to just 2 seconds. We recently saw this timeout exceeded in a slow run which tried to do a very large write. So this patch configures all the configurable server-side timeouts we have to default to 300 seconds. This should be more than enough for even the slowest runs (famous last words...). This default is not a good idea on real multi-node clusters which are expected to deal with node loss, but this is not the case in cql-pytest. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210529213648.856503-1-nyh@scylladb.com>	2021-05-30 01:20:14 +03:00
Avi Kivity	d23bebf5c2	Merge "Unexport storage service dependencies" from Pavel E " Right now storage service is used as "provider" of another services -- database, feature service and tokens. This set unexports the first pair. This dropps a bunch of calls for global storage service instances from the places that don't really need it. tests: unit(dev), start-stop " * 'br-pupate-storage-service' of https://github.com/xemul/scylla: storage-service: Don't export features api: Get features from proxy storage-service: Don't export database storage-service: Turn some global helpers into methods storage-service: Open-code simple config getters view: Get database from stprage_proxy main: Use local database instance api: Use database from http_ctx	2021-05-29 20:52:47 +03:00
Pavel Emelyanov	598bbfab15	storage-service: Don't export features Now storage service uses the feature service instance internally and doesn't need to provide getter for it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:16:12 +03:00
Pavel Emelyanov	651568318d	api: Get features from proxy The reset_local_schema call needs proxy and feature service to do its job. Right now the features are retrived from global storage service, but they are present on the proxy as well. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:15:15 +03:00
Pavel Emelyanov	b990b764ca	storage-service: Don't export database Now storage service uses the database instance internally and doesn't need to provide getter for it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:13:27 +03:00
Pavel Emelyanov	0651038f29	storage-service: Turn some global helpers into methods There are two static helpers used by storage service that grab global storage service. To simplify these two turn both into storage service methods and use 'this' inside. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:12:25 +03:00
Pavel Emelyanov	5ae8accfed	storage-service: Open-code simple config getters There are two db::config getters in storage_service.cc that are used only once. Both call for global storage service, but since they are called from storage service it's simpler to break this loop and make storage service get needed config options directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:11:24 +03:00
Pavel Emelyanov	1ce0682821	view: Get database from stprage_proxy The db::view code already uses proxy rather actively, so instead of depending on the storage service to be at hands it's better to make db::view require the proxy. For now -- via global instance. There's one dependency on storage service left after this patch -- to get the tokens. This piece is to be fixed later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:09:32 +03:00
Pavel Emelyanov	6d53ddaa5f	main: Use local database instance All start-stop code in main has the sharded<database> at hands, there's no need in getting it from global storage service. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:08:57 +03:00
Pavel Emelyanov	e476247763	api: Use database from http_ctx Instead of getting database from global storage service it's simpler and better to grab it from the http context at hands. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-28 18:08:25 +03:00
Asias He	e86d39faf0	storage_service: Update peer table only if the peer is part of the ring Consider the following procedure: - n1, n2, n3 - n3 is network partitioned from the cluster - n4 replaces n3 - n3 has the network partition fixed - n1 learns n3 as NORMAL status and calls storage_service::handle_state_normal which in turn calls update_peer_info, all columns except tokens column in system.peers are written - n1 restarts before figure out n4 is the new owner and deletes the entry for n3 in system.peers - n3 is removed from gossip by all the nodes in the cluster automatically because they detect the collision and removes n3 - n1 restarts, leaving the entry in system.peers for n3 forever To fix, we can update peer tables only if the node is part of the ring. Fixes #8729 Closes #8742	2021-05-28 15:03:26 +02:00
Avi Kivity	b6c49fd320	Update seastar submodule > Merge "memory: optimize thread-local initialization" from Avi > Merge "Move priority classes manipulations from io-queue" from Pavel E > gate: add default move assignment operator	2021-05-28 11:47:54 +03:00
Pavel Emelyanov	526d31734c	scylla-gdb: scylla_io_queues: Support new registered classes layout Starting from seastar commit 5dae0cf3c48159990f51e5d38495af5ae224c2f8 all the registered classes info was moved into io_priority_class::_infos array. tests: scylla-gdb(release, old and new seastars) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210528083941.27990-1-xemul@scylladb.com>	2021-05-28 11:47:38 +03:00
Avi Kivity	0acf5bfca6	build: enable -Wreturn-std-move Clang warns when "return std::move(x)" is needed to elide a copy, but the call to std::move() is missing. We disabled the warning during the migration to clang. This patch re-enables the warning and fixes the places it points out, usually by adding std::move() and in one place by converting the returned variable from a reference to a local, so normal copy elision can take place. Closes #8739	2021-05-27 21:16:26 +03:00
Avi Kivity	d3e5b37059	Revert "Merge 'Commitlog: Handle disk usage and disk footprint discrepancies, ensuring we flush when needed' from Calle Wilund" This reverts commit `e9c940dbbc`, reversing changes made to `6144656b25`. Since it was merged commitlog_test consistently times out in debug mode.	2021-05-27 21:16:26 +03:00
Wojciech Mitros	725c6aac81	test/perf: close test_env to pass an assert in sstables_manager destructor When destroying an perf_sstable_test_env, an assert in sstables_manager destructor fails, because it hasn't been closed. Fix by removing all references to sstables from perf_sstable_test_env, and then closing the test_env(as well as the sstables_manager) Fixes #8736 Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> Closes #8737	2021-05-27 17:41:17 +03:00
Michał Chojnowski	5e9f741bb4	repair: remove range_split.hh Dead code since `80ebedd242`. Closes #8698	2021-05-27 17:21:37 +03:00
Avi Kivity	5f8484897b	Merge 'cdc: use a new internal table for exchanging generations' from Kamil Braun Reopening #8286 since the token metadata fix that allows `Everywhere` strategy tables to work with RBO (#8536) has been merged. --- Currently when a node wants to create and broadcast a new CDC generation it performs the following steps: 1. choose the generation's stream IDs and mapping (how this is done is irrelevant for the current discussion) 2. choose the generation's timestamp by taking the current time (according to its local clock) and adding 2 * ring_delay 3. insert the generation's data (mapping and stream IDs) into system_distributed.cdc_generation_descriptions, using the generation's timestamp as the partition key (we call this table the "old internal table" below) 4. insert the generation's timestamp into the "CDC_STREAMS_TIMESTAMP" application state. The timestamp spreads epidemically through the gossip protocol. When nodes see the timestamp, they retrieve the generation data from the old internal table. Unfortunately, due to the schema of the old internal table, where the entire generation data is stored in a single cell, step 3 may fail for sufficiently large generations (there is a size threshold for which step 3 will always fail - retrying the operation won't help). Also the old internal table lies in the system_distributed keyspace that uses SimpleStrategy with replication factor 3, which is also problematic; for example, when nodes restart, they must reach at least 2 out of these 3 specific replicas in order to retrieve the current generation (we write and read the generation data with QUORUM, unless we're a single-node cluster, where we use ONE). Until this happens, a restarting node can't coordinate writes to CDC-enabled tables. It would be better if the node could access the last known generation locally. The commit introduces a new table for broadcasting generation data with the following properties: - it uses a better schema that stores the data in multiple rows, each of manageable size - it resides in a new keyspace that uses EverywhereStrategy so the data will be written to every node in the cluster that has a token in the token ring - the data will be written using CL=ALL and read using CL=ONE; thanks to this, restarting node won't have to communicate with other nodes to retrieve the data of the last known generation. Note that writing with CL=ALL does not reduce availability: creating a new generation requires all nodes to be available anyway, because they must learn about the generation before their clocks go past the generation's timestamp; if they don't, partitions won't be mapped to stream IDs consistently across the cluster - the partition key is no longer the generation's timestamp. Because it was that way in the old internal table, it forced the algorithm to choose the timestamp before the generation data was inserted into the table. What if the inserting took a long time? It increased the chance that nodes would learn about the generation too late (after their clocks moved past its timestamp). With the new schema we will first insert the generation data using a randomly generated UUID as the partition key, then choose the timestamp, then gossip both the timestamp and the UUID. Observe that after a node learns about a generation broadcasted using this new method through gossip it will retrieve its data very quickly since it's one of the replicas and it can use CL=ONE as it was written using CL=ALL. The generation's timestamp and the UUID mentioned in the last point form a "generation identifier" for this new generation. For passing these new identifiers around, we introduce the cdc::generation_id_v2 type. Fixes #7961. --- For optimal review experience it is best to first read the updated design notes (you can read them rendered here: https://github.com/kbr-/scylla/blob/cdc-gen-table/docs/design-notes/cdc.md), specifically the ["Generation switching"](https://github.com/kbr-/scylla/blob/cdc-gen-table/docs/design-notes/cdc.md#generation-switching) section followed by the ["Internal generation descriptions table V1 and upgrade procedure"](https://github.com/kbr-/scylla/blob/cdc-gen-table/docs/design-notes/cdc.md#internal-generation-descriptions-table-v1-and-upgrade-procedure) section, then read the commits in topological order. dtest gating run (dev): https://jenkins.scylladb.com/job/scylla-master/job/byo/job/byo_build_tests_dtest/1160/ unit tests (dev) passed locally Closes #8643 * github.com:scylladb/scylla: docs: update cdc.md with info about the new internal table sys_dist_ks: don't create old CDC generations table on service initialization sys_dist_ks: rename all_tables() to ensured_tables() cdc: when creating new generations, use format v2 if possible main: pass feature_service to cdc::generation_service gms: introduce CDC_GENERATIONS_V2 feature cdc: introduce retrieve_generation_data test: cdc: include new generations table in permissions test sys_dist_ks: increase timeout for create_cdc_desc sys_dist_ks: new table for exchanging CDC generations tree-wide: introduce cdc::generation_id_v2	2021-05-27 17:13:44 +03:00
Avi Kivity	e8e4456ec7	Merge 'Introduce per-service-level workload types and their first use-case - shedding in interactive workloads' from Piotr Sarna This draft extends and obsoletes #8123 by introducing a way of determining the workload type from service level parameters, and then using this context to qualify requests for shedding. The rough idea is that when the admission queue in the CQL server is hit, it might make more sense to start shedding surplus requests instead of accumulating them on the semaphore. The assumption that interactive workloads are more interested in the success rate of as many requests as possible, and hanging on a semaphore reduces the chances for a request to succeed. Thus, it may make sense to shed some requests to reduce the load on this coordinator and let the existing requests to finish. It's a draft, because I only performed local guided tests. #8123 was followed by some experiments on a multinode cluster which I want to rerun first. Closes #8680 * github.com:scylladb/scylla: test: add a case for conflicting workload types cql-pytest: add basic tests for service level workload types docs: describe workload types for service levels sys_dist_ks: fix redundant parsing in get_service_level sys_dist_ks: make get_service_level exception-safe transport: start shedding requests during potential overload client_state: hook workload type from service levels cql3: add listing service level workload type cql3: add persisting service level workload type qos: add workload_type service level parameter	2021-05-27 17:01:56 +03:00
Avi Kivity	f3e8e625c0	Update tools/java submodule (toppartitions single jmx call) * tools/java fd92603b99...599b2368d6 (1): > toppartitions: Fix toppartitions to only jmx once Ref #8459.	2021-05-27 16:57:57 +03:00
Konstantin Osipov	52f7ff4ee4	raft: (testing) update copyright An incorrect copyright information was copy-pasted from another test file. Message-Id: <20210525183919.1395607-1-kostja@scylladb.com>	2021-05-27 15:47:49 +03:00
Nadav Har'El	92b7a84e90	secondary index: in error message, call UDT as UDT It is forbidden to create a secondary index of a column which includes in any way the "duration" type. This includes a UDT which including duration. Our code attempted to print in this case the message "Secondary indexes are not supported on UDTs containing durations" - but because we tested for tuples first, and UDTs are also tuples - we got the message about tuples. By changing the order of the tests, we get the most specific (and useful) error message. Fixes #8724. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210526201042.642550-1-nyh@scylladb.com>	2021-05-27 15:46:30 +03:00
Piotr Sarna	99f356d764	test: add a case for conflicting workload types The test case verifies that if several workload types are effective for a single role, the conflict resolution is well defined.	2021-05-27 14:31:36 +02:00
Piotr Sarna	01b7e445f9	cql-pytest: add basic tests for service level workload types The test cases check whether it's possible to declare workload type for a service level and if its input is validated.	2021-05-27 14:31:36 +02:00
Piotr Sarna	54a5d4516c	docs: describe workload types for service levels A paragraph about workload types is added to docs/service_levels.md	2021-05-27 14:31:36 +02:00
Piotr Sarna	d45574ed28	sys_dist_ks: fix redundant parsing in get_service_level The routine used for getting service level information already operates on the service level name, but the same information is also parsed once more from a row from an internal table. This parsing is redundant, so it's hereby removed.	2021-05-27 14:31:26 +02:00
Piotr Sarna	7faba19605	sys_dist_ks: make get_service_level exception-safe In order to avoid killing the node if a parsing error occurs, the routine which fetches service level information is made exception-safe.	2021-05-27 14:31:25 +02:00
Pavel Emelyanov	d2442a1bb3	tests: Ditch storage_service_for_tests The purpose of the class in question is to start sharded storage service to make its global instance alive. I don't know when exactly it happened but no code that instantiates this wrapper really needs the global storage service. Ref: #2795 tests: unit(dev), perf_sstable(dev) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210526170454.15795-1-xemul@scylladb.com>	2021-05-27 14:39:13 +03:00
Piotr Sarna	cb27ebe61d	transport: start shedding requests during potential overload This commit implements the following overload prevention heuristics: if the admission queue becomes full, a timer is armed for 50ms. If any of the ongoing requests finishes, the timer is disarmed, but if that doesn't happen, the server goes into shedding mode, which means that it reads new requests from the socket and immediately drops them until one of the ongoing requests finishes. This heuristics is not recommended for OLAP workloads, so it is applied only if the session declared itself as interactive (via service level's workload_type parameter).	2021-05-27 13:02:22 +02:00
Piotr Sarna	409c67b1b4	client_state: hook workload type from service levels The client state is now aware of its workload type derived from its attached service level.	2021-05-27 13:02:22 +02:00
Piotr Sarna	762e2f48f2	cql3: add listing service level workload type The workload type information is now presented in the output of LIST SERVICE LEVEL and LIST ALL SERVICE LEVELS statements.	2021-05-27 13:02:22 +02:00
Piotr Sarna	4816678eb6	cql3: add persisting service level workload type The workload type information can now be set via CQL and it's persisted in the distributed system table.	2021-05-27 13:02:22 +02:00

1 2 3 4 5 ...

26716 Commits