scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-18 22:02:08 +00:00

Author	SHA1	Message	Date
Mikołaj Grzebieluch	8b1f5ba293	api/error_injection: add message_injection endpoint Add an endpoint for sending empty messages to the injected code.	2023-07-06 12:34:53 +02:00
Gleb Natapov	05aa07835d	storage_service: delete code that handled REMOVING_TOKENS state The state is never advertised so the code is never used.	2023-05-25 14:48:09 +03:00
Kefu Chai	b112a3b78a	api: storage_service: use string for generation in this change, the type of the "generation" field of "sstable" in the return value of RESTful API entry point at "/storage_service/sstable_info" is changed from "long" to "string". this change depends on the corresponding change on tools/jmx submodule, so we have to include the submodule change in this very commit. this API is used by our JMX exporter, which in turn exposes the SSTable information via the "StorageService.getSSTableInfo" mBean operation, which returns the retrieved SSTable info as a list of CompositeData. and "generation" is a field of an element in the CompositeData. in general, the scylla JMX exporter is consumed by the nodetool, which prints out returned SSTable info list with a pretty formatted table, see tools/java/src/java/org/apache/cassandra/tools/nodetool/SSTableInfo.java. the nodetool's formatter is not aware of the schema or type of the SSTables to be printed, neither does it enforce the type -- it just tries it best to pretty print them as a tabular. But the fields in CompositeData is typed, when the scylla JMX exporter translates the returned SSTables from the RESTful API, it sets the typed fields of every `SSTableInfo` when constructing `PerTableSSTableInfo`. So, we should be consistent on the type of "generation" field on both the JMX and the RESTful API sides. because we package the same version of scylla-jmx and nodetool in the same precompiled tarball, and enforce the dependencies on exactly same version when shipping deb and rpm packages, we should be safe when it comes to interoperability of scylla-jmx and scylla. also, as explained above, nodetool does not care about the typing, so it is not a problem on nodetool's front. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13834	2023-05-15 20:33:48 +03:00
Raphael S. Carvalho	abc1eae1c2	Add API to disable tombstone GC in compaction Adding new APIs /column_family/tombstone_gc and /storage_service/tombstone_gc. Mimicks existing APIs /column_family/autocompaction and /storage_service/autocompaction. column_family variant must specify a single table only, following existing convention. whereas the storage_service one can specify an entire keyspace, or a subset of a tables in a keyspace. column_family API usage ----- The table name must be in keyspace:name format Get status: curl -s -X GET "http://127.0.0.1:10000/column_family/tombstone_gc/ks:cf" Enable GC curl -s -X POST "http://127.0.0.1:10000/column_family/tombstone_gc/ks:cf" Disable GC curl -s -X DELETE "http://127.0.0.1:10000/column_family/tombstone_gc/ks:cf" storage_service API usage ----- Tables can be specified using a comma-separated list. Enable GC on keyspace curl -s -X POST "http://127.0.0.1:10000/storage_service/tombstone_gc/ks" Disable GC on keyspace curl -s -X DELETE "http://127.0.0.1:10000/storage_service/tombstone_gc/ks" Enable GC on a subset of tables curl -s -X POST "http://127.0.0.1:10000/storage_service/tombstone_gc/ks?cf=table1,table2" Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-12 10:34:38 -03:00
Aleksandra Martyniuk	5d826f13e7	api: move get_and_update_ttl to task manager api Task ttl can be set with task manager test api, which is disabled in release mode. Move get_and_update_ttl from task manager test api to task manager api, so that it can be used in release mode. Closes #12894	2023-02-17 10:19:06 +02:00
Aleksandra Martyniuk	6b79c92cb7	api: get task statuses recursively Sometimes to debug some task manager module, we may want to inspect the whole tree of descendants of some task. To make it easier, an api call getting a list of statuses of the requested task and all its descendants in BFS order is added.	2023-01-11 12:34:06 +01:00
Aleksandra Martyniuk	ee13a5dde8	api: extend status in task manager api Status of tasks returned with get_task_status and wait_task is extended with the list of ids of child tasks.	2022-12-21 10:54:56 +01:00
Aleksandra Martyniuk	697af4ccf2	api: extend get_tasks in task manager api Each task stats in a list returned from tm::get_task api call is extended with info about: task type, keyspace, table, entity, and sequence number.	2022-12-21 10:54:50 +01:00
Aleksandra Martyniuk	f0b2b00a15	api: delete unused type parameter from task_manager_test api	2022-12-15 10:50:30 +01:00
Botond Dénes	139fbb466e	Merge 'Task manager extension' from Aleksandra Martyniuk The PR adds changes to task manager that allow more convenient integration with modules. Introduced changes: - adds internal flag in task::impl that allows user to filter too specific tasks - renames `parent_data` to more appropriate name `task_info` - creates `tasks/types.hh` which allows using some types connected with task manager without the necessity to include whole task manager - adds more flexible version of `make_task` method Closes #11821 * github.com:scylladb/scylladb: tasks: add alternative make_task method tasks: rename parent_data to task_info and move it tasks: move task_id to tasks/types.hh tasks: add internal flag for task_manager::task::impl	2022-10-31 09:57:10 +02:00
Benny Halevy	335a8cc362	api: doc: remove_node: improve summary The current summary of the operation is obscure. It refers to a token in the ring and the endpoint associated with it, while the operation uses a host_id to identify a whole node. Instead, clarify the summary to refer to a node in the cluster, consistent with the description for the host_id parameter. Also, describe the effect the call has on the data the removed node logically owned. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-10-28 07:52:37 +03:00
Benny Halevy	9ef2631ec2	api, service: storage_service: removenode: allow passing ignore_nodes as uuid:s Currently the api is inconsistent: requiring a uuid for the host_id of the node to be removed, while the ignored nodes list is given as comma-separated ip addresses. Instead, support identifying the ignored_nodes either by their host_id (uuid) or ip address. Also, require all ignore_nodes to be of the same kind: either UUIDs or ip addresses, as a mix of the 2 is likely indicating a user error. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-10-28 07:49:03 +03:00
Aleksandra Martyniuk	e2e8a286cc	tasks: add internal flag for task_manager::task::impl It is convenient to create many different tasks implementations representing more and more specific parts of the operation in a module. Presenting all of them through the api makes it cumbersome for user to navigate and track, though. Flag internal is added to task_manager::task::impl so that the tasks could be filtered before they are sent to user.	2022-10-26 14:01:05 +02:00
Piotr Sarna	481240b8b4	Merge 'Alternator: Run more TTL tests by default (and add a test for metrics)' from Nadav Har'El We had quite a few tests for Alternator TTL in test/alternator, but most of them did not run as part of the usual Jenkins test suite, because they were considered "very slow" (and require a special "--runveryslow" flag to run). In this series we enable six tests which run quickly enough to run by default, without an additional flag. We also make them even quicker - the six tests now take around 2.5 seconds. I also noticed that we don't have a test for the Alternator TTL metrics - and added one. Fixes #11374. Refs https://github.com/scylladb/scylla-monitoring/issues/1783 Closes #11384 * github.com:scylladb/scylladb: test/alternator: insert test names into Scylla logs rest api: add a new /system/log operation alternator ttl: log warning if scan took too long. alternator,ttl: allow sub-second TTL scanning period, for tests test/alternator: skip fewer Alternator TTL tests test/alternator: test Alternator TTL metrics	2022-09-22 09:47:50 +02:00
Nadav Har'El	a81310e23d	rest api: add a new /system/log operation Add a new REST API operation, taking a log level and a message, and printing it into the Scylla log. This can be useful when a test wants to mark certain positions in the log (e.g., to see which other log messages we get between the two positions). An alternative way to achieve this could have been for the test to write directly into the log file - but an on-disk log file is only one of the logging options that Scylla support, and the approach in this patch allows to add log message regardless of how Scylla keeps the logs. In motivation of this feature is that in the following patch the test/alternator framework will add log messages when starting and ending tests, which can help debug test failures. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-09-12 10:32:56 +03:00
Aleksandra Martyniuk	42f36db55b	task_manager: test api layer The test api that helps testing task manager api. It can be used to simulate the operations that can happen on modules and theirs task. Through the api user can: register and unregister the test module and the tasks belonging to the module, and finish the tasks with success or custom error.	2022-09-09 14:29:28 +02:00
Aleksandra Martyniuk	07043cee68	task_manager: api layer The task manager api layer. It can be used to list the modules registered in task_manager, list tasks belonging to the given module, abort, wait for or retrieve a status of the given task.	2022-09-09 14:29:28 +02:00
Igor Ribeiro Barbosa Duarte	a23c3d6338	api: Add API for resetting authorization cache For cases where we have very high values set to permissions_cache validity and update interval (E.g.: 1 day), whenever a change to permissions is made it's necessary to update scylla config and decrease these values, since waiting for all this time to pass wouldn't be viable. This patch adds an API for resetting the authorization cache so that changing the config won't be mandatory for these cases. Usage: $ curl -X POST http://localhost:10000/authorization_cache/reset Signed-off-by: Igor Ribeiro Barbosa Duarte <igor.duarte@scylladb.com>	2022-06-28 19:58:06 -03:00
Michael Livshin	28d44ce6db	api-doc: correct spelling Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-06-15 11:30:58 +03:00
Benny Halevy	10b86ee5bd	api: storage_service: take_snapshot: improve api help messages Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-05-10 10:02:47 +03:00
Michael Livshin	c96708d262	add support for the ME sstable format The ME format has been introduced in Cassandra 3.11.11: `11952fae77/src/java/org/apache/cassandra/io/sstable/format/big/BigFormat.java (L123)` `d84c6e9810` It adds originating host id to sstable metadata in support of fixing loss of commit log data when moving sstables between nodes: https://issues.apache.org/jira/browse/CASSANDRA-16619 In Scylla: * The supported way to ingest sstables is via upload/, where stored commit log replay position should be disregarded (but see https://github.com/scylladb/scylla/issues/10080). * A later commit in this series implements originating host id validation for native ME sstables. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-02-16 18:21:24 +02:00
Benny Halevy	f6431824a7	api: add keyspace_offstrategy_compaction Perform offstrategy compaction via the REST API with a new `keyspace_offstrategy_compaction` option. This is useful for performing offstrategy compaction post repair, after repairing all token ranges. Otherwise, offstrategy compaction will only be auto-triggered after a 5 minutes idle timeout. Like major compaction, the api call returns the offstrategy compaction task future, so it's waited on. The `long` result counts the number of tables that required offstrategy compaction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-01-30 20:40:39 +02:00
Benny Halevy	6805ce5bd9	api: compaction_manager: add stop_keyspace_compaction Allow stopping compaction by type on a given keyspace and list of tables. Add respective rest_api test. Fixes #9700 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-12-09 14:40:13 +02:00
Benny Halevy	71c95faeee	api: compaction_manager: stop_compaction: fix type description List only the compaction types we support stopping. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-12-09 14:17:38 +02:00
Benny Halevy	cc122984d6	compaction: scrub: add quarantine_mode option Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-12-05 18:29:04 +02:00
Alejo Sanchez	0a63e72fa4	api: (minor) fix typo bool instead of boolean In definition for /column_family/major_compaction/{name} there is an incorrect use of "bool" instead of "boolean". Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #9516	2021-10-27 12:25:59 +02:00
Avi Kivity	a7ef826c2b	Merge "Fold validation compaction into scrub" from Botond " Validation compaction -- although I still maintain that it is a good descriptive name -- was an unfortunate choice for the underlying functionality because Origin has burned the name already as it uses it for a compaction type used during repair. This opens the door for confusion for users coming from Cassandra who will associate Validation compaction with the purpose it is used for in Origin. Additionally, since Origin's validation compaction was not user initiated, it didn't have a corresponding `nodetool` command to start it. Adding such a command would create an operational difference between us and Origin. To avoid all this we fold validation compaction into scrub compaction, under a new "validation" mode. I decided against using the also suggested `--dry-mode` flag as I feel that a new mode is a more natural choice, we don't have to define how it interacts with all the other modes, unlike with a `--dry-mode` flag. Fixes: #7736 Tests: unit(dev), manual(REST API) " * 'scrub-validation-mode/v2' of https://github.com/denesb/scylla: compaction/compaction_descriptor: add comment to Validation compaction type compaction/compaction_descriptor: compaction_options: remove validate api: storage_service: validate_keyspace -> scrub_keyspace (validate mode) compaction/compaction_manager: hide perform_sstable_validation() compaction: validation compaction -> scrub compaction (validate mode) compaction/compaction_descriptor: compaction_options: add options() accessor compaction/compaction_descriptor: compaction_options::scrub::mode: add validate	2021-08-10 12:18:35 +03:00
Piotr Dulikowski	7e3966c03e	api: add HTTP API for hint sync points Adds HTTP endpoints for manipulating hint sync points: - /hinted_handoff/sync_point (POST) - creates a new sync point for hints towards nodes listed in the `target_hosts` parameter - /hinted_handoff/sync_point (GET) - checks the status of the sync point. If a non-zero `timeout` parameter is given, it waits until the sync point is reached or the timeout expires.	2021-08-09 09:24:36 +02:00
Botond Dénes	c1203618eb	api: storage_service: validate_keyspace -> scrub_keyspace (validate mode) Fold validate keyspace into scrub keyspace (validate mode).	2021-08-05 07:36:45 +03:00
Botond Dénes	b0ef57c833	api: storage_service: expose validation compaction	2021-07-12 10:25:15 +03:00
Benny Halevy	4169f56407	api: storage_service/snapshots: add sf (skip_flush) option Note: I tried adding the option and calling it "skip_flush" but I couldn't make it work with scylla-jmx, hence it's called by the abbreviated name - "sf". Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-02 17:20:19 +03:00
Botond Dénes	550a1cd036	api: storage_service/keyspace_scrub: expose new segregate mode Allow invoking scrub with the newly added segregate mode as well.	2021-05-05 14:35:04 +03:00
Botond Dénes	34643ac997	api: /storage_service/keyspace_scrub: add scrub mode param Add direct support to the newly added scrub mode enum. Instead of the legacy `skip_corrupted` flag, one can now select the desired mode from the mode enum. `skip_corrupted` is still supported for backwards compatibility but it is ignored when the mode enum is set.	2021-05-05 12:03:42 +03:00
Avi Kivity	0af7a22c21	repair: remove partition_checksum and related code `80ebedd242` made row-level repair mandatory, so there remain no callers to partition_checksum. Remove it. Closes #8537	2021-04-22 18:56:53 +03:00
Ivan Prisyazhnyy	778d9217f3	tracing: api: fast mode doc improvement Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com>	2021-03-30 16:22:56 +02:00
Piotr Wojtczak	c1daf2bb24	column_family: Make toppartitions queries more generic Right now toppartitions can only be invoked on one column family at a time. This change introduces a natural extension to this functionality, allowing to specify a list of families. We provide three ways for filtering in the query parameter "name_list": 1. A specific column family to include in the form "ks:cf" 2. A keyspace, telling the server to include all column families in it. Specified by omitting the cf name, i.e. "ks:" 3. All column families, which is represented by an empty list The list can include any amount of one or both of the 1. and 2. option. Fixes #4520 Closes #7864	2021-03-24 17:54:05 +02:00
Ivan Prisyazhnyy	7cbe2aa9c6	tracing: rest api for lightweight slow query tracing The patch adds REST API support for the lightweight slow query tracing (fast) mode that is implemented by omitting all of the trace events during the tracing. $ curl -v http://localhost:10000/storage_service/slow_query $ curl -v --request POST http://localhost:10000/storage_service/slow_query\?fast=true\&enable=true Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com>	2021-03-18 15:05:05 +02:00
Asias He	61ac8d03b9	repair: Add ignore_nodes option In some cases, user may want to repair the cluster, ignoring the node that is down. For example, run repair before run removenode operation to remove a dead node. Currently, repair will ignore the dead node and keep running repair without the dead node but report the repair is partial and report the repair is failed. It is hard to tell if the repair is failed only due to the dead node is not present or some other errors. In order to exclude the dead node, one can use the hosts option. But it is hard to understand and use, because one needs to list all the "good" hosts including the node itself. It will be much simpler, if one can just specify the node to exclude explicitly. In addition, we support ignore nodes option in other node operations like removenode. This change makes the interface to ignore a node explicitly more consistent. Refs: #7806 Closes #8233	2021-03-09 16:03:13 +01:00
Tomasz Grabiec	761f89e55e	api: Introduce system/drop_sstable_caches RESTful API Evicts objects from caches which reflect sstable content, like the row cache. In the future, it will also drop the page cache and sstable index caches. Unlike lsa/compact, doesn't cause reactor stalls. The old lsa/compact call invokes memory reclamation, which is non-preemptible. It also compacts LSA segments, so does more work. Some use cases don't need to compact LSA segments, just want the row cache to be wiped. Message-Id: <20210301120211.36195-1-tgrabiec@scylladb.com>	2021-03-01 16:13:04 +02:00
Piotr Sarna	d395305ddd	api: fix retrieving replied RPC messages The API call referred to a nonexistent callback, which is now renamed to better match the API path and actually implemented. Message-Id: <3d0dbb42f67e1584999a58da9aa9cc722487fda1.1612279443.git.sarna@scylladb.com>	2021-02-03 09:42:17 +02:00
Asias He	4d32d03172	storage_service: Introduce load_and_stream === Introduction === This feature extends the nodetool refresh to allow loading arbitrary sstables that do not belong to a node into the cluster. It loads the sstables from disk and calculates the owning nodes of the data and streams to the owners automatically. From example, say the old cluster has 6 nodes and the new cluster has 3 nodes. We can copy the sstables from the old cluster to any of the new nodes and trigger the load and stream process. This can make restores and migrations much easier. === Performance === I managed to get 40MB/s per shard on my build machine. CPU: AMD Ryzen 7 1800X Eight-Core Processor DISK: Samsung SSD 970 PRO 512GB Assume 1TB sstables per node, each shard can do 40MB/s, each node has 32 shards, we can finish the load and stream 1TB of data in 13 mins on each node. 1TB / 40 MB per shard * 32 shard / 60 s = 13 mins === Tests === backup_restore_tests.py:TestBackupRestore.load_and_stream_to_new_cluster_test which creates a cluster with 4 nodes and inserts data, then use load_and_stream to restore to a 2 nodes cluster. === Usage === curl -X POST "http://{ip}:10000/storage_service/sstables/{keyspace}?cf={table}&load_and_stream=true === Notes === Btw, with the old nodetool refresh, the node will not pick up the data that does not belong to this node but it will not delete it either. One has to run nodetool cleanup to remove those data manually which is a surprise to me and probably to users as well. With load and stream, the process will delete the sstables once it finishes stream, so no nodetool cleanup is needed. The name of this feature load and stream follows load and store in CPU world. Fixes #7831	2021-01-18 16:32:33 +08:00
Asias He	829b4c1438	repair: Make removenode safe by default Currently removenode works like below: - The coordinator node advertises the node to be removed in REMOVING_TOKEN status in gossip - Existing nodes learn the node in REMOVING_TOKEN status - Existing nodes sync data for the range it owns - Existing nodes send notification to the coordinator - The coordinator node waits for notification and announce the node in REMOVED_TOKEN Current problems: - Existing nodes do not tell the coordinator if the data sync is ok or failed. - The coordinator can not abort the removenode operation in case of error - Failed removenode operation will make the node to be removed in REMOVING_TOKEN forever. - The removenode runs in best effort mode which may cause data consistency issues. It means if a node that owns the range after the removenode operation is down during the operation, the removenode node operation will continue to succeed without requiring that node to perform data syncing. This can cause data consistency issues. For example, Five nodes in the cluster, RF = 3, for a range, n1, n2, n3 is the old replicas, n2 is being removed, after the removenode operation, the new replicas are n1, n5, n3. If n3 is down during the removenode operation, only n1 will be used to sync data with the new owner n5. This will break QUORUM read consistency if n1 happens to miss some writes. Improvements in this patch: - This patch makes the removenode safe by default. We require all nodes in the cluster to participate in the removenode operation and sync data if needed. We fail the removenode operation if any of them is down or fails. If the user want the removenode operation to succeed even if some of the nodes are not available, the user has to explicitly pass a list of nodes that can be skipped for the operation. $ nodetool removenode --ignore-dead-nodes <list_of_dead_nodes_to_ignore> <host_id> Example restful api: $ curl -X POST "http://127.0.0.1:10000/storage_service/remove_node/?host_id=7bd303e9-4c7b-4915-84f6-343d0dbd9a49&ignore_nodes=127.0.0.3,127.0.0.5" - The coordinator can abort data sync on existing nodes For example, if one of the nodes fails to sync data. It makes no sense for other nodes to continue to sync data because the whole operation will fail anyway. - The coordinator can decide which nodes to ignore and pass the decision to other nodes Previously, there is no way for the coordinator to tell existing nodes to run in strict mode or best effort mode. Users will have to modify config file or run a restful api cmd on all the nodes to select strict or best effort mode. With this patch, the cluster wide configuration is eliminated. Fixes #7359 Closes #7626	2020-12-10 10:14:39 +02:00
Piotr Wojtczak	c09ab3b869	api: Add cardinality to toppartitions results This change enhances the toppartitions api to also return the cardinality of the read and write sample sets. It now uses the size() method of space_saving_top_k class, counting the unique operations in the sampled set for up to the given capacity. Fixes #4089 Closes #7766	2020-12-08 09:38:59 +01:00
Asias He	0a3a2a82e1	api: Add force_remove_endpoint for gossip It is used to force remove a node from gossip membership if something goes wrong. Note: run the force_remove_endpoint api at the same time on _all_ the nodes in the cluster in order to prevent the removed nodes come back. Becasue nodes without running the force_remove_endpoint api cmd can gossip around the removed node information to other nodes in 2 * ring_delay (2 * 30 seconds by default) time. For instance, in a 3 nodes cluster, node 3 is decommissioned, to remove node 3 from gossip membership prior the auto removal (3 days by default), run the api cmd on both node 1 and node 2 at the same time. $ curl -X POST --header "Accept: application/json" "http://127.0.0.1:10000/gossiper/force_remove_endpoint/127.0.0.3" $ curl -X POST --header "Accept: application/json" "http://127.0.0.2:10000/gossiper/force_remove_endpoint/127.0.0.3" Then run 'nodetool gossipinfo' on all the nodes to check the removed nodes are not present. Fixes #2134 Closes #5436	2020-11-29 13:58:46 +02:00
Piotr Dulikowski	6465dd160b	storage_proxy: fix wrong return type in swagger The GET `hinted_handoff_enabled_by_dc` endpoint had an incorrect return type specified. Although it does not have an implementation, yet, it was supposed to return a list of strings with DC names for which generating hints is enabled - not a list of string pairs. Such return type is expected by the JMX.	2020-11-17 10:24:43 +01:00
Pekka Enberg	a37eaaa022	sstables: Add support for the "md" format enum value Add the sstable_version_types::md enum value and logically extend sstable_version_types comparisons to cover also the > sstable_version_types::mc cases. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Asias He	271fac56a3	repair: Add synchronous API to query repair status This new api blocks until the repair job is either finished or failed or timeout. E.g., - Without timeout curl -X GET http://127.0.0.1:10000/storage_service/repair_status/?id=123 - With timeout curl -X GET http://127.0.0.1:10000/storage_service/repair_status/?id=123&timeout=5 The timeout is in second. The current asynchronous api returns immediately even if the repair is in progress. E.g., curl -X GET http://127.0.0.1:10000/storage_service/repair_async/ks?id=123 User can use the new synchronous API to avoid keep sending the query to poll if the repair job is finished. Fixes #6445	2020-07-14 11:20:15 +03:00
Avi Kivity	e5be3352cf	database, streaming, messaging: drop streaming memtables Before Scylla 3.0, we used to send streaming mutations using individual RPC requests and flush them together using dedicated streaming memtables. This mechanism is no longer in use and all versions that use it have long reached end-of-life. Remove this code.	2020-06-25 15:25:54 +02:00
Juliusz Stasiewicz	aadd2ffa6a	api: Added command `/storage_service/cdc_streams_check_and_repair` This commit introduces a placeholder for HTTP POST request at `/storage_service/cdc_streams_check_and_repair`.	2020-05-29 12:23:08 +02:00
Ivan Prisyazhnyy	84e25e8ba4	api: support table auto compaction control The patch implements: - /storage_service/auto_compaction API endpoint - /column_family/autocompaction/{name} API endpoint Those APIs allow to control and request the status of background compaction jobs for the existing tables. The implementation introduces the table::_compaction_disabled_by_user. Then the CompactionManager checks if it can push the background compaction job for the corresponding table. New members === table::enable_auto_compaction(); table::disable_auto_compaction(); bool table::is_auto_compaction_disabled_by_user() const Test === Tests: unit(sstable_datafile_test autocompaction_control_test), manual $ ninja build/dev/test/boost/sstable_datafile_test $ ./build/dev/test/boost/sstable_datafile_test --run_test=autocompaction_control_test -- -c1 -m2G --overprovisioned --unsafe-bypass-fsync 1 --blocked-reactor-notify-ms 2000000 The test tries to submit a compaction job after playing with autocompaction control table switch. However, there is no reliable way to hook pending compaction task. The code assumed that with_scheduling_group() closure will never preempt execution of the stats check. Revert === Reverts commit `c8247ac`. In previous version the execution sometimes resulted into the following error: test/boost/sstable_datafile_test.cc(1076): fatal error: in "autocompaction_control_test": critical check cm->get_stats().pending_tasks == 1 \|\| cm->get_stats().active_tasks == 1 has failed This version adds a few sstables to the cf, starts the compaction and awaits until it is finished. API change === - `/column_family/autocompaction/` always returned `true` while answering to the question: if the autocompaction disabled (see https://github.com/scylladb/scylla-jmx/blob/master/src/main/java/org/apache/cassandra/db/ColumnFamilyStore.java#L321). now it answers to the question: if the autocompaction for specific table is enabled. The question logic is inverted. The patch to the JMX is required. However, the change is decent because all old values were invalid (it always reported all compactions are disabled). - `/column_family/autocompaction/` got support for POST/DELETE per table Fixes === Fixes #1488 Fixes #1808 Fixes #440 Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com>	2020-05-07 16:23:38 +03:00

1 2 3 4

157 Commits