scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Asias He	829b4c1438	repair: Make removenode safe by default Currently removenode works like below: - The coordinator node advertises the node to be removed in REMOVING_TOKEN status in gossip - Existing nodes learn the node in REMOVING_TOKEN status - Existing nodes sync data for the range it owns - Existing nodes send notification to the coordinator - The coordinator node waits for notification and announce the node in REMOVED_TOKEN Current problems: - Existing nodes do not tell the coordinator if the data sync is ok or failed. - The coordinator can not abort the removenode operation in case of error - Failed removenode operation will make the node to be removed in REMOVING_TOKEN forever. - The removenode runs in best effort mode which may cause data consistency issues. It means if a node that owns the range after the removenode operation is down during the operation, the removenode node operation will continue to succeed without requiring that node to perform data syncing. This can cause data consistency issues. For example, Five nodes in the cluster, RF = 3, for a range, n1, n2, n3 is the old replicas, n2 is being removed, after the removenode operation, the new replicas are n1, n5, n3. If n3 is down during the removenode operation, only n1 will be used to sync data with the new owner n5. This will break QUORUM read consistency if n1 happens to miss some writes. Improvements in this patch: - This patch makes the removenode safe by default. We require all nodes in the cluster to participate in the removenode operation and sync data if needed. We fail the removenode operation if any of them is down or fails. If the user want the removenode operation to succeed even if some of the nodes are not available, the user has to explicitly pass a list of nodes that can be skipped for the operation. $ nodetool removenode --ignore-dead-nodes <list_of_dead_nodes_to_ignore> <host_id> Example restful api: $ curl -X POST "http://127.0.0.1:10000/storage_service/remove_node/?host_id=7bd303e9-4c7b-4915-84f6-343d0dbd9a49&ignore_nodes=127.0.0.3,127.0.0.5" - The coordinator can abort data sync on existing nodes For example, if one of the nodes fails to sync data. It makes no sense for other nodes to continue to sync data because the whole operation will fail anyway. - The coordinator can decide which nodes to ignore and pass the decision to other nodes Previously, there is no way for the coordinator to tell existing nodes to run in strict mode or best effort mode. Users will have to modify config file or run a restful api cmd on all the nodes to select strict or best effort mode. With this patch, the cluster wide configuration is eliminated. Fixes #7359 Closes #7626	2020-12-10 10:14:39 +02:00
Piotr Wojtczak	c09ab3b869	api: Add cardinality to toppartitions results This change enhances the toppartitions api to also return the cardinality of the read and write sample sets. It now uses the size() method of space_saving_top_k class, counting the unique operations in the sampled set for up to the given capacity. Fixes #4089 Closes #7766	2020-12-08 09:38:59 +01:00
Asias He	0a3a2a82e1	api: Add force_remove_endpoint for gossip It is used to force remove a node from gossip membership if something goes wrong. Note: run the force_remove_endpoint api at the same time on _all_ the nodes in the cluster in order to prevent the removed nodes come back. Becasue nodes without running the force_remove_endpoint api cmd can gossip around the removed node information to other nodes in 2 * ring_delay (2 * 30 seconds by default) time. For instance, in a 3 nodes cluster, node 3 is decommissioned, to remove node 3 from gossip membership prior the auto removal (3 days by default), run the api cmd on both node 1 and node 2 at the same time. $ curl -X POST --header "Accept: application/json" "http://127.0.0.1:10000/gossiper/force_remove_endpoint/127.0.0.3" $ curl -X POST --header "Accept: application/json" "http://127.0.0.2:10000/gossiper/force_remove_endpoint/127.0.0.3" Then run 'nodetool gossipinfo' on all the nodes to check the removed nodes are not present. Fixes #2134 Closes #5436	2020-11-29 13:58:46 +02:00
Piotr Dulikowski	6465dd160b	storage_proxy: fix wrong return type in swagger The GET `hinted_handoff_enabled_by_dc` endpoint had an incorrect return type specified. Although it does not have an implementation, yet, it was supposed to return a list of strings with DC names for which generating hints is enabled - not a list of string pairs. Such return type is expected by the JMX.	2020-11-17 10:24:43 +01:00
Pekka Enberg	a37eaaa022	sstables: Add support for the "md" format enum value Add the sstable_version_types::md enum value and logically extend sstable_version_types comparisons to cover also the > sstable_version_types::mc cases. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Asias He	271fac56a3	repair: Add synchronous API to query repair status This new api blocks until the repair job is either finished or failed or timeout. E.g., - Without timeout curl -X GET http://127.0.0.1:10000/storage_service/repair_status/?id=123 - With timeout curl -X GET http://127.0.0.1:10000/storage_service/repair_status/?id=123&timeout=5 The timeout is in second. The current asynchronous api returns immediately even if the repair is in progress. E.g., curl -X GET http://127.0.0.1:10000/storage_service/repair_async/ks?id=123 User can use the new synchronous API to avoid keep sending the query to poll if the repair job is finished. Fixes #6445	2020-07-14 11:20:15 +03:00
Avi Kivity	e5be3352cf	database, streaming, messaging: drop streaming memtables Before Scylla 3.0, we used to send streaming mutations using individual RPC requests and flush them together using dedicated streaming memtables. This mechanism is no longer in use and all versions that use it have long reached end-of-life. Remove this code.	2020-06-25 15:25:54 +02:00
Juliusz Stasiewicz	aadd2ffa6a	api: Added command `/storage_service/cdc_streams_check_and_repair` This commit introduces a placeholder for HTTP POST request at `/storage_service/cdc_streams_check_and_repair`.	2020-05-29 12:23:08 +02:00
Ivan Prisyazhnyy	84e25e8ba4	api: support table auto compaction control The patch implements: - /storage_service/auto_compaction API endpoint - /column_family/autocompaction/{name} API endpoint Those APIs allow to control and request the status of background compaction jobs for the existing tables. The implementation introduces the table::_compaction_disabled_by_user. Then the CompactionManager checks if it can push the background compaction job for the corresponding table. New members === table::enable_auto_compaction(); table::disable_auto_compaction(); bool table::is_auto_compaction_disabled_by_user() const Test === Tests: unit(sstable_datafile_test autocompaction_control_test), manual $ ninja build/dev/test/boost/sstable_datafile_test $ ./build/dev/test/boost/sstable_datafile_test --run_test=autocompaction_control_test -- -c1 -m2G --overprovisioned --unsafe-bypass-fsync 1 --blocked-reactor-notify-ms 2000000 The test tries to submit a compaction job after playing with autocompaction control table switch. However, there is no reliable way to hook pending compaction task. The code assumed that with_scheduling_group() closure will never preempt execution of the stats check. Revert === Reverts commit `c8247ac`. In previous version the execution sometimes resulted into the following error: test/boost/sstable_datafile_test.cc(1076): fatal error: in "autocompaction_control_test": critical check cm->get_stats().pending_tasks == 1 \|\| cm->get_stats().active_tasks == 1 has failed This version adds a few sstables to the cf, starts the compaction and awaits until it is finished. API change === - `/column_family/autocompaction/` always returned `true` while answering to the question: if the autocompaction disabled (see https://github.com/scylladb/scylla-jmx/blob/master/src/main/java/org/apache/cassandra/db/ColumnFamilyStore.java#L321). now it answers to the question: if the autocompaction for specific table is enabled. The question logic is inverted. The patch to the JMX is required. However, the change is decent because all old values were invalid (it always reported all compactions are disabled). - `/column_family/autocompaction/` got support for POST/DELETE per table Fixes === Fixes #1488 Fixes #1808 Fixes #440 Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com>	2020-05-07 16:23:38 +03:00
Pekka Enberg	c8247aced6	Revert "api: support table auto compaction control" This reverts commit `1c444b7e1e`. The test it adds sometimes fails as follows: test/boost/sstable_datafile_test.cc(1076): fatal error: in "autocompaction_control_test": critical check cm->get_stats().pending_tasks == 1 \|\| cm->get_stats().active_tasks == 1 has failed Ivan is working on a fix, but let's revert this commit to avoid blocking next promotion failing from time to time.	2020-04-11 17:56:02 +03:00
Ivan Prisyazhnyy	1c444b7e1e	api: support table auto compaction control This patch adds API endpoint /column_family/autocompaction/{name} that listen to GET and POST requests to pick and control table background compactions. To implement that the patch introduces "_compaction_disabled_by_user" flag that affects if CompactionManager is allowed to push background compactions jobs into the work. It introduces table::enable_auto_compaction(); table::disable_auto_compaction(); bool table::is_auto_compaction_disabled_by_user() const to control auto compaction state. Fixes #1488 Fixes #1808 Fixes #440 Tests: unit(sstable_datafile_test autocompaction_control_test), manual	2020-04-08 21:18:38 +03:00
Ivan Prisyazhnyy	5ec7e77b2e	api: /column_family/major_compaction/{keyspace:table} implementation This implements support for triggering major compations through the REST API. Please note that "split_output" is not supported and Glauber Costa confirmed this this is fine: "We don't support splits, nor do I think we should." Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com>	2020-03-23 13:48:29 +02:00
Piotr Sarna	331ddf41e5	api: add error injection to REST API Simple REST API for error injection is implemented. The API allow the following operations: * injecting an error at given injection name * listing injections * disabling an injection * disabling all injections Currently the API enables/disables on all shards. Closes #3295 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-03-20 20:49:03 +01:00
Konstantin Osipov	94ee511f6a	lwt: implement cas_failed_read_round_optimization metric Presently lightweight transactions piggy back the old row value on prepare round response. If one of the participants did not provide the old value or the values from peers don't match, we perform a full read round which will repair the Paxos table and the base table, if necessary, at all participants. Capture the fact that read optimization has failed in a metric. Message-Id: <20200304192955.84208-2-kostja@scylladb.com>	2020-03-05 12:20:45 +01:00
Amnon Heiman	6b020e67ce	api/storage_service: Support specifying a table when deleting a snapshot This patch adds an optional parameter to DELETE /storage_service/snapshots After this patch the following will be supported: If a keyspace called keyspace1 and a table called standard1 exists. curl -X POST 'http://localhost:10000/storage_service/snapshots?tag=am1&kn=keyspace1' curl -X DELETE --header 'Accept: application/json' 'http://localhost:10000/storage_service/snapshots?tag=am1&kn=keyspace1&cf=standard1' Fixes #5658 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2020-02-18 16:34:10 +02:00
Amnon Heiman	f43285f39a	api: replace swagger definition to use long instead of int (#5380 ) In swagger 1.2 int is defined as int32. We originally used int following the jmx definition, in practice internally we use uint and int64 in many places. While the API format the type correctly, an external system that uses swagger-based code generator can face a type issue problem. This patch replace all use of int in a return type with long that is defined as int64. Changing the return type, have no impact on the system, but it does help external systems that use code generator from swagger. Fixes #5347 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-12-11 12:48:29 +02:00
Glauber Costa	73aff1fc95	api: export system uptime via REST This will be useful for tools like nodetool that want to query the uptime of the system. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190619110850.14206-1-glauber@scylladb.com>	2019-11-20 16:44:11 +02:00
Vladimir Davydov	e8bcb34ed4	api: drop /storage_proxy/metrics/cas_read/condition_not_met There's no such metric in Cassandra (although Cassadra's docs mistakenly say it exists). Having it would make no sense anyway so let's drop it. Message-Id: <b4f7a6ad278235c443cb8ea740bfa6399f8e4ee1.1570434332.git.vdavydov@scylladb.com>	2019-10-07 16:54:39 +03:00
Calle Wilund	298da3fc4b	api/storage_service: Add "sstable_info" command Assembles information and attributes of sstables in one or more column families. v2: * Use (not really legal) nested "type" in json * Rename "table" param to "cf" for consistency * Some comments on data sizes * Stream result to avoid huge string allocations on final json	2019-08-06 08:14:15 +00:00
Amnon Heiman	1c6dec139f	API: compaction_manager add get pending tasks by table The pending tasks by table name API return an array of pending tasks by keyspace/table names. After this patch the following command would work: curl -X GET 'http://localhost:10000/compaction_manager/metrics/pending_tasks_by_table' Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-07-12 19:21:26 +03:00
Amnon Heiman	f3b6c5fe2f	API: storage_proxy add CAS and View endpoints Some nodetool command in 3.0 uses the CAS and View metrics. CAS is not implemented and we don't have all the metrics for View but we still don't want those nodetool commands to fail. After this patch the following would work and will return empty: curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/cas_read/moving_average_histogram' curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/view_write/moving_average_histogram' curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/cas_write/moving_average_histogram' This patch is needed for #4416 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20190521141235.20856-1-amnon@scylladb.com>	2019-05-22 14:25:17 +03:00
Rafi Einstein	197f38d4ee	nodetool toppartitions: Toppartitions query REST API A HTTP GET operation starts the query (with args: ks/cf name and duration in ms). It executes synchroneously, results are returned as JSON: $ curl -s -X GET http://localhost:10000/column_family/toppartitions/ks:cf1?duration=10000 \| jq { "read": [ { "count": "15", "error": "0", "partition": "4b504d39354f37353131" }, { "count": "15", "error": "0", "partition": "3738313134394d353530" } ], "write": [ { "count": "15", "error": "0", "partition": "4b504d39354f37353131" }, { "count": "15", "error": "0", "partition": "3738313134394d353530" } ] } Signed-off-by: Rafi Einstein <rafie@scylladb.com>	2018-12-28 16:45:57 +02:00
Glauber Costa	98332de268	api: use longs instead of ints for snapshot sizes Int types in json will be serialized to int types in C++. They will then only be able to handle 4GB, and we tend to store more data than that. Without this patch, listsnapshots is broken in all versions. Fixes: #3845 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20181012155902.7573-1-glauber@scylladb.com>	2018-10-12 21:17:24 +03:00
Amnon Heiman	cc5601d000	api: column_family.json make the get_sstables_for_key doc clearer This patch makes it clearer that the key that get_sstables_for_key refers to, is a partition key.	2018-06-10 16:13:01 +03:00
Avi Kivity	4419e60207	Merge "Add a confiugration API" from Amnon " The configuration API is part of scylla v2 configuration. It uses the new definition capabilities of the API to dynamically create the swagger definition for the configuration. This mean that the swagger will contain an entry with description and type for each of the config value. To get the v2 of the swager file: http://localhost:10000/v2 If using with swagger ui, change http://localhost:10000/api-doc to http://localhost:10000/v2 It takes longer to load because the file is much bigger now. " * 'amnon/config_api_v5' of github.com:scylladb/seastar-dev: Explanation about the API V2 API: add the config API as part of the v2 API. Defining the config api	2018-03-28 12:45:17 +03:00
Amnon Heiman	6d907e43e0	Defining the config api The config API is created dynamically from the config. This mean that the swagger definition file will contain the description and types based on the configuration. The config.json file is used by the code generator to define a path that is used to register the handler function. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2018-03-28 12:41:55 +03:00
Duarte Nunes	ff15068a41	service/storage_service: Allow querying the view build status This patch adds support for the nodetool viewbuildstatus command, which shows the progress of a materialized view build across the cluster. A view can be absent from the result, successfully built, or currently being built. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-03-27 01:20:10 +01:00
Amnon Heiman	827723cec8	API: Add get active repair api This patch adds an API to return an array of the ids of current active repairs. After this patch a call to: curl http://localhost:10000/storage_service/active_repair/ Will return the active repairs ids Fixes #3193 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2018-02-14 11:43:41 +02:00
Amnon Heiman	4ccf76c62b	Adding the header part of the swagger2.0 API In Swagger 2.0 all the API is exported as a single file. The header part of the file, contains general information. It is stored as an external file so it will be easy to modify when needed. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2018-01-21 14:00:27 +02:00
Asias He	6dc62c6215	api: Add force_terminate_repair API The api /storage_service/force_terminate is supposed to be /storage_service/force_terminate_repair. scylla-jmx uses /storage_service/force_terminate api. So instead of renaming it, it is better to add a new name for it.	2017-08-30 15:19:51 +08:00
Calle Wilund	0181fc8159	api::cache_service: Add (dummy) calls for key&counter metrics	2016-11-08 12:22:04 +00:00
Calle Wilund	5eb54f9bc4	api::storage_service: c3 compat - make query keyspaces a trinary choice all, user or non-local strategy ones.	2016-11-08 12:22:04 +00:00
Calle Wilund	3b7a7dd383	api::failure_detector: c3 compat - add endpoint phi value query	2016-11-08 12:22:04 +00:00
Calle Wilund	f9836cd23b	api::endpoint_snitch: c3 compat - allow dc/rack query for broadcast	2016-11-08 12:22:04 +00:00
Calle Wilund	54ba06a8bf	api::column_family: Add calls/parameters for c3 compatibility	2016-11-08 12:22:04 +00:00
Amnon Heiman	c8082ccadb	API: fix a type in storage_proxy This patch fixes a typo in the URL definition, causing the metric in the jmx not to find it. Fixes #1821 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1478563869-20504-1-git-send-email-amnon@scylladb.com>	2016-11-08 11:09:21 +02:00
Amnon Heiman	ed1d02b1a3	API: Add slow query API definition This adds the GET and POST api for slow query logging. The GET return an object with the enable, ttl and threshold and the POST lets you configure each of them. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-09-03 01:15:15 +03:00
Amnon Heiman	56ea8c943e	API: add scylla release version API This adds a definition to the scylla release version. The API already return the compatibility version (ie. the compatible origin version) This definition returns the scylla version, a call to the API should return the same result as running scylla --version. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-07-03 16:26:21 +03:00
Amnon Heiman	4d7837af40	API Definition: collectd to support enable disable This adds to the definition of the collectd API the ability to turn on and off specific collectd metrics. For the GET end point a POST option was added that allow to enable or disable a metric. The general GET endpoint now returns the enable flag that indicates if the metric is enable. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1466932139-19264-2-git-send-email-amnon@scylladb.com>	2016-06-26 12:26:48 +03:00
Amnon Heiman	b33ed48527	API Definition: change cache_service, column_family and storage_proxy to use rate objects This patch replaces the latency histogram to rate_moving_avrage_and_histogram and the counters to rate_moving_average. The old endpoints where left unchagned but marked as depricated when needed.	2016-05-17 11:55:06 +03:00
Amnon Heiman	7e07d97e4b	API utils: Adding rate moving avrage rate_moving_average and rate_moving_average_and_histogram are type that are used by the JMX. They are based on the yammer meter and timer and are used to collect derivative information. Specificlly: rate_moving_average calculate rates and rate_moving_average_and_histogram collect rates and histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-16 11:40:19 +03:00
Nadav Har'El	f9ee74f56f	repair: options for repairing only a subrange To implement nodetool's "--start-token"/"--end-token" feature, we need to be able to repair only part of the ranges held by this node. Our REST API already had a "ranges" option where the tool can list the specific ranges to repair, but using this interface in the JMX implementation is inconvenient, because it requires the Java code to be able to intersect the given start/end token range with the actual ranges held by the repaired node. A more reasonable approach, which this patch uses, is to add new "startToken"/"endToken" options to the repair's REST API. What these options do is is to find the node's token ranges as usual, and only then intersect them with the user-specified token range. The JMX implementation becomes much simpler (in a separate patch for scylla-jmx) and the real work is done in the C++ code, where it belongs, not in Java code. With the additional scylla-jmx patch to use the new REST API options provided here, this fixes #917. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1455807739-25581-1-git-send-email-nyh@scylladb.com>	2016-02-18 17:13:56 +02:00
Asias He	77684a5d4c	messaging_service: Drop STREAM_INIT_MESSAGE The verb is not used anymore. Message-Id: <1453719054-29584-1-git-send-email-asias@scylladb.com>	2016-01-25 12:53:08 +02:00
Asias He	53c6cd7808	gossip: Rename echo verb to gossip_echo It is used by gossip only. I really could not allow myself to get along this inconsistence. Change before we still can. Message-Id: <1453719054-29584-2-git-send-email-asias@scylladb.com>	2016-01-25 12:53:07 +02:00
Raphael S. Carvalho	5cceb7d249	api: fix paramType of parameter of stop_compaction Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-19 23:15:18 -02:00
Amnon Heiman	9be42bfd7b	API: Add version to application state in failure_detection The upstream of origin adds the version to the application_state in the get_endpoints in the failure detector. In our implementation we return an object to the jmx proxy and the proxy do the string formatting. This patch adds the version to the return object which is both useful as an API and will allow the jmx proxy to add it to its output when we move forward with the jmx version. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1448962889-19611-1-git-send-email-amnon@scylladb.com>	2016-01-19 10:23:56 +02:00
Pekka Enberg	422cff5e00	api/messaging_service: Fix heap-buffer-overflows in set_messaging_service() Fix various issues in set_messaging_service() that caused heap-buffer-overflows when JMX proxy connects to Scylla API: - Off-by-one error in 'num_verb' definition - Call to initializer list std::vector constructor variant that caused the vector to be two elements long. - Missing verb definitions from the Swagger definition that caused response vector to be too small. Spotted by ASan. Message-Id: <1453125439-16703-1-git-send-email-penberg@scylladb.com>	2016-01-18 15:43:29 +01:00
Amnon Heiman	6942b41693	API: rename the map of string, double to map_string_double This replaces the confusing name to a more meaningful name. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1451466952-405-1-git-send-email-amnon@scylladb.com>	2016-01-03 19:10:49 +02:00
Gleb Natapov	2bcfe02ee6	messaging: remove unused verbs	2015-12-30 15:06:35 +01:00
Pekka Enberg	0aa105c9cf	Merge "load report a negative value" from Amnon "This series solve an issue with the load broadcaster that reports negative values due to an integer wrap around. While fixing this issue an additional change was made so that the load_map would return doubles and not formatted string. This is a better API, safer and better documented."	2015-12-30 10:21:55 +02:00

1 2 3

116 Commits