scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 17:10:35 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	b6e1e6df64	misc_services: Introduce load_meter There's a lonely get_load_map() call on storage_service that needs only load broadcaster, always runs on shard 0 and that's it. Next patch will move this whole stuff into its own helper no-shard container and this is preparation for this. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-13 13:53:08 +03:00
Pavel Emelyanov	998f51579a	storage_service: Rip join_ring config option The option in question apparently does not work, several sharded objects are start()-ed (and thus instanciated) in join_roken_ring, while instances themselves of these objects are used during init of other stuff. This leads to broken seastar local_is_initialized assertion on sys_dist_ks, but reading the code shows more examples, e.g. the auth_service is started on join, but is used for thrift and cql servers initialization. The suggestion is to remove the option instead of fixing. The is_joined logic is kept since on-start joining still can take some time and it's safer to report real status from the API. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191203140717.14521-1-xemul@scylladb.com>	2019-12-18 12:45:13 +02:00
Piotr Dulikowski	48f7b2e4fb	table: move out table::stats to table_stats This change was done in order to be able to forward-declare the table::stats structure.	2019-11-12 13:35:41 +01:00
Asias He	f876580740	storage_service: Reject nodetool cleanup when there is pending ranges From Shlomi: 4 node cluster Node A, B, C, D (Node A: seed) cassandra-stress write n=10000000 -pop seq=1..10000000 -node <seed-node> cassandra-stress read duration=10h -pop seq=1..10000000 -node <seed-node> while read is progressing Node D: nodetool decommission Node A: nodetool status node - wait for UL Node A: nodetool cleanup (while decommission progresses) I get the error on c-s once decommission ends java.io.IOException: Operation x0 on key(s) [383633374d31504b5030]: Data returned was not validated The problem is when a node gets new ranges, e.g, the bootstrapping node, the existing nodes after a node is removed or decommissioned, nodetool cleanup will remove data within the new ranges which the node just gets from other nodes. To fix, we should reject the nodetool cleanup when there is pending ranges on that node. Note, rejecting nodetool cleanup is not a full protection because new ranges can be assigned to the node while cleanup is still in progress. However, it is a good start to reject until we have full protection solution. Refs: #5045	2019-10-23 19:20:36 +08:00
Calle Wilund	298da3fc4b	api/storage_service: Add "sstable_info" command Assembles information and attributes of sstables in one or more column families. v2: * Use (not really legal) nested "type" in json * Rename "table" param to "cf" for consistency * Some comments on data sizes * Stream result to avoid huge string allocations on final json	2019-08-06 08:14:15 +00:00
Benny Halevy	3749148339	storage_service: fix handling of load_new_sstables exception ignore_ready_future in load_new_ss_tables broke migration_test:TestMigration_with_*.migrate_sstable_with_counter_test_expect_fail dtests. The java.io.NotSerializableException in nodetool was caused by exceptions that were too long. This fix prints the problematic file names onto the node system log and includes the casue in the resulting exception so to provide the user with information about the nature of the error. Fixes #4375 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190331154006.12808-1-bhalevy@scylladb.com>	2019-04-02 11:46:19 +03:00
Benny Halevy	956cb2e61c	storage_service: handle load_new_sstables exception Refs #3117 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-03-28 14:54:56 +02:00
Calle Wilund	ef1bdebd0a	api::storage_service: Implement "scrub"	2019-03-06 13:13:21 +00:00
Calle Wilund	23f4c982ea	api/storage_service: Implement "upgradesstables" Fixes #4245 Implemented as a compation barrier (forcing previous compactions to finish) + parameterized "cleanup", with sstable list based on parameters.	2019-03-06 13:13:21 +00:00
Calle Wilund	3b5588dddd	api::storage_service: Add keyspace + tables helper To avoid repeating code to get keyspace + tables	2019-03-06 13:13:21 +00:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	da17c29bd3	api: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Amnon Heiman	ab207356a5	API: storage_service stream endpoints This patch changes how list of tokens returned from the storage_service API. Instead of create a vector and construct a json object of it, use the streaming capabilities of the http. This is important for large cluster and prevent large allocations. Fixes #3701 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20180820195631.26792-1-amnon@scylladb.com>	2018-08-22 11:24:38 +03:00
Asias He	4a0b561376	storage_service: Get rid of moving operation The moving operation changes a node's token to a new token. It is supported only when a node has one token. The legacy moving operation is useful in the early days before the vnode is introduced where a node has only one token. I don't think it is useful anymore. In the future, we might support adjusting the number of vnodes to reblance the token range each node owns. Removing it simplifies the cluster operation logic and code. Fixes #3475 Message-Id: <144d3bea4140eda550770b866ec30e961933401d.1533111227.git.asias@scylladb.com>	2018-08-01 11:18:17 +03:00
Duarte Nunes	ff15068a41	service/storage_service: Allow querying the view build status This patch adds support for the nodetool viewbuildstatus command, which shows the progress of a materialized view build across the cluster. A view can be absent from the result, successfully built, or currently being built. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-03-27 01:20:10 +01:00
Pekka Enberg	bd365a10d3	Merge "Add an API to get all active repairs" from Amnon "This series adds an API to return the active repairs by their IDs. After this series a call to: curl -X GET --header "Accept: application/json" "http://localhost:10000/storage_service/active_repair/" Will return an array with the ids of the active repairs. Fixes #3193" * 'amnon/get_active_repairs_v3' of github.com:scylladb/seastar-dev: API: Add get active repair api repair: Add a get_active_repairs function to return the active repair	2018-02-19 15:32:17 +02:00
Amnon Heiman	827723cec8	API: Add get active repair api This patch adds an API to return an array of the ids of current active repairs. After this patch a call to: curl http://localhost:10000/storage_service/active_repair/ Will return the active repairs ids Fixes #3193 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2018-02-14 11:43:41 +02:00
Amnon Heiman	449f9af0db	API: Use stream_range_as_array to return token endpoints The token_to_endpoint map can get big that trying to convert it to a vector will cause large allocation warning. This patch replace the implementation, so the return json array will be created directly from the map by using stream_range_as_array helper function. Fixes #3185 Message-Id: <20180207153306.30921-1-amnon@scylladb.com>	2018-02-12 15:24:07 +02:00
Amnon Heiman	3ec84a0b1d	API tokens_endpoint: use streams Returning token_endpoints when there are many tokens and end points can take a long time. This patch uses output stream to return the result. Instead of returning a vector, it uses the streaming functionality in json layer. Fixes #2476 Message-Id: <20180103081907.5175-1-amnon@scylladb.com>	2018-01-03 11:11:49 +02:00
Amnon Heiman	8d668a9dc0	API: storage_service repair_async_status to return proper error code This patch change the implementation of storage_service repair_async_status to throw an exception, this way a 400 return code will be returned. Fixes #2794 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20170917080533.6612-1-amnon@scylladb.com>	2017-09-18 09:08:26 +03:00
Avi Kivity	9b540eccb0	database: remove dependency on compaction.hh and compaction_manager.hh	2017-09-11 20:09:45 +03:00
Asias He	471e8b341f	repair: Support termination of repair jobs This patch implements the missing API to terminate all repairs. For example: $ curl -X POST --header "Accept: application/json" "http://127.0.0.2:10000/storage_service/force_terminate_repair" With the new stream_plan::abort() api we can now abort the stream session assocaited with the repair as well. Fixes #2105	2017-08-30 15:19:52 +08:00
Asias He	6dc62c6215	api: Add force_terminate_repair API The api /storage_service/force_terminate is supposed to be /storage_service/force_terminate_repair. scylla-jmx uses /storage_service/force_terminate api. So instead of renaming it, it is better to add a new name for it.	2017-08-30 15:19:51 +08:00
Amnon Heiman	6c1858b275	API:storage_service should support metrics load Following C* API there are two APIs for getting the load from storage_service: /storage_service/metrics/load /storage_service/load This patch adds the implementation for /storage_service/metrics/load The alternative, is to drop on of the API and modify the JMX implementation to use the same API. Fixes #2245 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20170401181520.19506-1-amnon@scylladb.com>	2017-04-05 18:14:19 +03:00
Nadav Har'El	d49aa7abd2	storage_service: make is_joined() an immediate function Commit `d41cd48a` made the is_joined() method a future<bool> because only cpu 0 knows its real value. This makes this function inconvenient to use. So this patch reverts commit `d41cd48a`, and instead sets this flag's value on all shards, so each shard can read its value locally (and immediately). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20161228160450.5831-1-nyh@scylladb.com>	2016-12-28 18:37:22 +02:00
Vlad Zolotarov	9606db2f08	api::set_tracing_probability: prevent a server from returning 500 for a bad probability value - Change an exception type thrown by a tracing::tracing::set_trace_probability() to make it different from the one thrown by an std::stod() when it fails to parse a given string. - Catch the std::out_of_range exception thrown by a tracing::tracing::set_trace_probability() and wrap the exception string into the httpd::bad_param_exception() object. - Throw a httpd::bad_param_exception() with a "Bad format in a probability value: <a user given probability string value>" message if std::invalid_argument is caught. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1465300738-1557-1-git-send-email-vladz@cloudius-systems.com>	2016-12-27 12:07:09 +02:00
Calle Wilund	5eb54f9bc4	api::storage_service: c3 compat - make query keyspaces a trinary choice all, user or non-local strategy ones.	2016-11-08 12:22:04 +00:00
Tomasz Grabiec	c1a7e2090e	Revert "database: change find_column_families signature so it returns a lw_shared_ptr" This reverts commit `f3528ede65`.	2016-11-04 10:48:21 +01:00
Glauber Costa	f3528ede65	database: change find_column_families signature so it returns a lw_shared_ptr There are places in which we need to use the column family object many times, with deferring points in between. Because the column family may have been destroyed in the deferring point, we need to go and find it again. If we use lw_shared_ptr, however, we'll be able to at least guarantee that the object will be alive. Some users will still need to check, if they want to guarantee that the column family wasn't removed. But others that only need to make sure we don't access an invalid object will be able to avoid the cost of re-finding it just fine. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <722bf49e158da77ff509372c2034e5707706e5bf.1478111467.git.glauber@scylladb.com>	2016-11-03 13:27:31 +01:00
Vlad Zolotarov	006999f46c	api::storage_service::slow_query: don't use duration_cast in GET The slow_query_record_ttl() and slow_query_threshold() return the duration of the appropriate type already - no need for an additional cast. In addition there was a mistake in a cast of ttl. Fixes #1734 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1475669400-5925-1-git-send-email-vladz@cloudius-systems.com>	2016-10-09 18:09:13 +03:00
Amnon Heiman	11c687dd93	API: Add slow query logging implementation This adds the implementation for the slow query logging API. After this patch the following will be available: curl -X GET "http://localhost:10000/storage_service/slow_query" curl -X POST "http://localhost:10000/storage_service/slow_query?enable=true&ttl=10&threshold=6000" Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-09-03 01:15:22 +03:00
Amnon Heiman	e66a1cd705	API: Add implementation for the scylla release version This adds the implementation to the scylla release version API. After this patch a call to: curl -X GET "http://localhost:10000/storage_service/scylla_release_version" Will return the current scylla release version. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-07-03 16:29:09 +03:00
Vlad Zolotarov	0611417c76	api::storage_service: add set_trace_probability/get_trace_probability Trace probability defines a probability for the next CQL command to be traced. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-06 15:44:28 +03:00
Asias He	1e84699a64	api: Wire up storage_service removal_status and force_remove_completion They are used by nodetool removenode: $ nodetool removenode force $ nodetool removenode status For example: $ nodetool removenode status RemovalStatus: Removing token (-8969872965815280276). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. $ nodetool removenode force RemovalStatus: No token removals in process. Tested with: 1) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 2) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 - kill -9 node3 - nodetool removenode will wait forever since node3 is gonne, node3 will never send the replication confirmation to node1 - run nodetool removenode force on node1 nodetool removenode completes with the following error: $ nodetool removenode 31690b82-ebb0-4594-8bcf-1ce82b6e0f6e nodetool: Scylla API server HTTP POST to URL '/storage_service/remove_node' failed: nodetool removenode force is called by user nodetool removenode force completes sucessfully $ nodetool removenode force RemovalStatus: Removing token (-9171569494049085776). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. Fixes 1135.	2016-04-13 14:53:28 +08:00
Asias He	891e947314	storage_service: Rename remove_node to removenode nodetool uses removenode command to remove a node. Rename the implementation in storage_service to match the command.	2016-04-13 14:53:28 +08:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Vlad Zolotarov	2cd836a02e	api::set_storage_service(): fix the 'nodetool enablebackup' API 'nodetool enable/disablebackup' callback was modifying only the existing keyspaces and column families configurations. However new keyspaces/column families were using the original 'incremental_backups' configuration value which could be different from the value configured by 'nodetool enable/disablebackup' user command. This patch updates the database::_enable_incremental_backups per-shard value in addition to updating the existing keyspaces and column families configurations. Fixes #845 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-03-06 17:26:31 +02:00
Nadav Har'El	f9ee74f56f	repair: options for repairing only a subrange To implement nodetool's "--start-token"/"--end-token" feature, we need to be able to repair only part of the ranges held by this node. Our REST API already had a "ranges" option where the tool can list the specific ranges to repair, but using this interface in the JMX implementation is inconvenient, because it requires the Java code to be able to intersect the given start/end token range with the actual ranges held by the repaired node. A more reasonable approach, which this patch uses, is to add new "startToken"/"endToken" options to the repair's REST API. What these options do is is to find the node's token ranges as usual, and only then intersect them with the user-specified token range. The JMX implementation becomes much simpler (in a separate patch for scylla-jmx) and the real work is done in the C++ code, where it belongs, not in Java code. With the additional scylla-jmx patch to use the new REST API options provided here, this fixes #917. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1455807739-25581-1-git-send-email-nyh@scylladb.com>	2016-02-18 17:13:56 +02:00
Raphael S. Carvalho	a53cfc8127	compaction manager: add support to wait for termination of cleanup 'nodetool cleanup' must wait for termination of cleanup, however, cleanup is handled asynchronously. To solve that, a mechanism is added here to wait for termination of a cleanup. This mechanism is about using promise to notificate waiter of cleanup completion. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <6dc0a39170f3f51487fb8858eb443573548d8bce.1455655016.git.raphaelsc@scylladb.com>	2016-02-18 17:01:18 +02:00
Amnon Heiman	e33710d2ca	API: storage_service get_logging_level This patch adds the get_loggin_level command that returns a map between the log name and its level. To test the API do: curl -X GET "http://localhost:10000/storage_service/logging_level" this would enable the `nodetool getlogginglevels` command. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1453365106-27294-3-git-send-email-amnon@scylladb.com>	2016-01-21 11:58:54 +02:00
Raphael S. Carvalho	fc6a1934b0	api: implement force_keyspace_cleanup This will add support for an user to clean up an entire keyspace or some of its column families. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-12 03:53:22 -02:00
Pekka Enberg	67ccd05bbe	api/storage_service: Wire up 'compaction_throughput_mb_per_sec' The API is needed by nodetool compactionstats command.	2016-01-05 13:01:05 +02:00
Amnon Heiman	6942b41693	API: rename the map of string, double to map_string_double This replaces the confusing name to a more meaningful name. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1451466952-405-1-git-send-email-amnon@scylladb.com>	2016-01-03 19:10:49 +02:00
Pekka Enberg	0aa105c9cf	Merge "load report a negative value" from Amnon "This series solve an issue with the load broadcaster that reports negative values due to an integer wrap around. While fixing this issue an additional change was made so that the load_map would return doubles and not formatted string. This is a better API, safer and better documented."	2015-12-30 10:21:55 +02:00
Amnon Heiman	ec379649ea	API: repair to use documented params The repair API use to have an undocumented parameter list similiar to origin. This patch changes the way repair is getting its parameters. Instead of a one undocumented string it now lists all the different optional parameters in the swagger file and accept them explicitely. Reviewed-by: Nadav Har'El <nyh@scylladb.com> Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-29 15:38:44 +02:00
Amnon Heiman	71905081b1	API: report the load map as an unformatted double In origin the storage_serivce report the load map as a formatted string. As an API a better option is to report the load map as double and let the JMX proxy do the formatting. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-12-29 11:55:34 +02:00
Paweł Dziepak	39a65e6294	api: enable storage_service::drain() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-17 14:06:41 +01:00
Paweł Dziepak	8ee1a44720	storage_service: implement get_drain_progress() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-17 14:06:40 +01:00
Asias He	e9a4d93d1b	storage_service: Fix added node not showing up in nodetool in status joining The get_token_endpoint API should return a map of tokens to endpoints, including the bootstrapping ones. Use get_local_storage_service().get_token_to_endpoint_map() for it. $ nodetool -p 7100 status Status=Up/Down \|/ State=Normal/Leaving/Joining/Moving -- Address Load Tokens Owns Host ID Rack UN 127.0.0.1 12645 256 ? eac5b6cf-5fda-4447-8104-a7bf3b773aba rack1 UN 127.0.0.2 12635 256 ? 2ad1b7df-c8ad-4cbc-b1f1-059121d2f0c7 rack1 UN 127.0.0.3 12624 256 ? 61f82ea7-637d-4083-acc9-567e0c01b490 rack1 UJ 127.0.0.4 ? 256 ? ced2725e-a5a4-4ac3-86de-e1c66cecfb8d rack1 Fixes #617	2015-12-09 10:43:51 +08:00
Amnon Heiman	cda79c9a31	API: add get_natural_endpoints to storage_service This adds the get_natural_endpoints implementation to the storage_service API. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2015-11-12 14:55:19 +02:00

1 2 3

127 Commits