scylladb

Author	SHA1	Message	Date
Amnon Heiman	a4be7afbb0	API: cache_capacity should use uint for summing Using integer as a type for the map_reduce causes number over overflow. Fixes #1801 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1479299425-782-1-git-send-email-amnon@scylladb.com>	2016-11-16 13:55:46 +01:00
Calle Wilund	473326d49a	api/column_family: Make mean row size return integral As (at least) per C3, these metrics are integral in origin. Adapt. (Other option would be to translate in jmx).	2016-11-08 12:22:04 +00:00
Calle Wilund	0181fc8159	api::cache_service: Add (dummy) calls for key&counter metrics	2016-11-08 12:22:04 +00:00
Calle Wilund	5eb54f9bc4	api::storage_service: c3 compat - make query keyspaces a trinary choice all, user or non-local strategy ones.	2016-11-08 12:22:04 +00:00
Calle Wilund	3b7a7dd383	api::failure_detector: c3 compat - add endpoint phi value query	2016-11-08 12:22:04 +00:00
Calle Wilund	f9836cd23b	api::endpoint_snitch: c3 compat - allow dc/rack query for broadcast	2016-11-08 12:22:04 +00:00
Calle Wilund	54ba06a8bf	api::column_family: Add calls/parameters for c3 compatibility	2016-11-08 12:22:04 +00:00
Amnon Heiman	c8082ccadb	API: fix a type in storage_proxy This patch fixes a typo in the URL definition, causing the metric in the jmx not to find it. Fixes #1821 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1478563869-20504-1-git-send-email-amnon@scylladb.com>	2016-11-08 11:09:21 +02:00
Tomasz Grabiec	c1a7e2090e	Revert "database: change find_column_families signature so it returns a lw_shared_ptr" This reverts commit `f3528ede65`.	2016-11-04 10:48:21 +01:00
Glauber Costa	f3528ede65	database: change find_column_families signature so it returns a lw_shared_ptr There are places in which we need to use the column family object many times, with deferring points in between. Because the column family may have been destroyed in the deferring point, we need to go and find it again. If we use lw_shared_ptr, however, we'll be able to at least guarantee that the object will be alive. Some users will still need to check, if they want to guarantee that the column family wasn't removed. But others that only need to make sure we don't access an invalid object will be able to avoid the cost of re-finding it just fine. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <722bf49e158da77ff509372c2034e5707706e5bf.1478111467.git.glauber@scylladb.com>	2016-11-03 13:27:31 +01:00
Vlad Zolotarov	006999f46c	api::storage_service::slow_query: don't use duration_cast in GET The slow_query_record_ttl() and slow_query_threshold() return the duration of the appropriate type already - no need for an additional cast. In addition there was a mistake in a cast of ttl. Fixes #1734 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1475669400-5925-1-git-send-email-vladz@cloudius-systems.com>	2016-10-09 18:09:13 +03:00
Raphael S. Carvalho	67343798cf	api: implement api to return sstable count per level 'nodetool cfstats' wasn't showing per-level sstable count because the API wasn't implemented. Fixes #1119. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <0dcdf9196eaec1692003fcc8ef18c77d0834b2c6.1474410770.git.raphaelsc@scylladb.com>	2016-09-21 09:13:40 +03:00
Paweł Dziepak	6373289532	Merge "Adding slow query API" from Amnon "This series adds an API for the slow query recording. After this series it will be possible to set the/get the slow query recording parameters."	2016-09-07 11:06:09 -07:00
Amnon Heiman	11c687dd93	API: Add slow query logging implementation This adds the implementation for the slow query logging API. After this patch the following will be available: curl -X GET "http://localhost:10000/storage_service/slow_query" curl -X POST "http://localhost:10000/storage_service/slow_query?enable=true&ttl=10&threshold=6000" Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-09-03 01:15:22 +03:00
Amnon Heiman	ed1d02b1a3	API: Add slow query API definition This adds the GET and POST api for slow query logging. The GET return an object with the enable, ttl and threshold and the POST lets you configure each of them. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-09-03 01:15:15 +03:00
Glauber Costa	4310635bae	move estimated histogram to utils Nothing sstable-specific in it, really. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-08-31 15:13:23 -04:00
Glauber Costa	ffc2131c51	decouple estimated_histogram from sstables There is nothing really that fundamentally ties the estimated histogram to sstables. This patch gets rid of the few incidental ties. They are: - the namespace name, which is now moved to utils. Users inside sstables/ now need to add a namespace prefix, while the ones outside have to change it to the right one - sstables::merge, which has a very non-descriptive name to begin with, is changed to a more descriptive name that can live inside utils/ - the disk_types.hh include has to be removed - but it had no reason to be here in the first place. Todo, is to actually move the file outside sstables/. That is done in a separate step for clarity. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-08-31 15:13:23 -04:00
Raphael S. Carvalho	d8be32d93a	api: use estimation of pending tasks in compaction manager too We have API for getting pending compaction tasks both in column family and compaction manager. Column family is already returning pending tasks properly. Compaction manager's one is used by 'nodetool compactionstats', and was returning a value which doesn't reflect pending compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <a20b88938ad39e95f98bfd7f93e4d1666d1c6f95.1471641211.git.raphaelsc@scylladb.com>	2016-08-24 14:00:23 +03:00
Amnon Heiman	0706db9387	API: use the estimated sum when converting histogram to json The function that convert histogram to the json histogram object need to use the estimated_sum to get the actual sum and not the sampled sum. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1467547341-30438-3-git-send-email-amnon@scylladb.com>	2016-08-16 11:06:51 +03:00
Gleb Natapov	3531dd8d71	api: fix use after free in sum_sstable get_sstables_including_compacted_undeleted() may return temporary shared ptr which will be destroyed before the loop if not stored locally. Fixes #1514 Message-Id: <20160728100504.GD2502@scylladb.com>	2016-07-28 14:25:40 +03:00
Raphael S. Carvalho	e688fc9550	api: provide estimation of pending compaction Use compaction_strategy::estimated_pending_compaction() to provide user with an estimation of number of compaction for strategy to be fully satisfied. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <39b7d91f2525ca38fb2ce9d8885d0c2e727de7ed.1467667054.git.raphaelsc@scylladb.com>	2016-07-05 19:03:12 +03:00
Avi Kivity	e22517bafc	Merge "Optimize reads from leveled sstables" In a leveled column family, there can be many thousands of sstables, since each sstable is limited to a relatively small size (160M by default). With the current approach of reading from all sstables in parallel, cpu quickly becomes a bottleneck as we need to check the bloom filter for each of these sstables. This patch addresses the problem by introducing a compaction-strategy-specific data structure for holding sstables. This data structure has a method to obtain the sstables used for a read. For leveled compaction strategy, this data structure is an interval map, which can be efficiently used to select the right sstables.	2016-07-04 16:00:35 +03:00
Amnon Heiman	e66a1cd705	API: Add implementation for the scylla release version This adds the implementation to the scylla release version API. After this patch a call to: curl -X GET "http://localhost:10000/storage_service/scylla_release_version" Will return the current scylla release version. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-07-03 16:29:09 +03:00
Amnon Heiman	56ea8c943e	API: add scylla release version API This adds a definition to the scylla release version. The API already return the compatibility version (ie. the compatible origin version) This definition returns the scylla version, a call to the API should return the same result as running scylla --version. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-07-03 16:26:21 +03:00
Avi Kivity	2a46410f4a	Change sstable_list from a map to a set sstable_list is now a map<generation, sstable>; change it to a set in preparation for replacing it with sstable_set. The change simplifies a lot of code; the only casualty is the code that computes the highest generation number.	2016-07-03 10:26:57 +03:00
Amnon Heiman	a439a6b8d3	API: Add the collectd enable/disable implementation This adds the implementation to the enable and disable of the collectd metrics. An example for disabling all collectd metrics that has write in their type_instance part: curl -X POST --header "Content-Type: application/json" --header "Accept: application/json" "http://localhost:10000/collectd/.?instance=.&type=.&type_instance=.write.&enable=false" After that a call to: curl -X GET "http://localhost:10000/collectd/" Would return those metrics with the enable set to "false" An example to enable all the metrics in cache that their type starts with byt: curl -X POST --header "Content-Type: application/json" --header "Accept: application/json" "http://localhost:10000/collectd/cache?type=byt.&enable=true" Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1466932139-19264-3-git-send-email-amnon@scylladb.com>	2016-06-26 12:26:50 +03:00
Amnon Heiman	4d7837af40	API Definition: collectd to support enable disable This adds to the definition of the collectd API the ability to turn on and off specific collectd metrics. For the GET end point a POST option was added that allow to enable or disable a metric. The general GET endpoint now returns the enable flag that indicates if the metric is enable. Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1466932139-19264-2-git-send-email-amnon@scylladb.com>	2016-06-26 12:26:48 +03:00
Vlad Zolotarov	0611417c76	api::storage_service: add set_trace_probability/get_trace_probability Trace probability defines a probability for the next CQL command to be traced. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-06-06 15:44:28 +03:00
Amnon Heiman	d4123ba613	API: column_family count sstable space used correctly The space calculation counters in column family had two problem: 1. The total bytes is an ever growing counter, which is meaningless for the API. 2. Trying to simply sum the size on all shards, ignores the fact that the same sstable file can be referenced by multiple shards, this is especially noticeable during migration time. To solve this, the implementation was modified so instead of collecting the sizes, the API would collect a map of file name to size and then would do the summing. This removes the duplications and fixes the total bytes calculation Calling cfstats before the change with load after a compaction happend: $ nodetool cfstats keyspace1 Keyspace: keyspace1 Verify write latency 1068253.0 76435 Read Count: 75915 Read Latency: 0.5953986037015082 ms. Write Count: 76435 Write Latency: 0.013975966507490025 ms. Pending Flushes: 0 Table: standard1 SSTable count: 5 Space used (live): 44261215 Space used (total): 219724478 After the fix: $ nodetool cfstats keyspace1 Keyspace: keyspace1 Verify write latency 1863206.0 124219 Read Count: 125401 Read Latency: 0.9381053978835895 ms. Write Count: 124219 Write Latency: 0.01499936402643718 ms. Pending Flushes: 0 Table: standard1 SSTable count: 6 Space used (live): 50402904 Space used (total): 50402904 Space used by snapshots (total): 0 Fixes: #1042 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1464518757-14666-2-git-send-email-amnon@scylladb.com>	2016-05-29 14:11:03 +03:00
Avi Kivity	9637c2232c	Merge "Move the JMX timer polling logic to Scylla" from Amnon	2016-05-24 13:07:52 +03:00
Pekka Enberg	94e7e61cd0	api: Register snitch API earlier Currently, we register snitch API in set_server_gossip_settle() which waits until a node has joined the cluster. This makes 'nodetool status' not properly show the status of a joining node. Fix the issue by registering snitch API earlier. Fixes #1269. Message-Id: <1463576381-15484-1-git-send-email-penberg@scylladb.com>	2016-05-20 14:24:14 +03:00
Amnon Heiman	ad34f80e6f	API: change cache_service, column_family and storage_proxy to rate object The API would expose now the rate_moving_average and rate_moving_average_and_histogram. The old end points remains for the transition period, but marked as depricated. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:56:52 +03:00
Amnon Heiman	b33ed48527	API Definition: change cache_service, column_family and storage_proxy to use rate objects This patch replaces the latency histogram to rate_moving_avrage_and_histogram and the counters to rate_moving_average. The old endpoints where left unchagned but marked as depricated when needed.	2016-05-17 11:55:06 +03:00
Amnon Heiman	20a48b0f20	API: column family stats break the map_reduce functionality This patch replaces the helper function for column family with two function, one that collect the relevant column family from all shareds and another one that do the translation to json object. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	f6a5a4e3da	API: Add helper function for the rate objects This patch adds the helper function that are used to sum the rate_moving_average and rate_moving_average_and_histogram. The current sum functionality for histogram was modified to support rate and histogram but return a histogram. This way current endpoints would continue to behave the same. It also cleans the histogram related method by using the plus operator in the histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:49:34 +03:00
Amnon Heiman	7e07d97e4b	API utils: Adding rate moving avrage rate_moving_average and rate_moving_average_and_histogram are type that are used by the JMX. They are based on the yammer meter and timer and are used to collect derivative information. Specificlly: rate_moving_average calculate rates and rate_moving_average_and_histogram collect rates and histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-16 11:40:19 +03:00
Tomasz Grabiec	88bb5fcb53	api: Fix error message Keyspace and table names are separated by a single colon. Message-Id: <1461600269-4070-1-git-send-email-tgrabiec@scylladb.com>	2016-04-26 08:40:28 +03:00
Asias He	1e84699a64	api: Wire up storage_service removal_status and force_remove_completion They are used by nodetool removenode: $ nodetool removenode force $ nodetool removenode status For example: $ nodetool removenode status RemovalStatus: Removing token (-8969872965815280276). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. $ nodetool removenode force RemovalStatus: No token removals in process. Tested with: 1) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 2) - start 3 nodes - inject data with cassandra-stress write no-warmup cl=TWO n=2000000 -schema 'replication(factor=2)' - kill -9 node2 - wait for node2 to be in DOWN state - run nodetool removenode host2_host_id on node1 - kill -9 node3 - nodetool removenode will wait forever since node3 is gonne, node3 will never send the replication confirmation to node1 - run nodetool removenode force on node1 nodetool removenode completes with the following error: $ nodetool removenode 31690b82-ebb0-4594-8bcf-1ce82b6e0f6e nodetool: Scylla API server HTTP POST to URL '/storage_service/remove_node' failed: nodetool removenode force is called by user nodetool removenode force completes sucessfully $ nodetool removenode force RemovalStatus: Removing token (-9171569494049085776). Waiting for replication confirmation from [127.0.0.3,127.0.0.1]. Fixes 1135.	2016-04-13 14:53:28 +08:00
Asias He	891e947314	storage_service: Rename remove_node to removenode nodetool uses removenode command to remove a node. Rename the implementation in storage_service to match the command.	2016-04-13 14:53:28 +08:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Raphael S. Carvalho	e15ce5eb4d	api: Add support to get column family compression ratio After this change, user can query compression ratio on a per column family basis with 'nodetool cfstats'. look at 'nodetool cfstats' output: ./bin/nodetool cfstats ks.test5 Keyspace: ks Read Count: 0 Read Latency: NaN ms. Write Count: 0 Write Latency: NaN ms. Pending Flushes: 0 Table: test5 SSTable count: 1 Space used (live): 4774 Space used (total): 4774 Space used by snapshots (total): 0 Off heap memory used (total): 131384 SSTable Compression Ratio: 0.833333 ... Fixes #636. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <a1bee5a23fe63787df3e387a88f2d216ba4a4134.1459802771.git.raphaelsc@scylladb.com>	2016-04-05 12:46:40 +03:00
Amnon Heiman	725231a7a0	api: set the api_doc before registering any api This is a left over from the re ordering of the API init. The api_doc should be set first, so later API registration will enable their relevent swagger doc. Currently, the swagger documentation of the system API is not available. Fixes #1160 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1459750490-15996-1-git-send-email-amnon@scylladb.com>	2016-04-04 11:37:59 +03:00
Vlad Zolotarov	2cd836a02e	api::set_storage_service(): fix the 'nodetool enablebackup' API 'nodetool enable/disablebackup' callback was modifying only the existing keyspaces and column families configurations. However new keyspaces/column families were using the original 'incremental_backups' configuration value which could be different from the value configured by 'nodetool enable/disablebackup' user command. This patch updates the database::_enable_incremental_backups per-shard value in addition to updating the existing keyspaces and column families configurations. Fixes #845 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-03-06 17:26:31 +02:00
Paweł Dziepak	5396042f06	api: use proper describe_schema_versions implementation Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-03-02 12:49:55 +00:00
Asias He	fd5f3cff47	streaming: Fix stream_manager progress api For each stream_session, we pretend we are sending/receiving one file, to make it compatible with nodetool. For receiving_files, the file name is "rxnofile". For sending_files, the file name is "txnofile". stream_manager::update_all_progress_info is introduced to update the progress info of all the stream_sessions in the node. We need this because streaming mutations are received on all the cores, but the stream_session object is only on one of the cores. It adds overhead if we update progress info in stream_session object whenever we receive a streaming mutation. So, what we do now is when we really need the progress info, we update the progress info in stream_session object. With http://127.0.0.$i:10000/stream_manager/, it looks like below when decommission node 3 in a 3 nodes cluster. =========== GET NODE 1 [{"plan_id": "935a2cc0-dc6b-11e5-bdbf-000000000000", "description": "Unbootstrap", "sessions": [{"receiving_files": [{"value": {"direction": "IN", "file_name": "rxnofile", "session_index": 0, "total_bytes": 16876296, "peer": "127.0.0.3", "current_bytes": 16876296}, "key": "rxnofile"}], "receiving_summaries": [{"files": 1, "total_size": 0, "cf_id": "869d8630-dc6b-11e5-bdbf-000000000000"}], "session_index": 0, "state": "PREPARING", "connecting": "127.0.0.3", "peer": "127.0.0.3"}]}] =========== GET NODE 2 [{"plan_id": "935a2cc0-dc6b-11e5-bdbf-000000000000", "description": "Unbootstrap", "sessions": [{"receiving_files": [{"value": {"direction": "IN", "file_name": "rxnofile", "session_index": 0, "total_bytes": 16755552, "peer": "127.0.0.3", "current_bytes": 16755552}, "key": "rxnofile"}], "receiving_summaries": [{"files": 1, "total_size": 0, "cf_id": "869d8630-dc6b-11e5-bdbf-000000000000"}], "session_index": 0, "state": "PREPARING", "connecting": "127.0.0.3", "peer": "127.0.0.3"}]}] =========== GET NODE 3 [{"plan_id": "935a2cc0-dc6b-11e5-bdbf-000000000000", "description": "Unbootstrap", "sessions": [{"sending_files": [{"value": {"direction": "OUT", "file_name": "txnofile", "session_index": 0, "total_bytes": 16876296, "peer": "127.0.0.1", "current_bytes": 16876296}, "key": "txnofile"}], "sending_summaries": [{"files": 1, "total_size": 0, "cf_id": "869d8630-dc6b-11e5-bdbf-000000000000"}], "session_index": 0, "state": "PREPARING", "connecting": "127.0.0.1", "peer": "127.0.0.1"},{"sending_files": [{"value": {"direction": "OUT", "file_name": "txnofile", "session_index": 0, "total_bytes": 16755552, "peer": "127.0.0.2", "current_bytes": 16755552}, "key": "txnofile"}], "sending_summaries": [{"files": 1, "total_size": 0, "cf_id": "869d8630-dc6b-11e5-bdbf-000000000000"}], "session_index": 0, "state": "PREPARING", "connecting": "127.0.0.2", "peer": "127.0.0.2"}]}]	2016-02-26 17:38:37 +08:00
Nadav Har'El	f9ee74f56f	repair: options for repairing only a subrange To implement nodetool's "--start-token"/"--end-token" feature, we need to be able to repair only part of the ranges held by this node. Our REST API already had a "ranges" option where the tool can list the specific ranges to repair, but using this interface in the JMX implementation is inconvenient, because it requires the Java code to be able to intersect the given start/end token range with the actual ranges held by the repaired node. A more reasonable approach, which this patch uses, is to add new "startToken"/"endToken" options to the repair's REST API. What these options do is is to find the node's token ranges as usual, and only then intersect them with the user-specified token range. The JMX implementation becomes much simpler (in a separate patch for scylla-jmx) and the real work is done in the C++ code, where it belongs, not in Java code. With the additional scylla-jmx patch to use the new REST API options provided here, this fixes #917. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1455807739-25581-1-git-send-email-nyh@scylladb.com>	2016-02-18 17:13:56 +02:00
Raphael S. Carvalho	a53cfc8127	compaction manager: add support to wait for termination of cleanup 'nodetool cleanup' must wait for termination of cleanup, however, cleanup is handled asynchronously. To solve that, a mechanism is added here to wait for termination of a cleanup. This mechanism is about using promise to notificate waiter of cleanup completion. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <6dc0a39170f3f51487fb8858eb443573548d8bce.1455655016.git.raphaelsc@scylladb.com>	2016-02-18 17:01:18 +02:00
Asias He	d146045bc5	Revert "Revert "streaming: Send mutations on all shards"" This brings back streaming on all shards. The bug in locator/abstract_replication_strategy is now fixed. This reverts commit `9f3061ade8`. Message-Id: <a79ce9cdd6f4af1c6088b89e1911b4b2ed1c10ae.1455589460.git.asias@scylladb.com>	2016-02-16 11:16:51 +02:00
Avi Kivity	9f3061ade8	Revert "streaming: Send mutations on all shards" This reverts commit `31d439213c`. Fixes #894. Conflicts: streaming/stream_manager.cc (may have undone part of `63a5aa6122`)	2016-02-09 18:26:14 +02:00
Asias He	31d439213c	streaming: Send mutations on all shards Currently, only the shard where the stream_plan is created on will send streaing mutations. To utilize all the available cores, we can make each shard send mutations which it is responsbile for. On the receiver side, we do not forward the mutations to the shard where the stream_session is created, so that we can avoid unnecessary forwarding. Note: the downside is that it is now harder to: 1) to track number of bytes sent and received 2) to update the keep alive timer upon receive of the STREAM_MUTATION To fix, we now store the sent/recieved bytes info on all shards. When the keep alive timer expires, we check if any progress has been made. Hopefully, this patch will make the streaming much faster and in turn make the repair/decommission/adding a node faster. Refs: https://github.com/scylladb/scylla/issues/849 Tested with decommission/repair dtest. Message-Id: <96b419ab11b736a297edd54a0b455ffdc2511ac5.1454645370.git.asias@scylladb.com>	2016-02-07 10:57:51 +02:00

1 2 3 4 5 ...

336 Commits