scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 04:26:48 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	998f51579a	storage_service: Rip join_ring config option The option in question apparently does not work, several sharded objects are start()-ed (and thus instanciated) in join_roken_ring, while instances themselves of these objects are used during init of other stuff. This leads to broken seastar local_is_initialized assertion on sys_dist_ks, but reading the code shows more examples, e.g. the auth_service is started on join, but is used for thrift and cql servers initialization. The suggestion is to remove the option instead of fixing. The is_joined logic is kept since on-start joining still can take some time and it's safer to report real status from the API. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191203140717.14521-1-xemul@scylladb.com>	2019-12-18 12:45:13 +02:00
Amnon Heiman	f43285f39a	api: replace swagger definition to use long instead of int (#5380 ) In swagger 1.2 int is defined as int32. We originally used int following the jmx definition, in practice internally we use uint and int64 in many places. While the API format the type correctly, an external system that uses swagger-based code generator can face a type issue problem. This patch replace all use of int in a return type with long that is defined as int64. Changing the return type, have no impact on the system, but it does help external systems that use code generator from swagger. Fixes #5347 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-12-11 12:48:29 +02:00
Glauber Costa	73aff1fc95	api: export system uptime via REST This will be useful for tools like nodetool that want to query the uptime of the system. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190619110850.14206-1-glauber@scylladb.com>	2019-11-20 16:44:11 +02:00
Piotr Dulikowski	48f7b2e4fb	table: move out table::stats to table_stats This change was done in order to be able to forward-declare the table::stats structure.	2019-11-12 13:35:41 +01:00
Vladimir Davydov	e510288b6f	api: wire up column_family cas-related statistics	2019-10-29 19:26:18 +03:00
Vladimir Davydov	21c3c98e5b	api: wire up storage_proxy cas-related statistics	2019-10-29 19:26:18 +03:00
Asias He	f876580740	storage_service: Reject nodetool cleanup when there is pending ranges From Shlomi: 4 node cluster Node A, B, C, D (Node A: seed) cassandra-stress write n=10000000 -pop seq=1..10000000 -node <seed-node> cassandra-stress read duration=10h -pop seq=1..10000000 -node <seed-node> while read is progressing Node D: nodetool decommission Node A: nodetool status node - wait for UL Node A: nodetool cleanup (while decommission progresses) I get the error on c-s once decommission ends java.io.IOException: Operation x0 on key(s) [383633374d31504b5030]: Data returned was not validated The problem is when a node gets new ranges, e.g, the bootstrapping node, the existing nodes after a node is removed or decommissioned, nodetool cleanup will remove data within the new ranges which the node just gets from other nodes. To fix, we should reject the nodetool cleanup when there is pending ranges on that node. Note, rejecting nodetool cleanup is not a full protection because new ranges can be assigned to the node while cleanup is still in progress. However, it is a good start to reject until we have full protection solution. Refs: #5045	2019-10-23 19:20:36 +08:00
Vladimir Davydov	e8bcb34ed4	api: drop /storage_proxy/metrics/cas_read/condition_not_met There's no such metric in Cassandra (although Cassadra's docs mistakenly say it exists). Having it would make no sense anyway so let's drop it. Message-Id: <b4f7a6ad278235c443cb8ea740bfa6399f8e4ee1.1570434332.git.vdavydov@scylladb.com>	2019-10-07 16:54:39 +03:00
Nadav Har'El	6c4ad93296	api/compaction_manager: do not hold map on the stack Merged patch series by Amnon Heiman: This patch fixes a bug that a map is held on the stack and then is used by a future. Instead, the map is now moved to the relevant lambda function. Fixes #4824	2019-09-01 13:16:34 +03:00
Amnon Heiman	2d3185fa7d	column_family.cc: remove unhandle future The sum_ratio struct is a helper struct that is used when calculating ratio over multiple shards. Originally it was created thinking that it may need to use future, in practice it was never used and the future was ignore. This patch remove the future from the implementation and reduce an unhandle future warning from the compilation. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-08-25 16:51:14 +03:00
Amnon Heiman	21dee3d8ef	API:column_family.cc Add get_build_index implmentation This Patch adds an implementation of the get build index API and remove a FIXME. The API returns the list of the built secondary indexes belongs to a column family. Example: CREATE KEYSPACE scylla_demo WITH replication = {'class': 'SimpleStrategy', 'replication_factor': '1'}; CREATE TABLE scylla_demo.mytableID ( uid uuid, text text, time timeuuid, PRIMARY KEY (uid, time) ); CREATE index on scylla_demo.mytableID (time); $ curl -X GET 'http://localhost:10000/column_family/built_indexes/scylla_demo%3Amytableid' ["mytableid_time_idx"] Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-08-25 16:46:49 +03:00
Pekka Enberg	d0eecbf3bb	api/storage_proxy: Wire up hinted-handoff status to API We support hinted-handoff now, so let's return it's status via the API. Message-Id: <20190819080006.18070-1-penberg@scylladb.com>	2019-08-20 00:24:50 +02:00
Amnon Heiman	6a0490c419	api/compaction_manager: indentation	2019-08-12 14:04:40 +03:00
Amnon Heiman	8181601f0e	api/compaction_manager: do not hold map on the stack This patch fixes a bug that a map is held on the stack and then is used by a future. Instead, the map is now wrapped with do_with. Fixes #4824 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-08-12 14:04:00 +03:00
Calle Wilund	298da3fc4b	api/storage_service: Add "sstable_info" command Assembles information and attributes of sstables in one or more column families. v2: * Use (not really legal) nested "type" in json * Rename "table" param to "cf" for consistency * Some comments on data sizes * Stream result to avoid huge string allocations on final json	2019-08-06 08:14:15 +00:00
Amnon Heiman	1c6dec139f	API: compaction_manager add get pending tasks by table The pending tasks by table name API return an array of pending tasks by keyspace/table names. After this patch the following command would work: curl -X GET 'http://localhost:10000/compaction_manager/metrics/pending_tasks_by_table' Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-07-12 19:21:26 +03:00
Calle Wilund	4ef940169f	Replace use of "ipv4_addr" with socket_address Allows the various sockets to use ipv6 address binding if so configured.	2019-07-08 14:13:09 +00:00
Paweł Dziepak	8a13d96203	api/column_family: work around gcc9 bug in seastar::future<std::any> There is a gcc9 bug https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90415 that makes it impossible to pass std::any through a seastar::future<T>. Fortunately, there is only one user of seastar::future<std::any> in Scylla and it is not performance-critical. This patch avoids the gcc9 bug by using seastar::future<std::unique_ptr<std::any>>.	2019-06-17 13:06:28 +01:00
Calle Wilund	26702612f3	api.hh: Fix bool parsing in req_param Fixes #4525 req_param uses boost::lexical cast to convert text->var. However, lexical_cast does not handle textual booleans, thus param=true causes not only wrong values, but exceptions. Message-Id: <20190610140511.15478-1-calle@scylladb.com>	2019-06-10 17:11:47 +03:00
Amnon Heiman	f3b6c5fe2f	API: storage_proxy add CAS and View endpoints Some nodetool command in 3.0 uses the CAS and View metrics. CAS is not implemented and we don't have all the metrics for View but we still don't want those nodetool commands to fail. After this patch the following would work and will return empty: curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/cas_read/moving_average_histogram' curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/view_write/moving_average_histogram' curl -X GET --header 'Accept: application/json' 'http://localhost:10000/storage_proxy/metrics/cas_write/moving_average_histogram' This patch is needed for #4416 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20190521141235.20856-1-amnon@scylladb.com>	2019-05-22 14:25:17 +03:00
Avi Kivity	b663cd1765	api: config: stop using _make_config_values Now that named_value::value_as_json() exists, make use of it to report the current value of a configuration variable via the REST API, instead of _make_config_values().	2019-04-23 16:29:03 +03:00
Benny Halevy	3749148339	storage_service: fix handling of load_new_sstables exception ignore_ready_future in load_new_ss_tables broke migration_test:TestMigration_with_*.migrate_sstable_with_counter_test_expect_fail dtests. The java.io.NotSerializableException in nodetool was caused by exceptions that were too long. This fix prints the problematic file names onto the node system log and includes the casue in the resulting exception so to provide the user with information about the nature of the error. Fixes #4375 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190331154006.12808-1-bhalevy@scylladb.com>	2019-04-02 11:46:19 +03:00
Benny Halevy	956cb2e61c	storage_service: handle load_new_sstables exception Refs #3117 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-03-28 14:54:56 +02:00
Calle Wilund	ef1bdebd0a	api::storage_service: Implement "scrub"	2019-03-06 13:13:21 +00:00
Calle Wilund	23f4c982ea	api/storage_service: Implement "upgradesstables" Fixes #4245 Implemented as a compation barrier (forcing previous compactions to finish) + parameterized "cleanup", with sstable list based on parameters.	2019-03-06 13:13:21 +00:00
Calle Wilund	3b5588dddd	api::storage_service: Add keyspace + tables helper To avoid repeating code to get keyspace + tables	2019-03-06 13:13:21 +00:00
Amnon Heiman	6c7742d616	system_keyspace, api: stream get_compaction_history get_compaciton_history can return big chunk of data. To prevent large memory allocation, the get_compaction_history now read each compaction_history record and use the http stream to send it. Fixes #4152 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-02-05 11:14:53 +02:00
Avi Kivity	32e79fc23b	api/commitlog: de-template acquire_cl_metric() Use std::function instead of a template parameter. Likely doesn't gain anyting, because the template was always instantiated with the same type (the result of std::bind() with the same signatures), but still good practice. std::function was used instead of noncopyable_function because sharded::map_reduce0() copies the input function.	2019-01-20 11:58:39 +02:00
Avi Kivity	6e6372e8d2	Revert "Merge "Type-eaese gratuitous templates with functions" from Avi" This reverts commit `31c6a794e9`, reversing changes made to `4537ec7426`. It causes bad_function_calls in some situations: INFO 2019-01-20 01:41:12,164 [shard 0] database - Keyspace system: Reading CF sstable_activity id=5a1ff267-ace0-3f12-8563-cfae6103c65e version=d69820df-9d03-3cd0-91b0-c078c030b708 INFO 2019-01-20 01:41:13,952 [shard 0] legacy_schema_migrator - Moving 0 keyspaces from legacy schema tables to the new schema keyspace (system_schema) INFO 2019-01-20 01:41:13,958 [shard 0] legacy_schema_migrator - Dropping legacy schema tables INFO 2019-01-20 01:41:14,702 [shard 0] legacy_schema_migrator - Completed migration of legacy schema tables ERROR 2019-01-20 01:41:14,999 [shard 0] seastar - Exiting on unhandled exception: std::bad_function_call (bad_function_call)	2019-01-20 11:32:14 +02:00
Avi Kivity	08bd28942b	api/commitlog: de-template acquire_cl_metric() Use noncopyable_function instead of a template parameter. Likely doesn't gain anyting, because the template was always instantiated with the same type (the result of std::bind() with the same signatures), but still good practice.	2019-01-17 18:45:14 +02:00
Rafael Ávila de Espíndola	26ac2c23ef	Change _row_ names that refer to partitions This renames some variables and functions to make it clear that they refer to partitions and not rows. Old versions of sstablemetadata used to refer to a row histogram, but current versions now mention a partition histogram instead. This patch doesn't change the exposed API names. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20181229223311.4184-2-espindola@scylladb.com>	2019-01-09 14:53:42 +02:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Tomasz Grabiec	7747f2dde3	Merge "nodetool toppartitions" from Rafi & Avi Implementation of nodetool toppartiotion query, which samples most frequest PKs in read/write operation over a period of time. Content: - data_listener classes: mechanism that interfaces with mutation readers in database and table classes, - toppartition_query and toppartition_data_listener classes to implement toppartition-specific query (this interfaces with data_listeners and the REST api), - REST api for toppartitions query. Uses Top-k structure for handling stream summary statistics (based on implementation in C, see #2811). What's still missing: - JMX interface to nodetool (interface customization may be required), - Querying #rows and #bytes (currently, only #partitions is supported). Fixes #2811 https://github.com/avikivity/scylla rafie_toppartitions_v7.1: top_k: whitespace and minor fixes top_k: map template arguments top_k: std::list -> chunked_vector top_k: support for appending top_k results nodetool toppartitions: refactor table::config constructor nodetool toppartitions: data listeners nodetool toppartitions: add data_listeners to database/table nodetool toppartitions: fully_qualified_cf_name nodetool toppartitions: Toppartitions query implementation nodetool toppartitions: Toppartitions query REST API nodetool toppartitions: nodetool-toppartitions script	2018-12-28 16:31:24 +01:00
Rafi Einstein	197f38d4ee	nodetool toppartitions: Toppartitions query REST API A HTTP GET operation starts the query (with args: ks/cf name and duration in ms). It executes synchroneously, results are returned as JSON: $ curl -s -X GET http://localhost:10000/column_family/toppartitions/ks:cf1?duration=10000 \| jq { "read": [ { "count": "15", "error": "0", "partition": "4b504d39354f37353131" }, { "count": "15", "error": "0", "partition": "3738313134394d353530" } ], "write": [ { "count": "15", "error": "0", "partition": "4b504d39354f37353131" }, { "count": "15", "error": "0", "partition": "3738313134394d353530" } ] } Signed-off-by: Rafi Einstein <rafie@scylladb.com>	2018-12-28 16:45:57 +02:00
Rafi Einstein	404f75def5	nodetool toppartitions: fully_qualified_cf_name Encapsulate keyspace:column_family REST API argument parsing into fully_qualified_cf_name class. Signed-off-by: Rafi Einstein <rafie@scylladb.com>	2018-12-28 16:45:57 +02:00
Botond Dénes	1865e5da41	treewide: remove include database.hh from headers where possible Many headers don't really need to include database.hh, the include can be replaced by forward declarations and/or including the actually needed headers directly. Some headers don't need this include at all. Each header was verified to be compilable on its own after the change, by including it into an empty `.cc` file and compiling it. `.cc` files that used to get `database.hh` through headers that no longer include it were changed to include it themselves.	2018-12-14 08:03:57 +02:00
Raphael S. Carvalho	953fdcc867	sstables: store cf pointer in compaction_info motivation is that we need a more efficient way to find compactions that belong to a given column family in compaction list. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2018-11-24 18:53:28 -02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Amnon Heiman	25378916bc	API: colummn_family.hh yield in map_reduce_column_families_locally map_reduce_column_families_locally iterate over all tables (column family) in a shard. If the number of tables is big it can cause latency spikes. This patch replaces the current loop with a do_for_each allowing preepmtion inside the loop. Fixes #3886 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20181115154825.23430-1-amnon@scylladb.com>	2018-11-15 18:58:23 +02:00
Avi Kivity	da17c29bd3	api: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Glauber Costa	98332de268	api: use longs instead of ints for snapshot sizes Int types in json will be serialized to int types in C++. They will then only be able to handle 4GB, and we tend to store more data than that. Without this patch, listsnapshots is broken in all versions. Fixes: #3845 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20181012155902.7573-1-glauber@scylladb.com>	2018-10-12 21:17:24 +03:00
Amnon Heiman	ab207356a5	API: storage_service stream endpoints This patch changes how list of tokens returned from the storage_service API. Instead of create a vector and construct a json object of it, use the streaming capabilities of the http. This is important for large cluster and prevent large allocations. Fixes #3701 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20180820195631.26792-1-amnon@scylladb.com>	2018-08-22 11:24:38 +03:00
Asias He	4a0b561376	storage_service: Get rid of moving operation The moving operation changes a node's token to a new token. It is supported only when a node has one token. The legacy moving operation is useful in the early days before the vnode is introduced where a node has only one token. I don't think it is useful anymore. In the future, we might support adjusting the number of vnodes to reblance the token range each node owns. Removing it simplifies the cluster operation logic and code. Fixes #3475 Message-Id: <144d3bea4140eda550770b866ec30e961933401d.1533111227.git.asias@scylladb.com>	2018-08-01 11:18:17 +03:00
Amnon Heiman	8fbc6a22fb	Add the API implementation to get_sstables_by_key Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2018-06-10 16:13:01 +03:00
Amnon Heiman	cc5601d000	api: column_family.json make the get_sstables_for_key doc clearer This patch makes it clearer that the key that get_sstables_for_key refers to, is a partition key.	2018-06-10 16:13:01 +03:00
Vladimir Krivopalov	3e471116b4	Separate statistics for count of cells, columns and rows in column_stats. SSTables 3.0 format makes a distinction between count of cells and count of columns. In that sense, a column of a collection type counts as one column but every atomic cell in it counts as a separate cell. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-03 17:05:06 -07:00
Avi Kivity	6c35db2c44	api: type-erase all-column_family map_reduce variant Encapsulate the map_reduce parameters in type-erased std::function, as well as the iterator-on-all-column-families logic. Reduces binary size by 18%.	2018-04-03 13:08:22 +03:00
Avi Kivity	0ade558999	api: simplify 6-argument map_reduce_cf() variant The 6-argument map_reduce_cf function is identical to the 5-argument version, except that it applies performs an extra cast (by calling the 6th argument's operator=()). Simplify the code by calling the 5-argument version from the 6-argument version. Reduces binary size by ~10%.	2018-04-03 12:22:14 +03:00
Avi Kivity	cadd983856	api: type-erase map_reduce_cf() map_reduce_cf() is called with varying template parameters which each have to be compiled separately. Unifying the internals to use types based on std::any reduced the object size by 15% (115MB->99MB) with presumably a commensurate decrease in compile time. A version that used "I" instead of "std::any" (and thus merged the internals only for callers that used the same result type) delivered a 10% decrease in object size. While std::any is less safe, in this case it is completely encapsulated. Message-Id: <20180402213732.432-1-avi@scylladb.com>	2018-04-03 09:31:04 +01:00
Avi Kivity	4419e60207	Merge "Add a confiugration API" from Amnon " The configuration API is part of scylla v2 configuration. It uses the new definition capabilities of the API to dynamically create the swagger definition for the configuration. This mean that the swagger will contain an entry with description and type for each of the config value. To get the v2 of the swager file: http://localhost:10000/v2 If using with swagger ui, change http://localhost:10000/api-doc to http://localhost:10000/v2 It takes longer to load because the file is much bigger now. " * 'amnon/config_api_v5' of github.com:scylladb/seastar-dev: Explanation about the API V2 API: add the config API as part of the v2 API. Defining the config api	2018-03-28 12:45:17 +03:00

1 2 3 4 5 ...

410 Commits