scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Author	SHA1	Message	Date
Nadav Har'El	c9345d8a0e	alternator: automatically choose RF: 1 or 3 In CQL, before a user can create a table, they must create a keyspace to contain this table and, among other things, specify this keyspace's RF. But in the DynamoDB API, there is no "create keyspace" operation - the user just creates a table, and there is no way, and no opportunity, to specify the requested RF. Presumably, Amazon always uses the same RF for all tables, most likely 3, although this is not officially documented anywhere. The existing code creates the keyspace during Scylla boot, with RF=1. This RF=1 always works, and is a good choice for a one-node test run, but was a really bad choice for a real cluster with multiple nodes, so this patch fixes this choice: With this patch, the keyspace creation is delayed - it doesn't happen when the first node of the cluster boots, but only when the user creates the first table. Presumably, at that time, the cluster is already up, so at that point we can make the obvious choice automatically: a one-node cluster will get RF=1, a >=3 node cluster will get RF=3. The choice of RF is logged - and the choice of RF=1 is considered a warning. Note that with this patch, keyspace creation is still automatic as it was before. The user may manually create the keyspace via CQL, to override this automatic choice. In the future we may also add additional keyspace configuration options via configuration flags or new REST requests, and the keyspace management code will also likely change as we start to support clusters with multiple regions and global tables. But for now, I think the automatic method is easiest for users who want to test-drive Alternator without reading lengthy instructions on how to set up the keyspace. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190820180610.5341-1-nyh@scylladb.com>	2019-08-20 21:24:01 +03:00
Piotr Sarna	587b38cd69	alternator-test: add a test for wrong BEGINS_WITH target type The test ensures that passing a non-compatible type to BEGINS WITH, e.g. a number, results in a validation error. Tested both locally and remotely. Message-Id: <894a10d3da710d97633dd12b6ac54edccc18be82.1566291989.git.sarna@scylladb.com>	2019-08-20 14:52:22 +03:00
Piotr Sarna	7d68d5030d	alternator: replace is_byte_order_compatible in BEGINS WITH Checking if the type is byte-order compatible is more than enough for BEGINS WITH operator - actually, we just need to check if the type is compatible with a string. Message-Id: <27a867cc1fa907ff87e011914e4acbb4f7db0181.1566225556.git.sarna@scylladb.com>	2019-08-19 17:43:12 +03:00
Nadav Har'El	c49e009e3e	alternator: use empty_service_permit() In the new code, write and read queries take a "service permit" which they hold for the duration of the query, to help limit the load on the machine. Alternator doesn't yet participate in this feature, so for now let's just use empty_service_permit() meaning the queries don't hold on to any permit. We can fix this later to use real permits. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-08-19 17:12:08 +03:00
Nadav Har'El	eebb2f0a0f	alternator: add to CreateTable verification of BillingMode setting We allow BillingMode to be set to either PAY_PER_REQUEST (the default) or PROVISIONED, although neither mode is fully implemented: In the former case the payment isn't accounted, and in the latter case the throughput limits are not enforced. But other settings for BillingMode are now refused, and we add a new test to verify that. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190818122919.8431-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	fd10eee1ae	alternator-test: require a new-enough boto library The alternator tests want to exercise many of the DynamoDB API features, so they need a recent enough version of the client libraries, boto3 and botocore. In particular, only in botocore 1.12.54, released a year ago, was support for BillingMode added - and we rely on this to create pay-per-request tables for our tests. Instead of letting the user run with an old version of this library and get dozens of mysterious errors, in this patch we add a test to conftest.py which cleanly aborts the test if the libraries aren't new enough, and recommends a "pip" command to upgrade these libraries. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190819121831.26101-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	28b3819c23	alternator-test: exhaustive tests for DescribeTable operation The DescribeTable operation was currently implemented to return the minimal information that libraries and applications usually need from it, namely verifying that some table exists. However, this operation is actually supposed to return a lot more information fields (e.g., the size of the table, its creation date, and more) which we currently don't return. This patch adds a new test file, test_describe_table.py, testing all these additional attributes that DescribeTable is supposed to return. Several of the tests are marked xfail (expected to fail) because we did not implement these attributes yet. The test is exhaustive except for attributes that have to do with four major features which will be tested together with these features: GSI, LSI, streams (CDC), and backup/restore. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190816132546.2764-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	656f62722b	alternator: enable timeouts on requests Currently Alternator starts all Scylla requests (including both reads and writes) without any timeout set. Because of bugs and/or network problems, Requests can theoretically hang and waste Scylla request for hours, long after the client has given up on them and closed their connection. The DynamoDB protocol doesn't let a user specify which timeout to use, so we should just use something "reasonable", in this patch 10 seconds. Remember that all DynamoDB read and write requests are small (even scans just scan a small piece), so 10 seconds should be above and beyond anything we actually expect to see in practice. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190812105132.18651-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	ecb571d7e3	alternator: add "--alternator-address" configuration parameter So far we had the "--alternator-port" option allowing to configure the port on which the Alternator server listens on, but the server always listened to any address. It is important to also be able to configure the listen address - it is useful in tests running several instances of Scylla on the same machine, and useful in multi-homed machines with several interfaces. So this patch adds the "--alternator-address" option, defaulting to 0.0.0.0 (to listen on all interfaces). It works like the many other "--*-address" options that Scylla already has. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190808204641.28648-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	dd4638d499	alternator: docs/alternator.md more about filtering support Give more details about what is, and what isn't, currently supported in filtering of Scan (and Query) results. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190811094425.30951-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	aaf559c4f9	alternator: fix indentation It turns out that recent rjson patches introduced some buggy tabs instead of spaces due to bad IDE configuration. The indentation is restored to spaces.	2019-08-19 15:49:52 +03:00
Piotr Sarna	f7d0ca3c92	alternator-test: add QueryFilter validation cases QueryFilter validation was lately supplemented with non-key column checks, which is hereby tested.	2019-08-19 15:49:52 +03:00
Piotr Sarna	8394225741	alternator-test: add scan case for key equality filtering With key equality filtering enabled, a test case for scanning is provided.	2019-08-19 15:49:52 +03:00
Piotr Sarna	091b1b40c2	alternator: add filtering for key equality Until now, filtering in alternator was possible only for non-key column equality relations. This commit adds support for equality relations for key columns.	2019-08-19 15:49:52 +03:00
Piotr Sarna	b914ba11fa	alternator: add validation to QueryFilter QueryFilter, according to docs, can only contain non-key attributes.	2019-08-19 15:49:52 +03:00
Piotr Sarna	1b2b2c7009	alternator: add computing key bounds from filtering Alternator allows passing hash and sort key restrictions as filters - it is, however, better to incorporate these restrictions directly into partition and clustering ranges, if possible. It's also necessary, as optimizations inside restrictions_filter assume that it will not be fed unneeded rows - e.g. if filtering is not needed on partition key restrictions, they will not be checked.	2019-08-19 15:49:52 +03:00
Piotr Sarna	188c6a552a	alternator: extract getting key value subfunction Currently the only utility function for getting key bytes from JSON was to parse a document with the following format: "key_column_name" : { "key_column_type" : VALUE }. However, it's also useful to parse only the inner document, i.e.: { "key_column_type" : VALUE }.	2019-08-19 15:49:52 +03:00
Piotr Sarna	b8964ab0ba	alternator: make make_map_element_restriction static The function has no outside users and thus does not need to be exposed.	2019-08-19 15:49:52 +03:00
Piotr Sarna	c4fd846dbb	alternator: register filtering metrics Three metrics related to filtering are added to alternator: - total rows read during filtering operations - rows read and matched by filtering - rows read and dropped by filtering	2019-08-19 15:49:52 +03:00
Piotr Sarna	338b7e9e67	alternator: add bumping filtering stats When filtering is used in querying or scanning, the number of total filtered rows is added to stats.	2019-08-19 15:49:52 +03:00
Piotr Sarna	154d1649c6	alternator: add cql_stats to alternator stats Some underlying operations (e.g. paging) make use of cql_stats structure from CQL3. As such, cql_stats structure is added to alternator stats in order to gather and use these statistics.	2019-08-19 15:49:52 +03:00
Piotr Sarna	5620a46024	alternator: fix a comment typo s/Miscellenous/Miscellaneous/g	2019-08-19 15:49:52 +03:00
Piotr Sarna	fc9744791c	alternator: register read-before-write stats Read-before-write stat counters were already introduced, but the metrics needs to be added to a metric group as well in order to be available for users.	2019-08-19 15:49:52 +03:00
Nadav Har'El	e0b01a0233	alternator: initial support for GSI This patch adds partial support for GSI (Global Secondary Index) in Alternator, implemented using a materialized view in Scylla. This initial version only supports the specific cases of the index indexing a column which was already part of the base table's key - e.g., indexing what used to be a sort key (clustering key) in the base table. Indexing of non-key attributes (which today live in a map) is not yet supported in this version. Creation of a table with GSIs is supported, and so is deleting the table. UpdateTable which adds a GSI to an existing table is not yet supported. Query and Scan operations on the index are supported. DescribeTable does not yet list the GSIs as it should. Seven previously-failing tests now pass, so their "xfail" tag is removed. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190808090256.12374-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	b3bf2fab2e	alternator: add stats for read-before-write A simple metric counting how many read-before-writes were executed is added. Message-Id: <d8cc1e9d77e832bbdeff8202a9f792ceb4f1e274.1565274797.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	05b895ca84	alternator: complement rjson.hh comments Some comments in rjson.hh header file were not clear and are hereby amended. Message-Id: <7fa4e2cf39b95c176af31fe66f404a6a51a25bec.1565275276.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	6b145b59d3	alternator: remove missing key FIXME The case for missing key in update_item was already properly fixed along with migrating from libjsoncpp to rapidjson, but one FIXME remained in the code by mistake. Message-Id: <94b3cf53652aa932a661153c27aa2cb1207268c7.1565271432.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	76bc30a82d	alternator: remove decimal_type FIXME Decimal precision problems were already solved by commit d5a1854d93c9448b1d22c2d02eb1c46a286c5404, but one FIXME remained in the code by mistake. Message-Id: <381619e26f8362a8681b83e6920052919acf1142.1565271198.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	972474a215	alternator: add comments to rjson The rapidjson library needs to be used with caution in order to provide maximum performance and avoid undefined behavior. Comments added to rjson.hh describe provided methods and potential pitfalls to avoid. Message-Id: <ba94eda81c8dd2f772e1d336b36cae62d39ed7e1.1565270214.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	3342ebff22	alternator: stop discarding futures in alternator server By mistakes, some futures were discarded instead of being chained in alternator server initialization.	2019-08-19 15:49:52 +03:00
Piotr Sarna	cd2c581c7c	alternator: remove a pointer-based workaround for future<json> With libjsoncpp we were forced to work around the problem of non-noexcept constructors by using an intermediate unique pointer. Objects provided by rapidjson have correct noexcept specifiers, so the workaround can be dropped.	2019-08-19 15:49:52 +03:00
Piotr Sarna	e19a7f908e	alternator: migrate to rapidjson library Profiling alternator implied that JSON parsing takes up a fair amount of CPU, and as such should be optimized. libjsoncpp is a standard library for handling JSON objects, but it also proves slower than rapidjson, which is hereby used instead. The results indicated that libjsoncpp used roughly 30% of CPU for a single-shard alternator instance under stress, while rapidjson dropped that usage to 18% without optimizations. Future optimizations should include eliding object copying, string copying and perhaps experimenting with different JSON allocators.	2019-08-19 15:49:52 +03:00
Piotr Sarna	e9f1540de1	alternator: add handling rapidjson errors in the server If a JSON parsing error is encountered, it is transformed to a validation exception and returned to the user in JSON form.	2019-08-19 15:49:52 +03:00
Piotr Sarna	eb678ed63a	alternator: add rapidjson helper functions Migrating from libjsoncpp to rapidjson proved to be beneficial for parsing performance. As a first step, a set of helper functions is provided to ease the migration process.	2019-08-19 15:49:52 +03:00
Piotr Sarna	ebdd4022cf	alternator: add missing namespaces to status_type error.hh file implicitly assumed that seastar:: namespace is available when it's included, which is not always the case. To remedy that, seastar::httpd namespace is used explicitly.	2019-08-19 15:49:52 +03:00
Nadav Har'El	d6a8626e90	alternator: correct catch table-already-exists exception Our CreateTable handler assumed that the function migration_manager::announce_new_column_family() returns a failed future if the table already exists. But in some of our code branches, this is not the case - the function itself throws instead of returning a failed future. The solution is to use seastar::futurize_apply() to handle both possibilities (direct exception or future holding an exception). This fixes a failure of the test_table.py::test_create_table_already_exists test case. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	62858c8466	alternator: add docs/alternator.md This adds a new document, docs/alternator.md, about Alternator. The scope of this document should be expanded in the future. We begin here by introducing Alternator and its current compatibility level with Amazon DynamoDB, but it should later grow to explain the design of Alternator and how it maps the DynamoDB data model onto Scylla's. Whether this document should remain a short high-level overview, or a long and detailed design document, remains an open question. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190805085340.17543-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	3716d23ce4	dependencies: add rapidjson The rapidjson fast JSON parsing library is used instead of libjsoncpp in the Alternator subproject. Message-Id: <a48104dec97c190e3762f927973a08a74fb0c773.1564995712.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	1611b5dd4f	alternator: wait for sharded service to start We start()ed Alternator's sharded service, but forgot to wait for the future it returns! So on multi-shard run which is slow enough (e.g, debug build), we sometimes get to invoke_on_all() before start() had completed, and fail to initialize the Alternator server. The fix is to just wait for the future returned by start() - just as similar code in main.cc does. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190804172006.14888-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	105533c046	alternator: fix sharing of a seastar::shared_ptr between threads The function attrs_type() return a supposedly singleton, but because it is a seastar::shared_ptr we can't use the same one for multiple threads, and need to use a separate one per thread. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190804163933.13772-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	7c23e23e7d	alternator: fix cross-shard use of CQL type objects The CQL type singletons like utf8_type et al. are separate for separate shards and cannot be used across shards. So whatever hash tables we use to find them, also needs to be per-shard. If we fail to do this, we get errors running the debug build with multiple shards. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190804165904.14204-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	a5057a3b6e	alternator-test: some more GSI tests Expand the GSI test suite. The most important new test is test_gsi_key_not_in_index(), where the index's key includes just one of the base table's key columns, but not a second one. In this case, the Scylla implementation will nevertheless need to add the second key column to the view (as a clustering key), even though it isn't considered a key column by the DynamoDB API. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190718085606.7763-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	8d8baccdc4	alternator: ListTables should not list materialized views Our ListTables implementation uses get_column_families(), which lists both base tables and materialized views. We will use materialized views to implement DynamoDB's secondary indexes, and those should not be listed in the results of ListTables. The patch also includes a test for this. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190717133103.26321-2-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	f8c7a2e0b8	alternator-test: move list_tables to util.py The list_tables() utility function was used only in test_table.py but I want to use it elsewhere too (in GSI test) so let's move it to util.py. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190717133103.26321-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	99fd032b1f	alternator: make set_sum exception more user-friendly As in case of set_diff, an exception message in set_sum should include the user-provided request (ADD) rather than our internal helper function set_sum.	2019-08-19 15:49:52 +03:00
Piotr Sarna	7984050054	alternator-tests: enable DELETE case for sets UpdateExpression's case for DELETE operation for sets is enabled.	2019-08-19 15:49:52 +03:00
Piotr Sarna	d7f75b405b	alternator: implement set DELETE UpdateExpression's DELETE operation for set is implemented on top of set_diff helper function.	2019-08-19 15:49:52 +03:00
Piotr Sarna	1d19934bc6	alternator: add set difference helper function A function for computing set differene of two sets represented as JSON is added.	2019-08-19 15:49:52 +03:00
Nadav Har'El	493890c6f6	alternator: fail attempt to create table with GSI Although we do not support GSI yet, until now we silently ignored CreateTable's GSI parameter, and the user wouldn't know the table wasn't created as intended. In this patch, GSI is still unsupported, but now CreateTable will fail with an error message that GSI is not supported. We need to change some of the tests which test the error path, and expect an error - but should not consider a table creation error as the expected error. After this patch, test_gsi.py still fails all the tests on Alternator, but much more quickly :-) Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190711161420.18547-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	e550a666e3	alternator-test: add stub case for set add duplication The test case for adding two sets with common values is added. This case is a stub, because boto3 transforms the result into a Python set, which removes duplicates on its own. A proper TODO is left in order to migrate this case to a lower-level API and check the returned JSON directly for lack of duplicates.	2019-08-19 15:49:52 +03:00

1 2 3 4 5 ...

19475 Commits