scylladb

Author	SHA1	Message	Date
Botond Dénes	3fab83b3a1	flat_mutation_reader: impl: add reader_permit parameter Not used yet, this patch does all the churn of propagating a permit to each impl. In the next patch we will use it to track to track the memory consumption of `_buffer`.	2020-09-28 10:53:48 +03:00
Avi Kivity	ecb2bdad54	Merge 'Replace operator_type with an enum' from Dejan " operator_type is awkward because it's not copyable or assignable. Replace it with a new enum class. Tests: unit(dev) " * dekimir-operator-type: cql3: Drop operator_type entirely cql3: Drop operator_type from the parser cql3/expr: Replace operator_type with an enum	2020-08-18 13:45:20 +03:00
Dejan Mircevski	1aa326c93b	cql3: Drop operator_type entirely Since no live code uses it anymore, it can be safely removed. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-08-18 12:27:01 +02:00
Dejan Mircevski	71c921111d	cql3/expr: Replace operator_type with an enum operator_type is awkward because it's not copyable or assignable. Replace it in expression representation with a new enum class, oper_t. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-08-18 12:27:00 +02:00
Piotr Jastrzebski	c001374636	codebase wide: replace count with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously `count` function was often used in various ways. `contains` does not only express the intend of the code better but also does it in more unified way. This commit replaces all the occurences of the `count` with the `contains`. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b4ef3b4bc24f49abe04a2aba0ddd946009c9fcb2.1597314640.git.piotr@scylladb.com>	2020-08-15 20:26:02 +03:00
Piotr Sarna	4cb79f04b0	treewide: replace libjsoncpp usage with rjson In order to eventually switch to a single JSON library, most of the libjsoncpp usage is dropped in favor of rjson. Unfortunately, one usage still remains: test/utils/test_repl utility heavily depends on the exact textual format of its output JSON files, so replacing a library results in all tests failing because of differences in formatting. It is possible to force rjson to print its documents in the exact matching format, but that's left for later, since the issue is not critical. It would be nice though if our test suite compared JSON documents with a real JSON parser, since there are more differences - e.g. libjsoncpp keeps children of the object sorted, while rapidjson uses an unordered data structure. This change should cause no change in semantics, it strives just to replace all usage of libjsoncpp with rjson.	2020-07-03 10:27:23 +02:00
Avi Kivity	0c6bbc84cd	Merge "Classify queries based on their initiator, rather than their target" from Botond " Currently we classify queries as "system" or "user" based on the table they target. The class of a query determines how the query is treated, currently: timeout, limits for reverse queries and the concurrency semaphore. The catch is that users are also allowed to query system tables and when doing so they will bypass the limits intended for user queries. This has caused performance problems in the past, yet the reason we decided to finally address this is that we want to introduce a memory limit for unpaged queries. Internal (system) queries are all unpaged and we don't want to impose the same limit on them. This series uses scheduling groups to distinguish user and system workloads, based on the assumption that user workloads will run in the statement scheduling group, while system workloads will run in the main (or default) scheduling group, or perhaps something else, but in any case not in the statement one. Currently the scheduling group of reads and writes is lost when going through the messaging service, so to be able to use scheduling groups to distinguish user and system reads this series refactors the messaging service to retain this distinction across verb calls. Furthermore, we execute some system reads/writes as part of user reads/writes, such as auth and schema sync. These processes are tagged to run in the main group. This series also centralises query classification on the replica and moves it to a higher level. More specifically, queries are now classified -- the scheduling group they run in is translated to the appropriate query class specific configuration -- on the database level and the configuration is propagated down to the lower layers. Currently this query class specific configuration consists of the reader concurrency semaphore and the max memory limit for otherwise unlimited queries. A corollary of the semaphore begin selected on the database level is that the read permit is now created before the read starts. A valid permit is now available during all stages of the read, enabling tracking the memory consumption of e.g. the memtable and cache readers. This change aligns nicely with the needs of more accurate reader memory tracking, which also wants a valid permit that is available in every layer. The series can be divided roughly into the following distinct patch groups: * 01-02: Give system read concurrency a boost during startup. * 03-06: Introduce user/system statement isolation to messaging service. * 07-13: Various infrastructure changes to prepare for using read permits in all stages of reads. * 14-19: Propagate the semaphore and the permit from database to the various table methods that currently create the permit. * 20-23: Migrate away from using the reader concurrency semaphore for waiting for admission, use the permit instead. * 24: Introduce `database::make_query_config()` and switch the database methods needing such a config to use it. * 25-31: Get rid of all uses of `no_reader_permit()`. * 32-33: Ban empty permits for good. * 34: querier_cache: use the queriers' permits to obtain the semaphore. Fixes: #5919 Tests: unit(dev, release, debug), dtest(bootstrap_test.py:TestBootstrap.start_stop_test_node), manual testing with a 2 node mixed cluster with extra logging. " * 'query-class/v6' of https://github.com/denesb/scylla: (34 commits) querier_cache: get semaphore from querier reader_permit: forbid empty permits reader_permit: fix reader_resources::operator bool treewide: remove all uses of no_reader_permit() database: make_multishard_streaming_reader: pass valid permit to multi range reader sstables: pass valid permits to all internal reads compaction: pass a valid permit to sstable reads database: add compaction read concurrency semaphore view: use valid permits for reads from the base table database: use valid permit for counter read-before-write database: introduce make_query_class_config() reader_concurrency_semaphore: remove wait_admission and consume_resources() test: move away from reader_concurrency_semaphore::wait_admission() reader_permit: resource_units: introduce add() mutation_reader: restricted_reader: work in terms of reader_permit row_cache: pass a valid permit to underlying read memtable: pass a valid permit to the delegate reader table: require a valid permit to be passed to most read methods multishard_mutation_query: pass a valid permit to shard mutation sources querier: add reader_permit parameter and forward it to the mutation_source ...	2020-05-29 10:11:44 +03:00
Glauber Costa	44a0e40cb2	compaction: move compaction_strategy_type to its own header I just hit a circularity in header inclusion that I traced back to the fact that schema.hh includes compaction_strategy.hh. schema.hh is in turn included in lots of places, so a circularity is not hard to come by. The schema header really only needs to know about the compaction_type, so it can inform schema users about it. Following the trend in header clenups, I am moving that to a separate header which will both break the circularity and make sure we are included less stuff that is not needed. With this change, Scylla fails to compile due to a new missing forward declaration at index/secondary_index_manager.hh, so this is fixed. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200527172203.915936-1-glauber@scylladb.com>	2020-05-29 08:14:27 +03:00
Botond Dénes	cc5137ffe3	table: require a valid permit to be passed to most read methods Now that the most prevalent users (range scan and single partition reads) all pass valid permits we require all users to do so and propagate the permit down towards `make_sstable_reader()`. The plan is to use this permit for restricting the sstable readers, instead of the semaphore the table is configured with. The various `make_streaming_*reader()` overloads keep using the internal semaphores as but they also create the permit before the read starts and pass it to `make_sstable_reader()`.	2020-05-28 11:34:35 +03:00
Nadav Har'El	7922b9eb8f	materialized views: reduce recompilation when db/view/view.hh changes. Before this patch, when db/view/view.hh was modified, 89 source files had to be recompiled. After this patch, this number is down to 5. Most of the irrelevant source files got view.hh by including database.hh, which included view.hh just for the definition of statistics. So in this patch we split the view statistics to a separate header file, view_stats.hh, and database.hh only includes that. A few source files which included only database.hh and also needed view.hh (for materialized-view related functions) now need to include view.hh explicitly. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200319121031.540-1-nyh@scylladb.com>	2020-03-19 15:46:14 +02:00
Pavel Emelyanov	4fa12f2fb8	header: De-bloat schema.hh The header sits in many other headers, but there's a handy schema_fwd.hh that's tiny and contains needed declarations for other headers. So replace shema.hh with schema_fwd.hh in most of the headers (and remove completely from some). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200303102050.18462-1-xemul@scylladb.com>	2020-03-03 11:34:00 +01:00
Botond Dénes	dfc8b2fc45	treewide: replace reader_resource_tracer with reader_permit The former was never really more than a reader_permit with one additional method. Currently using it doesn't even save one from any includes. Now that readers will be using reader_permit we would have to pass down both to mutation_source. Instead get rid of reader_resource_tracker and just use reader_permit. Instead of making it a last and optional parameter that is easy to ignore, make it a first class parameter, right after schema, to signify that permits are now a prominent part of the reader API. This -- mostly mechanical -- patch essentially refactors mutation_source to ask for the reader_permit instead of reader_resource_tracking and updates all usage sites.	2020-01-28 08:13:16 +02:00
Amnon Heiman	6f58d51c83	secondary_index_manager: add the index_name_from_table_name function index_name_from_table_name is a reverse of index_table_name, it gets a table name that was generated for an index and return the name of the index that generated that table. Relates to #4192 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2020-01-15 15:06:00 +02:00
Piotr Sarna	2ee8c6f595	index: add is_global_index() utility The helper function is useful for determining if given schema represents a global index.	2019-10-14 17:13:32 +02:00
Piotr Sarna	a8f7d64a08	index: mark token column as 'computed' when creating mv Secondary indexes use a computed token column to preserve proper query ordering. This column is now marked as 'computed'.	2019-07-19 11:58:42 +02:00
Piotr Sarna	757419b524	index: add serialization function for index targets Since target_parser is responsible for deserializing target strings, the function that serializes them belongs in the same class.	2019-03-20 10:51:26 +01:00
Piotr Sarna	074ed2c8a5	index: use proper local index target when adding index With global indexes, target column name is always the same as the string kept in 'options[target]' field. It's not the case for local indexes, and so a proper extracting function is used to get the value.	2019-03-20 10:20:24 +01:00
Piotr Sarna	2fcae3d0ec	index: add parsing target column name from local index targets When (re)creating a local index, the target string needs to be used to parse out the actual indexed column: "(base_pk_part1,base_pk_part2,base_pk_part3),actual_indexed_column". This column is later used to deterine if an index should be applied to a SELECT statement.	2019-03-20 10:20:24 +01:00
Piotr Sarna	de5e5ee1a5	index: add checking if serialized target implies local index This utility enables checking if the specified target indicated having a local index, even before base table schema is known.	2019-03-20 10:20:24 +01:00
Piotr Sarna	5672edc149	index: enable parsing multi-key targets Parsing index targets that consist of partition key columns followed by clustering key columns is enabled.	2019-03-20 10:20:24 +01:00
Piotr Sarna	9782381dd4	index: move target parser code to .cc file It will be useful later when expanding the implementation.	2019-03-20 10:20:24 +01:00
Piotr Sarna	9c984f9da9	index: fix indentation	2019-03-20 09:51:46 +01:00
Piotr Sarna	3b908b7b5d	index: add base partition keys to local index schema When the index is local, its partition key in underlying materialized view is the the same as base's, and the indexed column is a first clustering key. This implementation ensures that view and base rows will reside on the same partition, while querying the indexed column will be possible by putting it as a first clustering key part.	2019-03-20 09:51:46 +01:00
Piotr Sarna	cb20fc2e4f	index: make non-pointer overload of is_index function Previous interface enforced passing a shared pointer, which might result in calling unneeded shared_from_this().	2019-02-20 12:52:32 +01:00
Piotr Sarna	94db098d39	index: avoid copying when checking for is_index Previously is_index implementation used list_indexes() helper function, which copies data.	2019-02-20 12:52:32 +01:00
Duarte Nunes	aa476cd6c9	index/secondary_index_manager: Add virtual columns to MV Virtual columns are MV-specific columns that contribute to the liveness of view rows. However, we were not adding those columns when creating an index's underlying MV, causing indexes to miss base rows. Fixes #4144 Branches: master, branch-3.0 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2019-01-27 22:30:12 +00:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Botond Dénes	1865e5da41	treewide: remove include database.hh from headers where possible Many headers don't really need to include database.hh, the include can be replaced by forward declarations and/or including the actually needed headers directly. Some headers don't need this include at all. Each header was verified to be compilable on its own after the change, by including it into an empty `.cc` file and compiling it. `.cc` files that used to get `database.hh` through headers that no longer include it were changed to include it themselves.	2018-12-14 08:03:57 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	7ae23d8f9b	index: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Piotr Sarna	372644c909	index: add target_column getter to index Target column for an index is later needed to find matching restrictions.	2018-07-11 18:06:21 +02:00
Avi Kivity	6f23403137	Merge "Virtualize IndexInfo system table" from Duarte " The IndexInfo table tracks the secondary indexes that have already been populated. Since our secondary index implementation is backed by materialized views, we can virtualize that table so queries are actually answered by built_views. Fixes #3483 " * 'built-indexes-virtual-reader/v2' of github.com:duarten/scylla: tests/virtual_reader_test: Add test for built indexes virtual reader db/system_keysace: Add virtual reader for IndexInfo table db/system_keyspace: Explain that table_name is the keyspace in IndexInfo index/secondary_index_manager: Expose index_table_name() db/legacy_schema_migrator: Don't migrate indexes	2018-06-06 17:35:51 +03:00
Piotr Sarna	4a9bf7ed5b	index, tests: add token column to secondary index schema Additional token column is now present in every view schema that backs a secondary index. This column is always a first part of the clustering key, so it forces token order on queries. Column's name is ideally idx_token, but can be postfixed with a number to ensure its uniqueness. It also updates tests to make them acknowledge the new token order. Fixes #3423	2018-06-06 09:02:33 +02:00
Duarte Nunes	3e39985c7a	db/system_keysace: Add virtual reader for IndexInfo table The IndexInfo table tracks the secondary indexes that have already been populated. Since our secondary index implementation is backed by materialized views, we can virtualize that table so queries are actually answered by built_views. Fixes #3483 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-06-04 11:14:17 +01:00
Duarte Nunes	bc4db67524	index/secondary_index_manager: Expose index_table_name() Expose secondary_index::index_table_name() so knowledge on how to built an index name can remain centralized. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-06-04 11:14:17 +01:00
Nadav Har'El	79c6bb642f	secondary index: fix indexing of partition-key column Indexing an only partition key component is not allowed (because it would be redundant), but it should be allowed to index one of several partition key components. We had a bug in that case: the underlying materialized view we created had the same column as both a partition key and a clustering key, which resulted in an assertion failure. This patch fixes that. Fixes #3404. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180501121544.22869-1-nyh@scylladb.com>	2018-05-02 08:06:38 +03:00
Nadav Har'El	21d7507b74	secondary index: move stuff out of db/index directory The db/index directory contains just a few lines of code that exists there for historical reasons. It's confusing that we have both db/index and index/ directory related to secondary-indexing. This patch moves what little is still in db/index/ to index/. In the future we should probably get rid of the "secondary_index" class we had there, but for now, let's at least not have a whole new directory for it. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180501101246.21143-1-nyh@scylladb.com>	2018-05-01 13:21:24 +03:00
Nadav Har'El	46d4f6f352	secondary index: fix yet another case sensitivity bug When the secondary index code builds a "%s IS NOT NULL" clause for a CQL statement, it needs to quote the column name if it needs to be (not only lowercase, digits and _). Fixes #3401. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180429221857.6248-7-nyh@scylladb.com>	2018-04-30 00:27:23 +02:00
Nadav Har'El	ecc85297a4	secondary index: clean up dead unquoting code In commit `d674b6f672`, I fixed a case- sensitive column name bug by avoiding CQL quoting of a column name in create_index_statement.cc when building a "targets" option string. However, there is also matching code in target_parser.hh to unquote that option string. So this unquoting code is no longer necessary, and should be dropped. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180429221857.6248-1-nyh@scylladb.com>	2018-04-30 00:27:23 +02:00
Duarte Nunes	9146de3118	service/migration_manager: Don't drop index-backing MV Unless dropped by the index itself, forbid dropping an index-backing MV using `drop materialized view`. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180424140745.7144-1-duarte@scylladb.com>	2018-04-24 17:01:59 +01:00
Pekka Enberg	3150962cb7	index: Fix index view schema when primary key component is indexed This fixes index view schema to exclude indexed column when a primary key component like clustering key is indexed. This fixes a server crash when CREATE INDEX statement is executed on a clustering key column.	2017-11-03 10:12:58 +02:00
Pekka Enberg	678a6f6e2f	index: Implement index::supports_expression() for EQ operator	2017-11-03 09:10:43 +02:00
Pekka Enberg	1ae9343f68	index: Fix index::supports_expression() operator parameter type The cql3::operator_type is supposed to be passed around as const reference, not by value; otherwise equality won't work.	2017-11-03 09:10:43 +02:00
Pekka Enberg	ed4c96c025	index: Add secondary_index_manager::create_view_for_index() This patch adds a create_view_for_index() function, which creates a view_ptr for index_metadata.	2017-10-05 10:07:44 +03:00
Pekka Enberg	a809ea902e	index: Add target_parser::parse() helper	2017-10-05 10:07:44 +03:00
Pekka Enberg	50943ce592	index: Add secondary_index_manager::get_dependent_indices()	2017-10-05 10:07:44 +03:00
Pekka Enberg	9ebd8be82b	index: Add secondary_index_manager::reload() This patch adds a reload() function, which updates the secondary index manager index map to match underlying column family indices.	2017-09-18 14:31:35 +03:00
Pekka Enberg	2ae6b141e5	index: Add secondary_index_manager::list_indexes()	2017-09-18 14:27:35 +03:00
Pekka Enberg	870de26e35	index: Add index class Add a simple index class, which represents an instantiated index.	2017-08-24 14:00:02 +03:00
Pekka Enberg	d63a650b3f	index: Pass column_family to secondary_index_manager constructor We need column family for various secondary index manager operations.	2017-08-24 14:00:02 +03:00

1 2

52 Commits