scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 00:20:47 +00:00

Author	SHA1	Message	Date
Avi Kivity	bbad8f4677	replica: move ::database, ::keyspace, and ::table to replica namespace Move replica-oriented classes to the replica namespace. The main classes moved are ::database, ::keyspace, and ::table, but a few ancillary classes are also moved. There are certainly classes that should be moved but aren't (like distributed_loader) but we have to start somewhere. References are adjusted treewide. In many cases, it is obvious that a call site should not access the replica (but the data_dictionary instead), but that is left for separate work. scylla-gdb.py is adjusted to look for both the new and old names.	2022-01-07 12:04:38 +02:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Piotr Sarna	f49c20aa24	thrift: drop obtaining incorrect permits The thrift layer started partially having admission control after commit `ef1de114f0`, but code inspection suggests that it might cause use-after-free in a few cases, when a permit is obtained more than once per handling - due to the fact that some functions tail-called other functions, which also obtain a permit. These extraneous permits are not taken anyore. Tests: "please trust me" + cassandra-stress in thrift mode Message-Id: <ac5d711288b22c5fed566937722cceeabc234e16.1639394937.git.sarna@scylladb.com>	2021-12-17 09:35:24 +02:00
Gleb Natapov	38e1f85959	migration_manager: drop view_ptr array from announce_column_family_update() No users pass it any longer.	2021-12-11 12:31:07 +02:00
Pavel Emelyanov	d86b35f474	thrift: Use local proxy reference in do_paged_slice This place need some more care than simple replacement Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-03 17:56:04 +03:00
Pavel Emelyanov	35c35602ae	thrift: Use local proxy reference in handler methods Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-03 17:56:04 +03:00
Pavel Emelyanov	2d8272dc03	thrift: Keep sharded proxy reference on thrift_handler Carried via main -> controller -> server -> factory -> handler Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-03 17:48:19 +03:00
Nadav Har'El	666017f2f0	Merge 'Convert last uses of sprint() to fmt::format()' from Avi Kivity sprint() uses the printf-style formatting language while most of our code uses the Python-derived format language from fmt::format(). The last mass conversion of sprint() to fmt (in `1129134a4a`) missed some callers (principally those that were on multiple lines, and so the automatic converter missed them). Convert the remainder to fmt::format(), and some sprintf() and printf() calls, so we have just one format language in the code base. Seastar::sprint() ought to be deprecated and removed. Test: unit (dev) Closes #9529 * github.com:scylladb/scylla: utils: logalloc: convert debug printf to fmt::print() utils: convert fmt::fprintf() to fmt::print() main: convert fprint() to fmt::print() compress: convert fmt::sprintf() to fmt::format() tracing: replace seastar::sprint() with fmt::format() thrift: replace seastar::sprint() with fmt::format() test: replace seastar::sprint() with fmt::format() streaming: replace seastar::sprint() with fmt::format() storage_service: replace seastar::sprint() with fmt::format() repair: replace seastar::sprint() with fmt::format() redis: replace seastar::sprint() with fmt::format() locator: replace seastar::sprint() with fmt::format() db: replace seastar::sprint() with fmt::format() cql3: replace seastar::sprint() with fmt::format() cdc: replace seastar::sprint() with fmt::format() auth: replace seastar::sprint() with fmt::format()	2021-10-28 22:33:23 +03:00
Benny Halevy	a2fc3345bd	storage_service: futurize storage_service::describe_ring Convert storage_service::describe_ring to a coroutine to prevent reactor stalls as seen in #9280. Fixes #9280 Closes #9282 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #9282	2021-10-28 16:51:57 +03:00
Avi Kivity	d1616a7643	thrift: replace seastar::sprint() with fmt::format() sprint() is obsolete. Note InvalidRequestException used sprint() with runtime format, so both it and its callers were updated.	2021-10-27 17:02:00 +03:00
Avi Kivity	2d25705db0	cql3: deinline non-trivial methods in selection.hh This allows us to forward-declare raw_selector, which in turn reduces indirect inclusions of expression.hh from 147 to 58, reducing rebuilds when anything in that area changes. Includes that were lost due to the change are restored in individual translation units. Closes #9434	2021-10-05 12:58:55 +02:00
Pavel Emelyanov	de7761985c	client_state: Add database argument to has_schema_access The only caller is thrift that has database reference on board Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-08-27 14:07:26 +03:00
Pavel Emelyanov	36a4c1ddc1	client_state: Add database argument to has_keyspace_access() Callers are cql3, that has database via proxy, and thrift that has one by reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-08-27 14:07:18 +03:00
Pavel Emelyanov	5a13031ce8	thrift: Use local storage service in handlers Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Pavel Emelyanov	f2992f4e32	thrift: Carry sharded<storage_service>& down to handler The thrift_handler class' methods need storage service. This patch makes sure this class has sharded storage service reference on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Juliusz Stasiewicz	a8b741efe2	endpoint_details: store `_host` as `gms::inet_address` In an upcoming commit I will add "system.describe_ring" table which uses endpoint's inet address as a part of CK and, therefore, needs to keep them sorted with `inet_addr_type::less`.	2021-07-20 14:00:54 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	e0749d6264	treewide: some random header cleanups Eliminate not used includes and replace some more includes with forward declarations where appropriate. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-06 19:18:49 +03:00
Piotr Sarna	d1f2e8b469	treewide: propagate service level to client state ... since it's going to be used to set up per-service-level timeouts.	2021-05-10 11:48:14 +02:00
Piotr Sarna	7e6beabf27	migration_manager: allow table updates with timestamp In order to avoid needless schema disagreements, a way of announcing a schema change with fixed timestamp is added. That way, when nodes update schemas of their internal tables (e.g. during updates), it's possible for all nodes to use an identical timestamp for this operation, which in turn makes their digests identical.	2021-05-10 10:10:38 +02:00
Avi Kivity	daeddda7cc	treewide: remove inclusions of storage_proxy.hh from headers storage_proxy.hh is huge and includes many headers itself, so remove its inclusions from headers and re-add smaller headers where needed (and storage_proxy.hh itself in source files that need it). Ref #1.	2021-04-20 21:23:00 +03:00
Piotr Sarna	ef1de114f0	thrift: partially add admission control This commit adds admission control in the form of passing service permits to the Thrift server. The support is partial, because Thrift also supports running CQL queries, and for that purpose a query_state object is kept in the Thrift handler. However, the handler is generally created once per connection, not once per query, and the query_state object is supposed to keep the state of a single query only. In order to keep this series simpler, the CQL-on-top-of-Thrift layer is not touched and is left as TODO. Moreover, the Thrift layer does not make it easy to pass custom per-query context (like service_permit), so the implementation uses a trick: the service permit is created on the server and then passed as reference to its connections and their respective Thrift handlers. Then, each time a query is read from the socket, this service permit is overwritten and then read back from the Thrift handler. This mechanism heavily relies on the fact that there are zero preemption points between overwriting the service permit and reading it back by the handler. Otherwise, races may occur. This assumption was verified by code inspection + empirical tests, but if somebody is aware that it may not always hold, please speak up.	2021-03-29 13:05:16 +02:00
Pavel Emelyanov	fb49550943	thrift: Use query_processor::get_migration_manager() Thrift needs migration manager to call announce_<something> on it and currently it grabs blobak migration manager instance. Since thrift handler has query processor rerefence onboard and the query processor can provide the migration manager reference, it's time to remove few more globals from thrift code. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-15 19:35:59 +03:00
Piotr Sarna	c5214eb096	treewide: remove timeout config from query options Timeout config is now stored in each connection, so there's no point in tracking it inside each query as well. This patch removes timeout_config from query_options and follows by removing now unnecessary parameters of many functions and constructors.	2021-02-25 17:20:27 +01:00
Piotr Sarna	7ceafda70a	service: add timeout config to client state Future patches will use this per-connection timeout config to allow setting different timeouts for each session, based on roles.	2021-02-25 17:20:26 +01:00
Gleb Natapov	382ee066bf	database: drop duplicated function The database lass have to duplicated functions keyspaces() and get_keyspaces(). Drop the former since it is used in one place only. Message-Id: <20210201135333.GA1403508@scylladb.com>	2021-02-01 18:52:04 +02:00
Avi Kivity	60f5ec3644	Merge 'managed_bytes: switch to explicit linearization' from Michał Chojnowski This is a revival of #7490. Quoting #7490: The managed_bytes class now uses implicit linearization: outside LSA, data is never fragmented, and within LSA, data is linearized on-demand, as long as the code is running within with_linearized_managed_bytes() scope. We would like to stop linearizing managed_bytes and keep it fragmented at all times, since linearization can require large contiguous chunks. Large contiguous allocations are hard to satisfy and cause latency spikes. As a first step towards that, we remove all implicitly linearizing accessors and replace them with an explicit linearization accessor, with_linearized(). Some of the linearization happens long before use, by creating a bytes_view of the managed_bytes object and passing it onwards, perhaps storing it for later use. This does not work with with_linearized(), which creates a temporary linearized view, and does not work towards the longer term goal of never linearizing. As a substitute a managed_bytes_view class is introduced that acts as a view for managed_bytes (for interoperability it can also be a view for bytes and is compatible with bytes_view). By the end of the series, all linearizations are temporary, within the scope of a with_linearized() call and can be converted to fragmented consumption of the data at leisure. This has limited practical value directly, as current uses of managed_bytes are limited to keys (which are limited to 64k). However, it enables converting the atomic_cell layer back to managed_bytes (so we can remove IMR) and the CQL layer to managed_bytes/managed_bytes_view, removing contiguous allocations from the coordinator. Closes #7820 * github.com:scylladb/scylla: test: add hashers_test memtable: fix accounting of managed_bytes in partition_snapshot_accounter test: add managed_bytes_test utils: fragment_range: add a fragment iterator for FragmentedView keys: update comments after changes and remove an unused method mutation_test: use the correct preferred_max_contiguous_allocation in measuring_allocator row_cache: more indentation fixes utils: remove unused linearization facilities in `managed_bytes` class misc: fix indentation treewide: remove remaining `with_linearized_managed_bytes` uses memtable, row_cache: remove `with_linearized_managed_bytes` uses utils: managed_bytes: remove linearizing accessors keys, compound: switch from bytes_view to managed_bytes_view sstables: writer: add write_* helpers for managed_bytes_view compound_compat: transition legacy_compound_view from bytes_view to managed_bytes_view types: change equal() to accept managed_bytes_view types: add parallel interfaces for managed_bytes_view types: add to_managed_bytes(const sstring&) serializer_impl: handle managed_bytes without linearizing utils: managed_bytes: add managed_bytes_view::operator[] utils: managed_bytes: introduce managed_bytes_view utils: fragment_range: add serialization helpers for FragmentedMutableView bytes: implement std::hash using appending_hash utils: mutable_view: add substr() utils: fragment_range: add compare_unsigned utils: managed_bytes: make the constructors from bytes and bytes_view explicit utils: managed_bytes: introduce with_linearized() utils: managed_bytes: constrain with_linearized_managed_bytes() utils: managed_bytes: avoid internal uses of managed_bytes::data() utils: managed_bytes: extract do_linearize_pure() thrift: do not depend on implicit conversion of keys to bytes_view clustering_bounds_comparator: do not depend on implicit conversion of keys to bytes_view cql3: expression: linearize get_value_from_mutation() eariler bytes: add to_bytes(bytes) cql3: expression: mark do_get_value() as static	2021-01-18 11:01:28 +02:00
Gleb Natapov	d3aa17591c	migration_manager: drop announce_locally flag It looks like the history of the flag begins in Cassandra's https://issues.apache.org/jira/browse/CASSANDRA-7327 where it is introduced to speedup tests by not needing to start the gossiper. The thing is we always start gossiper in our cql tests, so the flag only introduce noise. And, of course, since we want to move schema to use raft it goes against the nature of the raft to be able to apply modification only locally, so we better get rid of the capability ASAP. Tests: units(dev, debug) Message-Id: <20201230111101.4037543-2-gleb@scylladb.com>	2021-01-03 13:58:09 +02:00
Avi Kivity	4b3f0fd7c0	thrift: do not depend on implicit conversion of keys to bytes_view This implicit conversion will soon be gone, as it is dangerous. Ask for the representation explicitly.	2020-12-20 15:14:44 +01:00
Pavel Emelyanov	b0c4a9087d	client_state: Add database& arg to has_column_family_access It is called from cql3/statements' check_access methods and from thrift handlers. The former have proxy argument from which they can get the database. The latter already have the database itself on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-11 18:49:16 +03:00
Tomasz Grabiec	0c5d23d274	thrift: Validate cell names when constructing clustering keys Currently, if the user provides a cell name with too many components, we will accept it and construct an invalid clusterin key. This may result in undefined behavior down the stream. It was caught by ASAN in a debug build when executing dtest cql_tests.py:MiscellaneousCQLTester.cql3_insert_thrift_test with nodetool flush manually added after the write. Triggered during sstable writing to an MC-format sstable: seastar::shared_ptr<abstract_type const>::operator*() const at ././seastar/include/seastar/core/shared_ptr.hh:577 sstables::mc::clustering_blocks_input_range::next() const at ./sstables/mx/writer.cc:180 To prevent corrupting the state in this way, we should fail early. This patch addds validation which will fail thrift requests which attempt to create invalid clustering keys. Fixes #7568. Example error: Internal server error: Cell name of ks.test has too many components, expected 1 got 2 in 0x0004000000040000017600 Message-Id: <1605550477-24810-1-git-send-email-tgrabiec@scylladb.com>	2020-12-01 15:12:08 +02:00
Dejan Mircevski	fb6c011b52	everywhere: Insert space after `switch` Quoth @avikivity: "switch is not a function, and we celebrate that by putting a space after it like other control-flow keywords." https://github.com/scylladb/scylla/pull/7052#discussion_r471932710 Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-08-18 14:31:04 +03:00
Tomasz Grabiec	bfd129cffe	thrift: Fix crash on unsorted column names in SlicePredicate The column names in SlicePredicate can be passed in arbitrary order. We converted them to clustering ranges in read_command preserving the original order. As a result, the clustering ranges in read command may appear out of order. This violates storage engine's assumptions and lead to undefined behavior. It was seen manifesting as a SIGSEGV or an abort in sstable reader when executing a get_slice() thrift verb: scylla: sstables/consumer.hh:476: seastar::future<> data_consumer::continuous_data_consumer<StateProcessor>::fast_forward_to(size_t, size_t) [with StateProcessor = sstables::data_consume_rows_context_m; size_t = long unsigned int]: Assertion `end >= _stream_position.position' failed. Fixes #6486. Tests: - added a new dtest to thrift_tests.py which reproduces the problem Message-Id: <1596725657-15802-1-git-send-email-tgrabiec@scylladb.com>	2020-08-06 19:13:22 +03:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Botond Dénes	92a7b16cba	query: read_command: add max_result_size This field will replace max size which is currently passed once per established rpc connection via the CLIENT_ID verb and stored as an auxiliary value on the client_info. For now it is unused, but we update all sites creating a read command to pass the correct value to it. In the next patch we will phase out the old max size and use this field to pass max size on each verb instead.	2020-07-28 18:00:29 +03:00
Botond Dénes	8992bcd1f8	query: read_command: use tagged ints for limit ctor params The convenience constructor of read_command now has two integer parameter next to each other. In the next patch we intend to add another one. This is recipe for disaster, so to avoid mistakes this patch converts these parameters to tagged integers. This makes sure callers pass what they meant to pass. As a matter of fact, while fixing up call-sites, I already found several ones passing `query::max_partitions` to the `row_limit` parameter. No harm done yet, as `query::max_partitions` == `query::max_rows` but this shows just how easy it is to mix up parameters with the same type.	2020-07-28 18:00:29 +03:00
Botond Dénes	2ca118b2d5	query: read_command: add separate convenience constructor query::read_command currently has a single constructor, which serves both as an idl constructor (order of parameters is fixed) and a convenience one (most parameters have default values). This makes it very error prone to add new parameters, that everyone should fill. The new parameter has to be added as last, with a default value, as the previous ones have a default value as well. This means the compiler's help cannot be enlisted to make sure all usages are updated. This patch adds a separate convenience constructor to be used by normal code. The idl constructor looses all default parameters. New parameters can be added to any position in the convenience constructor (to force users to fill in a meaningful value) while the removed default parameters from the idl constructor means code cannot accidentally use it without noticing.	2020-07-28 18:00:29 +03:00
Gleb Natapov	d5b0cf975a	cql transport: get rid of unneeded shared_ptr There is no point to hold prepared_metadata in result_message::prepared as a shared_ptr since their lifetime match. Message-Id: <20200610113217.GF335449@scylladb.com>	2020-06-10 15:48:40 +03:00
Avi Kivity	a4c44cab88	treewide: update concepts language from the Concepts TS to C++20 Seastar recently lost support for the experimental Concepts Technical Specification (TS) and gained support for C++20 concepts. Re-enable concepts in Scylla by updating our use of concepts to the C++20 standard. This change: - peels off uses of the GCC6_CONCEPT macro - removes inclusions of <seastar/gcc6-concepts.hh> - replaces function-style concepts (no longer supported) with equation-style concepts - semicolons added and removed as needed - deprecated std::is_pod replaced by recommended replacement - updates return type constraints to use concepts instead of type names (either std::same_as or std::convertible_to, with std::same_as chosen when possible) No attempt is made to improve the concepts; this is a specification update only. Message-Id: <20200531110254.2555854-1-avi@scylladb.com>	2020-06-02 09:12:21 +03:00
Avi Kivity	2d933c62ec	thrift: capture "this" explicitly in lambda C++20 deprecates capturing this in default-copy lambdas ([=]), with good reason. Move to explicit captures to avoid any ambiguity and reduce warning spew. Message-Id: <20200517151023.754906-1-avi@scylladb.com>	2020-05-18 10:24:00 +03:00
Piotr Jastrzebski	ae1f14095f	i_partitioner: Stop distinguishing whether keys order is preserved Scylla inherited a concept of partitioners that preserve order of keys from the origin but it is not used for anything. Moreover, none of the existing partitioners preserves keys order. The only partitioner that did this in the past was ByteOrderedPartitioner and Scylla does not support it any more. For a partitioner to preserve an order of the keys means that if there are two keys A and B such that A < B then token(A) < token(B) where token(X) isa token the partitioner assignes to key X. This patch removes dht::i_partitioner::preserves_order with all its overrides. The only place that was using this member function was a check in thrift server and it is safe to remove the check because the check was only done to differentiate the error message for partitioners that do and do not preserve the order of the keys. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-04-21 15:50:22 +02:00
Rafael Ávila de Espíndola	8da235e440	everywhere: Use futurize_invoke instead of futurize<T>::invoke No functionality change, just simpler. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200330165308.52383-1-espindola@scylladb.com>	2020-04-03 15:53:35 +02:00
Rafael Ávila de Espíndola	eca0ac5772	everywhere: Update for deprecated apply functions Now apply is only for tuples, for varargs use invoke. This depends on the seastar changes adding invoke. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200324163809.93648-1-espindola@scylladb.com>	2020-03-25 08:49:53 +02:00
Rafael Ávila de Espíndola	01ac4aef3a	everywhere: Use futurize_apply instead of futurize<void>::apply No functionality change, just simpler. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200318234149.283090-1-espindola@scylladb.com>	2020-03-21 19:51:38 +02:00
Pavel Emelyanov	d551f0323a	thrift: Forget cql_config The thrift handlers already mess with query_processor which has the config in question. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-03-09 11:57:30 +03:00
Rafael Ávila de Espíndola	94d07fba07	Pass string_view to the keyspace_metadata constructor This avoids a few sstring copies when constructing keyspace_metadata. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-28 17:04:12 -08:00
Piotr Jastrzebski	7c6f415647	thrift: reduce global_partitioner() calls Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:15 +01:00
Piotr Jastrzebski	ca4a89d239	dht: add dht::decorate_key and replace all dht::global_partitioner().decorate_key with dht::decorate_key It is an improvement because dht::decorate_key takes schema and uses it to obtain partitioner instead of using global partitioner as it was before. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:06 +01:00
Konstantin Osipov	2e07c76153	query_processor: rename process_statement_prepared Rename process_statement_prepared to execute_prepared for consistency with the rest of query_processor API.	2020-02-12 16:37:08 +03:00
Konstantin Osipov	1a53458239	query_processor: rename one overload of process() Rename an overloaded function process() to execute_direct(). Execute direct is a common term for executing a statement that was not previously prepared. See, for example SQLExecuteDirect in ODBC/SQL CLI specification, mysql_stmt_execute_direct() in MySQL C API or EXECUTE DIRECT in Postgres XC.	2020-02-12 16:36:56 +03:00

1 2 3 4 5 ...

292 Commits