scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 00:50:35 +00:00

Author	SHA1	Message	Date
Gleb Natapov	00b3f28199	db/system_keyspace: use user memory limits for local.paxos table Treat writes to local.paxos as user memory, as the number of writes is dependent on the amount of user data written with LWT. Fixes #5682 Message-Id: <20200130150048.GW26048@scylladb.com> (cherry picked from commit `b08679e1d3`)	2020-02-02 17:36:52 +02:00
Piotr Sarna	c0253d9221	db,view: fix checking for secondary index special columns A mistake in handling legacy checks for special 'idx_token' column resulted in not recognizing materialized views backing secondary indexes properly. The mistake is really a typo, but with bad consequences - instead of checking the view schema for being an index, we asked for the base schema, which is definitely not an index of itself. Branches 3.1,3.2 (asap) Fixes #5621 Fixes #4744 (cherry picked from commit `9b379e3d63`)	2020-01-21 23:32:11 +02:00
Tomasz Grabiec	36d90e637e	Merge "Relax migration manager dependencies" from Pavel Emalyanov The set make dependencies between mm and other services cleaner, in particular, after the set: - the query processor no longer needs migration manager (which doesn't need query processor either) - the database no longer needs migration manager, thus the mutual dependency between these two is dropped, only migration manager -> database is left - the migration manager -> storage_service dependency is relaxed, one more patchset will be needed to remove it, thus dropping one more mutual dependency between them, only the storage_service -> migration manager will be left - the migration manager is stopped on drain, but several more services need it on stop, thus causing use after free problems, in particular there's a caught bug when view builder crashes when unregistering from notifier list on stop. Fixed. Tests: unit(dev) Fixes: #5404	2020-01-16 12:12:25 +01:00
Piotr Jastrzębski	0c8c1ec014	config: fix description of enable_deprecated_partitioners Murmur3 is the default partitioner. ByteOrder and Random are the deprecated ones and should be mentioned in the description. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-16 12:05:50 +02:00
Pavel Emelyanov	5cf365d7e7	database: Explicitly pass migration_manager through init_non_system_keyspace This is the last place where database code needs the migration_manager instance to be alive, so now the mutual dependency between these two is gone, only the migration_manager needs the database, but not the vice-versa. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:29:21 +03:00
Pavel Emelyanov	28f1250b8b	view_builder: Use migration notifier The migration manager itself is still needed on start to wait for schema agreement, but there's no longer the need for the life-time reference on it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	e327feb77f	database: Prepare to use on-database migration_notifier Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f240d5760c	migration_manager: Split notifier from main class The _listeners list on migration_manager class and the corresponding notify_xxx helpers have nothing to do with the its instances, they are just transport for notification delivery. At the same time some services need the migration manager to be alive at their stop time to unregister from it, while the manager itself may need them for its needs. The proposal is to move the migration notifier into a complete separate sharded "service". This service doesn't need anything, so it's started first and stopped last. While it's not effectively a "migration" notifier, we inherited the name from Cassandra and renaming it will "scramble neurons in the old-timers' brains but will make it easier for newcomers" as Avi says. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:19 +03:00
Gleb Natapov	51672e5990	paxos: immediately sync commitlog entries for writes made by paxos learn stage	2020-01-15 12:15:42 +02:00
Gleb Natapov	0fc48515d8	paxos: mark paxos table schema as "always sync" We want all writes to paxos table to be persisted on a storage before declared completed.	2020-01-15 12:15:42 +02:00
Gleb Natapov	e0bc4aa098	commitlog: add sync method to entry_writer If the method returns true commitlog should sync to file immediately after writing the entry and wait for flush to complete before returning.	2020-01-15 12:15:42 +02:00
Piotr Sarna	36ec43a262	Merge "add table with connected cql clients" from Juliusz This change introduces system.clients table, which provides information about CQL clients connected. PK is the client's IP address, CK consists of outgoing port number and client_type (which will be extended in future to thrift/alternator/redis). Table supplies also shard_id and username. Other columns, like connection_stage, driver_name, driver_version..., are currently empty but exist for C* compatibility and future use. This is an ordinary table (i.e. non-virtual) and it's updated upon accepting connections. This is also why C*'s column request_count was not introduced. In case of abrupt DB stop, the table should not persist, so it's being truncated on startup. Resolves #4820	2020-01-14 10:01:07 +02:00
Nadav Har'El	1511b945f8	merge: Handle multiple regular base columns in view pk Merged patch series from Piotr Sarna: "Previous assumption was that there can only be one regular base column in the view key. The assumption is still correct for tables created via CQL, but it's internally possible to create a view with multiple such columns - the new assumption is that if there are multiple columns, they share their liveness. This series is vital for indexing to work properly on alternator, so it would be best to solve the issue upstream. I strived to leave the existing semantics intact as long as only up to one regular column is part of the materialized view primary key, which is the case for Scylla's materialized views. For alternator it may not be true, but all regular columns in alternator share liveness info (since alternator does not support per-column TTL), which is sufficient to compute view updates in a consistent way. Fixes #5006 Tests: unit(dev), alternator(test_gsi_update_second_regular_base_column, tic-tac-toe demo)" Piotr Sarna (3): db,view: fix checking if partition key is empty view: handle multiple regular base columns in view pk test: add a case for multiple base regular columns in view key alternator-test/test_gsi.py \| 1 - view_info.hh \| 5 +- cql3/statements/alter_table_statement.cc \| 2 +- db/view/view.cc \| 77 ++++++++++++++---------- mutation_partition.cc \| 2 +- test/boost/cql_query_test.cc \| 58 ++++++++++++++++++ 6 files changed, 109 insertions(+), 36 deletions(-)	2020-01-14 10:01:00 +02:00
Avi Kivity	6d454d13ac	db/schema_tables: make gratuitous generic lambdas in do_merge_schema() concrete Those gratuitous lambdas make life harder for IDE users by hiding the actual types from the IDEs. Message-Id: <20200107154746.1918648-1-avi@scylladb.com>	2020-01-08 17:43:18 +01:00
Piotr Sarna	155a47cc55	view: handle multiple regular base columns in view pk Previous assumption was that there can only be one regular base column in the view key. The assumption is still correct for tables created via CQL, but it's internally possible to create a view with multiple such columns - the new assumption is that if there are multiple columns, they share their liveness. This patch is vital for indexing to work properly on alternator, so it would be best to solve the issue upstream. I strived to leave the existing semantics intact as long as only up to one regular column is part of the materialized view primary key, which is the case for Scylla's materialized views. For alternator it may not be true, but all regular columns in alternator share liveness info (since alternator does not support per-column TTL), which is sufficient to compute view updates in a consistent way. Fixes #5006 Tests: unit(dev), alternator(test_gsi_update_second_regular_base_column, tic-tac-toe demo) Message-Id: <c9dec243ce903d3a922ce077dc274f988bcf5d57.1567604945.git.sarna@scylladb.com>	2020-01-07 12:18:39 +01:00
Piotr Sarna	54315f89cd	db,view: fix checking if partition key is empty Previous implementation did not take into account that a column in a partition key might exist in a mutation, but in a DEAD state - if it's deleted. There are no regressions for CQL, while for alternator and its capability of having two regular base columns in a view key, this additional check must be performed.	2020-01-07 12:05:36 +01:00
Avi Kivity	3a3c20d337	schema_tables: de-templatize diff_table_or_view() This reduces code bloat and makes the code friendlier for IDEs, as the IDE now understands the type of create_schema. Message-Id: <20191231134803.591190-1-avi@scylladb.com>	2020-01-07 11:56:54 +01:00
Avi Kivity	8f7f56d6a0	schema_tables: make gratuitous generic lambda in create_tables_from_partitions() concrete The generic lambda made IDE searches for create_table_from_table_row() fail. Message-Id: <20191231135210.591972-1-avi@scylladb.com>	2020-01-07 11:49:10 +01:00
Avi Kivity	92fd83d3af	schema_tables: make gratuitoous generic lambda in create_table_from_name() concrete The lambda made IDE searches for read_table_mutations fail. Message-Id: <20191231135103.591741-1-avi@scylladb.com>	2020-01-07 11:48:56 +01:00
Avi Kivity	dd6dd97df9	schema_tables: make gratuitous generic lambda in merge_tables_and_views() concrete The generic lambda made IDE searches for create_table_from_mutations fail. Message-Id: <20191231135059.591681-1-avi@scylladb.com>	2020-01-07 11:48:39 +01:00
Nadav Har'El	f0d8dd4094	merge: CDC rolling upgrade Merged pull request https://github.com/scylladb/scylla/pull/5538 from Avi Kivity and Piotr Jastrzębski. This series prepares CDC for rolling upgrade. This consists of reducing the footprint of cdc, when disabled, on the schema, adding a cluster feature, and redacting the cdc column when transferring it to other nodes. The latter is needed because we'll want to backport this to 3.2, which doesn't have canonical_mutations yet.	2020-01-05 17:13:12 +02:00
Gleb Natapov	720c0aa285	commitlog: update last sync timestamp when cycle a buffer If in memory buffer has not enough space for incoming mutation it is written into a file, but the code missed updating timestamp of a last sync, so we may sync to often. Message-Id: <20200102155049.21291-9-gleb@scylladb.com>	2020-01-05 16:13:59 +02:00
Gleb Natapov	14746e4218	commitlog: drop segment gate The code that enters the gate never defers before leaving, so the gate behaves like a flag. Lets use existing flag to prohibit adding data to a closed segment. Message-Id: <20200102155049.21291-8-gleb@scylladb.com>	2020-01-05 16:13:59 +02:00
Gleb Natapov	680330ae70	commitlog: introduce segment::close() function. Currently segment closing code is spread over several functions and activated based on the _closed flag. Make segment closing explicit by moving all the code into close() function and call it where _closed flag is set. Message-Id: <20200102155049.21291-6-gleb@scylladb.com>	2020-01-05 16:13:55 +02:00
Gleb Natapov	a1ae08bb63	commitlog: remove unused segment::flush() parameter Message-Id: <20200102155049.21291-5-gleb@scylladb.com>	2020-01-05 16:13:55 +02:00
Gleb Natapov	1e15e1ef44	commitlog: cleanup segment sync() Call cycle() only once. Message-Id: <20200102155049.21291-4-gleb@scylladb.com>	2020-01-05 16:13:54 +02:00
Gleb Natapov	3d3d2c572e	commitlog: move segment shutdown code from sync() Currently sync() does two completely different things based on the shutdown parameter. Separate code into two different function. Message-Id: <20200102155049.21291-3-gleb@scylladb.com>	2020-01-05 16:13:54 +02:00
Gleb Natapov	89afb92b28	commitlog: drop superfluous this Message-Id: <20200102155049.21291-2-gleb@scylladb.com>	2020-01-05 16:13:53 +02:00
Piotr Jastrzebski	c08e6985cd	cdc: allow cluster rolling upgrade Addition of cdc column in scylla_tables changes how schema digests are calculated, and affect the ABI of schema update messages (adding a column changes other columns' indexes in frozen_mutation). To fix this, extend the schema_tables mechanism with support for the cdc column, and adjust schemas and mutations to remove that column when sending schemas during upgrade. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-05 14:39:23 +02:00
Piotr Jastrzebski	4639989964	cdc: Add CDC_OPTIONS schema_feature Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-05 14:39:23 +02:00
Avi Kivity	c150f2e5d7	schema_tables, cdc: don't store empty cdc columns in scylla_tables An empty cdc column in scylla_tables is hashed differently from a missing column. This causes schema mismatch when a schema is propagated to another node, because the other node will redact the schema column completely if the cluster feature isn't enabled, and an empty value is hashed differently from a missing value. Store a tombstone instead. Tombstones are removed before digesting, so they don't affect the outcome. This change also undoes the changes in `386221da84` ("schema_tables: handle 'cdc' options") to schema_change_test test_merging_does_not_alter_tables_which_didnt_change. That change enshrined the breakage into the test, instead of fixing the root cause, which was that we added an an extra mutation to the schema (for cdc options, which were disabled).	2020-01-05 14:36:18 +02:00
Gleb Natapov	bae5cb9f37	commitlog: remove unused argument during segment creation Since `99a5a77234` all segments are created equal and "active" argument is never true, so drop it. Message-Id: <20191231150639.GR9084@scylladb.com>	2019-12-31 17:14:03 +02:00
Gleb Natapov	60a851d3a5	commitlog: always flush segments atomically with writing db::commitlog::segment::batch_cycle() assumes that after a write for a certain position completes (as reported by _pending_ops.wait_for_pending()) it will also be flushed, but this is true only if writing and flushing are atomic wrt _pending_ops lock. It usually is unless flush_after is set to false when cycle() is called. In this case only writing is done under the lock. This is exactly what happens when a segment is closed. Flush is skipped because zero header is added after the last entry and then flushed, but this optimization breaks batch_cycle() assumption. Fix it by flushing after the write atomically even if a segment is being closed. Fixes #5496 Message-Id: <20191224115814.GA6398@scylladb.com>	2019-12-24 14:52:23 +02:00
Pavel Emelyanov	d361894b9d	batchlog_manager: Speed up token_metadata endpoints counting a bit In this place we only need to know the number of endpoints, while current code additionally shuffles them before counting. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-23 14:22:45 +02:00
Tomasz Grabiec	28dec80342	db/schema_tables: Add trace-level logging of schema digesting This greatly helps to narrow down the source of schema digest mismatch between nodes. Intented use is to enable this logger on disagreeing nodes and trigger schema digest recalculation and observe which mutations differ in digest and then examine their content. Message-Id: <1574872791-27634-1-git-send-email-tgrabiec@scylladb.com>	2019-12-23 12:28:22 +02:00
Pavel Emelyanov	998f51579a	storage_service: Rip join_ring config option The option in question apparently does not work, several sharded objects are start()-ed (and thus instanciated) in join_roken_ring, while instances themselves of these objects are used during init of other stuff. This leads to broken seastar local_is_initialized assertion on sys_dist_ks, but reading the code shows more examples, e.g. the auth_service is started on join, but is used for thrift and cql servers initialization. The suggestion is to remove the option instead of fixing. The is_joined logic is kept since on-start joining still can take some time and it's safer to report real status from the API. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191203140717.14521-1-xemul@scylladb.com>	2019-12-18 12:45:13 +02:00
Nadav Har'El	aa1de5a171	merge: Synchronize snapshot and staging sstable deletion using sem Merged pull request https://github.com/scylladb/scylla/pull/5343 from Benny Halevy. Fixes #5340 Hold the sstable_deletion_sem table::move_sstables_from_subdirs to serialize access to the staging directory. It now synchronizes snapshot, compaction deletion of sstables, and view_update_generator moving of sstables from staging. Tests: unit (dev) [expect test_user_function_timestamp_return that fails for me locally, but also on master] snapshot_test.py (dev)	2019-12-17 14:06:02 +02:00
Juliusz Stasiewicz	7fdc8563bf	system_keyspace: Added infrastructure for table `system.clients' I used the following as a reference: https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/db/virtual/ClientsTable.java At this moment there is only info about IP, clients outgoing port, client 'type' (i.e. CQL/thrift/alternator), shard ID and username. Column `request_count' is NOT present and CK consists of (`port', `client_type'), contrary to what C's has: (`port'). Code that notifies `system.clients` about new connections goes to top-level files `connection_notifier.`. Currently only CQL clients are observed, but enum `client_type` can be used in future to notify about connections with other protocols.	2019-12-17 11:31:28 +01:00
Benny Halevy	4b3243f5b9	table: move_sstables_from_staging_in_thread with _sstable_deletion_sem Hold the _sstable_deletion_sem while moving sstables from the staging directory so not to move them under the feet of table::snapshot. Fixes #5340 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-17 12:20:20 +02:00
Benny Halevy	0446ce712a	view_update_generator::start: use variable binding Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-17 12:20:20 +02:00
Benny Halevy	5d7c80c148	view_update_generator::start: fix indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-17 12:20:20 +02:00
Benny Halevy	02784f46b9	view_update_generator: handle errors when processing sstable Consumer may throw, in this case, break from the loop and retry. move_sstable_from_staging_in_thread may theoretically throw too, ignore the error in this case since the sstable was already processed, individual move failures are already ignored and moving from staging will be retried upon restart. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-17 12:20:20 +02:00
Benny Halevy	0d2a7111b2	view_update_generator: sstable_with_table: std::move constructor args Just a small optimization. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-17 12:19:55 +02:00
Avi Kivity	3b7aca3406	Merge "db: Don't create a reference to nullptr" from Rafael " Only the first patch is needed to fix the undefined behavior, but the followup ones simplify the memory management around user types. " * 'espindola/fix-5193-v2' of ssh://github.com/espindola/scylla: db: Don't use lw_shared_ptr for user_types_metadata user_types_metadata: don't implement enable_lw_shared_from_this cql3: pass a const user_types_metadata& to prepare_internal db: drop special case for top level UDTs db: simplify db::cql_type_parser::parse db: Don't create a reference to nullptr Add test for loading a schema with a non native type	2019-12-16 17:10:58 +02:00
Benny Halevy	d1bcb39e7f	hinted handoff: log message after removing hints directory (#5372 ) To be used by dtest as an indicator that endpoint's hints were drained and hints directory is removed. Refs #5354 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-12 01:16:19 +02:00
Rafael Ávila de Espíndola	3b61cf3f0b	db: Don't use lw_shared_ptr for user_types_metadata The user_types_metadata can simply be owned by the keyspace. This simplifies the code since we never have to worry about nulls and the ownership is now explicit. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-12-11 10:44:40 -08:00
Rafael Ávila de Espíndola	99cb8965be	db: drop special case for top level UDTs This was originally done in `7f64a6ec4b`, but that commit was reverted in reverted in `8517eecc28`. The revert was done because the original change would call parse_raw for non UDT types. Unlike the old patch, this one doesn't change the behavior of non UDT types. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-12-11 10:44:40 -08:00
Rafael Ávila de Espíndola	7ae9955c5f	db: simplify db::cql_type_parser::parse The variant of db::cql_type_parser::parse that has a user_types_metadata argument was only used from the variant that didn't. This inlines one in the other. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-12-11 10:44:40 -08:00
Rafael Ávila de Espíndola	2092e1ef6f	db: Don't create a reference to nullptr The user_types variable can be null during db startup since we have to create types before reading the system table defining user types. This avoids undefined behavior, but is unlikely that it was causing more serious problems since the variable is only used when creating user types and we don't create any until after all system tables are read, in which case the user_types variable is not null. Fixes #5193 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-12-11 10:44:40 -08:00
Avi Kivity	63474a3380	Merge "Add `experimental_features` option" from Dejan " Add --experimental-features -- a vector of features to unlock. Make corresponding changes in the YAML parser. Fixes #5338 " * 'vecexper' of https://github.com/dekimir/scylla: config: Add `experimental_features` option utils: Add enum_option	2019-12-11 14:23:08 +02:00

1 2 3 4 5 ...

1540 Commits