scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Piotr Jastrzebski	05e0451b27	token: change _data to int64_t Previously _data was stored as array of 8 bytes in network byte order. After this change it stores the same value in int64_t in host byte order. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-05 09:31:32 +01:00
Piotr Jastrzebski	b569d127a0	token: change data to array<uint8_t, 8> It is save to do such change because we support only Murmur3Partitioner which uses only tokens that are 8 bytes long. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-05 09:30:46 +01:00
Rafael Ávila de Espíndola	5d4671526c	db: Replace large_data_handler::_stopped with _running This is not just a direct flip to a variable with the negated Boolean value. When created, a large_data_handler is not considered to be running, the user has to call start() before it can be used. The advantaged of doing this is that if initialization fails and a database is destructed before the large_data_handler is started, the assert database::stop() { assert(!_large_data_handler->running()); is not triggered. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-04 21:15:44 -08:00
Rafael Ávila de Espíndola	33dfe34f78	db: Move nop_large_data_handler constructor out-of-line Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-04 21:12:01 -08:00
Rafael Ávila de Espíndola	e99a225f25	db: Move large_data_handler::stop out-of-line Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-04 21:11:49 -08:00
Rafael Ávila de Espíndola	9eae0b57a3	test: Enable all experimental features in the cql_repl The cql repl will hopefully be used to write most new tests, so it should have all experimental features enabled. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200204173448.95892-1-espindola@scylladb.com>	2020-02-04 19:36:37 +02:00
Nadav Har'El	3de09042bb	CDC topology change support Merged pull request https://github.com/scylladb/scylla/pull/5485 by Kamil Braun: This series introduces the notion of CDC generations: sets of CDC streams used by the cluster to choose partition keys for CDC log writes. Each CDC generation begins operating at a specific time point, called the generation's timestamp (cdc_streams_timestamp in the code). It continues being used by all nodes in the cluster to generate log writes until superseded by a new generation. Generations are chosen so that CDC log writes are colocated with their corresponding base table writes, i.e. their partition keys (which are CDC stream identifiers picked from the generation operating at time of making the write) fall into the same vnode and shard as the corresponding base table write partition keys. Currently this is probabilistic and not 100% of log writes will be colocated - this will change in future commits, after per-table partitioners are implemented. CDC generations are a global property of the cluster -- they don't depend on any particular table's configuration. Therefore the old "CDC stream description tables", which were specific to each CDC-enabled table, were removed and replaced by a new, global description table inside the system_distributed keyspace. A new generation is introduced and supersedes the previous one whenever we insert new tokens into the token ring, which breaks the colocation property of the previous generation. The new generation is chosen to account for the new tokens and restore colocation. This happens when a new node joins the cluster. The joining node is responsible for creating and informing other nodes about the new CDC generation. It does that by serializing it and inserting into an internal distributed table ("CDC topology description table"). If it fails the insert, it fails the joining process. It then announces the generation to other nodes through gossip using the generation's timestamp, which is the partition key of the inserted distributed table entry. Nodes that learn about the new generation through gossip attempt to retrieve it from the distributed table. This might fail - for example, if the node is partitioned away from all replicas that hold this generation's table entry. In that case the node might stop accepting writes, since it knows that it should send log entries to a new generation of streams, but it doesn't know what the generation is. The node will keep trying to retrieve the data in the background until it succeeds or sees that it is no longer necessary (e.g., because yet another generation superseded this one). So we give up some availability to achieve safety. However, this solution is not completely safe (might break consistency properties): if a node learns about a new generation too late (if gossip doesn't reach this node in time), the node might send writes to the wrong (old) generation. In the future we will introduce a transaction-based approach where we will always make sure that all nodes receive the new generation before any of them starts using it (and if it's impossible e.g. due to a network partition, we will fail the bootstrap attempt). In practice, if the admin makes sure that the cluster works correctly before bootstrapping a new node, and a network partition doesn't start in the few seconds window where a new generation is announced, everything will work as it should. After the learning node retrieves the generation, it inserts it into an in-memory data structure called "CDC metadata". This structure is then used when performing writes to the CDC log -- given the timestamp of the written mutation, the data structure will return the CDC generation operating at this time point. CDC metadata might reject the query for two reasons: if the timestamp belongs to an earlier generation, which most probably doesn't have the colocation property anymore, or if it is picked too far away into the future, where we don't know if the current generation won't be superseded by a different one (so we don't yet know the set of streams that this log write should be sent to). If the client uses server-generated timestamps, the query will never be rejected. Clients can also use client-generated timestamps, but they must make sure that their clocks are not too desynchronized with the database -- otherwise some or all of their writes to CDC-enabled tables will be rejected. In the case of rolling upgrade, where we restart nodes that were previously running without CDC, we act a bit differently - there is no naturally selected joining node which must propose a new generation. We have to select such a node using other means. For this we use a bully approach: every node compares its host id with host ids of other nodes and if it finds that it has the greatest host id, it becomes responsible for creating the first generation. This change also fixes the way of choosing values of the "time" column of CDC log writes: the timeuuid is chosen in a way which preserves ordering of corresponding base table mutations (the timestamp of this timeuuid is equal to the base table mutation timestamp). Warning: if you were running a previous CDC version (without topology change support), make sure to disable CDC on all tables before performing the upgrade. This will drop the log data -- backup it if needed. TODO in future patchset: expire CDC generations. Currently, each inserted CDC generation will stay in the distributed tables forever (until manually removed by the administrator). When a generation is superseded, it should become "expired", and 24 hours after expiration, it should be removed. The distributed tables (cdc_topology_description and cdc_description) both have an "expired" column which can be used for this purpose. Unit tests: dev, debug, release dtests (dev): https://jenkins.scylladb.com/job/scylla-master/job/byo/job/byo_build_tests_dtest/907/	2020-02-04 10:20:29 +02:00
Pavel Emelyanov	abe588888d	database: Use feature service Keep local feature_service reference on database. This relaxes the circular storage_service <-> database reference, but not removes it completely. This needs some args tossing in apply_to_builder, but it's rather straightforward, so comes in the same patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	4f5b70dcb1	migration_manager: Use feature service This unties migration_manager from storage_service thus breaking the circular dependency between these two. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	74fd3466b5	start: Pass needed feature as argument into migrate_truncation_records As a nice side-effect this stops using global storage service instance by this function. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Avi Kivity	541893e69a	Merge "Fix conversion of lua nil to cql null" from Rafael " The fix itself is fairly simple, but looking at the code I found that our code base was not cleanly distinguishing null and empty values and was treating null and missing values differently, but that distinction was dead since a null is represented as a dead cell. " * 'espindola/lua-fix-null-v6' of https://github.com/espindola/scylla: lua: Handle nil returns correctly types: Return bytes_opt from data_value::serialize query-result-set: Assert that we don't have null values types: Fix comparison of empty and null data_values Revert "tests: Handle null and not present values differently" query-result-set: Avoid a copy during construction types: Move operator== for data_value out-of-line	2020-02-02 15:43:24 +02:00
Avi Kivity	c8890eb124	Merge "Simplify usage of stream subscriptions" from Rafael " In a few places, the only use we had for a subscription was calling done(). With this series we now call done() early and store the future<> instead. " * 'espindola/stream-cleanup' of https://github.com/espindola/scylla: sstable_test: Store a future<> instead of a subscription commitlog: Store a future instead of a subscription in db::commitlog::segment_manager::list_descriptors::helper lister: Store a future<> instead of a subscription	2020-02-02 14:49:00 +02:00
Rafael Ávila de Espíndola	da984f1f33	commitlog: Store a future instead of a subscription in db::commitlog::segment_manager::list_descriptors::helper The only use we had for the subscription was calling done, may as well call it early and store the future<>. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-30 08:31:28 -08:00
Gleb Natapov	b08679e1d3	db/system_keyspace: use user memory limits for local.paxos table Treat writes to local.paxos as user memory, as the number of writes is dependent on the amount of user data written with LWT. Fixes #5682 Message-Id: <20200130150048.GW26048@scylladb.com>	2020-01-30 17:07:27 +02:00
Eliran Sinvani	8cfc2aad57	internalize storage proxy statistics metric registration The storage proxy statistics structure did not contain a method for registering the statistics for metric groups, instead, each user had to register some of the metrics by itself. There is no real reason for separating the metrics registration from the statistics data. There is even less justification for doing this only for part of the stats as is the case for those statistics. This commit internalize the metrics registration in the storage_proxy stats structures. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2020-01-30 15:01:40 +01:00
Kamil Braun	bd42b10df1	cdc: rename cdc/cdc.{hh,cc} to cdc/log.{hh,cc} To increase modularity, making it easier to find what is where and maintain. The 'log' module (cdc/log.{hh,cc}) is responsible for updating CDC log tables when base table writes are performed. The 'generation' module (cdc/generation.{hh,cc}) handles stream generation changes in response to topology change events. cdc/metadata.{hh,cc} contains a helper class which holds the currently used generation of streams. It is used by both aforementioned modules: 'log' queries it, while 'generation' updates it.	2020-01-30 11:10:39 +01:00
Kamil Braun	7fa30f6f34	db: add a system.cdc_local table with CDC generation timestamp This will be used to persist CDC streams generation timestamp proposed by a joining node in case the node crashes or restarts, similarly to the way tokens are persisted. The get_saved_cdc_streams_timestamp method retrieves the generation timestamp from the system table. It will be used by a restarting node. The update_cdc_streams_timestamp method saves CDC stream generation timestamp of the calling node in the system table. A joining node will persist the timestamp before it proposes it to other nodes.	2020-01-30 11:10:08 +01:00
Piotr Jastrzebski	04fe18de0f	system_distributed_keyspace: add cdc-related tables The cdc_topology_description table will be used internally by nodes to send new CDC stream generations to other nodes. The cdc_description table is a user-facing table, used to inform users about new sets of CDC streams. Regenerate sstables and digests for schema_change_test. We don't need to protect this change by a schema feature: when a node creates these tables, it announces them to all other nodes. If schema agreement happens before this migration, all nodes will use a digest calculated without these tables. If it happens after, then all nodes will eventually know about these tables and use a digest calculated with these tables.	2020-01-30 11:10:08 +01:00
Rafael Ávila de Espíndola	e4b8f52237	commitlog: Simplify the return of read_log_file This function really just wants to signal it is done, so return a future<>. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200128172847.31513-1-espindola@scylladb.com>	2020-01-30 12:00:29 +02:00
Rafael Ávila de Espíndola	bd93a0af52	types: Return bytes_opt from data_value::serialize Since a data_value can contain a null value, returning bytes from serialize() was losing information as it was mapping null to empty. This also introduces a serialize_nonnull that still returns bytes, but results in an internal error if called with a null value. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-29 14:04:59 -08:00
Gleb Natapov	c654ffe34b	commitlog: fix flushing an entry marked as "sync" in periodic mode After `546556b71b` we can have mixed writes into commitlog, some do flush immediately some do not. If non flushing write races with flushing one and becomes responsible for writing back its buffer into a file flush will be skipped which will cause assert in batch_cycle() to trigger since flush position will not be advanced. Fix that by checking that flush was skipped and in this case flush explicitly our file position. Fixes #5670 Message-Id: <20200128145103.GI26048@scylladb.com>	2020-01-29 12:58:25 +02:00
Avi Kivity	ec1687e4fe	Merge "Remove deprecated partitioners #5636 " from Piotr " This PR makes named_value respect allowed_values and then use it to transition away from old deprecated RandomPartitioner and ByteOrderedPartitioner. Then it removes the code that's no longer used. We want to remove deprecated partitioners because, on one hand, they lead to performance problems and hot nodes. Moreover, we're planning to unify the token representation which would allow per table partitioner support. That, in turn, is a feature helpful in multiple efforts like CDC, materialized views, secondary indexes and multi-tenancy. tests: unit(dev) " * 'remove_deprecated_partitioners' of https://github.com/haaawk/scylla: partitioners: remove random_partitioner partitioners: Make it impossible to use RandomPartitioner partitioners: remove byte_ordered_partitioner partitioners: Make it impossible to use ByteOrderedPartitioner partitioners: Remove leftovers of OrderPreservingPartitioner i_partitioner.cc: stop including byte_ordered_partitioner.hh i_partitioner.cc: stop including random_partitioner.hh config: use allowed_values to verify named_value input config: add operator<< for seed_provider_type	2020-01-29 00:11:17 +02:00
Avi Kivity	17eaf552f0	Merge "Improve the accuracy of reader memory tracking" from Botond " Grab the lowest hanging fruits. This patch-set makes three important changes: * Consume the memory for I/O operations on tracked files, before they are forwarded to the underlying file. * Track memory consumed by buffers created for parsing in `continuous_data_consumer`. As this is the basis for the data, index and promoted index parsers, all three are covered now in this regard. * Track the index file. The remaining, not-so-low handing fruits in order of gain/cost(performance) ratio: * Track in-memory index lists. * Track in-memory promoted index blocks. * Track reader buffer memory. Note that this ordering might change based on the workload and other environmental factors. Also included in this series is an infrastructure refactoring to make tracking memory easier and involve including lighter headers, as well as a manual test designed to allow testing and experimenting with the effects of changes to the accuracy of the tracking of reader memory consumption. Refs: #4176 Refs: #2778 Tests: unit(dev), manual(sstable_scan_footprint_test) The latter was run as: build/dev/test/manual/sstable_scan_footprint_test -c1 -m2G --reads=4000 --read-concurrency=1 --logger-log-level test=trace --collect-stats --stats-period-ms=20 This will trickle reads until the semaphore blocks, then wait until the wait queue drains before sending new reads. This way we are not testing the effectiveness of the pre-admission estimation (which is terribly optimistic) and instead check that with slowly ramping up read load the semaphore will block on memory preventing OOM. This now runs to completion without a single `std::bad_alloc`. The read concurrency semaphore allows between 15-30 reads, and is always blocked on memory. " * 'more-accurate-reader-resource-tracking/v1' of ssh://github.com/denesb/scylla: test/manual/sstable_scan_footprint_test: improve memory consumption diagnostics tests/manual/sstable_scan_footprint_test: use the semaphore to determine read rate tests/manual: Add test measuring memory demand of concurrent sstable reads index_reader: make the index file tracked sstables/continuous_data_consumer: track buffers used for parsing reader_concurrency_semaphore: tracking_file_impl: consume memory speculatively reader_concurrency_semaphore: bye reader_resource_tracker treewide: replace reader_resource_tracer with reader_permit reader_permit: expose make_tracked_temporary_buffer() reader_permit: introduce make_tracked_file() reader_permit: introduce memory_units reader_concurrency_semaphore: mv reader_resources and reader_permit to reader_permit.hh reader_concurrency_semaphore: reader_permit: make it a value type reader_concurrency_semaphore: s/resources/reader_resources/ reader_concurrency_semaphore::reader_permit: move methods out-of-line	2020-01-29 00:11:17 +02:00
Gleb Natapov	8dc37277df	commitlog: remove unused variable Message-Id: <20200128132118.GH26048@scylladb.com>	2020-01-29 00:11:17 +02:00
Botond Dénes	dfc8b2fc45	treewide: replace reader_resource_tracer with reader_permit The former was never really more than a reader_permit with one additional method. Currently using it doesn't even save one from any includes. Now that readers will be using reader_permit we would have to pass down both to mutation_source. Instead get rid of reader_resource_tracker and just use reader_permit. Instead of making it a last and optional parameter that is easy to ignore, make it a first class parameter, right after schema, to signify that permits are now a prominent part of the reader API. This -- mostly mechanical -- patch essentially refactors mutation_source to ask for the reader_permit instead of reader_resource_tracking and updates all usage sites.	2020-01-28 08:13:16 +02:00
Dejan Mircevski	90b54c8c42	view_info: Drop partition_ranges() The method view_info::partition_ranges() is unused. Also drop the now-dead _partition_ranges data member. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-01-26 12:02:32 +02:00
Piotr Jastrzebski	d80ac4c2d0	partitioners: Make it impossible to use RandomPartitioner RandomPartitioner has been deprecated for 2.5 year. Now we drop the support for it. There are two reasons for this. First, this partitioner can lead to uneven distribution of partitions among the nodes in the cluster which leads to hot nodes. Second, we're planning to unify the representation of tokens and fix it as int64_t. RandomPartitioner does not comply with this. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-24 09:09:13 +01:00
Piotr Jastrzebski	130eb91636	partitioners: Make it impossible to use ByteOrderedPartitioner ByteOrderedPartitioner has been deprecated for 2.5 year. Now we drop the support for it. There are two reasons for this. First, this partitioner can lead to uneven distribution of partitions among the nodes in the cluster which leads to hot nodes. Second, we're planning to unify the representation of tokens and fix it as int64_t. ByteOrderPartitioner does not comply with this. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-24 09:09:13 +01:00
Piotr Jastrzebski	4088be2056	partitioners: Remove leftovers of OrderPreservingPartitioner OrderPreservingPartitioner seems to be long gone and not supported so remove all the places it's still mentioned. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-24 09:09:13 +01:00
Piotr Jastrzebski	6a2cd64b5c	config: use allowed_values to verify named_value input Even though we configure the set of accepted values for some config flags, named_value ignore them. This patch implements the checks that verify flag is not set to the value that's not on the list. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-24 09:08:59 +01:00
Piotr Jastrzebski	df1b7d2805	config: add operator<< for seed_provider_type Following patch will start checking allowed_values in named_value and print errors for wrong values. This will require all the types used with named_value to have operator<< implemented. seed_provider_type is one such type. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-23 10:28:58 +01:00
Piotr Sarna	9b379e3d63	db,view: fix checking for secondary index special columns A mistake in handling legacy checks for special 'idx_token' column resulted in not recognizing materialized views backing secondary indexes properly. The mistake is really a typo, but with bad consequences - instead of checking the view schema for being an index, we asked for the base schema, which is definitely not an index of itself. Branches 3.1,3.2 (asap) Fixes #5621 Fixes #4744	2020-01-21 22:32:04 +02:00
Rafael Ávila de Espíndola	27bd3fe203	service: Add a lock around migration_notifier::_listeners Before this patch the iterations over migration_notifier::_listeners could race with listeners being added and removed. The addition side is not modified, since it is common to add a listener during construction and it would require a fairly big refactoring. Instead, the iteration is modified to use indexes instead of iterators so that it is still valid if another listener is added concurrently. For removal we use a rw lock, since removing an element invalidates indexes too. There are only a few places that needed refactoring to handle unregister_listener returning a future<>, so this is probably OK. Fixes #5541. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200120192819.136305-1-espindola@scylladb.com>	2020-01-20 22:14:02 +02:00
Asias He	343986a70b	gossiper: Introduce gossip STATUS_UNKNOWN When a node does not have gossip STATUS application_state, we currently use an empty string to present such state in get_gossip_status. It is better to use an explicit "UNKNOWN" to present it. It makes the log easier to understand when the status is unknown. Before: 'gossip - InetAddress n2 is now UP, status =' After: 'gossip - InetAddress n2 is now UP, status = UNKNOWN' This patch is safe because the STATUS_UNKNOWN is never sent over the cluster. So the presentation is only internal to the node. Fixes #5520	2020-01-20 10:59:14 +02:00
Tomasz Grabiec	36d90e637e	Merge "Relax migration manager dependencies" from Pavel Emalyanov The set make dependencies between mm and other services cleaner, in particular, after the set: - the query processor no longer needs migration manager (which doesn't need query processor either) - the database no longer needs migration manager, thus the mutual dependency between these two is dropped, only migration manager -> database is left - the migration manager -> storage_service dependency is relaxed, one more patchset will be needed to remove it, thus dropping one more mutual dependency between them, only the storage_service -> migration manager will be left - the migration manager is stopped on drain, but several more services need it on stop, thus causing use after free problems, in particular there's a caught bug when view builder crashes when unregistering from notifier list on stop. Fixed. Tests: unit(dev) Fixes: #5404	2020-01-16 12:12:25 +01:00
Piotr Jastrzębski	0c8c1ec014	config: fix description of enable_deprecated_partitioners Murmur3 is the default partitioner. ByteOrder and Random are the deprecated ones and should be mentioned in the description. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-16 12:05:50 +02:00
Pavel Emelyanov	5cf365d7e7	database: Explicitly pass migration_manager through init_non_system_keyspace This is the last place where database code needs the migration_manager instance to be alive, so now the mutual dependency between these two is gone, only the migration_manager needs the database, but not the vice-versa. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:29:21 +03:00
Pavel Emelyanov	28f1250b8b	view_builder: Use migration notifier The migration manager itself is still needed on start to wait for schema agreement, but there's no longer the need for the life-time reference on it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	e327feb77f	database: Prepare to use on-database migration_notifier Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f240d5760c	migration_manager: Split notifier from main class The _listeners list on migration_manager class and the corresponding notify_xxx helpers have nothing to do with the its instances, they are just transport for notification delivery. At the same time some services need the migration manager to be alive at their stop time to unregister from it, while the manager itself may need them for its needs. The proposal is to move the migration notifier into a complete separate sharded "service". This service doesn't need anything, so it's started first and stopped last. While it's not effectively a "migration" notifier, we inherited the name from Cassandra and renaming it will "scramble neurons in the old-timers' brains but will make it easier for newcomers" as Avi says. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:19 +03:00
Gleb Natapov	51672e5990	paxos: immediately sync commitlog entries for writes made by paxos learn stage	2020-01-15 12:15:42 +02:00
Gleb Natapov	0fc48515d8	paxos: mark paxos table schema as "always sync" We want all writes to paxos table to be persisted on a storage before declared completed.	2020-01-15 12:15:42 +02:00
Gleb Natapov	e0bc4aa098	commitlog: add sync method to entry_writer If the method returns true commitlog should sync to file immediately after writing the entry and wait for flush to complete before returning.	2020-01-15 12:15:42 +02:00
Piotr Sarna	36ec43a262	Merge "add table with connected cql clients" from Juliusz This change introduces system.clients table, which provides information about CQL clients connected. PK is the client's IP address, CK consists of outgoing port number and client_type (which will be extended in future to thrift/alternator/redis). Table supplies also shard_id and username. Other columns, like connection_stage, driver_name, driver_version..., are currently empty but exist for C* compatibility and future use. This is an ordinary table (i.e. non-virtual) and it's updated upon accepting connections. This is also why C*'s column request_count was not introduced. In case of abrupt DB stop, the table should not persist, so it's being truncated on startup. Resolves #4820	2020-01-14 10:01:07 +02:00
Nadav Har'El	1511b945f8	merge: Handle multiple regular base columns in view pk Merged patch series from Piotr Sarna: "Previous assumption was that there can only be one regular base column in the view key. The assumption is still correct for tables created via CQL, but it's internally possible to create a view with multiple such columns - the new assumption is that if there are multiple columns, they share their liveness. This series is vital for indexing to work properly on alternator, so it would be best to solve the issue upstream. I strived to leave the existing semantics intact as long as only up to one regular column is part of the materialized view primary key, which is the case for Scylla's materialized views. For alternator it may not be true, but all regular columns in alternator share liveness info (since alternator does not support per-column TTL), which is sufficient to compute view updates in a consistent way. Fixes #5006 Tests: unit(dev), alternator(test_gsi_update_second_regular_base_column, tic-tac-toe demo)" Piotr Sarna (3): db,view: fix checking if partition key is empty view: handle multiple regular base columns in view pk test: add a case for multiple base regular columns in view key alternator-test/test_gsi.py \| 1 - view_info.hh \| 5 +- cql3/statements/alter_table_statement.cc \| 2 +- db/view/view.cc \| 77 ++++++++++++++---------- mutation_partition.cc \| 2 +- test/boost/cql_query_test.cc \| 58 ++++++++++++++++++ 6 files changed, 109 insertions(+), 36 deletions(-)	2020-01-14 10:01:00 +02:00
Avi Kivity	6d454d13ac	db/schema_tables: make gratuitous generic lambdas in do_merge_schema() concrete Those gratuitous lambdas make life harder for IDE users by hiding the actual types from the IDEs. Message-Id: <20200107154746.1918648-1-avi@scylladb.com>	2020-01-08 17:43:18 +01:00
Piotr Sarna	155a47cc55	view: handle multiple regular base columns in view pk Previous assumption was that there can only be one regular base column in the view key. The assumption is still correct for tables created via CQL, but it's internally possible to create a view with multiple such columns - the new assumption is that if there are multiple columns, they share their liveness. This patch is vital for indexing to work properly on alternator, so it would be best to solve the issue upstream. I strived to leave the existing semantics intact as long as only up to one regular column is part of the materialized view primary key, which is the case for Scylla's materialized views. For alternator it may not be true, but all regular columns in alternator share liveness info (since alternator does not support per-column TTL), which is sufficient to compute view updates in a consistent way. Fixes #5006 Tests: unit(dev), alternator(test_gsi_update_second_regular_base_column, tic-tac-toe demo) Message-Id: <c9dec243ce903d3a922ce077dc274f988bcf5d57.1567604945.git.sarna@scylladb.com>	2020-01-07 12:18:39 +01:00
Piotr Sarna	54315f89cd	db,view: fix checking if partition key is empty Previous implementation did not take into account that a column in a partition key might exist in a mutation, but in a DEAD state - if it's deleted. There are no regressions for CQL, while for alternator and its capability of having two regular base columns in a view key, this additional check must be performed.	2020-01-07 12:05:36 +01:00
Avi Kivity	3a3c20d337	schema_tables: de-templatize diff_table_or_view() This reduces code bloat and makes the code friendlier for IDEs, as the IDE now understands the type of create_schema. Message-Id: <20191231134803.591190-1-avi@scylladb.com>	2020-01-07 11:56:54 +01:00
Avi Kivity	8f7f56d6a0	schema_tables: make gratuitous generic lambda in create_tables_from_partitions() concrete The generic lambda made IDE searches for create_table_from_table_row() fail. Message-Id: <20191231135210.591972-1-avi@scylladb.com>	2020-01-07 11:49:10 +01:00

... 67 68 69 70 71 ...

4972 Commits