scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 03:20:37 +00:00

Author	SHA1	Message	Date
Avi Kivity	f917f73616	Merge "Handling of schema changes" from Tomasz "Our domain objects have schema version dependent format, for efficiency reasons. The data structures which map between columns and values rely on column ids, which are consecutive integers. For example, we store cells in a vector where index into the vector is an implicit column id identifying table column of the cell. When columns are added or removed the column ids may shift. So, to access mutations or query results one needs to know the version of the schema corresponding to it. In case of query results, the schema version to which it conforms will always be the version which was used to construct the query request. So there's no change in the way query result consumers operate to handle schema changes. The interfaces for querying needed to be extended to accept schema version and do the conversions if necessary. Shard-local interfaces work with a full definition of schema version, represented by the schema type (usually passed as schema_ptr). Schema versions are identified across shards and nodes with a UUID (table_schema_version type). We maintain schema version registry (schema_registry) to avoid fetching definitions we already know about. When we get a request using unknown schema, we need to fetch the definition from the source, which must know it, to obtain a shard-local schema_ptr for it. Because mutation representation is schema version dependent, mutations of different versions don't necessarily commute. When a column is dropped from schema, the dropped column is no longer representable in the new schema. It is generally fine to not hold data for dropped columns, the intent behind dropping a column is to lose the data in that column. However, when merging an incoming mutation with an existing mutation both of which have different schema versions, we'd have to choose which schema should be considered "latest" in order not to loose data. Schema changes can be made concurrently in the cluster and initiated on different nodes so there is not always a single notion of latest schema. However, schema changes are commutative and by merging changes nodes eventually agree on the version. For example adding column A (version X) on one node and adding column B (version Y) on another eventually results in a schema version with both A and B (version Z). We cannot tell which version among X and Y is newer, but we can tell that version Z is newer than both X and Y. So the solution to the problem of merging conflicting mutations could be to ensure that such merge is performed using the schema which is superior to schemas of both mutations. The approach taken in the series for ensuring this is as follows. When a node receives a mutation of an unknown schema version it first performs a schema merge with the source of that mutation. Schema merge makes sure that current node's version is superior to the schema of incoming mutation. Once the version is synced with, it is remembered as such and won't be synced with on later mutations. Because of this bookkeeping, schema versions must be monotonic; we don't want table altering to result in any earlier version because that would cause nodes to avoid syncing with them. The version is a cryptographically-secure hash of schema mutations, which should fulfill this purpose in practice. TODO: It's possible that the node is already performing a sync triggered by broadcasted schema mutations. To avoid triggering a second sync needlessly, the schema merging should mark incoming versions as being synced with. Each table shard keeps track of its current schema version, which is considered to be superior to all versions which are going to be applied to it. All data sources for given column family within a shard have the same notion of current schema version. Individual entries in cache and memtables may be at earlier versions but this is hidden behind the interface. The entries are upgraded to current version lazily on access. Sstables are immutable, so they don't need to track current version. Like any other data source, they can be queried with any schema version. Note, the series triggered a bug in demangler: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68700"	2016-01-11 17:59:14 +02:00
Tomasz Grabiec	b6c6ee5360	tests: Add test for statement invalidation	2016-01-11 10:34:55 +01:00
Tomasz Grabiec	0ea045b654	tests: Add notification test to schema_change_test	2016-01-11 10:34:54 +01:00
Tomasz Grabiec	5d38614f51	tests: Add test for column drop	2016-01-11 10:34:54 +01:00
Tomasz Grabiec	5689a1b08b	tests: Add test for column drop	2016-01-11 10:34:54 +01:00
Paweł Dziepak	21bbc65f3f	tests/cql: add tests for ALTER TABLE Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-11 10:34:54 +01:00
Tomasz Grabiec	a81fa1727b	tests: Add schema_change_test	2016-01-11 10:34:53 +01:00
Tomasz Grabiec	be2bdb779a	tests: Introduce canonical_mutation_test	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	a63971ee4c	tests: memtable_test: Add test for concurrent reading and schema changes	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	4e5a52d6fa	db: Make read interface schema version aware The intent is to make data returned by queries always conform to a single schema version, which is requested by the client. For CQL queries, for example, we want to use the same schema which was used to compile the query. The other node expects to receive data conforming to the requested schema. Interface on shard level accepts schema_ptr, across nodes we use table_schema_version UUID. To transfer schema_ptr across shards, we use global_schema_ptr. Because schema is identified with UUID across nodes, requestors must be prepared for being queried for the definition of the schema. They must hold a live schema_ptr around the request. This guarantees that schema_registry will always know about the requested version. This is not an issue because for queries the requestor needs to hold on to the schema anyway to be able to interpret the results. But care must be taken to always use the same schema version for making the request and parsing the results. Schema requesting across nodes is currently stubbed (throws runtime exception).	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	036974e19b	Make mutation interfaces support multiple versions Schema is tracked in memtable and cache per-entry. Entries are upgraded lazily on access. Incoming mutations are upgraded to table's current schema on given shard. Mutating nodes need to keep schema_ptr alive in case schema version is requested by target node.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	175be4c2aa	cql_query_test: Disable test_user_type	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	5184381a0b	memtable: Deconstify memtable in readers We want to upgrade entries on read and for that we need mutating permission.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	3e447e4ad1	tests: mutation_test: Add tests for equality and hashing	2016-01-11 10:34:50 +01:00
Tomasz Grabiec	48f1db5ffa	mutation_assertions: Add is_not_equal_to()	2016-01-11 10:34:50 +01:00
Tomasz Grabiec	88a6a17f72	tests: Use mutation generators in frozen_mutation_test	2016-01-11 10:34:50 +01:00
Vlad Zolotarov	19e275be1f	tests: gossip_test: initialize a broadcast address and a snitch This patch fixes a regression introduced by a commit `ca935bf` "tests: Fix gossip_test". database service initializes a replication_strategy object and a replication_strategy requires a snitch service to be initialized. A snitch service requires a broadcast address to be set. If any of the above is not initialized we are going to hit the corresponding assert(). Set a snitch to a SimpleSnitch and a broadcast address to 127.0.0.1. Fixes issue #770 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1452421748-9605-1-git-send-email-vladz@cloudius-systems.com>	2016-01-10 13:13:37 +02:00
Tomasz Grabiec	9b2cc557c5	mutation_source_test: Add mutation generators The goal is to provide various test cases with a way of iterating over many combinations of mutaitons. It's good to have this in one place to avoid duplication and increased coverage.	2016-01-08 21:10:27 +01:00
Tomasz Grabiec	4b92ef01fc	test: Add tests for mutation upgrade	2016-01-08 21:10:26 +01:00
Tomasz Grabiec	22254e94cc	query::result_set: Add constructor from mutation	2016-01-08 21:10:26 +01:00
Asias He	2345cda42f	messaging_service: Rename shard_id to msg_addr Use shard_id as the destination of the messaging_service is confusing, since shard_id is used in the context of cpu id. Message-Id: <8c9ef193dc000ef06f8879e6a01df65cf24635d8.1452155241.git.asias@scylladb.com>	2016-01-07 10:36:35 +02:00
Raphael S. Carvalho	a81b660c0d	tests: check that tombstone is purged under certain conditions Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-05 15:19:21 +01:00
Raphael S. Carvalho	03eee06784	remove empty rows in mutation_partition::do_compact do_compact() wasn't removing an empty row that is covered by a tombstone. As a result, an empty partition could be written to a sstable. To solve this problem, let's make trim_rows remove a row that is considered to be empty. A row is empty if it has no tombstone, no marker and no cells. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-05 15:19:21 +01:00
Paweł Dziepak	284162c41b	test/cql3: add test for frozen collections Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-05 11:13:53 +01:00
Glauber Costa	74fbd8fac0	do not call open_file_dma directly We have an API that wraps open_file_dma which we use in some places, but in many other places we call the reactor version directly. This patch changes the latter to match the former. It will have the added benefit of allowing us to make easier changes to these interfaces if needed. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <29296e4ec6f5e84361992028fe3f27adc569f139.1451950408.git.glauber@scylladb.com>	2016-01-05 10:37:57 +02:00
Tomasz Grabiec	5a9d45935a	Merge tag 'asias/fix_cql_query_test/v1' from seastar-dev.git Fixes for cql_query_test and gossip_test from Asias.	2016-01-04 12:28:49 +01:00
Avi Kivity	78429ad818	types: implement collection compatibility checks compatible: can be cast, keeps sort order value-compatible: can be cast, may change sort order frozen: values participate in sort order unfrozen: only sort keys participate in sort order Fixes #740.	2016-01-04 11:02:21 +01:00
Asias He	4952042fbf	tests: Fix cql_test_env.cc Current service initialization is a total mess in cql_test_env. Start the service the same order as in main.cc. Fixes #715, #716 './test.py --mode release' passes.	2016-01-01 10:15:17 +08:00
Asias He	ca935bf602	tests: Fix gossip_test Gossip depends on get_ring_delay from storage_service. storage_service depends on database. Start them.	2016-01-01 10:15:17 +08:00
Vlad Zolotarov	0f8090d6c7	tests: use steady_clock where monotinic clock is required Use steady_clock instead of high_resolution_clock where monotonic clock is required. high_resolution_clock is essentially a system_clock (Wall Clock) therefore may not to be assumed monotonic since Wall Clock may move backwards due to time/date adjustments. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2015-12-27 18:08:15 +02:00
Nadav Har'El	06ab43a7ee	murmur3 partitioner: fix midpoint() algorithm The midpoint() algorithm to find a token between two tokens doesn't work correctly in case of wraparound. The code tried to handle this case, but did it wrong. So this patch fixes the midpoint() algorithm, and adds clearer comments about why the fixed algorithm is correct. This patch also modifies two midpoint() tests in partitioner_test, which were incorrect - they verified that midpoint() returns some expected values, but expected values were wrong! We also add to the test a more fundemental test of midpoint() correctness, which doesn't check the midpoint against a known value (which is easy to get wrong, like indeed happened); Rather we simply check that the midpoint is really inside the range (according to the token ordering operator). This simple test failed with the old implementation of midpoint() and passes with the new one. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2015-12-24 17:19:49 +02:00
Paweł Dziepak	cf949e98cb	tests/types: add more tests for date and time parsing Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-21 15:34:17 +01:00
Paweł Dziepak	b39d1fb1fc	test/sstables: add more key_reader tests This patch introduces a test for reading keys from a single sstable with the range begining and end being the keys present in the index summary. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-18 17:24:29 +01:00
Paweł Dziepak	73f9850e1c	tests/key_reader: make sure that the reader lives long enough Fixes test failure in debug mode. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-18 10:32:37 +01:00
Pekka Enberg	9604d55a44	Merge "Add unit test for get_restricted_ranges()" from Tomek	2015-12-17 09:14:30 +02:00
Tomasz Grabiec	f28e5f0517	tests: mutation_assertions: Make is_equal_to() check symmetricity	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	3324cf0b8c	tests: mutation_reader_assertions: Introduce next_mutation()	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	ad99f89228	tests: mutation_assertion: Introduce has_schema()	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	7451ab4356	tests: mutation_assertion: Allow chaining of assertions	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	efe08a0512	tests: mutation_assertions: Own the mutation which is checked Easier for users because they don't have to ensure liveness.	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	0cdee6d1c3	tests: row_cache: Fix test_update() The underlying data source for cache should not be the same memtable which is later used to update the cache from. This fixes the following assertion failure: row_cache_test_g: utils/logalloc.hh:289: decltype(auto) logalloc::allocating_section::operator()(logalloc::region&, Func&&) [with Func = memtable::make_reader(schema_ptr, const partition_range&)::<lambda()>]: Assertion `r.reclaiming_enabled()' failed. The problem is that when memtable is merged into cache their regions are also merged, so locking cache's region locks the memtable region as well.	2015-12-16 18:06:55 +01:00
Tomasz Grabiec	2ee60d8496	tests: sstable_test: Avoid throwing during expected conditions Makes debugging easier by making 'catch throw' not stop on expected conditions.	2015-12-16 18:06:54 +01:00
Tomasz Grabiec	ef49c95015	tests: cql_query_env: Avoid exceptions during normal execution	2015-12-16 18:06:54 +01:00
Tomasz Grabiec	0d5166dcd8	tests: Add test for get_restricted_ranges()	2015-12-16 13:09:01 +01:00
Tomasz Grabiec	756624ef18	Remove dead code	2015-12-16 13:09:01 +01:00
Raphael S. Carvalho	36d31a5dab	fix cql_query_test Test was failing because _qp (distributed<cql3::query_processor>) was stopped before _db (distributed<database>). Compaction manager is member of database, and when database is stopped, compaction manager is also stopped. After `a2fb0ec9a`, compaction updates the system table compaction history, and that requires a working query context. We cannot simply move _qp->stop() to after _db->stop() because the former relies on migration_manager and storage_proxy. So the most obvious fix is to clean the global variable that stores query context after _qp was stopped. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-12-16 09:58:46 +02:00
Tomasz Grabiec	305c2b0880	frozen_mutation: Introduce decorated_key() helper Requested by Asias for use in streaming code.	2015-12-15 15:16:04 +02:00
Tomasz Grabiec	179b587d62	Abstract timestamp creation behind new_timestamp() Replace db_clock::now_in_usec() and db_clock::now() * 1000 accesses where the intent is to create a new auto-generate cell timestamp with a call to new_timestamp(). Now the knowledge of how to create timestamps is in a single place.	2015-12-15 15:16:04 +02:00
Paweł Dziepak	59245e7913	row_cache: add functions for invalidating entries in cache Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-15 13:21:11 +01:00
Paweł Dziepak	3a73496817	tests/cql: add test for ordering clustering keys Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-11 12:05:25 +01:00

1 2 3 4 5 ...

961 Commits