scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 10:41:12 +00:00

Author	SHA1	Message	Date
Piotr Sarna	c35160457b	Merge 'Clean up stream_id representation' from Piotr With #5950 we changed the representation of stream_id in CDC Log from two int columns to a single blob column. This PR cleans up stream_id representation internally. Now stream_id is stored as blob both in-memory and in internal CDC tables. Tests: unit(dev) * hawk/stream_id_representation: cdc: store stream_ids as blobs in internal tables cdc: improve do_update_streams_description cdc: Fix generate_topology_description cdc: add stream_id::operator< cdc: change stream_id representation	2020-03-05 14:14:29 +01:00
Kamil Braun	3200d415da	cdc: use a single timeuuid value for a batch of changes If a batch update is performed with a sequence of changes with a single timestamp, they will now show up in CDC with a single timeuuid in the `time` column, distinguished by different `batch_seq_no` values. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 12:32:57 +01:00
Kamil Braun	292eba9da0	cdc: replace `split` with `for_each_change` `for_each_change` is like `split` but it doesn't return a vector of mutations representing each change; instead, it takes as a parameter a function which gets called on each mutation. This reduced the memory usage and allows to preserve common context when handling each change (will be useful in next commits). Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 12:05:08 +01:00
Piotr Jastrzebski	57cfe6d0e1	cdc: store stream_ids as blobs in internal tables In new CDC Log format stream_id is represented by a single blob column so it makes sense to store it in the same form everywhere - including internal CDC tables. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	b2acdc9307	cdc: improve do_update_streams_description Use std::set::insert that takes range instead of looping through elements and adding them one by one. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	446722d6ed	cdc: Fix generate_topology_description In new CDC Log format we store only a single stream_id column. This means generate_topology_description has to use appropriate schema for generating tokens for stream_ids. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:22 +01:00
Piotr Jastrzebski	9a212dcaef	cdc: add stream_id::operator< Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:31:21 +01:00
Piotr Jastrzebski	f317a659d9	cdc: change stream_id representation New CDC Log format stores stream ids as blobs. It makes sense to keep them internally in the same form. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-05 11:30:10 +01:00
Piotr Sarna	f21bd57058	Merge "cdc: log static rows correctly" from Piotr Currently, writes to a static row in a base table are not reflected at all in the corresponding cdc log. This patch causes such writes to be properly logged. Fixes: #5744 Tests: unit(dev) * piodul/5744-handle-static-row-correctly-in-cdc: cdc_test: add tests for handling static row cdc: fix indentation in transformer::transform cdc: handle static rows separately in transformer::transform cdc: move process_cells higher (and fix captured variables) cdc: reduce dependencies on captured variables in process_cells cdc: fix preimage query for static rows	2020-03-05 10:42:15 +01:00
Piotr Dulikowski	204e204586	cdc: do not attempt to log empty mutations It is possible to produce an empty mutation using CQL. For example, the following query: DELETE FROM ks.tbl WHERE pk = 0 AND ck < 1 AND ck > 2; will attempt to delete from an empty range of rows. This is translated to the following mutation: {ks.tbl {key: pk{000400000000}, token:-3485513579396041028} {mutation_partition: static: cont=1 {row: }, clustered: {}}} Such mutation does not contain any timestamp, therefore it is difficult to determine what timestamp was used while making the query. This is problematic for CDC, because an entry in CDC log should be written with the same timestamp as a part of the mutation. Because an empty mutation does not modify the table in any way, we can safely skip logging such mutations in CDC and still preserve the ability to reconstruct the current state of the base table from full CDC log. Tests: unit(dev)	2020-03-05 08:32:54 +01:00
Piotr Dulikowski	39519ce923	cdc: fix indentation in transformer::transform	2020-03-05 00:16:17 +01:00
Piotr Dulikowski	0d05b17881	cdc: handle static rows separately in transformer::transform Before this patch, `transform` did not generate any log rows about static row change. This commit fixes that - now, a log row is created if a static row is changed, and this row is separate from the rows that describe changes to the clustering rows.	2020-03-05 00:16:17 +01:00
Piotr Dulikowski	6a0b0b5786	cdc: move process_cells higher (and fix captured variables) The `process_cells` lambda is moved outside the loop, because it will be used by other code in subsequent commits.	2020-03-05 00:15:57 +01:00
Piotr Dulikowski	f136f6e02c	cdc: reduce dependencies on captured variables in process_cells This is a preparation for moving the lambda outside the for loop. - `log_ck`, `pikey`, `pirow` are now passed as arguments, - `value` is now a variable local to the lambda, - `ttl` is now a variable local to the lambda that is returned.	2020-03-05 00:14:05 +01:00
Piotr Dulikowski	a7f51449c3	cdc: fix preimage query for static rows For static rows, we need to fetch at least one row from its partition in order to compute its preimage.	2020-03-04 18:43:55 +01:00
Piotr Jastrzebski	354e3c34c8	cdc log: merge stream_id columns into a single column Previously we had stream_id_1 and stream_id_2 columns of type long each. They were forming a partition key. In a new format we want a single stream_id column that forms a partition key. To be able to still store two longs, the new column will have type blob and its value will be concatenated bytes of two longs that partition key is composed of. We still want partition key to logically be two longs because those two values will be used by a custom partitioner later once we implement it. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-03-04 13:27:48 +02:00
Kamil Braun	5c4a237c12	cdc: split the mutation before passing it into `transform` If the mutation contains separate logical changes (e.g. with different timestamps and/or ttls), it will be split into multiple mutations, each passed into transform.	2020-03-03 13:17:51 +01:00
Kamil Braun	9924e3aa34	cdc: reduce code duplication in augment_mutation_call Now there's only one call to `transform`.	2020-03-03 13:17:51 +01:00
Kamil Braun	24a32a13b5	cdc: retrieve preimage anytime there are static/clustered row updates Previously we wouldn't retrieve the preimage if the mutation contained something different than static/clustered row updates, e.g. if it contained a partition deletion. However, there are mutations created from batch statements which can contain both a partition deletion and a set of row updates with a later timestamp. We want to retrieve the preimage too in this case.	2020-03-03 13:17:51 +01:00
Kamil Braun	529d30ef66	cdc: add `split` function This function takes a mutation and returns a set of mutations, each representing a separate change with a single timestamp and ttl.	2020-03-03 13:17:51 +01:00
Kamil Braun	132ea89c32	cdc: add `extract_changes` function This commit introduces a bunch of new structs describing a change made to a table, and an `extract_changes` function which takes a mutation and returns the set of changes contained in this mutation, separated by timestamp and ttl.	2020-03-03 13:17:51 +01:00
Kamil Braun	b5c944370e	cdc: add `should_split` function The function checks if there are multiple timestamps and/or ttls inside a mutation, which means separate changes should be created for this mutation in CDC.	2020-03-03 13:17:50 +01:00
Calle Wilund	ed0d1c5fe2	cdc: Break up data column tuple According to "new" spec: Data column is now pure frozen original type. If column is deleted (set to null), a metadata column cdc$deleted_<name> is set to true, to distinguish null column == not involved in row operation For non-atomic collections, a cdc$deleted_elements_<name> column is added, and when removing elements from collection this is where they are shown. For non-atomic assign, the "cdc$deleted_<name>" is true, and <name> is set to new value. column_op removed.	2020-03-03 08:52:20 +00:00
Calle Wilund	1085860c62	cdc: Rename metadata and data columns according to new spec Also use transformation methods for names in all code + tests to make switching again easier	2020-03-02 09:34:51 +00:00
Juliusz Stasiewicz	cf24ae86f3	cdc: distinguishing update from insert When incoming mutation contains live row marker the `operation` is described as "insert", not as an "update". Also, I extended the test case "test_row_delete" with one insert, which is expected to log different value of `operation` than update or delete. Renamed the test case accordingly. Test cases that relied on "update" being the same as "insert" are updated accordingly (`test_pre_image_logging`, `test_cdc_across_shards`, `test_add_columns`). Fixes #5723	2020-03-01 17:50:08 +02:00
Juliusz Stasiewicz	836183b847	cdc: fix `operation` value for row deletes Column `operation` now contains `operation::row_delete` (== 2) after queries like `delete from tbl where pk=x AND ck=y;`. Before this patch row deletes were treated as updates, which was incorrect because updates do not contain row tombstones (and row deletes do). Refs #5709	2020-02-26 11:58:50 +01:00
Calle Wilund	a3a764fd10	cdc: Handle non-atomic columns Fixes #5669 This implements non-atomic collection and UDT handling for both cdc preimage + delta. To be able to express deltas in a meaningful way (and reconstruct using it), non-atomic values are represented somewhat differently from regular values: * maps - stored as is (frozen) * sets - stored as is (frozen) * lists - stored as map<timeuuid, value> (frozen) this allows reconstructing the list, as otherwise things like list[0] = value cannot be represented in a meaningful way * udt - stored as tuple<tuple<field0>, tuple<field1>...> (frozen) UDTs are normally just tuples + metadata, but we need to distinguish the case of outer tuple element == null, meaning "no info/does not partake in mutation" from tuple element being a tuple(null) (i.e. empty tuple), meaning "set field to null"	2020-02-25 19:34:54 +02:00
Piotr Dulikowski	82a2bdf39f	cdc: distinguish open and closed ranges for range delete This patch causes inclusive and exclusive range deletes to be distinguished in cdc log. Previously, operations `range_delete_start` and `range_delete_end` were used for both inclusive and exclusive bounds in range deletes. Now, old operations were renamed to `range_delete__inclusive`, and for exclusive deletes, new operations `range_delete__exclusive` are used. Tests: unit(dev)	2020-02-20 11:39:06 +01:00
Piotr Jastrzebski	f0f6e220ea	cdc: stop using partitioners CDC can get all it needs from a config and does not need partitioner. For base table specific operations CDC is using partitioner from that table (obtained with schema::get_partitioner). Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:15 +01:00
Piotr Jastrzebski	2d7532f87f	dht: add dht::get_token and replace all calls to dht::global_partitioner().get_token dht::get_token is better because it takes schema and uses it to obtain partitioner instead of using a global partitioner. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-17 10:59:15 +01:00
Piotr Sarna	e620181832	Merge 'cdc: TTLs on CDC log cells' from Juliusz Cells in CDC logs used to be created while completely neglecting TTLs (the TTLs from cdc = {...'ttl':600}). This patch adds TTLs to all cells; there are no row markers, so wee need not set TTL there. Fixes #5688 * jul-stas/5688-set-ttl-in-cdc-log-table: tests/cdc: added test for TTL on log table cells cdc: set TTLs on CDC log cells	2020-02-16 11:22:30 +02:00
Pavel Emelyanov	b11cf6e950	cql3/query_processor.hh: Debloat from other headers This gives ~30% less (251 jobs -> 181 jobs) recompile when touching it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200212225828.3374-1-xemul@scylladb.com>	2020-02-16 11:22:30 +02:00
Nadav Har'El	9fad494572	merge: Reduce #include bloat around cql3 internals from non-cql3 users Merged pull request https://github.com/scylladb/scylla/pull/5755 from Avi Kivity: This series removes some #include dependencies around cql3. It results in 30k line (6.6%) reduction in the preprocessed size of database.i, mainly due to elimination of boost::regex (which was brought in in turn by like_matcher). This should result in fewer and faster recompiles. commits: tracing: remove #include of modification_statement.hh from table_helper cql3: selection: remove now-unneeded include of statement_restrictions.hh cql3: deinline result_set_builder::restrictions_filter constructor view_info: remove include of select_statement.hh cql3: selection: remove unnecessary include of selector_factories cql3: query_processor: reduce #includes	2020-02-11 15:58:29 +02:00
Juliusz Stasiewicz	67b92c584f	cdc: set TTLs on CDC log cells Cells in CDC logs used to be created while completely neglecting TTLs (the TTLs from `cdc = {...'ttl':600}`). This patch adds TTLs to all cells; there are no row markers, so wee need not set TTL there. Fixes #5688	2020-02-11 12:56:41 +01:00
Piotr Sarna	b977aa034b	Merge 'cdc: disallow negative TTL values in CDC options' from Juliusz Setting TTL = -1 in cdc_options prevents any writes to CDC log. But enabling CDC and having unwritable log table makes no sense. Notably, normal writes USING TTL -1 are forbidden. This patch does the same to TTLs in CDC options. Fixes #5747 * jul-stas/5747-cdc-disallow-negative-ttl: tests/cdc: added test for exception when TTL < 0 cdc: disallow negative TTL values in CDC	2020-02-11 09:23:56 +01:00
Piotr Dulikowski	6fe4f9ded8	cdc: restrict permissions on _scylla_cdc_log tables Disallows DROP permission on CDC log tables.	2020-02-10 15:40:48 +01:00
Piotr Dulikowski	0c18742997	cdc: refuse to enable cdc when table _scylla_cdc_log exists	2020-02-10 15:40:48 +01:00
Juliusz Stasiewicz	133156ddcf	cdc: disallow negative TTL values in CDC	2020-02-10 13:50:00 +01:00
Avi Kivity	dcab666d52	cql3: query_processor: reduce #includes query_processor is a central class, so reducing its includes can reduce dependencies treewite. This patch removes includes for parsed_statement, cf_statement, and untyped_result_set and fixes up the rest of the tree to include what it lacks as a result of these removals.	2020-02-09 12:24:24 +02:00
Piotr Dulikowski	534e9ba27d	cdc: store information on ttl in "ttl" column, not in tuples This patch changes the way TTL is stored in the CDC log table. Instead of including TTL of cell `X` in the third element of the tuple in column `_X`, TTL is written to the previously unused column `ttl`. This is done for cosmetic purposes. This implementation works under assumption that there will be only one TTL included in a mutation coming from a CQL write. This might not be the case when writing a batch that modifies the same row twice, e.g.: ``` BATCH INSERT INTO ks.t (pk, ck, v1) VALUES (1,2,3) USING TTL 10; INSERT INTO ks.t (pk, ck, v2) VALUES (1,2,3) USING TTL 20; END BATCH ``` In this case, this implementation will choose only one TTL value to be written in the CDC log: ``` ... \| batch_seq_no \| _ck \| _pk \| _v1 \| _v2 \| operation \| ttl ...-+--------------+-----+-----+--------+--------+-----------+----- ... \| 0 \| 2 \| 1 \| (0, 3) \| (0, 3) \| 1 \| 20 ``` This behavior might be changed as a part of issue #5719, which considers splitting a batch write mutation when it contains multiple writes to the same row. Refs #5689 Tests: unit(dev)	2020-02-08 11:10:09 +02:00
Nadav Har'El	8b6925790f	Reduce usage of global_partitioner() Merged pull request https://github.com/scylladb/scylla/pull/5733 from Piotr Jastrzębski: In many places we use global_partitioner() to obtain parameters that are available in config. This PR replaces number of global_partitioner() calls with equivalent non-global ways. tests: unit(dev) * 'reduce_global_usage' of github.com:haaawk/scylla: storage_service: reduce number of global_partitioner calls cdc: remove partitioner from db_context gossiper: stop calling global_partitioner() system_keyspace: stop calling global_partitioner() transport/server: stop calling global_partitioner() thrift: stop calling global_partitioner() partitioner: move cpu_sharding_algorithm_name to token-sharding.hh	2020-02-06 12:10:38 +02:00
Piotr Jastrzebski	97262bec82	cdc: remove partitioner from db_context partitioner from cdc::db_context is no longer used so it can be removed. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-06 08:00:01 +01:00
Juliusz Stasiewicz	5127568cc4	cdc: cdc per-table options put into schema extensions With this patch, client tools (in particular cqlsh) get the access to cdc options and will be able to print them with `DESC TABLE`. Fixes #5589	2020-02-05 13:44:39 +01:00
Piotr Jastrzebski	50cfe81331	murmur3: move sharding logic to token and i_partitioner Since token representation is fixed now, all the partitioners will share the sharding logic. It makes sense now to keep the logic in common super class and separate header that's included only in i_partitioner.cc. shard_of and token_for_next_shard are now implemented in i_partitioner. They would be non-virtual but we have to keep them virtual because one test is overriding them to enforce some specific sharding. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-02-05 09:31:32 +01:00
Kamil Braun	5fb5925fb4	test: add cdc::find_timestamp tests	2020-02-03 10:57:31 +01:00
Kamil Braun	bd42b10df1	cdc: rename cdc/cdc.{hh,cc} to cdc/log.{hh,cc} To increase modularity, making it easier to find what is where and maintain. The 'log' module (cdc/log.{hh,cc}) is responsible for updating CDC log tables when base table writes are performed. The 'generation' module (cdc/generation.{hh,cc}) handles stream generation changes in response to topology change events. cdc/metadata.{hh,cc} contains a helper class which holds the currently used generation of streams. It is used by both aforementioned modules: 'log' queries it, while 'generation' updates it.	2020-01-30 11:10:39 +01:00
Kamil Braun	1a56310687	locator: remove get_shard_count and get_ignore_msb_bits from snitch Snitch forms a class hierarchy which get_shard_count and get_ignore_msb_bits ignore (their returned values only depend on the gossiper's state). Besides, these functions just don't belong there. Snitch has nothing to do with shard_count or ignore_msb_bits.	2020-01-30 11:10:08 +01:00
Kamil Braun	e91af78cf5	cdc: update streams description table Inform CDC users about newly generated streams.	2020-01-30 11:10:08 +01:00
Kamil Braun	cbe510d1b8	cdc: use stream generations Change the CDC code to use the global CDC stream generations. The per-base-table CDC description table was removed. The code instead uses cdc::metadata which is updated on gossip events. The per-table description tables were replaced by a global description table to be used by clients when searching for streams.	2020-01-30 11:10:08 +01:00
Kamil Braun	834c2ca997	cdc: add cdc::metadata class The class stores a queue of CDC generations to be used for choosing streams when writing to the CDC log. This data structure will be updated on some gossip events (when a new node joins the cluster and proposes a new generation of CDC streams).	2020-01-30 11:10:08 +01:00

... 6 7 8 9 10

481 Commits