scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 01:20:39 +00:00

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Botond Dénes	a7866f783f	position_in_partition: add after_key(position_in_partition_view)	2021-12-10 15:48:49 +02:00
Kamil Braun	ea6310961c	test: sstable_conforms_to_mutation_source_test: extend `test_sstable_reversing_reader_random_schema` with fast-forwarding The test would check whether the forward and reverse readers returned consistent results when created in non-forwarding mode with slicing. Do the same but using fast-forwarding instead of slicing. To do this we require a vector of `position_range`s. We also need a vector of `clustering_range`s for the existing test. We modify the existing `random_ranges` function to return `position_range`s instead of `clustering_range`s since `position_range`s are easier to reason about, especially when we consider non-full clustering key prefixes. A function is introduced to convert a `position_range` to a `clustering_range` for the existing test.	2021-11-29 11:10:46 +01:00
Avi Kivity	fd8beeaea9	treewide: handle switch statements that return A switch statement where every case returns triggers a gcc warning if the surrounding function doesn't return/abort. Fix by adding an abort(). The abort() will never trigger since we have a warning on unhandled switch cases.	2021-10-10 18:16:50 +03:00
Tomasz Grabiec	2b3ae6aca4	position_in_partition: Introduce reversed() transformation It transforms the position from a forward-clustering-order schema domain to a reversed-clustering-order schema domain. The object still refers to the same element of the space of keys under this transformation. However, the identification of the position, the position_in_partition object, is schema-dependent, it is always interpreted relative to some schema. Hence the need to transform it when switching schema domains. Message-Id: <20210917102612.308149-1-tgrabiec@scylladb.com>	2021-09-27 14:23:09 +03:00
Avi Kivity	0909e3c17d	treewide: remove redundant "x <=> 0" compares If x is of type std::strong_ordering, then "x <=> 0" is equivalent to x. These no-ops were inserted during #1449 fixes, but are now unnecessary. They have potential for harm, since they can hide an accidental of the type of x to an arithmetic type, so remove them. Ref #1449.	2021-07-28 13:30:32 +03:00
Avi Kivity	e52ebe2da5	types: convert abstract_type::compare and related to std::strong_ordering Change comparators around types to std::strong_ordering. Ref #1449.	2021-07-28 13:19:24 +03:00
Botond Dénes	5e77f07263	position_in_paritition{_view}: add has_key()	2021-07-12 07:11:29 +03:00
Tomasz Grabiec	53568f6939	sstables: k_l: reader: Trim range tombstones to query ranges This is needed to change the guarantees of flat_mutation_reader v1 to produce only range tombstones trimmed to clustering restrictions. The reason for this is so that v2 has a canonical representation in which all fragments have position inside clustering restrictions. Conversion from v1 to v2 can guarantee that only if v1 trims range tombstones.	2021-06-15 13:14:45 +02:00
Tomasz Grabiec	08f471043e	position_range: Introduce contains() check for ranges	2021-06-15 13:14:45 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Emelyanov	92e72c62dc	position_in_partition: Convert tri_compare to strong_ordering All its users are now ready to accept both - int and the strong_ordering value, so the change is pretty straightforward. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-04-09 18:20:39 +03:00
Michał Chojnowski	dbcf987231	keys, compound: switch from bytes_view to managed_bytes_view The keys classes (partition_key et al) already use managed_bytes, but they assume the data is not fragmented and make liberal use of that by casting to bytes_view. The view classes use bytes_view. Change that to managed_bytes_view, and adjust return values to managed_bytes/managed_bytes_view. The callers are adjusted. In some places linearization (to_bytes()) is needed, but this isn't too bad as keys are always <= 64k and thus will not be fragmented when out of LSA. We can remove this linearization later. The serialize_value() template is called from a long chain, and can be reached with either bytes_view or managed_bytes_view. Rather than trace and adjust all the callers, we patch it now with constexpr if. operator bytes_view (in keys) is converted to operator managed_bytes_view, allowing callers to defer or avoid linearization.	2021-01-08 14:16:08 +01:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Kamil Braun	d158921966	sstables: add `may_have_partition_tombstones` method For sstable versions greater or equal than md, the `min_max_column_names` sstable metadata gives a range of position-in-partitions such that all clustering rows stored in this sstable have positions in this range. Partition tombstones in this context are understood as covering the entire range of clustering keys; thus, if the sstable contains at least one partition tombstone, the sstable position range is set to be the range of all clustered rows. Therefore, by checking that the position range is not the range of all clustered rows we know that the sstable cannot have any partition tombstones. Closes #7678	2020-11-23 23:30:19 +02:00
Botond Dénes	d7d93aef49	position_in_partition_view: add position_in_partition_view before_key() overload	2020-09-25 12:09:00 +03:00
Dejan Mircevski	fb6c011b52	everywhere: Insert space after `switch` Quoth @avikivity: "switch is not a function, and we celebrate that by putting a space after it like other control-flow keywords." https://github.com/scylladb/scylla/pull/7052#discussion_r471932710 Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-08-18 14:31:04 +03:00
Tomasz Grabiec	9885d0e806	position_in_partition: Introduce external_memory_usage()	2020-06-16 16:15:24 +02:00
Avi Kivity	6728b96df7	clustering_interval_set: split to own header file clustering_interval_set is a rarely used class, but one that requires boost/icl, which is quite heavyweight. To speed up compilation, move it to its own header and sprinkle #includes where needed. Tests: unit (dev) Message-Id: <20200214190507.1137532-1-avi@scylladb.com>	2020-02-16 17:40:47 +02:00
Avi Kivity	488c42408a	position_in_partition_view: add type-aware printer If the position_in_partition_view represents a clustering key, we can now see it with the clustering key decoded according to the schema. Message-Id: <20191231151315.602559-1-avi@scylladb.com>	2020-01-07 12:15:09 +01:00
Benny Halevy	a37acee68f	position_in_partition: define operator=(position_in_partition_view) The respective constructor is explicit. Define this assignment operator to be used by flat_mutation_reader mutation_fragment_stream_validator filter so that it can use mutation_fragment::position() verbatim and keep its internal state as position_in_partition. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-09-09 15:30:59 +03:00
Botond Dénes	4cb873abfe	query::trim_clustering_row_ranges_to(): fix handling of non-full prefix keys Non-full prefix keys are currently not handled correctly as all keys are treated as if they were full prefixes, and therefore they represent a point in the key space. Non-full prefixes however represent a sub-range of the key space and therefore require null extending before they can be treated as a point. As a quick reminder, `key` is used to trim the clustering ranges such that they only cover positions >= then key. Thus, `trim_clustering_row_ranges_to()` does the equivalent of intersecting each range with (key, inf). When `key` is a prefix, this would exclude all positions that are prefixed by key as well, which is not desired. Fixes: #4839 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190819134950.33406-1-bdenes@scylladb.com>	2019-08-20 00:24:51 +02:00
Botond Dénes	20c06adf80	position_in_partition: add for_partition_start()	2019-08-13 09:47:55 +03:00
Botond Dénes	1b4e88b972	position_in_partition_view: add get_bound_weight()	2019-08-13 09:47:55 +03:00
Botond Dénes	fd925f6049	position_in_partition_view: add constructor with bound_weight This is a low level constructor which allows directly providing a bound weight to go with the key.	2019-08-09 10:54:27 +03:00
Botond Dénes	b30af48c83	position_in_partition_view: add region() accessor	2019-04-26 11:38:12 +03:00
Asias He	165d3053b1	position_in_partition: Add get_type, get_bound_weight and get_clustering_key_prefix Needed by the RPC serialization code.	2018-12-12 16:49:01 +08:00
Asias He	4e55d22a8f	position_in_partition: Switch _bound_weight to use enum The _bound_weight in position_in_partition will be sent on wire in rpc. Make it enum instead of int.	2018-12-12 16:49:01 +08:00
Asias He	5bc109e1ee	position_in_partition: Add bound_weight It will be used to change _bound_weight to use enum instead of int8_t.	2018-12-12 16:49:01 +08:00
Asias He	05c663b932	position_in_partition: Use std::optional for clustering_key_prefix The new row level repair code will access clustering_key_prefix and it uses std::optional everywhere. Convert position_in_partition to use std::optional.	2018-12-12 16:49:01 +08:00
Asias He	0b31d7059b	position_in_partition: Make partition_region uint8_t It will be sent over rpc. Make the type explicit.	2018-12-12 16:49:01 +08:00
Botond Dénes	a594fd39ce	position_in_partition: add region() accessor	2018-12-04 08:51:05 +02:00
Piotr Jastrzebski	bff49345cd	Add position_in_partition_view::as_end_bound_view This will be used in sstables 3. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-09-25 17:55:52 -07:00
Vladimir Krivopalov	593d8faf7d	position_in_partition: Add a constructor from range_tag_t{}, bound_kind and clustering_key_prefix. This facilitates position_in_partition creation when parsing range tombstones bounds from SSTables files. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-07-20 13:50:17 -07:00
Avi Kivity	31151cadd4	Merge "row_cache: Fix violation of continuity on concurrent eviction and population" from Tomasz " The problem happens under the following circumstances: - we have a partially populated partition in cache, with a gap in the middle - a read with no clustering restrictions trying to populate that gap - eviction of the entry for the lower bound of the gap concurrent with population The population may incorrectly mark the range before the gap as continuous. This may result in temporary loss of writes in that clustering range. The problem heals by clearing cache. Caught by row_cache_test::test_concurrent_reads_and_eviction, which has been failing sporadically. The problem is in ensure_population_lower_bound(), which returns true if current clustering range covers all rows, which means that the populator has a right to set continuity flag to true on the row it inserts. This is correct only if the current population range actually starts since before all clustering rows. Otherwise, we're populating since _last_row and should consult it. Fixes #3608. " * 'tgrabiec/fix-violation-of-continuity-on-concurrent-read-and-eviction' of github.com:tgrabiec/scylla: row_cache: Fix violation of continuity on concurrent eviction and population position_in_partition: Introduce is_before_all_clustered_rows()	2018-07-18 10:11:34 +03:00
Tomasz Grabiec	8d47d21149	position_in_partition: Introduce is_before_all_clustered_rows()	2018-07-17 16:43:21 +02:00
Tomasz Grabiec	ac772cbd81	clustering_interval_set: Introduce contained_in()	2018-07-17 16:30:01 +02:00
Tomasz Grabiec	d24ebe8565	clustering_interval_set: Introduce add() overload accepting another interval set	2018-07-17 16:30:01 +02:00
Vladimir Krivopalov	82f76b0947	Use std::reference_wrapper instead of a plain reference in bound_view. The presence of a plain reference prohibits the bound_view class from being copyable. The trick employed to work around that was to use 'placement new' for copy-assigning bound_view objects, but this approach is ill-formed and causes undefined behaviour for classes that have const and/or reference members. The solution is to use a std::reference_wrapper instead. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <a0c951649c7aef2f66612fc006c44f8a33713931.1530113273.git.vladimir@scylladb.com>	2018-06-28 11:24:06 +01:00
Vladimir Krivopalov	03cf20676c	Revert "Add missing enum values to bound_kind." This reverts commit `3ecc9e9ce4`. It also adds another enum to be used instead.	2018-06-18 14:22:12 -07:00
Vladimir Krivopalov	3ecc9e9ce4	Add missing enum values to bound_kind. bound_kind::clustering, bound_kind::excl_end_incl_start and bound_kind::incl_end_excl_start are used during SSTables 3.0 writing. bound_kind::static_clustering is not used yet but added for completeness and parity with the Origin. For #1969. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-04-26 14:34:20 -07:00
Duarte Nunes	12507fb9ce	keys: Replace feed_hash() member function with appending_hash Replace the feed_hash() member function of partition_key and clustering_key_prefix with the specialization of appending_hash, so that we can use the general feed_hash() function. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-02-01 00:22:50 +00:00
Tomasz Grabiec	bbd9ef6b59	position_in_partition: Introduce after_key()	2018-01-18 11:32:48 +01:00
Tomasz Grabiec	0c0d52a933	clustering_interval_set: Introduce overlaps()	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	c1d96bda88	clustering_interval_set: Extract private make_interval()	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	7e5d243a95	Introduce clustering_interval_set Will make it easy to represent and manipulate continuity in tests. Could also replace clustering_row_ranges in the future, which is currently a naked vector<> with no semantic methods.	2017-12-08 12:01:27 +01:00
Tomasz Grabiec	72028bb048	mutation_partition: Allow creating rows_entry at any clustered position_in_partition In preparation for supporting setting continuity of arbitrary clustering range.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	9ac1b515e1	position_in_partition: Do not use -2 and +2 weights ::weight() is using those values for excl_end and excl_start in order to be able to represent non-overlapping ranges. In their model the end bound is inclusive. We don't need this, since position_range has end bound exclusive. This change makes that: position_in_partition::after_key(y) == position_in_partition::for_range_end(clutering_range::make({x}, {y})	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	34cb13939f	position_in_partition: Introduce before_key()	2017-11-02 11:05:19 +01:00

1 2

66 Commits