scylladb

Author	SHA1	Message	Date
Benny Halevy	a085ef74ff	atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal Following up on `a57c087c89`, compare_atomic_cell_for_merge should compare the ttl value in the reverse order since, when comparing two cells that are identical in all attributes but their ttl, we want to keep the cell with the smaller ttl value rather than the larger ttl, since it was written at a later (wall-clock) time, and so would remain longer after it expires, until purged after gc_grace seconds. Fixes #10173 Test: mutation_test.test_cell_ordering, unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302154328.2400717-1-bhalevy@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220306091913.106508-1-bhalevy@scylladb.com>	2022-03-07 11:05:30 +02:00
Benny Halevy	a57c087c89	atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal Unlike atomic_cell_or_collection::equals, compare_atomic_cell_for_merge currently returns std::strong_ordering::equal if two cells are equal in every way except their ttl:s. The problem with that is that the cells' hashes are different and this will cause repair to keep trying to repair discrepancies caused by the ttl being different. This may be triggered by e.g. the spark migrator that computes the ttl based on the expiry time by subtracting the expiry time from the current time to produce a respective ttl. If the cell is migrated multiple times at different times, it will generate cells that the same expiry (by design) but have different ttl values. Fixes #10156 Test: mutation_test.test_cell_ordering, unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302154328.2400717-1-bhalevy@scylladb.com>	2022-03-03 15:27:16 +02:00
Benny Halevy	d43da5d6dc	atomic_cell: compare_atomic_cell_for_merge: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302113833.2308533-2-bhalevy@scylladb.com>	2022-03-03 14:13:14 +02:00
Benny Halevy	be865a29b8	atomic_cell: compare_atomic_cell_for_merge: simplify expiry/deltion_time comparison No need to check first the the cells' expiry is different or that deletion_time is different before comparing them with `<=>`. If they are the same the function returns std::strong_ordering::equal anyhow and that is the same as `<=>` comparing identical values. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20220302113833.2308533-1-bhalevy@scylladb.com>	2022-03-03 14:12:44 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	6c53717a39	replica, atomic_cell: move atomic_cell merge code from replica module to atomic_cell.cc compare_atomic_cell_for_merge() was placed in database.cc, before atomic_cell.cc existed. Move it to its correct place. Closes #9889	2022-01-09 11:08:10 +02:00
Avi Kivity	1aff7d19c2	treewide: replace seastar::fmt_print() with fmt::print() We shouldn't be using Seastar as a text formatting library; that's not its focus. Use fmt directly instead. fmt::print() doesn't return the output stream which is a minor inconvenience, but that's life. Closes #9556	2021-11-01 10:05:16 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Michał Chojnowski	a24f83852e	atomic_cell: fix operator<< for atomic_cell_or_collection operator<< used the wrong criterium for deciding whether the data is stored as atomic_cell or collection_mutation, resulting in catastrophical failure if it was used with frozen collections or UDTs. Since frozen collections and UDTs are stored as atomic_cell, not collection_mutation, the correct criterium is not is_collection(), but is_multi_cell(). Closes #8134	2021-02-22 14:45:34 +02:00
Botond Dénes	ba7a9d2ac3	imr: switch back to open-coded description of structures Commit `aab6b0ee27` introduced the controversial new IMR format, which relied on a very template-heavy infrastructure to generate serialization and deserialization code via template meta-programming. The promise was that this new format, beyond solving the problems the previous open-coded representation had (working on linearized buffers), will speed up migrating other components to this IMR format, as the IMR infrastructure reduces code bloat, makes the code more readable via declarative type descriptions as well as safer. However, the results were almost the opposite. The template meta-programming used by the IMR infrastructure proved very hard to understand. Developers don't want to read or modify it. Maintainers don't want to see it being used anywhere else. In short, nobody wants to touch it. This commit does a conceptual revert of `aab6b0ee27`. A verbatim revert is not possible because related code evolved a lot since the merge. Also, going back to the previous code would mean we regress as we'd revert the move to fragmented buffers. So this revert is only conceptual, it changes the underlying infrastructure back to the previous open-coded one, but keeps the fragmented buffers, as well as the interface of the related components (to the extent possible). Fixes: #5578	2021-02-16 23:43:07 +01:00
Botond Dénes	192bcc5811	data/cell: don't overshoot target allocation sizes data::cell targets 8KB as its maximum allocations size to avoid pressuring the allocator. This 8KB target is used for internal storage -- values small enough to be stored inside the cell itself -- as well for external storage. Externally stored values use 8KB fragment sizes. The problem is that only the size of data itself was considered when making the allocations. For example when allocating the fragments (chunks) for external storage, each fragment stored 8KB of data. But fragments have overhead, they have next and back pointers. This resulted in a 8KB + 2 * sizeof(void*) allocation. IMR uses the allocation strategy mechanism, which works with aligned allocations. As the seastar allocation only guarantees aligned allocations for power of two sizes, it ends up allocating a 16KB slot. This results in the mutation fragment using almost twice as much memory as would be required. This is a huge waste. This patch fixes the problem by considering the overhead of both internal and external storage ensuring allocations are 8KB or less. Fixes: #6043 Tests: unit(debug, dev, release) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200910171359.1438029-1-bdenes@scylladb.com>	2020-09-14 14:21:46 +03:00
Juliusz Stasiewicz	0ea17216fe	atomic_cell: special rule for printing counter cells Until now, attempts to print counter update cell would end up calling abort() because `atomic_cell_view::value()` has no specialized visitor for `imr::pod<int64_t>::basic_view<is_mutable>`, i.e. counter update IMR type. Such visitor is not easy to write if we want to intercept counters only (and not all int64_t values). Anyway, linearized byte representation of counter cell would not be helpful without knowing if it consists of counter shards or counter update (delta) - and this must be known upon `deserialize`. This commit introduces simple approach: it determines cell type on high level (from `atomic_cell_view`) and prints counter contents by `counter_cell_view` or `atomic_cell_view::counter_update_value()`. Fixes #5616	2020-02-24 17:11:34 +02:00
Avi Kivity	12bc965f71	atomic_cell: consistently use comma as separator in pretty-printers The atomic_cell pretty printers use a mix of commas and semicolons. This change makes them use commas everywhere, for consistency. Message-Id: <20200116133327.2610280-1-avi@scylladb.com>	2020-01-16 17:26:33 +01:00
Avi Kivity	e3310201dd	atomic_cell_or_collection: type-aware print atomic_cell or collection components Now that atomic_cell_view and collection_mutation_view have type-aware printers, we can use them in the type-aware atomic_cell_or_collection printer. Message-Id: <20191231142832.594960-1-avi@scylladb.com>	2020-01-14 10:01:06 +02:00
Avi Kivity	972127e3a8	atomic_cell: add type-aware pretty printing The standard printer for atomic_cell prints the value as hex, because atomic_cell does not include the type. Add a type-aware printer that allows the user to provide the type.	2019-12-30 18:27:04 +02:00
Avi Kivity	19f68412ad	atomic_cell: move pretty printers from database.cc to atomic_cell.cc atomic_cell.cc is the logical home for atomic_cell pretty printers, and since we plan to add more pretty printers, start by tidying up.	2019-12-30 18:20:30 +02:00
Kamil Braun	c0d3e6c773	atomic_cell: move collection_mutation(_view) to a new file. The classes 'collection_mutation' and 'collection_mutation_view' were moved to a separate header, collection_mutation.hh. Implementations of functions that operate on these classes, including some methods of collection_type_impl, were moved to a separate compilation unit, collection_mutation.cc. This makes it easier to modify these structures in future commits in order to generalize them for non-frozen User Defined Types. Some additional documentation has been written for collection_mutation.	2019-10-25 10:19:45 +02:00
Rafael Ávila de Espíndola	ddac002fd4	Make atomic_cell comparison symmetrical I noticed a test failure with Mutation inequality is not symmetric for ... And the difference between the two mutations was that one atomic_cell was live and the other wasn't. Looking at the code I found a few cases where the comparison was not symmetrical. This patch fixes them. This patch will not fix the test, as it will now fail with a "Mutations differ" error, but that is probably an independent issue. Ref #3975. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190325194647.54950-1-espindola@scylladb.com>	2019-03-26 11:14:22 +02:00
Piotr Jastrzebski	5a5201a50b	Move collection_type_impl out of types.hh to types/collection.hh Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-24 09:56:38 +01:00
Paweł Dziepak	637b9a7b3b	atomic_cell_or_collection: make operator<< show cell content After the new in-memory representation of cells was introduced there was a regression in atomic_cell_or_collection::operator<< which stopped printing the content of the cell. This makes debugging more incovenient are time-consuming. This patch fixes the problem. Schema is propagated to the atomic_cell_or_collection printer and the full content of the cell is printed. Fixes #3571. Message-Id: <20181024095413.10736-1-pdziepak@scylladb.com>	2018-10-24 13:29:51 +03:00
Paweł Dziepak	cc87263bd8	atomic_cell: accept fragmented_temporary_buffer::view values	2018-07-18 12:28:06 +01:00
Tomasz Grabiec	2ffb621271	Merge "Fix atomic_cell_or_collection::external_memory_usage()" from Paweł After the transition to the new in-memory representation in `aab6b0ee27` 'Merge "Introduce new in-memory representation for cells" from Paweł' atomic_cell_or_collection::external_memory_usage() stopped accounting for the externally stored data. Since, it wasn't covered by the unit tests the bug remained unnotices until now. This series fixes the memory usage calculation and adds proper unit tests. * https://github.com/pdziepak/scylla.git fix-external-memory-usage/v1: tests/mutation: properly mark atomic_cells that are collection members imr::utils::object: expose size overhead data::cell: expose size overhead of external chunks atomic_cell: add external chunks and overheads to external_memory_usage() tests/mutation: test external_memory_usage()	2018-07-03 14:58:10 +02:00
Paweł Dziepak	92700c6758	atomic_cell: allow passing value via ser::buffer_view	2018-06-28 22:11:19 +01:00
Paweł Dziepak	8153df7684	atomic_cell: add external chunks and overheads to external_memory_usage()	2018-06-28 19:20:23 +01:00
Paweł Dziepak	a040d37cd5	atomic_cell: switch to new IMR-based cell reperesentation This patch changes the implementation of atomic_cell and atomic_cell_or_collection to use the data::cell implementation which is based on the new in-memory representation infrastructure.	2018-05-31 15:51:11 +01:00

25 Commits