scylladb

Author	SHA1	Message	Date
Avi Kivity	f85cd289bc	Merge "repair: make sure there is one permit per repair with count res" from Botond " Repair obtains a permit for each repair-meta instance it creates. This permit is supposed to track all resources consumed by that repair as well as ensure concurrency limit is respected. However when the non-local reader path is used (shard config of master != shard config of follower), a second permit will be obtained -- for the shard reader of the multishard reader. This creates a situation where the repair-meta's permit can block the shard permit, creating a deadlock situation. This patch solves this by dropping the count resource on the repair-meta's permit when a non-local reader path is executed -- that is a multishard reader is created. Fixes: #9751 " * 'repair-double-permit-block/v4' of https://github.com/denesb/scylla: repair: make sure there is one permit per repair with count res reader_permit: add release_base_resource() (cherry picked from commit `52b7778ae6`)	2022-01-17 16:02:55 +02:00
Beni Peled	5e661af9a4	release: prepare for 4.6.rc3	2022-01-17 13:11:54 +02:00
Calle Wilund	5629b67d25	messaging_service: Make dc/rack encryption check for connection more strict Fixes #9653 When doing an outgoing connection, in a internode_encryption=dc/rack situation we should not use endpoint/local broadcast solely to determine if we can downgrade a connection. If gossip/message_service determines that we will connect to a different address than the "official" endpoint address, we should use this to determine association of target node, and similarly, if we bind outgoing connection to interface != bc we need to use this to decide local one. Note: This will effectively _disable_ internode_encryption=dc/rack on ec2 etc until such time that gossip can give accurate info on dc/rack for "internal" ip addresses of nodes. (cherry picked from commit `4778770814`)	2022-01-16 19:10:57 +02:00
Takuya ASADA	ad632cf7fc	dist: fix scylla-housekeeping uuid file chmod call Should use chmod() on a file, not fchmod() Fixes #9683 Closes #9802 (cherry picked from commit `7064ae3d90`)	2022-01-10 16:57:34 +02:00
Botond Dénes	ca24bebcf2	sstables/partition_index_cache: destroy entry ptr on error The error-handling code removes the cache entry but this leads to an assertion because the entry is still referenced by the entry pointer instance which is returned on the normal path. To avoid this clear the pointer on the error path and make sure there are no additional references kept to it. Fixes #9887 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20220105140859.586234-2-bdenes@scylladb.com> (cherry picked from commit `92727ac36c`)	2022-01-07 21:21:44 +01:00
Calle Wilund	7dc5abb6f8	commitlog: Don't allow error_handler to swallow exception Fixes #9798 If an exception in allocate_segment_ex is (sub)type of std::system_error, commit_error_handler might _not_ cause throw (doh), in which case the error handling code would forget the current exception and return an unusable segment. Now only used as an exception pointer replacer. Closes #9870 (cherry picked from commit `3c02cab2f7`)	2022-01-06 14:10:18 +02:00
Yaron Kaikov	e8a1cfb6f8	release: prepare for 4.6.rc2 scylla-4.6.rc2	2022-01-02 09:15:47 +02:00
Tomasz Grabiec	fc312b3021	lsa: Fix segment leak on memory reclamation during alloc_buf alloc_buf() calls new_buf_active() when there is no active segment to allocate a new active segment. new_buf_active() allocates memory (e.g. a new segment) so may cause memory reclamation, which may cause segment compaction, which may call alloc_buf() and re-enter new_buf_active(). The first call to new_buf_active() would then override _buf_active and cause the segment allocated during segment compaction to be leaked. This then causes abort when objects from the leaked segment are freed because the segment is expected to be present in _closed_segments, but isn't. boost::intrusive::list::erase() will fail on assertion that the object being erased is linked. Introduced in `b5ca0eb2a2`. Fixes #9821 Fixes #9192 Fixes #9825 Fixes #9544 Fixes #9508 Refs #9573 Message-Id: <20211229201443.119812-1-tgrabiec@scylladb.com> (cherry picked from commit `7038dc7003`)	2021-12-30 18:56:28 +02:00
Nadav Har'El	7b82aaf939	alternator: fix error on UpdateTable for non-existent table When the UpdateTable operation is called for a non-existent table, the appropriate error is ResourceNotFoundException, but before this patch we ran into an exception, which resulted in an ugly "internal server error". In this patch we use the existing get_table() function which most other operations use, and which does all the appropriate verifications and generates the appropriate Alternator api_error instead of letting internal Scylla exceptions escape to the user. This patch also includes a test for UpdateTable on a non-existent table, which used to fail before this patch and pass afterwards. We also add a test for DeleteTable in the same scenario, and see it didn't have this bug. As usual, both tests pass on DynamoDB, which confirms we generate the right error codes. Fixes #9747. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211206181605.1182431-1-nyh@scylladb.com> (cherry picked from commit `31eeb44d28`)	2021-12-29 22:59:25 +02:00
Nadav Har'El	894a4abfae	commitlog: fix missing wait for semaphore units Commit `dcc73c5d4e` introduced a semaphore for excluding concurrent recalculations - _reserve_recalculation_guard. Unfortunately, the two places in the code which tried to take this guard just called get_units() - which returns a future<units>, not units - and never waited for this future to become available. So this patch adds the missing "co_await" needed to wait for the units to become available. Fixes #9770. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211214122612.1462436-1-nyh@scylladb.com> (cherry picked from commit `b8786b96f4`)	2021-12-29 13:18:59 +02:00
Takuya ASADA	4dcf023470	scylla_raid_setup: workaround for mdmonitor.service issue on CentOS8 On CentOS8, mdmonitor.service does not works correctly when using mdadm-4.1-15.el8.x86_64 and later versions. Until we find a solution, let's pinning the package version to older one which does not cause the issue (4.1-14.el8.x86_64). Fixes #9540 Closes #9782 (cherry picked from commit `0d8f932f0b`)	2021-12-28 11:38:04 +02:00
Benny Halevy	283788828e	compaction: scrub_validate_mode_validate_reader: throw compaction_stopped_exception if stop is requested Currently when scrub/validate is stopped (e.g. via the api), scrub_validate_mode_validate_reader co_return:s without closing the reader passed to it - causing a crash due to internal error check, see #9766. Throwing a compaction_stopped_exception rather than co_return:ing an exception will be handled as any other exeption, including closing the reader. Fixes #9766 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211213125528.2422745-1-bhalevy@scylladb.com> (cherry picked from commit `c89876c975`)	2021-12-15 15:03:59 +02:00
Pavel Emelyanov	730a147ba6	row-cache: Handle exception (un)safety of rows_entry insertion The B-tree's insert_before() is throwing operation, its caller must account for that. When the rows_entry's collection was switched on B-tree all the risky places were fixed by `ee9e1045`, but few places went under the radar. In the cache_flat_mutation_reader there's a place where a C-pointer is inserted into the tree, thus potentially leaking the entry. In the partition_snapshot_row_cursor there are two places that not only leak the entry, but also leave it in the LRU list. The latter it quite nasty, because those entry can be evicted, eviction code tries to get rows_entry iterator from "this", but the hook happens to be unattached (because insertion threw) and fails the assert. fixes: #9728 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> (cherry picked from commit `ee103636ac`)	2021-12-14 15:53:42 +02:00
Pavel Emelyanov	9897e83029	partition_snapshot_row_cursor: Shuffle ensure_result creation Both places get the C-pointer on the freshly allocated rows_entry, insert it where needed and return back the dereferenced pointer. The C-pointer is going to become smart-pointer that would go out of scope before return. This change prepares for that by constructing the ensure_result from the iterator, that's returned from insertion of the entry. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> (cherry picked from commit `9fd8db318d`) Ref #9728	2021-12-14 15:52:37 +02:00
Asias He	1a9b64e6f6	storage_service: Wait for seastar::get_units in node_ops The seastar::get_units returns a future, we have to wait for it. Fixes #9767 Closes #9768 (cherry picked from commit `9859c76de1`)	2021-12-12 18:42:20 +02:00
Takuya ASADA	49fe9e2c8e	dist: allow running scylla-housekeeping with strict umask setting To avoid failing scylla-housekeeping in strict umask environment, we need to chmod a+r on repository file and housekeeping.uuid. Fixes #9683 Closes #9739 (cherry picked from commit `ea20f89c56`)	2021-12-12 14:25:57 +02:00
Takuya ASADA	d0580c41ee	dist: add support im4gn/is4gen instance on AWS Add support next-generation, storage-optimized ARM64 instance types. Fixes #9711 Closes #9730 (cherry picked from commit `097a6ee245`)	2021-12-08 14:29:44 +02:00
Beni Peled	542394c82f	release: prepare for 4.6.rc1 scylla-4.6.rc1	2021-12-08 11:08:45 +02:00
Avi Kivity	018ad3f6f4	test: refine test suite names exposed via xunit format The test suite names seen by Jenkins are suboptimal: there is no distinction between modes, and the ".cc" suffix of file names is interpreted as a class name, which is converted to a tree node that must be clicked to expand. Massage the names to remove unnecessary information and add the mode. Closes #9696 (cherry picked from commit `ef3edcf848`) Fixes #9738.	2021-12-05 19:58:22 +02:00
Avi Kivity	9b8b7efb54	tests: consolidate boost xunit result files The recent parallelization of boost unit tests caused an increase in xml result files. This is challenging to Jenkins, since it appears to use rpc-over-ssh to read the result files, and as a result it takes more than an hour to read all result files when the Jenkins main node is not on the same continent as the agent. To fix this, merge the result files in test.py and leave one result file per mode. Later we can leave one result file overall (integrating the mode into the testsuite name), but that can wait. Tested on a local Jenkins instance (just reading the result files, not the entire build). Closes #9668 (cherry picked from commit `b23af15432`) Fixes #9738	2021-12-05 19:57:39 +02:00
Botond Dénes	1c3e63975f	Merge 'Backport of #9348 (xceptions in commitlog::segment_manager::delete_segments could cause footprint counters to loose track)' from Calle Wilund Backport of series to 4.6 Upstream merge commit: `e2c27ee743`. Refs #9348 Closes #9702 * github.com:scylladb/scylla: commitlog: Recalculate footprint on delete_segment exceptions commitlog_test: Add test for exception in alloc w. deleted underlying file commitlog: Ensure failed-to-create-segment is re-deleted commitlog::allocate_segment_ex: Don't re-throw out of function	2021-12-02 09:22:19 +02:00
Calle Wilund	11bb03e46d	commitlog: Recalculate footprint on delete_segment exceptions Fixes #9348 If we get exceptions in delete_segments, we can, and probably will, loose track of footprint counters. We need to recompute the used disk footprint, otherwise we will flush too often, and even block indefinately on new_seg iff using hard limits.	2021-11-29 14:56:48 +00:00
Calle Wilund	810e410c5d	commitlog_test: Add test for exception in alloc w. deleted underlying file Tests that we can handle exception-in-alloc cleanup if the file actually does not exist. This however uncovers another weakness (addressed in next patch) - that we can loose track of disk footprint here, and w. hard limits end up waiting for disk space that never comes. Thus test does not use hard limit.	2021-11-29 14:56:43 +00:00
Calle Wilund	97f6da0c3e	commitlog: Ensure failed-to-create-segment is re-deleted Fixes #9343 If we fail in allocate_segment_ex, we should push the file opened/created to the delete set to ensure we reclaim the disk space. We should also ensure that if we did not recycle a file in delete_segments, we still wake up any recycle waiters iff we made a file delete instead. Included a small unit test.	2021-11-29 14:51:39 +00:00
Calle Wilund	c229fe9694	commitlog::allocate_segment_ex: Don't re-throw out of function Fixes #9342 commitlog_error_handler rethrows. But we want to not. And run post-handler cleanup (co_await)	2021-11-29 14:51:39 +00:00
Tomasz Grabiec	ee1ca8ae4d	lsa: Add sanity checks around lsa_buffer operations We've been observing hard to explain crashes recently around lsa_buffer destruction, where the containing segment is absent in _segment_descs which causes log_heap::adjust_up to abort. Add more checks to catch certain impossible senarios which can lead to this sooner. Refs #9192. Message-Id: <20211116122346.814437-1-tgrabiec@scylladb.com> (cherry picked from commit `bf6898a5a0`)	2021-11-24 15:17:37 +01:00
Tomasz Grabiec	6bfd322e3b	lsa: Mark compact_segment_locked() as noexcept We cannot recover from a failure in this method. The implementation makes sure it never happens. Invariants will be broken if this throws. Detect violations early by marking as noexcept. We could make it exception safe and try to leave the data structures in a consistent state but the reclaimer cannot make progress if this throws, so it's pointless. Refs #9192 Message-Id: <20211116122019.813418-1-tgrabiec@scylladb.com> (cherry picked from commit `4d627affc3`)	2021-11-24 15:17:35 +01:00
Tomasz Grabiec	afc18d5070	cql: Fix missing data in indexed queries with base table short reads Indexed queries are using paging over the materialized view table. Results of the view read are then used to issue reads of the base table. If base table reads are short reads, the page is returned to the user and paging state is adjusted accordingly so that when paging is resumed it will query the view starting from the row corresponding to the next row in the base which was not yet returned. However, paging state's "remaining" count was not reset, so if the view read was exhausted the reading will stop even though the base table read was short. Fix by restoring the "remaining" count when adjusting the paging state on short read. Tests: - index_with_paging_test - secondary_index_test Fixes #9198 Message-Id: <20210818131840.1160267-1-tgrabiec@scylladb.com> (cherry picked from commit `1e4da2dcce`) scylla-4.6.rc0	2021-11-23 11:22:00 +02:00
Tomasz Grabiec	2ec22c2404	sstables: partition_index_cache: Avoid abort due to benign bad_alloc inside allocating section shared_promise::get_shared_future() is marked noexcept, but can allocate memory. It is invoked by sstable partition index cache inside an allocating section, which means that allocations can throw bad_alloc even though there is memory to reclaim, so under normal conditions. Fix by allocating the shared_promise in a stable memory, in the standard allocator via lw_shared_ptr<>, so that it can be accessed outside allocating section. Fixes #9666 Tests: - build/dev/test/boost/sstable_partition_index_cache_test Message-Id: <20211122165100.1606854-1-tgrabiec@scylladb.com> (cherry picked from commit `1d84bc6c3b`)	2021-11-23 11:21:27 +02:00
Avi Kivity	19da778271	Merge "Run gossiper message handlers in a gate" from Pavel E " When gossiper processes its messages in the background some of the continuations may pop up after the gossiper is shutdown. This, in turn, may result in unwanted code to be executed when it doesn't expect. In particular, storage_service notification hooks may try to update system keyspace (with "fresh" peer info/state/tokens/etc). This update doesn't work after drain because drain shuts down commitlog. The intention was that gossiper did _not_ notify anyone after drain, because it's shut down during drain too. But since there are background continuations left, it's not working as expected. refs: #9567 tests: unit(dev), dtest.concurrent_schema_changes.snapshot(dev) " * 'br-gossiper-background-messages-2' of https://github.com/xemul/scylla: gossiper: Guard background processing with gate gossiper: Helper for background messaging processing (cherry picked from commit `9e2b6176a2`)	2021-11-19 07:25:26 +02:00
Avi Kivity	cbd4c13ba6	Merge 'Revert "scylla_util.py: return bool value on systemd_unit.is_active()"' from Takuya ASADA On scylla_unit.py, we provide `systemd_unit.is_active()` to return `systemctl is-active` output. When we introduced systemd_unit class, we just returned `systemctl is-active` output as string, but we changed the return value to bool after that (`2545d7fd43`). This was because `if unit.is_active():` always becomes True even it returns "failed" or "inactive", to avoid such scripting bug. However, probably this was mistake. Because systemd unit state is not 2 state, like "start" / "stop", there are many state. And we already using multiple unit state ("activating", "failed", "inactive", "active") in our Cloud image login prompt: https://github.com/scylladb/scylla-machine-image/blob/next/common/scylla_login#L135 After we merged `2545d7fd43`, the login prompt is broken, because it does not return string as script expected (https://github.com/scylladb/scylla-machine-image/issues/241). I think we should revert `2545d7fd43`, it should return exactly same value as `systemctl is-active` says. Fixes #9627 Fixes scylladb/scylla-machine-image#241 Closes #9628 * github.com:scylladb/scylla: scylla_ntp_setup: use string in systemd_unit.is_active() Revert "scylla_util.py: return bool value on systemd_unit.is_active()" (cherry picked from commit `c17101604f`)	2021-11-18 11:44:11 +02:00
Pavel Emelyanov	338871802d	generic_server: Keep server alive during conn background processing There's at least one tiny race in generic_server code. The trailing .handle_exception after the conn->process() captures this, but since the whole continuation chain happens in the background, that this can be released thus causing the whole lambda to execute on freed generic_server instance. This, in turn, is not nice because captured this is used to get a _logger from. The fix is based on the observation that all connections pin the server in memory until all of them (connections) are destructed. Said that, to keep the server alive in the aforementioned lambda it's enough to make sure the conn variable (it's lw_shared_ptr on the connection) is alive in it. Not to generate a bunch of tiny continuations with identical set of captures -- tail the single .then_wrapped() one and do whatever is needed to wrap up the connection processing in it. tests: unit(dev) fixes: #9316 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211115105818.11348-1-xemul@scylladb.com> (cherry picked from commit `ba16318457`)	2021-11-17 10:21:11 +02:00
Yaron Kaikov	8b5b1b8af6	dist/docker/debian/build_docker.sh: debian version fix for rc releases When building a docker we relay on `VERSION` value from `SCYLLA-VERSION-GEN` . For `rc` releases only there is a different between the configured version (X.X.rcX) and the actualy debian package we generate (X.X~rcX) Using a similar solution as i did in `dcb10374a5` Fixes: #9616 Closes #9617 (cherry picked from commit `060a91431d`)	2021-11-12 20:07:19 +02:00
Takuya ASADA	ea89eff95d	dist/docker: fix bashrc filename for Ubuntu For Debian variants, correct filename is /etc/bash.bashrc. Fixes #9588 Closes #9589 (cherry picked from commit `201a97e4a4`)	2021-11-10 14:25:27 +02:00
Michał Radwański	96421e7779	memtable: fix gcc function argument evaluation order induced use after move clang evaluates function arguments from left to right, while gcc does so in reverse. Therefore, this code can be correct on clang and incorrect on gcc: ``` f(x.sth(), std::move(x)) ``` This patch fixes one such instance of this bug, in memtable.cc. Fixes #9605. Closes #9606 (cherry picked from commit `eff392073c`)	2021-11-10 08:58:09 +02:00
Botond Dénes	142336ca53	mutation_writer/feed_writer: don't drop readers with small amount of content Due to an error in transforming the above routine, readers who have <= a buffer worth of content are dropped without consuming them. This is due to the outer consume loop being conditioned on `is_end_of_stream()`, which will be set for readers that eagerly pre-fill their buffer and also have no more data then what is in their buffer. Change the condition to also check for `is_buffer_empty()` and only drop the reader if both of these are true. Fixes: #9594 Tests: unit(mutation_writer_test --repeat=200, dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20211108092923.104504-1-bdenes@scylladb.com> (cherry picked from commit `4b6c0fe592`)	2021-11-09 14:13:21 +02:00
Calle Wilund	492f12248c	commitlog: Add explicit track var for "wasted space" to avoid double counting Refs #9331 In segment::close() we add space to managers "wasted" counter. In destructor, if we can cleanly delete/recycle the file we remove it. However, if we never went through close (shutdown - ok, exception in batch_cycle - not ok), we can end up subtracting numbers that were never added in the first place. Just keep track of the bytes added in a var. Observed behaviour in above issue is timeouts in batch_cycle, where we declare the segment closed early (because we cannot add anything more safely - chunks could get partial/misplaced). Exception will propagate to caller(s), but the segment will not go through actual close() call -> destructor should not assume such. Closes #9598 (cherry picked from commit `3929b7da1f`)	2021-11-09 14:07:04 +02:00
Yaron Kaikov	7eb7a0e5fe	release: prepare for 4.6.rc0	2021-11-08 09:18:26 +02:00
Botond Dénes	e991604918	schema: make private constructor invokable via make_lw_shared The schema has a private constructor, which means it can't be constructed with `make_lw_shared()` even by classes which are otherwise able to invoke the private constructor themselves. This results in such classes (`schema_builder`) resorting to building a local schema object, then invoking `make_lw_shared()` with the schema's public move constructor. Moving a schema is not cheap at all however, so each `schema_builder::build()` call results in two expensive schema construction operations. We could make `make_lw_shared()` a friend of `schema` to resolve this, but then we'd de-facto open the private consctructor to the world. Instead this patch introduces a private tag type, which is added to the private constructor, which is then made public. Everybody can invoke the constructor but only friends can create the private tag instance required to actually call it. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20211105085940.359708-1-bdenes@scylladb.com>	2021-11-07 12:51:09 +02:00
Tomasz Grabiec	31bc1eb681	Merge 'Memtable reversing reader: fix computing rt slice, if there was previously emitted range tombstone.' from Michał Radwański This PR started by realizing that in the memtable reversing reader, it never happened on tests that `do_refresh_state` was called with `last_row` and `last_rts` which are not `std::nullopt`. Changes - fix memtable test (`tesst_memtable_with_many_versions_conforms_to_mutation_source`), so that there is a background job forcing state refreshes, - fix the way rt_slice is computed (was `(last_rts, cr_range_snapshot.end]`, now is `[cr_range_snapshot.start, last_rts)`). Fixes #9486 Closes #9572 * github.com:scylladb/scylla: partition_snapshot_reader: fix indentation in fill_buffer range_tombstone_list: {lower,upper,}slice share comparator implementation test: memtable: add full_compaction in background partition_snapshot_reader: fix obtaining rt_slice, if Reversing and _last_rts was set range_tombstone_list: add lower_slice	2021-11-05 15:27:03 +01:00
Michał Radwański	ee601b7d87	partition_snapshot_reader: fix indentation in fill_buffer	2021-11-05 10:51:58 +01:00
Michał Radwański	35b1c3ff52	range_tombstone_list: {lower,upper,}slice share comparator implementation slice (2 overloads), upper_slice, lower_slice previously had implementations of a comparator. Move out the common structs, so that all 4 of them can share implementation.	2021-11-05 10:51:58 +01:00
Michael Livshin	60f76155a7	build: have configure.py create compile_commands.json compile_commands.json (a.k.a. "compdb", https://clang.llvm.org/docs/JSONCompilationDatabase.html) is intended to help stand-alone C-family LSP servers index the codebase as precisely as possible. The actively maintained LSP servers with good C++ support are: - Clangd (https://clangd.llvm.org/) - CCLS (https://github.com/MaskRay/ccls) This change causes a successful invocation of configure.py to create a unified Scylla+Seastar+Abseil compdb for every selected build mode, and to leave a valid symlink in the source root (if a valid symlink already exists, it will be left alone). Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Closes #9558	2021-11-05 11:28:37 +02:00
Raphael S. Carvalho	4950ce539c	schema: replace outdated comment on default compaction strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211104210043.199156-1-raphaelsc@scylladb.com>	2021-11-05 00:35:41 +02:00
Nadav Har'El	5e52858295	rjson, alternator: rename set() functions add() The rjson::set() sounds like it can set any member of a JSON object (i.e., map), but that's not true :-( It calls the RapidJson function AddMember() so it can only add a member to an object which doesn't have a member with the same name (i.e., key). If it is called with a key that already has a value, the result may have two values for the same key, which is ill-formed and can cause bugs like issue #9542. So in this patch we begin by renaming rjson::set() and its variant to rjson::add() - to suggest to its user that this function only adds members, without checking if they already exist. After this rename, I was left with dozens of calls to the set() functions that need to changed to either add() - if we're sure that the object cannot already have a member with the same name - or to replace() if it might. The vast majority of the set() calls were starting with an empty item and adding members with fixed (string constant) names, so these can be trivially changed to add(). It turns out that all other set() calls - except the one fixed in issue #9542 - can also use add() because there are various "excuses" why we know the member names will be unique. A typical example is a map with column-name keys, where we know that the column names are unique. I added comments in front of such non-obvious uses of add() which are safe. Almost all uses of rjson except a handful are in Alternator, so I verified that all Alternator test cases continue to pass after this patch. Fixes #9583 Refs #9542 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211104152540.48900-1-nyh@scylladb.com>	2021-11-04 16:35:38 +01:00
Nadav Har'El	b95e431228	alternator: fix bug in ReturnValues=ALL_NEW This patch fixes a bug in UpdateItem's ReturnValues=ALL_NEW, which in some cases returned the OLD (pre-modification) value of some of the attributes, instead of its NEW value. The bug was caused by a confusion in our JSON utility function, rjson::set(), which sounds like it can set any member of a map, but in fact may only be used to add a new member - if a member with the same name (key) already existed, the result is undefined (two values for the same key). In ReturnValues=ALL_NEW we did exactly this: we started with a copy of the original item, and then used set() to override some of the members. This is not allowed. So in this patch, we introduce a new function, rjson::replace(), which does what we previously thought that rjson::set() does - i.e., replace a member if it exists, or if not, add it. We call this function in the ReturnValues=ALL_NEW code. This patch also adds a test case that reproduces the incorrect ALL_NEW results - and gets fixed by this patch. In an upcoming patch, we should rename the confusingly-named set() functions and audit all their uses. But we don't do this in this patch yet. We just add some comments to clarify what set() does - but don't change it, and just add one new function for replace(). Fixes #9542 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211104134937.40797-1-nyh@scylladb.com>	2021-11-04 16:34:58 +01:00
Michał Radwański	cac9ac5126	test: memtable: add full_compaction in background Add full compaction in test_memtable_with_many_versions_conforms_to_mutation_source in background. Without it, some paths in the partition snapshot reader weren't covered, as the tests always managed to read all range tombstones and rows which cover a given clustering range from just a single snapshot. Now, when full_compaction happens in process of reading from a clustering range, we can force state refresh with non-nullopt positions of last row and last range tombstone. Note: this inability to test affected only the reversing reader.	2021-11-04 16:19:54 +01:00
Michał Radwański	94b263e356	partition_snapshot_reader: fix obtaining rt_slice, if Reversing and _last_rts was set If Reversing and _last_rts was set, the created rt_slice still contained range tombstones between _last_rts and (snapshot) clustering range end. This is wrong - the correct range is between (snapshot) clustering range begin and _last_rts.	2021-11-04 16:10:07 +01:00
Pavel Emelyanov	6e97d2ce87	Merge branch 'compaction_cleanup_and_improvements_v2' from Raphael S. Carvalho Cleanup and improvements for compaction * 'compaction_cleanup_and_improvements_v2' of https://github.com/raphaelsc/scylla: compaction: fix outdated doc of compact_sstables() table: fix indentation in compact_sstables() table: give a more descriptive name to compaction_data in compact_sstables() compaction_manager: rename submit_major_compaction to perform_major_compaction compaction: fix indentantion in compaction.hh compaction: move incremental_owned_ranges_checker into cleanup_compaction compaction: make owned ranges const in cleanup_compaction compaction: replace outdated comment in regular_compaction compaction: give a more descriptive name to compaction_data compaction_manager: simplify creation of compaction_data	2021-11-04 17:27:07 +03:00
Raphael S. Carvalho	132a840ed5	compaction: fix outdated doc of compact_sstables() Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-11-04 11:09:24 -03:00

1 2 3 4 5 ...

28831 Commits