scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 12:36:56 +00:00

Author	SHA1	Message	Date
Raphael Carvalho	370b1336fe	service: fix refresh Vlad and I were working on finding the root of the problems with refresh. We found that refresh was deleting existing sstable files because of a bug in a function that was supposed to return the maximum generation of a column family. The intention of this function is to get generation from last element of column_family::_sstables, which is of type std::map. However, we were incorrectly using std::map::end() to get last element, so garbage was being read instead of maximum generation. If the garbage value is lower than the minimum generation of a column family, then reshuffle_sstables() would set generation of all existing sstables to a lower value. That would confuse our mechanism used to delete sstables because sstables loaded at boot stage were touched. Solution to this problem is about using rbegin() instead of end() to get last element from column_family::_sstables. The other problem is that refresh will only load generations that are larger than or equal to X, so new sstables with lower generation will not be loaded. Solution is about creating a set with generation of live SSTables from all shards, and using this set to determine whether a generation is new or not. The last change was about providing an unused generation to reshuffle procedure by adding one to the maximum generation. That's important to prevent reshuffle from touching an existing SSTable. Tested 'refresh' under the following scenarios: 1) Existing generations: 1, 2, 3, 4. New ones: 5, 6. 2) Existing generations: 3, 4, 5, 6. New ones: 1, 2. 3) Existing generations: 1, 2, 3, 4. New ones: 7, 8. 4) No existing generation. No new generation. 5) No existing generation. New ones: 1, 2. I also had to adapt existing testcase for reshuffle procedure. Fixes #1073. Signed-off-by: Raphael Carvalho <raphaelsc@scylladb.com> Message-Id: <1c7b8b7f94163d5cd00d90247598dd7d26442e70.1458694985.git.raphaelsc@scylladb.com>	2016-03-23 10:21:58 +02:00
Benoît Canet	1594bdd5bb	dist/ubuntu: Fix the init script variable sourcing The variable sourcing was crashing the init script on ubuntu. Fix it with the suggestion from Avi. Signed-off-by: Benoît Canet <benoit@scylladb.com> Message-Id: <1458685099-1160-1-git-send-email-benoit@scylladb.com>	2016-03-23 09:03:17 +02:00
Tomasz Grabiec	5f44afa311	cql3: batch_statement: Execute statements sequentially Currently we execute all statements in parallel, but some statements depend on order, in particular list append/prepend. Fix by executing sequentially. Fixes cql_additional_tests.py:TestCQL.batch_and_list_test dtest. Fixes #1075. Message-Id: <1458672874-4749-1-git-send-email-tgrabiec@scylladb.com>	2016-03-22 20:59:40 +02:00
Pekka Enberg	354fca9d56	Revert "streaming: Simplify session completion logic" This reverts commit `208b7fa7ba`. It breaks Glauber's upcoming repair series.	2016-03-22 20:37:50 +02:00
Pekka Enberg	1f29a698d5	Revert "streaming: Start to send mutations after PREPARE_DONE_MESSAGE" This reverts commit `4c06221766`. It breaks Glauber's upcoming repair series.	2016-03-22 20:37:22 +02:00
Avi Kivity	7df21768d6	Merge "Fix row_cache_alloc_stress test" from Tomasz "The test predates LSA zones and was not anticipating that LSA would take much more free memory from the system than it needs in its assertions. Fix by accounting for the fact properly."	2016-03-22 18:46:31 +02:00
Avi Kivity	b8f80bb2be	Update scylla-ami submodule * dist/ami/files/scylla-ami 56f1ab7...89e7436 (1): > Merge "iotune packaging fix for scylla-ami" from Takuya	2016-03-22 17:55:00 +02:00
Takuya ASADA	dac2bc3055	dist: on scylla_io_setup, SMP and CPUSET should be empty when the parameter not present Fixes #1060 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1458659928-2050-1-git-send-email-syuu@scylladb.com>	2016-03-22 17:49:06 +02:00
Avi Kivity	8cf785e53a	Merge "Merge "iotune packaging fix" from Takuya "This implements #1065 - iotune will NOT be a part of scylla service - remove the scylla.io.service - User will have to run it manually - using a script call scylla_io_tune_setup (that will do the exact same thing the service does today. - if they wont, and do not use --developer-mode, scylla init will fail will a proper error - scylla will not start (in the same manner it does not start if you run scylla on non XFS FS) - For c3,m3,i2 we will use the evaluation formula we have (that takes the number of disks , cores etc.) - For other instances we will set --developer-mode. if the user logins into the instance - he will get a developer-mode warning - No iotune on AWS" Fixes #1065.	2016-03-22 17:46:32 +02:00
Takuya ASADA	9889712d43	dist: remove scylla-io-setup.service and make it standalone script	2016-03-22 17:45:58 +02:00
Takuya ASADA	2cedab07f2	dist: on scylla_io_setup print out message both for stdout and syslog	2016-03-22 17:45:58 +02:00
Takuya ASADA	83112551bb	dist: introduce dev-mode.conf and scylla_dev_mode_setup	2016-03-22 17:45:58 +02:00
Tomasz Grabiec	a4e3adfbec	Fix assertion in row_cache_alloc_stress Fixes the following assertion failure: row_cache_alloc_stress: tests/row_cache_alloc_stress.cc:120: main(int, char**)::<lambda()>::<lambda()>: Assertion `mt->occupancy().used_space() < memory::stats().free_memory()' failed. memory::stats()::free_memory() may be much lower than the actual amount of reclaimable memory in the system since LSA zones will try to keep a lot of free segments to themselves. Fix by using actual amount of reclaimable memory in the check.	2016-03-22 16:31:04 +01:00
Tomasz Grabiec	a0cba3c86f	logalloc: Introduce tracker::occupancy() Returns occupancy information for all memory allocated by LSA, including segment pools / zones.	2016-03-22 16:28:10 +01:00
Yoav Kleinberger	97bb7a35d9	tools/scyllatop: some sensible default metrics Previosly if the user did not specify any metrics, scyllatop use whatever it could find. Now we have some preset defaults which are probably more interesting. Signed-off-by: Yoav Kleinberger <yoav@scylladb.com> Message-Id: <1458658804-377-1-git-send-email-yoav@scylladb.com>	2016-03-22 17:04:13 +02:00
Tomasz Grabiec	529c8b8858	logalloc: Rename tracker::occupancy() to region_occupancy()	2016-03-22 14:56:44 +01:00
Pekka Enberg	5019b709ba	service/migration_manager: Simplify verb unregistration You can safely unregister verbs even if they're not registered yet. Simplify code in migration manager by dropping the redundant checks. Message-Id: <1458027669-6517-1-git-send-email-penberg@scylladb.com>	2016-03-22 15:24:55 +02:00
Pekka Enberg	3e1a660839	Merge seastar upstream * seastar c193821...9f2b868 (4): > memory: set free memory to non-zero value in debug mode > Merge "Increase IOTune's robustness by including a timeout" from Glauber > shared_future: add companion class, shared_promise > rpc: fix client connection stopping	2016-03-22 15:16:21 +02:00
Asias He	4c06221766	streaming: Start to send mutations after PREPARE_DONE_MESSAGE Below are 3 possible cases in a stream session, after commit `208b7fa7ba` (streaming: Simplify session completion logic) We might close the session before the exchange of the PREPARE_DONE_MESSAGE message in case 1). To fix, we defer the sending of mutations after PREPARE_DONE_MESSAGE is sent at the initiator node. 1) Initiator Follower tx rx tx rx 1 0 0 1 send prepare send back prepare recev prepare send mutations (close the session before prepare_done msg is sent) recv mutations (close session before prepare_done msg is received) send prepare_done recv prepare_done and send no mutations 2) Initiator Follower tx rx tx rx 0 1 1 0 send prepare send back prepare recv prepare nothing to send send prepare_done recv prepare_done and send mutations (close session) recv mutations (close session) 3) Initiator Follower tx rx tx rx 1 1 1 1 send prepare send back prepare recv prepare send mutations recv mutations, can not close session since we have mutations to send send prepare_done recv prepare_done and send mutations (close session) recv mutations (close session) Message-Id: <d6510b558565db23202164fa491b883ef3796e58.1458634037.git.asias@scylladb.com>	2016-03-22 15:05:57 +02:00
Takuya ASADA	6b2a8a2f70	dist: enable collectd on scylla_setup by default, to make scyllatop usable Fixes #1037 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1458324769-9152-1-git-send-email-syuu@scylladb.com>	2016-03-22 15:02:18 +02:00
Tomasz Grabiec	ca08db504b	managed_bytes: Make operator[] work for large blobs as well Fixes assertion in mutation_test: mutation_test: ./utils/managed_bytes.hh:349: blob_storage::char_type* managed_bytes::data(): Assertion `!_u.ptr->next' Introduced in `ea7c2dd085` Message-Id: <1458648786-9127-1-git-send-email-tgrabiec@scylladb.com>	2016-03-22 14:43:52 +02:00
Gleb Natapov	1e6352e398	messaging: do not admit new requests during messaging service shutdown. Sending a message may open new client connection which will never be closed in case messaging service is shutting down already. Fixes #1059 Message-Id: <1458639452-29388-3-git-send-email-gleb@scylladb.com>	2016-03-22 13:00:18 +02:00
Gleb Natapov	357c91a076	messaging: do not delete client during messaging service shutdown Messaging service stop() method calls stop() on all clients. If remove_rpc_client_one() is called while those stops are running client::stop() will be called twice which not suppose to happen. Fix it by ignoring client remove request during messaging service shutdown. Fixes #1059 Message-Id: <1458639452-29388-2-git-send-email-gleb@scylladb.com>	2016-03-22 13:00:18 +02:00
Asias He	b8abd88841	messaging_service: Take reference of ms in send_message_timeout_and_retry Take a reference of messaging_service object inside send_message_timeout_and_retry to make sure it is not freed during the life time of send_message_timeout_and_retry operation.	2016-03-22 12:32:19 +02:00
Pekka Enberg	ae33e9fe76	dist/ubuntu: Use tilde for release candidate builds The version number ordering rules are different for rpm and deb. Use tilde ('~') for the latter to ensure a release candidate is ordered _before_ a final version. Message-Id: <1458627524-23030-1-git-send-email-penberg@scylladb.com>	2016-03-22 11:52:05 +02:00
Avi Kivity	5a20a70728	Merge "CQL syntax extension to handle sstable loader lists" from Calle "Adds an extension function SCYLLA_TIMEUUID_LIST_INDEX to CQL syntax for collection element indexing, which, if the target is a list, will attempt to directly index the list (which is really a map) by the ordering time uuid (as index parameter)."	2016-03-22 11:42:47 +02:00
Duarte Nunes	36571a2018	init: Trim spaces in seeds list This patch ensures we are resilient against spaces before or after IP addresses in the seeds list. Fixes #958 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1458637617-5761-1-git-send-email-duarte@scylladb.com>	2016-03-22 11:10:29 +02:00
Avi Kivity	1798889e85	Merge "Make apply() exception-safe" from Tomasz "We cannot leave partially applied mutation behind when the write fails. It may fail if memory allocation fails in the middle of apply(). This for example would violate write atomicity, readers should either see the whole write or none at all. This fix makes apply() revert partially applied data upon failure, by the means of ReversiblyMergeable concept. In a nut shell the idea is to store old state in the source mutation as we apply it and swap back in case of exception. At cell level this swapping is inexpensive, just rewiring pointers. For this to work, the source mutation needs to be brought into mutable form, so frozen mutations need to be unfrozen. In practice this doesn't increase amount of cell allocations in the memtable apply path because incoming data will usually be newer and we will have to copy it into LSA anyway. There are extra allocations though for the data structures which holds cells. I didn't see significant change in performance of: build/release/tests/perf/perf_simple_query -c1 -m1G --write --duration 13 The score fluctuates around ~77k ops/s. The change was tested with a unit test (patch to mutation_test) which generates random mutations and injects allocation failures at every possible allocation site in the apply path. This also uncovered other preexisting bugs."	2016-03-22 10:43:41 +02:00
Gleb Natapov	ea92064d38	avoid invoke_on_all during developer-mode application if possible Message-Id: <20160315145327.GW6117@scylladb.com>	2016-03-22 10:40:30 +02:00
Nadav Har'El	2eb0627665	sstable: fix use-after-free of temporary ioclass copy Commit `6a3872b355` fixed some use-after-free bugs but introduced a new one because of a typo: Instead of capturing a reference to the long-living io-class object, as all the code does, one place in the code accidentally captured a copy of this object. This copy had a very temporary life, and when a reference to that copy was passed to sstable reading code which assumed that it lives at least as long as the read call, a use-after-free resulted. Fixes #1072 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1458595629-9314-1-git-send-email-nyh@scylladb.com>	2016-03-21 22:28:05 +01:00
Tomasz Grabiec	6e73c3f3dc	perf_simple_query: Make duration configurable	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	2fbb55929d	mutation_test: Add allocation failure stress test for apply() The test injects allocation failures at every allocation site during apply(). Only allocations throug allocation_strategy are instrumented, but currently those should include all allocations in the apply() path. The target and source mutations are randomized.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	8ede27f9c6	mutation_test: Add more apply() tests	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	36575d9f01	mutation_test: Hoist make_blob() to a function	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	4c85d06df7	mutation_test: Make make_blob() return different blob each time random_bytes was constructed with the same seed each time.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	19b3df9f0f	mutation_test: Fix use-after-free The problem was that verify_row() was returning a future which was not waited on. Fix by running the code in a thread.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	a7966e9b71	mutation_partition: Fix friend declarations Missing "class" confuses CLion IDE.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	dc290f0af7	mutation_partition: Make apply() atomic even in case of exception We cannot leave partially applied mutation behind when the write fails. It may fail if memory allocation fails in the middle of apply(). This for example would violate write atomicity, readers should either see the whole write or none at all. This fix makes apply() revert partially applied data upon failure, by the means of ReversiblyMergeable concept. In a nut shell the idea is to store old state in the source mutation as we apply it and swap back in case of exception. At cell level this swapping is inexpensive, just rewiring pointers. For this to work, the source mutation needs to be brought into mutable form, so frozen mutations need to be unfrozen. In practice this doesn't increase amount of cell allocations in the memtable apply path because incoming data will usually be newer and we will have to copy it into LSA anyway. There are extra allocations though for the data structures which holds cells. I didn't see significant change in performance of: build/release/tests/perf/perf_simple_query -c1 -m1G --write --duration 13 The score fluctuates around ~77k ops/s. Fixes #283.	2016-03-21 21:49:52 +01:00
Tomasz Grabiec	e09d186c7c	mutation_partition: Make intrusive sets ReversiblyMergeable	2016-03-21 21:49:52 +01:00
Tomasz Grabiec	f1a4feb1fc	mutation_partition: Make row_tombstones_entry ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	e4a576a90f	mutation_partition: Make rows_entry ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	aadcd75d89	mutation_partition: Make row_marker ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	ea7c2dd085	mutation_partition: Make row ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	c9d4f5a49c	atomic_cell_or_collection: Introduce as_atomic_cell_ref() Needed for setting the REVERT flag on existing cell.	2016-03-21 19:25:54 +01:00
Tomasz Grabiec	1ffe06165d	atomic_cell_hash: Specialize appending_hash<> for atomic_cell and collection_mutation	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	bfc6413414	atomic_cell: Add REVERT flag Needed to make atomic cells ReversiblyMergeable.	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	7fcfa97916	tombstone: Make ReversiblyMergeable	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	1407173186	Introduce the concept of ReversiblyMergeable	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	9fc7f8a5ed	mutation_partition: row: Add empty()	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	d5e66a5b0d	mutation_partition: row: Allow storing empty cells internally Currently only "set" storage could store empty cells, but not the "vector" one because there empty cell has the meaning of being missing. To implement rolback, we need to be able to distinguish empty cells from missing ones. Solve by making vector storage use a bitmap for presence checking instead of emptiness. This adds 4 bytes to vector storage.	2016-03-21 18:41:27 +01:00

... 54 55 56 57 58 ...

11716 Commits