scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 20:16:43 +00:00

Author	SHA1	Message	Date
Avi Kivity	e06e795031	Merge "database: assign proper io priority for streaming view updates" from Piotr " Streamed view updates parasitized on writing io priority, which is reserved for user writes - it's now properly bound to streaming write priority. Verified manually by checking appropriate io metrics: scylla_io_queue_total_bytes{class="streaming_write" ...} vs scylla_io_queue_total_bytes{class="query" ...} Tests: unit(dev) " Fixes #4615. * 'assign_proper_io_priority_to_streaming_view_updates' of https://github.com/psarna/scylla: db,view: wrap view update generation in stream scheduling group database: assign proper io priority for streaming view updates (cherry picked from commit `2c7435418a`)	2019-08-22 16:20:19 +03:00
Piotr Sarna	7d56e8e5bb	storage_proxy: fix iterator liveness issue in on_down (#4876 ) The loop over view update handlers used a guard in order to ensure that the object is not prematurely destroyed (thus invalidating the iterator), but the guard itself was not in the right scope. Fixed by replacinga 'for' loop with a 'while' loop, which moves the iterator incrementation inside the scope in which it's still guarded and valid. Fixes #4866 (cherry picked from commit `526f4c42aa`)	2019-08-21 19:04:56 +03:00
Avi Kivity	417250607b	relocatable: switch from run-time relocation to install-time relocation Our current relocation works by invoking the dynamic linker with the executable as an argument. This confuses gdb since the kernel records the dynamic linker as the executable, not the real executable. Switch to install-time relocation with patchelf: when installing the executable and libraries, all paths are known, and we can update the path to the dynamic loader and to the dynamic libraries. Since patchelf itself is dynamically linked, we have to relocate it dynamically (with the old method of invoking it via the dynamic linker). This is okay since it's a one-time operation and since we don't expect to debug core dumps of patchelf crashes. We lose the ability to run scylla directly from the uninstalled tarball, but since the nonroot installer is already moving in the direction of requiring install.sh, that is not a great loss, and certainly the ability to debug is more important. dh_strip barfs on some binaries which were treated with patchelf, so exclude them from dh_strip. This doesn't lose any functionality, since these binaries didn't have debug information to begin with (they are already-stripped Fedora executables). Fixes #4673. (cherry-picked from commit `698b72b501`) Backport notes: - 3.1 doesn't call install.sh from the debian packager, so add an adjust_bin and call it from the debian rules file directly - adjusted install.sh for 3.1 prefix (/usr) compared to master prefix (/opt/scylladb)	2019-08-20 17:08:49 +03:00
Pekka Enberg	d06bcef3b7	Merge "docker: relax permission checks" from Avi "Commit `e3f7fe4` added file owner validation to prevent Scylla from crashing when it tries to touch a file it doesn't own. However, under docker, we cannot expect to pass this check since user IDs are from different namespaces: the process runs in a container namespace, but the data files usually come from a mounted volume, and so their uids are from the host namespace. So we need to relax the check. We do this by reverting `b1226fb`, which causes Scylla to run as euid 0 in docker, and by special-casing euid 0 in the ownership verification step. Fixes #4823." * 'docker-euid-0' of git://github.com/avikivity/scylla: main: relax file ownership checks if running under euid 0 Revert "dist/docker/redhat: change user of scylla services to 'scylla'" (cherry picked from commit `595434a554`)	2019-08-14 08:31:10 +03:00
Tomasz Grabiec	50c5cb6861	Merge "Multishard combining reader more robust reader recreation" from Botond Make the reader recreation logic more robust, by moving away from deciding which fragments have to be dropped based on a bunch of special cases, instead replacing this with a general logic which just drops all already seen fragments (based on their position). Special handling is added for the case when the last position is a range tombstone with a non full prefix starting position. Reproducer unit tests are added for both cases. Refs #4695 Fixes #4733 (cherry picked from commit `0cf4fab2ca`)	2019-08-14 08:30:53 +03:00
Kamil Braun	70f5154109	Fix command line argument parsing in main. Command line arguments are parsed twice in Scylla: once in main and once in Seastar's app_template::run. The first parse is there to check if the "--version" flag is present --- in this case the version is printed and the program exists. The second parsing is correct; however, most of the arguments were improperly treated as positional arguments during the first parsing (e.g., "--network host" would treat "host" as a positional argument). This happened because the arguments weren't known to the command line parser. This commit fixes the issue by moving the parsing code until after the arguments are registered. Resolves #4141. Signed-off-by: Kamil Braun <kbraun@scylladb.com> (cherry picked from commit `f155a2d334`)	2019-08-13 20:11:26 +03:00
Rafael Ávila de Espíndola	329c419c30	Always close commitlog files We were using segment::_closed to decide whether _file was already closed. Unfortunately they are not exactly the same thing. As far as I understand it, segments can be closed and reused without actually closing the file. Found with a seastar patch that asserts on destroying an open append_challenged_posix_file_impl. Fixes #4745. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190721171332.7995-1-espindola@scylladb.com> (cherry picked from commit `636e2470b1`)	2019-08-13 19:59:13 +03:00
Avi Kivity	062d43c76e	Merge "Unbreak the Unbreakable Linux" from Glauber " scylla_setup is currently broken for OEL. This happens because the OS detection code checks for RHEL and Fedora. CentOS returns itself as RHEL, but OEL does not. " Fixes #4842. * 'unbreakable' of github.com:glommer/scylla: scylla_setup: be nicer about unrecognized OS scylla_util: recognize OEL as part of the RHEL family (cherry picked from commit `1cf72b39a5`)	2019-08-13 16:52:05 +03:00
Avi Kivity	cf4c238b28	Merge "Catch unclosed partition sstable write #4794 " from Tomasz " Not emitting partition_end for a partition is incorrect. SStable writer assumes that it is emitted. If it's not, the sstable will not be written correctly. The partition index entry for the last partition will be left partially written, which will result in errors during reads. Also, statistics and sstable key ranges will not include the last partition. It's better to catch this problem at the time of writing, and not generate bad sstables. Another way of handling this would be to implicitly generate a partition_end, but I don't think that we should do this. We cannot trust the mutation stream when invariants are violated, we don't know if this was really the last partition which was supposed to be written. So it's safer to fail the write. Enabled for both mc and la/ka. Passing --abort-on-internal-error on the command line will switch to aborting instead of throwing an exception. The reason we don't abort by default is that it may bring the whole cluster down and cause unavailability, while it may not be necessary to do so. It's safer to fail just the affected operation, e.g. repair. However, failing the operation with an exception leaves little information for debugging the root cause. So the idea is that the user would enable aborts on only one of the nodes in the cluster to get a core dump and not bring the whole cluster down. " * 'catch-unclosed-partition-sstable-write' of https://github.com/tgrabiec/scylla: sstables: writer: Validate that partition is closed when the input mutation stream ends config, exceptions: Add helper for handling internal errors utils: config_file: Introduce named_value::observe() (cherry picked from commit `95c0804731`)	2019-08-08 13:13:42 +02:00
Amnon Heiman	20090c1992	init: do not allow replace-address for seeds If a node is a seed node, it can not be started with replace-address-first-boot or the replace-address flag. The issue is that as a seed node it will generate new tokens instead of replacing the existing one the user expect it to replaec when supplying the flags. This patch will throw a bad_configuration_error exception in this case. Fixes #3889 Signed-off-by: Amnon Heiman <amnon@scylladb.com> (cherry picked from commit `399d79fc6f`)	2019-08-07 22:04:58 +03:00
Raphael S. Carvalho	8ffb567474	table: do not rely on undefined behavior in cleanup_sstables It shouldn't rely on argument evaluation order, which is ub. Fixes #4718. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `0e732ed1cf`)	2019-08-07 21:48:44 +03:00
Tomasz Grabiec	710ec83d12	Merge "Fix the system.size_estimates table" from Kamil Fixes a segfault when querying for an empty keyspace. Also, fixes an infinite loop on smp > 1. Queries to system.size_estimates table which are not single-partition queries caused Scylla to go into an infinite loop inside multishard_combining_reader::fill_buffer. This happened because multishard_combinind_reader assumes that shards return rows belonging to separate partitions, which was not the case for size_estimates_mutation_reader. Fixes #4689. (cherry picked from commit `14700c2ac4`)	2019-08-07 21:38:38 +03:00
Asias He	8d7c489436	streaming: Move stream_mutation_fragments_cmd to a new file (#4812 ) Avoid including the lengthy stream_session.hh in messaging_service. More importantly, fix the build because currently messaging_service.cc and messaging_service.hh does not include stream_mutation_fragments_cmd. I am not sure why it builds on my machine. Spotted this when backporting the "streaming: Send error code from the sender to receiver" to 3.0 branch. Refs: #4789 (cherry picked from commit `49a73aa2fc`)	2019-08-07 19:11:51 +02:00
Asias He	6ec558e3a0	streaming: Send error code from the sender to receiver In case of error on the sender side, the sender does not propagate the error to the receiver. The sender will close the stream. As a result, the receiver will get nullopt from the source in get_next_mutation_fragment and pass mutation_fragment_opt with no value to the generating_reader. In turn, the generating_reader generates end of stream. However, the last element that the generating_reader has generated can be any type of mutation_fragment. This makes the sstable that consumes the generating_reader violates the mutation_fragment stream rule. To fix, we need to propagate the error. However RPC streaming does not support propagate the error in the framework. User has to send an error code explicitly. Fixes: #4789 (cherry picked from commit `bac987e32a`) (cherry picked from commit `288371ce75`)	2019-08-07 19:11:33 +02:00
Tomasz Grabiec	b1e2842c8c	sstables: ka/la: reader: Make sure push_ready_fragments() does not miss to emit partition_end Currently, if there is a fragment in _ready and _out_of_range was set after row end was consumer, push_ready_fragments() would return without emitting partition_end. This is problematic once we make consume_row_start() emit partiton_start directly, because we will want to assume that all fragments for the previous partition are emitted by then. If they're not, then we'd emit partition_start before partition_end for the previous partition. The fix is to make sure that push_ready_fragments() emits everything. Fixes #4786 (cherry picked from commit `9b8ac5ecbc`) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-08-01 13:06:08 +03:00
Avi Kivity	5a273737e3	Update seastar submodule * seastar 82637dcab4...c59d019d6b (1): > reactor: fix deadlock of stall detector vs dlopen Fixes #4759.	2019-07-31 18:31:40 +03:00
Avi Kivity	b0d2312623	toppartitions: fix race between listener removal and reads Data listener reads are implemented as flat_mutation_readers, which take a reference to the listener and then execute asynchronously. The listener can be removed between the time when the reference is taken and actual execution, resulting in a dangling pointer dereference. Fix by using a weak_ptr to avoid writing to a destroyed object. Note that writes don't need protection because they execute atomically. Fixes #4661. Tests: unit (dev) (cherry picked from commit `e03c7003f1`)	2019-07-28 13:53:40 +03:00
Avi Kivity	2f007d8e6b	sstable: index_reader: close index_reader::reader more robustly If we had an error while reading, then we would have failed to close the reader, which in turn can cause memory corruption. Make the closing more robust by using then_wrapped (that doesn't skip on exception) and log the error for analysis. Fixes #4761. (cherry picked from commit `b272db368f`)	2019-07-27 18:19:57 +03:00
yaronkaikov	bebfd7b26c	release: prepare for 3.1.0.rc3 scylla-3.1.0.rc3	2019-07-25 12:15:55 +03:00
Tomasz Grabiec	03b48b2caf	database: Add missing partition slicing on streaming reader recreation streaming_reader_lifecycle_policy::create_reader() was ignoring the partition_slice passed to it and always creating the reader for the full slice. That's wrong because create_reader() is called when recreating a reader after it's evicted. If the reader stopped in the middle of partition we need to start from that point. Otherwise, fragments in the mutation stream will appear duplicated or out of ordre, violating assumptions of the consumers. This was observed to result in repair writing incorrect sstables with duplicated clustering rows, which results in malformed_sstable_exception on read from those sstables. Fixes #4659. In v2: - Added an overload without partition_slice to avoid changing existing users which never slice Tests: - unit (dev) - manual (3 node ccm + repair) Backport: 3.1 Reviewd-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1563451506-8871-1-git-send-email-tgrabiec@scylladb.com> (cherry picked from commit `7604980d63`)	2019-07-22 15:08:09 +03:00
Avi Kivity	95362624bc	Merge "Fix disable_sstable_write synchronization with on_compaction_completion" from Benny " disable_sstable_write needs to acquire _sstable_deletion_sem to properly synchronize with background deletions done by on_compaction_completion to ensure no sstables will be created or deleted during reshuffle_sstables after storage_service::load_new_sstables disables sstable writes. Fixes #4622 Test: unit(dev), nodetool_additional_test.py migration_test.py " * 'scylla-4622-fix-disable-sstable-write' of https://github.com/bhalevy/scylla: table: document _sstables_lock/_sstable_deletion_sem locking order table: disable_sstable_write: acquire _sstable_deletion_sem table: uninline enable_sstable_write table: reshuffle_sstables: add log message (cherry picked from commit `43690ecbdf`)	2019-07-22 13:47:25 +03:00
Asias He	7865c314a5	repair: Avoid deadlock in remove_repair_meta Start n1, n2 Create ks with rf = 2 Run repair on n2 Stop n2 in the middle of repair n1 will notice n2 is DOWN, gossip handler will remove repair instance with n2 which calls remove_repair_meta(). Inside remove_repair_meta(), we have: ``` 1 return parallel_for_each(*repair_metas, [repair_metas] (auto& rm) { 2 return rm->stop(); 3 }).then([repair_metas, from] { 4 rlogger.debug("Removed all repair_meta for single node {}", from); 5 }); ``` Since 3.1, we start 16 repair instances in parallel which will create 16 readers.The reader semaphore is 10. At line 2, it calls ``` 6 future<> stop() { 7 auto gate_future = _gate.close(); 8 auto writer_future = _repair_writer.wait_for_writer_done(); 9 return when_all_succeed(std::move(gate_future), std::move(writer_future)); 10 } ``` The gate protects the reader to read data from disk: ``` 11 with_gate(_gate, [] { 12 read_rows_from_disk 13 return _repair_reader.read_mutation_fragment() --> calls reader() to read data 14 }) ``` So line 7 won't return until all the 16 readers return from the call of reader(). The problem is, the reader won't release the reader semaphore until the reader is destroyed! So, even if 10 out of the 16 readers have finished reading, they won't release the semaphore. As a result, the stop() hangs forever. To fix in short term, we can delete the reader, aka, drop the the repair_meta object once it is stopped. Refs: #4693 (cherry picked from commit `8774adb9d0`)	2019-07-21 13:31:08 +03:00
Asias He	0e6b62244c	streaming: Do not open rpc stream connection if ranges are not relevant to a shard Given a list of ranges to stream, stream_transfer_task will create an reader with the ranges and create a rpc stream connection on all the shards. When user provides ranges to repair with -st -et options, e.g., using scylla-manger, such ranges can belong to only one shard, repair will pass such ranges to streaming. As a result, only one shard will have data to send while the rpc stream connections are created on all the shards, which can cause the kernel run out of ports in some systems. To mitigate the problem, do not open the connection if the ranges do not belong to the shard at all. Refs: #4708 (cherry picked from commit `64a4c0ede2`)	2019-07-21 10:23:49 +03:00
Kamil Braun	9d722a56b3	Fix timestamp_type_impl::timestamp_from_string. Now it accepts the 'z' or 'Z' timezone, denoting UTC+00:00. Fixes #4641. Signed-off-by: Kamil Braun <kbraun@scylladb.com> (cherry picked from commit `4417e78125`)	2019-07-17 21:54:47 +03:00
Eliran Sinvani	7009d5fb23	auth: Prevent race between role_manager and pasword_authenticator When scylla is started for the first time with PasswordAuthenticator enabled, it can be that a record of the default superuser will be created in the table with the can_login and is_superuser set to null. It happens because the module in charge of creating the row is the role manger and the module in charge of setting the default password salted hash value is the password authenticator. Those two modules are started together, it the case when the password authenticator finish the initialization first, in the period until the role manager completes it initialization, the row contains those null columns and any loging attempt in this period will cause a memory access violation since those columns are not expected to ever be null. This patch removes the race by starting the password authenticator and autorizer only after the role manger finished its initialization. Tests: 1. Unit tests (release) 2. Auth and cqlsh auth related dtests. Fixes #4226 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <20190714124839.8392-1-eliransin@scylladb.com> (cherry picked from commit `997a146c7f`)	2019-07-15 21:18:05 +03:00
Takuya ASADA	eb49fae020	reloc: provide libthread_db.so.1 to debug thread on gdb In scylla-debuginfo package, we have /usr/lib/debug/opt/scylladb/libreloc/libthread_db-1.0.so-666.development-0.20190711.73a1978fb.el7.x86_64.debug but we actually does not have libthread_db.so.1 in /opt/scylladb/libreloc since it's not available on ldd result with scylla binary. To debug thread, we need to add the library in a relocatable package manually. Fixes #4673 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190711111058.7454-1-syuu@scylladb.com> (cherry picked from commit `842f75d066`)	2019-07-15 14:50:56 +03:00
Asias He	92bf928170	repair: Allow repair when a replica is down Since commit `bb56653` (repair: Sync schema from follower nodes before repair), the behaviour of handling down node during repair has been changed. That is, if a repair follower is down, it will fail to sync schema with it and the repair of the range will be skipped. This means a range can not be repaired unless all the nodes for the replicas are up. To fix, we filter out the nodes that is down and mark the repair is partial and repair with the nodes that are still up. Tests: repair_additional_test:RepairAdditionalTest.repair_with_down_nodes_2b_test Fixes: #4616 Backports: 3.1 Message-Id: <621572af40335cf5ad222c149345281e669f7116.1562568434.git.asias@scylladb.com> (cherry picked from commit `39ca044dab`)	2019-07-11 11:44:49 +03:00
Rafael Ávila de Espíndola	deac0b0e94	mc writer: Fix exception safety when closing _index_writer This fixes a possible cause of #4614. From the backtrace in that issue, it looks like a file is being closed twice. The first point in the backtrace where that seems likely is in the MC writer. My first idea was to add a writer::close and make it the responsibility of the code using the writer to call it. That way we would move work out of the destructor. That is a bit hard since the writer is destroyed from flat_mutation_reader::impl::~consumer_adapter and that would need to get a close function too. This patch instead just fixes an exception safety issue. If _index_writer->close() throws, _index_writer is still valid and ~writer will try to close it again. If the exception was thrown after _completed.set_value(), that would explain the assert about _completed.set_value() being called twice. With this patch the path outside of the destructor now moves the writer to a local variable before trying to close it. Fixes #4614 Message-Id: <20190710171747.27337-1-espindola@scylladb.com> (cherry picked from commit `281f3a69f8`)	2019-07-11 11:42:48 +03:00
kbr-	c294000113	Implement tuple_type_impl::to_string_impl. (#4645 ) Resolves #4633. Signed-off-by: Kamil Braun <kbraun@scylladb.com> (cherry picked from commit `8995945052`)	2019-07-08 11:08:10 +03:00
Avi Kivity	18bb2045aa	Update seastar submodule * seastar 4cdccae53b...82637dcab4 (1): > perftune.py: fix the i3 metal detection pattern Ref #4057.	2019-07-02 13:49:21 +03:00
Avi Kivity	5e3276d08f	Update seastar submodule to point to scylla-seastar.git This allows us to add 3.1 specific patches to Seastar.	2019-07-02 13:47:50 +03:00
Piotr Sarna	acff367ea8	main: stop view builder conditionally The view builder is started only if it's enabled in config, via the view_building=true variable. Unfortunately, stopping the builder was unconditional, which may result in failed assertions during shutdown. To remedy this, view building is stopped only if it was previously started. Fixes #4589 (cherry picked from commit `efa7951ea5`)	2019-06-26 10:45:50 +03:00
Tomasz Grabiec	e39724a343	Merge "Sync schema before repair" from Asias This series makes sure new schema is propagated to repair master and follower nodes before repair. Fixes #4575 * dev.git asias/repair_pull_schema_v2: migration_manager: Add sync_schema repair: Sync schema from follower nodes before repair (cherry picked from commit `269e65a8db`)	2019-06-26 09:35:42 +02:00
Asias He	31c4db83d8	repair: Avoid searching all the rows in to_repair_rows_on_wire The repair_rows in row_list are sorted. It is only possible for the current repair_row to share the same partition key with the last repair_row inserted into repair_row_on_wire. So, no need to search from the beginning of the repair_rows_on_wire to avoid quadratic complexity. To fix, look at the last item in repair_rows_on_wire. Fixes #4580 Message-Id: <08a8bfe90d1a6cf16b67c210151245879418c042.1561001271.git.asias@scylladb.com> (cherry picked from commit `b99c75429a`)	2019-06-25 12:48:37 +02:00
Tomasz Grabiec	433cb93f7a	Merge "Use same schema version for repair nodes" from Asias This patch set fixes repair nodes using different schema version and optimizes the hashing thanks to the fact now all nodes uses same schema version. Fixes: #4549 * seastar-dev.git asias/repair_use_same_schema.v3: repair: Use the same schema version for repair master and followers repair: Hash column kind and id instead of column name and type name (cherry picked from commit `cd1ff1fe02`)	2019-06-23 20:57:16 +03:00
Avi Kivity	f553819919	Merge "Fix infinite paging for indexed queries" from Piotr " Fixes #4569 This series fixes the infinite paging for indexed queries issue. Before this fix, paging indexes tended to end up in an infinite loop of returning pages with 0 results, but has_more_pages flag set to true, which confused the drivers. Tests: unit(dev) Branches: 3.0, 3.1 " * 'fix_infinite_paging_for_indexed_queries' of https://github.com/psarna/scylla: tests: add test case for finishing index paging cql3: fix infinite paging for indexed queries (cherry picked from commit `9229afe64f`)	2019-06-23 20:54:27 +03:00
Nadav Har'El	48c34e7635	storage_proxy: fix race and crash in case of MV and other node shutdown Recently, in merge commit `2718c90448`, we added the ability to cancel pending view-update requests when we detect that the target node went down. This is important for view updates because these have a very long timeout (5 minutes), and we wanted to make this timeout even longer. However, the implementation caused a race: Between creating the update's request handler (create_write_response_handler()) and actually starting the request with this handler (mutate_begin()), there is a preemption point and we may end up deleting the request handler before starting the request. So mutate_begin() must gracefully handle the case of a missing request handler, and not crash with a segmentation fault as it did before this patch. Eventually the lifetime management of request handlers could be refactored to avoid this delicate fix (which requires more comments to explain than code), or even better, it would be more correct to cancel individual writes when a node goes down, not drop the entire handler (see issue #4523). However, for now, let's not do such invasive changes and just fix bug that we set out to fix. Fixes #4386. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190620123949.22123-1-nyh@scylladb.com> (cherry picked from commit `6e87bca65d`)	2019-06-23 20:53:01 +03:00
Nadav Har'El	7f85b30941	Fix deciding whether a query uses indexing The code that decides whether a query should used indexing was buggy - a partition key index might have influenced the decision even if the whole partition key was passed in the query (which effectively means that indexing it is not necessary). Fixes #4539 Closes https://github.com/scylladb/scylla/pull/4544 Merged from branch 'fix_deciding_whether_a_query_uses_indexing' of git://github.com/psarna/scylla tests: add case for partition key index and filtering cql3: fix deciding if a query uses indexing (cherry picked from commit `6aab1a61be`)	2019-06-18 13:25:18 +03:00
Hagit Segev	7d14514b8a	release: prepare for 3.1.0.rc2 scylla-3.1.0.rc2	2019-06-16 20:28:31 +03:00
Piotr Sarna	35f906f06f	tests: add a test case for filtering clustering key The test cases makes sure that clustering key restriction columns are fetched for filtering if they form a clustering key prefix, but not a primary key prefix (partition key columns are missing). Ref #4541 Message-Id: <3612dc1c6c22c59ac9184220a2e7f24e8d18407c.1560410018.git.sarna@scylladb.com> (cherry picked from commit `2c2122e057`)	2019-06-16 14:36:52 +03:00
Piotr Sarna	2c50a484f5	cql3: fix qualifying clustering key restrictions for filtering Clustering key restrictions can sometimes avoid filtering if they form a prefix, but that can happen only if the whole partition key is restricted as well. Ref #4541 Message-Id: <9656396ee831e29c2b8d3ad4ef90c4a16ab71f4b.1560410018.git.sarna@scylladb.com> (cherry picked from commit `c4b935780b`)	2019-06-16 14:36:52 +03:00
Piotr Sarna	24ddb46707	cql3: fix fetching clustering key columns for filtering When a column is not present in the select clause, but used for filtering, it usually needs to be fetched from replicas. Sometimes it can be avoided, e.g. if primary key columns form a valid prefix - then, they will be optimized out before filtering itself. However, clustering key prefix can only be qualified for this optimization if the whole partition key is restricted - otherwise the clustering columns still need to be present for filtering. This commit also fixes tests in cql_query_test suite, because they now expect more values - columns fetched for filtering will be present as well (only internally, the clients receive only data they asked for). Fixes #4541 Message-Id: <f08ebae5562d570ece2bb7ee6c84e647345dfe48.1560410018.git.sarna@scylladb.com> (cherry picked from commit `adeea0a022`)	2019-06-16 14:36:52 +03:00
Dejan Mircevski	f2fc3f32af	tests: Add cquery_nofail() utility Most tests await the result of cql_test_env::execute_cql(). Most would also benefit from reporting errors with top-level location included. Signed-off-by: Dejan Mircevski <dejan@scylladb.com> (cherry picked from commit `a9849ecba7`)	2019-06-16 14:36:52 +03:00
Asias He	c9f488ddc2	repair: Avoid writing row with same partition key and clustering key more than once Consider master: row(pk=1, ck=1, col=10) follower1: row(pk=1, ck=1, col=20) follower2: row(pk=1, ck=1, col=30) When repair runs, master fetches row(pk=1, ck=1, col=20) and row(pk=1, ck=1, col=30) from follower1 and follower2. Then repair master sends row(pk=1, ck=1, col=10) and row(pk=1, ck=1, col=30) to follower1, follower1 will write the row with the same pk=1, ck=1 twice, which violates uniqueness constraints. To fix, we apply the row with same pk and ck into the previous row. We only needs this on repair follower because the rows can come from multiple nodes. While on repair master, we have a sstable writer per follower, so the rows feed into sstable writer can come from only a single node. Tests: repair_additional_test.py:RepairAdditionalTest.repair_same_row_diff_value_3nodes_test Fixes: #4510 Message-Id: <cb4fbba1e10fb0018116ffe5649c0870cda34575.1560405722.git.asias@scylladb.com> (cherry picked from commit `9079790f85`)	2019-06-16 10:23:58 +03:00
Asias He	46498e77b8	repair: Allow repair_row to initialize partially On repair follower node, only decorated_key_with_hash and the mutation_fragment inside repair_row are used in apply_rows() to apply the rows to disk. Allow repair_row to initialize partially and throw if the uninitialized member is accessed to be safe. Message-Id: <b4e5cc050c11b1bafcf997076a3e32f20d059045.1560405722.git.asias@scylladb.com> (cherry picked from commit `912ce53fc5`)	2019-06-16 10:23:50 +03:00
Piotr Jastrzebski	440f33709e	sstables: distinguish empty and missing cellpath Before this patch mc sstables writer was ignoring empty cellpaths. This is a wrong behaviour because it is possible to have empty key in a map. In such case, our writer creats a wrong sstable that we can't read back. This is becaus a complex cell expects cellpath for each simple cell it has. When writer ignores empty cellpath it writes nothing and instead it should write a length of zero to the file so that we know there's an empty cellpath. Fixes #4533 Tests: unit(release) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <46242906c691a56a915ca5994b36baf87ee633b7.1560532790.git.piotr@scylladb.com> (cherry picked from commit `a41c9763a9`)	2019-06-16 09:04:24 +03:00
Pekka Enberg	34696e1582	dist/docker: Switch to 3.1 release repository	2019-06-14 08:10:02 +03:00
Takuya ASADA	43bb290705	dist/docker/redhat: change user of scylla services to 'scylla' On branch-3.1 / master, we are getting following error: ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/data: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/hints: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/commitlog: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) ERROR 2019-06-11 10:58:49,156 [shard 0] database - /var/lib/scylla/view_hints: File not owned by current euid: 0. Owner is: 999 ERROR 2019-06-11 10:58:49,156 [shard 0] init - Failed owner and mode verification: std::runtime_error (File not owned by current euid: 0. Owner is: 999) It seems like owner verification of data directory fails because scylla-server process is running in root but data directory owned by scylla, so we should run services as scylla user. Fixes #4536 Message-Id: <20190611113142.23599-1-syuu@scylladb.com> (cherry picked from commit `b1226fb15a`)	2019-06-14 08:02:45 +03:00
Calle Wilund	53980816de	api.hh: Fix bool parsing in req_param Fixes #4525 req_param uses boost::lexical cast to convert text->var. However, lexical_cast does not handle textual booleans, thus param=true causes not only wrong values, but exceptions. Message-Id: <20190610140511.15478-1-calle@scylladb.com> (cherry picked from commit `26702612f3`)	2019-06-13 11:56:27 +03:00
Vlad Zolotarov	c1f4617530	fix_system_distributed_tables.py: declare the 'port' argument as 'int' If a port value passed as a string this makes the cluster.connect() to fail with Python3.4. Let's fix this by explicitly declaring a 'port' argument as 'int'. Fixes #4527 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20190606133321.28225-1-vladz@scylladb.com> (cherry picked from commit `20a610f6bc`)	2019-06-13 11:45:54 +03:00

1 2 3 4 5 ...

18631 Commits