scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Piotr Sarna	add40d4e59	cql3: lift infinite bound check if it's supported If the database supports infinite bound range deletions, CQL layer will no longer throw an error indicating that both ranges need to be specified. [bhalevy] Update test_range_deletion_scenarios unit test accordingly. Fixes #432 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-24 15:58:34 +03:00
Piotr Sarna	c19fdc4c90	service: enable infinite bound range deletions with mc As soon as it's agreed that the cluster supports sstables in mc format, infinite bound range deletions in statements can be safely enabled. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-24 15:58:28 +03:00
Piotr Sarna	e77ef849af	database: add flag for infinite bound range deletions Database can only support infinite bound range deletions if sstable mc format is supported. As a first step to implement these checks, an appropriate flag is added to database.	2019-06-24 15:57:47 +03:00
Benny Halevy	b1e78313fe	log_histogram: log_heap_options::bucket_of: avoid calling pow2_rank(0) pow2_rank is undefined for 0. bucket_of currently works around that by using a bitmask of 0. To allow asserting that count_{leading,trailing}_zeros are not called with 0, we want to avoid it at all call sites. Fixes #4153 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190623162137.2401-1-bhalevy@scylladb.com>	2019-06-23 19:32:51 +03:00
Avi Kivity	779b378785	Merge "Fix partitioned_sstable_set by making it self sufficient" from Raphael & Benny " partitioned_sstable_set is not self sufficient because it relies on compatible_ring_position_view, which in turn relies on lifetime of sstable object. This leads to use-after-free. Fix this problem by introducing compatible_ring_position and using it in p__s__s. Fixes #4572. Test: unit (dev), compaction dtests (dev) " * 'projects/fix_partitioned_sstable_set/v4' of ssh://github.com/bhalevy/scylla: tests: Test partitioned sstable set's self-sufficiency sstables: Fix partitioned_sstable_set by making it self sufficient Introduce compatible_ring_position and compatible_ring_position_or_view	2019-06-23 17:17:18 +03:00
Raphael S. Carvalho	14fa7f6c02	tests: Test partitioned sstable set's self-sufficiency Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-23 16:29:13 +03:00
Raphael S. Carvalho	293557a34e	sstables: Fix partitioned_sstable_set by making it self sufficient Partitioned sstable set is not self sufficient, because it uses compatible_ring_position_view as key for interval map, which is constructed from a decorated key in sstable object. If sstable object is destroyed, like when compaction releases it early, partitioned set potentially no longer works because c__r__p__v would store information that is already freed, meaning its use implies use-after-free. Therefore, the problem happens when partitioned set tries to access the interval of its interval map and uses freed information from c__r__p__v. Fix is about using the newly introduced compatible_ring_position_or_view which can hold a ring_position, meaning that partitioned set is no longer dependent on lifetime of sstable object. Retire compatible_ring_position_view.hh as it is now unused. Fixes #4572. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-23 16:29:13 +03:00
Raphael S. Carvalho	9a83561700	Introduce compatible_ring_position and compatible_ring_position_or_view The motivation for supporting ring position is that containers using it can be self sufficient. The existing compatible_ring_position_view could lead to use after free when the ring position data, it was built from, is gone. The motivation for compatible_ring_position_or_view is to allow lookup on containers that don't support different key types using c__r__p, and also to avoid unnecessary copies. If the user is provided only with a ring_position_view, c__r__p__or_v could be built from it and used for lookups. Converting ring_position_view to ring_position is very bug prone because there could be information lost in the process. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-23 16:29:12 +03:00
Rafael Ávila de Espíndola	65ac0a831c	Add to_string_impl that takes a data_value Currently to_string takes raw bytes. This means that to print a data_value it has to first be serialized to be passed to to_string, which will then deserializes it. This patch adds a virtual to_string_impl that takes a data_value and implements a now non virtual to_sting on top of it. I don't expect this to have a performance impact. It mostly documents how to access a data_value without converting it to bytes. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190620183449.64779-3-espindola@scylladb.com>	2019-06-23 16:03:06 +03:00
Rafael Ávila de Espíndola	3bd5dd7570	Add a few more tests of data_value::to_string I found that no tests covered this code while refactoring it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190620183449.64779-2-espindola@scylladb.com>	2019-06-23 16:03:06 +03:00
Nadav Har'El	6e87bca65d	storage_proxy: fix race and crash in case of MV and other node shutdown Recently, in merge commit `2718c90448`, we added the ability to cancel pending view-update requests when we detect that the target node went down. This is important for view updates because these have a very long timeout (5 minutes), and we wanted to make this timeout even longer. However, the implementation caused a race: Between creating the update's request handler (create_write_response_handler()) and actually starting the request with this handler (mutate_begin()), there is a preemption point and we may end up deleting the request handler before starting the request. So mutate_begin() must gracefully handle the case of a missing request handler, and not crash with a segmentation fault as it did before this patch. Eventually the lifetime management of request handlers could be refactored to avoid this delicate fix (which requires more comments to explain than code), or even better, it would be more correct to cancel individual writes when a node goes down, not drop the entire handler (see issue #4523). However, for now, let's not do such invasive changes and just fix bug that we set out to fix. Fixes #4386. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190620123949.22123-1-nyh@scylladb.com>	2019-06-23 16:03:06 +03:00
Asias He	b99c75429a	repair: Avoid searching all the rows in to_repair_rows_on_wire The repair_rows in row_list are sorted. It is only possible for the current repair_row to share the same partition key with the last repair_row inserted into repair_row_on_wire. So, no need to search from the beginning of the repair_rows_on_wire to avoid quadratic complexity. To fix, look at the last item in repair_rows_on_wire. Fixes #4580 Message-Id: <08a8bfe90d1a6cf16b67c210151245879418c042.1561001271.git.asias@scylladb.com>	2019-06-23 16:03:06 +03:00
Benny Halevy	883cb4318f	Merge pull request #4583 from bhalevy/init-and-shutdown-logging Init and shutdown logging	2019-06-23 16:03:06 +03:00
Rafael Ávila de Espíndola	3660caff77	Reduce memory used by all tests Tests without custom flags were already being run with -m2G. Tests with custom flags have to manually specify it, but some were missing it. This could cause tests to fail with std::bad_alloc when two concurrent tests tried to allocate all the memory. This patch adds -m2G to all tests that were missing it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190620002921.101481-1-espindola@scylladb.com>	2019-06-23 16:03:06 +03:00
Avi Kivity	9229afe64f	Merge "Fix infinite paging for indexed queries" from Piotr " Fixes #4569 This series fixes the infinite paging for indexed queries issue. Before this fix, paging indexes tended to end up in an infinite loop of returning pages with 0 results, but has_more_pages flag set to true, which confused the drivers. Tests: unit(dev) Branches: 3.0, 3.1 " * 'fix_infinite_paging_for_indexed_queries' of https://github.com/psarna/scylla: tests: add test case for finishing index paging cql3: fix infinite paging for indexed queries	2019-06-23 16:03:06 +03:00
Takuya ASADA	2135d2ae7f	dist/debian: install capabilities.conf on postinst script We still has "{{^jessie}}" tag on scylla-server systemd unit file to skip using AmbientCapabilities on Debian 8, but it does not able to work anymore since we moved to single binary .deb package for all debian variants, we must share same systemd unit file across all Debian variants. To do so we need to have separated file on /etc/systemd to define AmbientCapabilities, create the file while running postinst script only if distribution is not Debian 8, just like we do in .rpm. See #3344 See #3486 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190619064224.23035-1-syuu@scylladb.com>	2019-06-23 16:03:06 +03:00
Tomasz Grabiec	46341bd63f	gdb: Print coordinator stats related to memory usage from 'scylla memory' Example: Coordinator: fg writes: 150 bg writes: 39980, 21429280 B fg reads: 0 bg reads: 0 hints: 0 B view hints: 0 B Reviewed-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1559906745-17150-1-git-send-email-tgrabiec@scylladb.com>	2019-06-23 16:03:06 +03:00
Tomasz Grabiec	f7e79b07d1	lsa: Respect the reclamation step hint from seastar allocator This will allow us to reduce the amount of segment compaction when reclaiming on behlaf of a large allocation because we'll evict much more up front. Tests: - unit (dev) Reviewed-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1559906584-16770-1-git-send-email-tgrabiec@scylladb.com>	2019-06-23 16:03:06 +03:00
Tomasz Grabiec	c5184b3dd0	gdb: Print region_impl pointer from scylla lsa Reviewed-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1559906684-17019-1-git-send-email-tgrabiec@scylladb.com>	2019-06-23 16:03:06 +03:00
Alexys Jacob	98bc9edf6f	thrift/: support version 0.11+ after THRIFT-2221 Thrift 0.11 changed to generate c++ code with std::shared_ptr instead of boost::shared_ptr. - https://issues.apache.org/jira/browse/THRIFT-2221 This was forcing scylla to stick with older versions of thrift. Fixes issue #3097. thrift: add type aliases to build with old and new versions. update to using namespace =	2019-06-23 16:03:06 +03:00
Takuya ASADA	e4320d6537	dist/debian: run 'systemctl daemon-reload' automatically on package install/uninstall Since we cannot use dh --with=systemd because we don't want to automatically enabling systemd units, manage them by our setup scripts, we have to do 'systemctl daemon-reload' manually. (On dh --with=systemd, systemd helper automatically provides such scirpts) Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20190618000210.28972-1-syuu@scylladb.com>	2019-06-23 16:03:06 +03:00
Rafael Ávila de Espíndola	8c067c26d9	Add support for the sanitize build mode in scylla Running tests in debug mode takes 25:22.08 in my machine. Using sanitize instead takes that down to 10:46.39. The mode is opt in, in that it must be explicitly selected with "configure.py --mode=sanitize" or "ninja sanitize". It must also be explicitly passed to test.py. Unfortunately building with asan, optimizations and debug info is very slow and there is nothing like -gline-tables-only in gcc. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190617170007.44117-1-espindola@scylladb.com>	2019-06-23 16:03:06 +03:00
Benny Halevy	1fd91eb616	main: add logging for deferred stopping Increase visibility of init messages to help diagnose init and shutdown issues. Ref #4384 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-20 13:04:36 +03:00
Benny Halevy	cbbe5a519a	main: improve init logging Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-20 13:04:36 +03:00
Benny Halevy	e96b1afdbd	supervisor::notify log at info level rather than trace Increase visibility of init messages to help diagnose init and shutdown issues. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-06-20 13:04:36 +03:00
Piotr Sarna	b8cadc928c	tests: add test case for finishing index paging The test case makes sure that paging indexes does not result in an infinite loop. Refs #4569	2019-06-19 14:10:13 +02:00
Piotr Sarna	88f3ade16f	cql3: fix infinite paging for indexed queries Indexed queries need to translate between view table paging state and base table paging state, in order to be able to page the results correctly. One of the stages of this translation is overwriting the paging state obtained from the base query, in order to return view paging state to the user, so it can be used for fetching next pages. Unfortunately, in the original implementation the paging state was overwritten only if more pages were available, while if 'remaining' pages were equal to 0, nothing was done. This is not enough, because the paging state of the base query needs to be overwritten unconditionally - otherwise a guard paging state value of 'remaining == 0' is returned back to the client along with 'has_more_pages = true', which will result in an infinite loop. This patch correctly overwrites the base paging state unconditionally. Fixes #4569	2019-06-19 14:10:13 +02:00
Tomasz Grabiec	cd1ff1fe02	Merge "Use same schema version for repair nodes" from Asias This patch set fixes repair nodes using different schema version and optimizes the hashing thanks to the fact now all nodes uses same schema version. Fixes: #4549 * seastar-dev.git asias/repair_use_same_schema.v3: repair: Use the same schema version for repair master and followers repair: Hash column kind and id instead of column name and type name	2019-06-18 12:42:53 +02:00
Asias He	4285801af9	repair: Hash column kind and id instead of column name and type name It is guaranteed repair nodes use the same schema. It is faster to hash column kind and id. Changing the hashing of mutation fragment causes incompatibility with mixed clusters. Let's backport to the 3.1 release, which includes row level repair for the first time and is not released yet. Refs: #4549 Backports: 3.1	2019-06-18 18:27:21 +08:00
Asias He	3db136f81e	repair: Use the same schema version for repair master and followers Before this patch, repair master and followers use their own schema version at the point repair starts independently. The schemas can be different due to schema change. Repair uses the schema to serialize mutation_fragment and deserialize the mutation_fragment received from peer nodes. Using different schema version to serialize and deserialize cause undefined behaviour. To fix, we use the schema the repair master decides for all the repair nodes involved. On top of this patch, we could do another step to make sure all nodes has the latest schema. But let's do it in a separate patch. Fixes: #4549 Backports: 3.1	2019-06-18 18:27:21 +08:00
Rafael Ávila de Espíndola	8672eddff2	Document the best practices for when to use asserts/exceptions/logs The intention is just to document what is currently done. If someone wants to propose changes, that can be done after the current practices have been documented. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190524135109.29436-1-espindola@scylladb.com>	2019-06-18 12:13:01 +03:00
Rafael Ávila de Espíndola	26c0814a88	Add test large collection warning This was already working, but we were not testing for it. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190617181706.66490-1-espindola@scylladb.com>	2019-06-18 10:27:55 +02:00
Nadav Har'El	6aab1a61be	Fix deciding whether a query uses indexing The code that decides whether a query should used indexing was buggy - a partition key index might have influenced the decision even if the whole partition key was passed in the query (which effectively means that indexing it is not necessary). Fixes #4539 Closes https://github.com/scylladb/scylla/pull/4544 Merged from branch 'fix_deciding_whether_a_query_uses_indexing' of git://github.com/psarna/scylla tests: add case for partition key index and filtering cql3: fix deciding if a query uses indexing	2019-06-18 01:01:14 +03:00
Takuya ASADA	7320c966bc	dist/common/scripts/scylla_setup: don't proceed with empty NIC name Currently NIC selection prompt on scylla_setup just proceed setup when user just pressed Enter key on the prompt. The prompt should ask NIC name again until user input correct NIC name. Fixes #4517 Message-Id: <20190617124925.11559-1-syuu@scylladb.com>	2019-06-17 15:52:29 +03:00
Avi Kivity	938b74f47a	Merge "Fix gcc9 build" from Paweł " These patches fix remaining issues with gcc9 build, that involve a gcc9 bug, a gcc9 bug, and a stricter warning. Tests: unit(debug, dev, release). " * 'fix-gcc9-build' of https://github.com/pdziepak/scylla: dht/ring_position: silence complaints about uninitialised _token_bound xx_hasher: disable -Warray-bounds api/column_family: work around gcc9 bug in seastar::future<std::any>	2019-06-17 15:23:24 +03:00
Tomasz Grabiec	f798f724c8	frozen_mutation: Guard against unfreezing using wrong schema Currently, calling unfreeze() using the wrong version of the schema results in undefined behavior. That can cause hard-to-debug problems. Better to throw in such cases. Refs #4549. Tests: - unit (dev) Message-Id: <1560459022-23786-1-git-send-email-tgrabiec@scylladb.com>	2019-06-17 15:23:24 +03:00
Asias He	f32371727b	repair: Avoid copying position in to_repair_rows_list No need to make a copy because it is not used to construct repair_row any more since commit `9079790f85` (repair: Avoid writing row with same partition key and clustering key more than once). Use mf->position() instead. Refs: #4510 Backports: 3.1 Message-Id: <7b21edcc3368036b6357b5136314c0edc22ad4d2.1560753672.git.asias@scylladb.com>	2019-06-17 15:23:24 +03:00
Paweł Dziepak	483f66332b	dht/ring_position: silence complaints about uninitialised _token_bound	2019-06-17 13:11:20 +01:00
Paweł Dziepak	82b8450922	xx_hasher: disable -Warray-bounds In release mode gcc9 has a false positive warning about out of bound access in xxhash implementation: ./xxHash/xxhash.c:799:27: error: array subscript -3 is outside array bounds of ‘long unsigned int [1]’ [-Werror=array-bounds] This is solved by disabling -Warray-bounds in the xxhash code.	2019-06-17 13:09:54 +01:00
Paweł Dziepak	8a13d96203	api/column_family: work around gcc9 bug in seastar::future<std::any> There is a gcc9 bug https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90415 that makes it impossible to pass std::any through a seastar::future<T>. Fortunately, there is only one user of seastar::future<std::any> in Scylla and it is not performance-critical. This patch avoids the gcc9 bug by using seastar::future<std::unique_ptr<std::any>>.	2019-06-17 13:06:28 +01:00
Glauber Costa	91b71a0b1a	do not allow multiple snapshot operations at the same time We saw a node crashing today with nodetool clearsnapshot being called. After investigation, the reason is that nodetool clearsnapshot ws called at the same time a new snapshot was created with the same tag. nodetool clearsnapshot can't delete all files in the directory, because new files had by then been created in that directory, and crashes on I/O error. There are, many problems with allowing those operations to proceed in parallel. Even if we fix the code not to crash and return an error on directory non-empty, the moment they do any amount of work in parallel the result of the operation becomes undefined. Some files in the snapshot may have been deleted by clear, for example, and a user may then not be able to properly restore from the backup if this snapshot was used to generate a backup. Moreover, although we could lock at the granularity of a keyspace or column family, I think we should use a big hammer here and lock the entire snapshot creation/deletion to avoid surprises (for example, if a user requests creation of a snapshot for all keyspaces, and another process requests clear of a single keyspace) Fixes #4554 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20190614174438.9002-1-glauber@scylladb.com>	2019-06-16 10:30:13 +03:00
Rafael Ávila de Espíndola	44eb939aa6	Use the sanitizer flags from seastar In practice, we always want to use the same sanitizer flags with seastar and scylla. Seastar was already marking its sanitizer flags public, so what was missing was exporting the link flags via pkgconfig and dropping the duplicates from scylla. I am doing this after wasting some time editing the wrong file. This depends on the seastar patch to export the sanitizer flags in pkgconfig. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-06-16 09:21:10 +03:00
Takuya ASADA	f582a759ee	dist: merge /usr/lib/scylla to /opt/scylladb We used to use /opt/scylladb just for Scylla build toolchain and dependency libraries, not for Scylla main package. But since we merged relocatable package, Scylla main binary and dependency libraries are all located under /opt/scylladb, only setup scripts remained on /usr/lib/scylla. It strange to keep using both /usr/lib/<app name> and /opt/<app name>, we should merge them into single place. Message-Id: <20190614011038.17827-1-syuu@scylladb.com>	2019-06-14 21:03:36 +03:00
Piotr Jastrzebski	a41c9763a9	sstables: distinguish empty and missing cellpath Before this patch mc sstables writer was ignoring empty cellpaths. This is a wrong behaviour because it is possible to have empty key in a map. In such case, our writer creats a wrong sstable that we can't read back. This is becaus a complex cell expects cellpath for each simple cell it has. When writer ignores empty cellpath it writes nothing and instead it should write a length of zero to the file so that we know there's an empty cellpath. Fixes #4533 Tests: unit(release) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <46242906c691a56a915ca5994b36baf87ee633b7.1560532790.git.piotr@scylladb.com>	2019-06-14 20:36:41 +03:00
Asias He	9079790f85	repair: Avoid writing row with same partition key and clustering key more than once Consider master: row(pk=1, ck=1, col=10) follower1: row(pk=1, ck=1, col=20) follower2: row(pk=1, ck=1, col=30) When repair runs, master fetches row(pk=1, ck=1, col=20) and row(pk=1, ck=1, col=30) from follower1 and follower2. Then repair master sends row(pk=1, ck=1, col=10) and row(pk=1, ck=1, col=30) to follower1, follower1 will write the row with the same pk=1, ck=1 twice, which violates uniqueness constraints. To fix, we apply the row with same pk and ck into the previous row. We only needs this on repair follower because the rows can come from multiple nodes. While on repair master, we have a sstable writer per follower, so the rows feed into sstable writer can come from only a single node. Tests: repair_additional_test.py:RepairAdditionalTest.repair_same_row_diff_value_3nodes_test Fixes: #4510 Message-Id: <cb4fbba1e10fb0018116ffe5649c0870cda34575.1560405722.git.asias@scylladb.com>	2019-06-13 17:19:19 +02:00
Asias He	912ce53fc5	repair: Allow repair_row to initialize partially On repair follower node, only decorated_key_with_hash and the mutation_fragment inside repair_row are used in apply_rows() to apply the rows to disk. Allow repair_row to initialize partially and throw if the uninitialized member is accessed to be safe. Message-Id: <b4e5cc050c11b1bafcf997076a3e32f20d059045.1560405722.git.asias@scylladb.com>	2019-06-13 17:18:53 +02:00
Benny Halevy	2fd2713fda	conf: update conf/scylla.yaml default large data warning thresholds They are currently inconsistent with db/config.cc and missing compaction_large_cell_warning_threshold_mb Fixes #4551 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190613133657.15370-1-bhalevy@scylladb.com>	2019-06-13 16:45:27 +03:00
Benny Halevy	4ad06c7eeb	tests/perf: provide random-seed option Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190613114307.31038-2-bhalevy@scylladb.com>	2019-06-13 14:45:49 +03:00
Benny Halevy	43e4631e6a	tests: random-utils: use seastar::testing::local_random_engine To provide test reproducibility use the seastar local_random_engine. To reproduce a run, use the --random-seed command line option with the seed printed accordingly. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190613114307.31038-1-bhalevy@scylladb.com>	2019-06-13 14:45:48 +03:00
Benny Halevy	fe2d629e20	mutation_reader_test: test_multishard_combining_reader_reading_empty_table: fix non-atomic sharing of shards_touched It needs to be a std::vector<std::atomic<bool>> otherwise threads step on wach other in shared memory. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190613112359.21884-1-bhalevy@scylladb.com>	2019-06-13 14:44:43 +03:00

1 2 3 4 5 ...

18794 Commits