scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 11:30:36 +00:00

Author	SHA1	Message	Date
Benny Halevy	e4132edef3	stream_session: prepare: fix missing string format argument As seen in mv_populating_from_existing_data_during_node_decommission_test dtest: ``` ERROR 2021-02-11 06:01:32,804 [shard 0] stream_session - failed to log message: fmt::v7::format_error (argument not found) ``` Fixes #8067 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210211100158.543952-1-bhalevy@scylladb.com> (cherry picked from commit `d01e7e7b58`)	2021-02-14 13:08:20 +02:00
Shlomi Livne	492f0802fb	scylla_io_setup did not configure pre tuned gce instances correctly scylla_io_setup condition for nr_disks was using the bitwise operator (&) instead of logical and operator (and) causing the io_properties files to have incorrect values Fixes #7341 Reviewed-by: Lubos Kosco <lubos@scylladb.com> Signed-off-by: Shlomi Livne <shlomi@scylladb.com> Closes #8019 (cherry picked from commit `718976e794`)	2021-02-14 13:08:00 +02:00
Takuya ASADA	34f22e1df1	dist/debian: install scylla-node-exporter.service correctly node-exporter systemd unit name is "scylla-node-exporter.service", not "node-exporter.service". Fixes #8054 Closes #8053 (cherry picked from commit `856fe12e13`)	2021-02-14 13:07:29 +02:00
Nadav Har'El	acb921845f	cql-pytest: fix flaky timeuuid_test.py The test timeuuid_test.py::testTimeuuid sporadically failed, and it turns out the reason was a bug in the test - which this patch fixes. The buggy test created a timeuuid and then compared the time stored in it to the result of the dateOf() CQL function. The problem is that dateOf() returns a CQL "timestamp", which has millisecond resolution, while the timeuuid may have finer than millisecond resolution. The reason why this test rarely failed is that in our implementation, the timeuuid almost always gets a millisecond-resolution timestamp. Only if now() gets called more than once in one millisecond, does it pick a higher time incremented by less than a millisecond. What this patch does is to truncate the time read from the timeuuid to millisecond resolution, and only then compare it to the result of dateOf(). We cannot hope for more. Fixes #8060 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210211165046.878371-1-nyh@scylladb.com> (cherry picked from commit `a03a8a89a9`)	2021-02-14 13:06:59 +02:00
Botond Dénes	5b6c284281	query: use local limit for non-limited queries in mixed cluster Since `fea5067df` we enforce a limit on the memory consumption of otherwise non-limited queries like reverse and non-paged queries. This limit is sent down to the replicas by the coordinator, ensuring that each replica is working with the same limit. This however doesn't work in a mixed cluster, when upgrading from a version which doesn't have this series. This has been worked around by falling back to the old max_result_size constant of 1MB in mixed clusters. This however resulted in a regression when upgrading from a pre `fea5067df` to a post `fea5067df` one. Pre `fea5067df` already had a limit for reverse queries, which was generalized to also cover non-paged ones too by `fea5067df`. The regression manifested in previously working reverse queries being aborted. This happened because even though the user has set a generous limit for them before the upgrade, in the mix cluster replicas fall back to the much stricter 1MB limit temporarily ignoring the configured limit if the coordinator is an old node. This patch solves this problem by using the locally configured limit instead of the max_result_size constant. This means that the user has to take extra care to configure the same limit on all replicas, but at least they will have working reverse queries during the upgrade. Fixes: #8022 Tests: unit(release), manual test by user who reported the issue Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210209075947.1004164-1-bdenes@scylladb.com> (cherry picked from commit `3d001b5587`)	2021-02-09 18:06:43 +02:00
Yaron Kaikov	7d15319a8a	release: prepare for 4.4 Update Docker parameters for the 4.4 release. Closes #7932	2021-02-09 09:42:53 +02:00
Amnon Heiman	a06412fd24	API: Fix aggregation in column_familiy Few method in column_familiy API were doing the aggregation wrong, specifically, bloom filter disk size. The issue is not always visible, it happens when there are multiple filter files per shard. Fixes #4513 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Closes #8007 (cherry picked from commit `4498bb0a48`)	2021-02-08 17:03:45 +02:00
Avi Kivity	2500dd1dc4	Merge 'dist/offline_installer/redhat: fix umask error' from Takuya ASADA Since makeself script changes current umask, scylla_setup causes "scylla does not work with current umask setting (0077)" error. To fix that we need use latest version of makeself, and specfiy --keep-umask option. Fixes #6243 Closes #6244 * github.com:scylladb/scylla: dist/offline_redhat: fix umask error dist/offline_installer/redhat: support cross build (cherry picked from commit `bb202db1ff`)	2021-02-01 13:03:06 +02:00
Hagit Segev	fd868722dd	release: prepare for 4.4.rc1 scylla-4.4.rc1	2021-01-31 14:09:44 +02:00
Pekka Enberg	f470c5d4de	Update tools/python3 submodule * tools/python3 c579207...199ac90 (1): > dist: debian: adjust .orig tarball name for .rc releases	2021-01-25 09:26:33 +02:00
Pekka Enberg	3677a72a21	Update tools/python3 submodule * tools/python3 1763a1a...c579207 (1): > dist/debian: handle rc version correctly	2021-01-22 09:36:54 +02:00
Hagit Segev	46e6273821	release: prepare for 4.4.rc0 scylla-4.4.rc0	2021-01-18 20:29:53 +02:00
Jenkins Promoter	ce7e31013c	release: prepare for 4.4	2021-01-18 15:49:55 +02:00
Avi Kivity	60f5ec3644	Merge 'managed_bytes: switch to explicit linearization' from Michał Chojnowski This is a revival of #7490. Quoting #7490: The managed_bytes class now uses implicit linearization: outside LSA, data is never fragmented, and within LSA, data is linearized on-demand, as long as the code is running within with_linearized_managed_bytes() scope. We would like to stop linearizing managed_bytes and keep it fragmented at all times, since linearization can require large contiguous chunks. Large contiguous allocations are hard to satisfy and cause latency spikes. As a first step towards that, we remove all implicitly linearizing accessors and replace them with an explicit linearization accessor, with_linearized(). Some of the linearization happens long before use, by creating a bytes_view of the managed_bytes object and passing it onwards, perhaps storing it for later use. This does not work with with_linearized(), which creates a temporary linearized view, and does not work towards the longer term goal of never linearizing. As a substitute a managed_bytes_view class is introduced that acts as a view for managed_bytes (for interoperability it can also be a view for bytes and is compatible with bytes_view). By the end of the series, all linearizations are temporary, within the scope of a with_linearized() call and can be converted to fragmented consumption of the data at leisure. This has limited practical value directly, as current uses of managed_bytes are limited to keys (which are limited to 64k). However, it enables converting the atomic_cell layer back to managed_bytes (so we can remove IMR) and the CQL layer to managed_bytes/managed_bytes_view, removing contiguous allocations from the coordinator. Closes #7820 * github.com:scylladb/scylla: test: add hashers_test memtable: fix accounting of managed_bytes in partition_snapshot_accounter test: add managed_bytes_test utils: fragment_range: add a fragment iterator for FragmentedView keys: update comments after changes and remove an unused method mutation_test: use the correct preferred_max_contiguous_allocation in measuring_allocator row_cache: more indentation fixes utils: remove unused linearization facilities in `managed_bytes` class misc: fix indentation treewide: remove remaining `with_linearized_managed_bytes` uses memtable, row_cache: remove `with_linearized_managed_bytes` uses utils: managed_bytes: remove linearizing accessors keys, compound: switch from bytes_view to managed_bytes_view sstables: writer: add write_* helpers for managed_bytes_view compound_compat: transition legacy_compound_view from bytes_view to managed_bytes_view types: change equal() to accept managed_bytes_view types: add parallel interfaces for managed_bytes_view types: add to_managed_bytes(const sstring&) serializer_impl: handle managed_bytes without linearizing utils: managed_bytes: add managed_bytes_view::operator[] utils: managed_bytes: introduce managed_bytes_view utils: fragment_range: add serialization helpers for FragmentedMutableView bytes: implement std::hash using appending_hash utils: mutable_view: add substr() utils: fragment_range: add compare_unsigned utils: managed_bytes: make the constructors from bytes and bytes_view explicit utils: managed_bytes: introduce with_linearized() utils: managed_bytes: constrain with_linearized_managed_bytes() utils: managed_bytes: avoid internal uses of managed_bytes::data() utils: managed_bytes: extract do_linearize_pure() thrift: do not depend on implicit conversion of keys to bytes_view clustering_bounds_comparator: do not depend on implicit conversion of keys to bytes_view cql3: expression: linearize get_value_from_mutation() eariler bytes: add to_bytes(bytes) cql3: expression: mark do_get_value() as static	2021-01-18 11:01:28 +02:00
Avi Kivity	ab44464911	Revert "docker: remove sshd from the image" This reverts commit `32fd38f349`. Some tests (in scylla-cluster-tests) depend on it.	2021-01-17 14:34:40 +02:00
Raphael S. Carvalho	00c29e1e24	table: Move notify_bootstrap_or_replace_*() out of line Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210117045747.69891-9-raphaelsc@scylladb.com>	2021-01-17 10:36:13 +02:00
Michał Chojnowski	5b72fb65ae	test: add hashers_test This test is a sanity check. It verifies that our wrappers over well known hashes (xxhash, md5, sha256) actually calculate exactly those hashes. It also checks that the `update()` methods of used hashers are linear with respect to concatenation: that is, `update(a + b)` must be equivalent to `update(a); update(b)`. This wasn't relied on before, but now we need to confirm that hashing fragmented keys without linearizing them won't break backward compatibility.	2021-01-15 18:28:24 +01:00
Michał Chojnowski	85048b349b	memtable: fix accounting of managed_bytes in partition_snapshot_accounter managed_bytes has a small overhead per each fragment. Due to that, managed_bytes containing the same data can have different total memory usage in different allocators. The smaller the preferred max allocation size setting is, the more fragments are needed and the greater total per-fragment overhead is. In particular, managed_bytes allocated in the LSA could grow in memory usage when copied to the standard allocator, if the standard allocator had a preferred max allocation setting smaller than the LSA. partition_snapshot_accounter calculates the amount of memory used by mutation fragments in the memtable (where they are allocated with LSA) based on the memory usage after they are copied to the standard allocator. This could result in an overestimation, as explained above. But partition_snapshot_accounter must not overestimate the amount of freed memory, as doing otherwise might result in OOM situations. This patch prevents the overaccounting by adding minimal_external_memory_usage(): a new version of external_memory_usage(), which ignores allocator-dependent overhead. In particular, it includes the per-fragment overhead in managed_bytes only once, no matter how many fragments there are.	2021-01-15 18:21:13 +01:00
Michał Chojnowski	d31771c0b2	test: add managed_bytes_test	2021-01-15 18:21:13 +01:00
Michał Chojnowski	72ecbd6936	utils: fragment_range: add a fragment iterator for FragmentedView A stylistic change. Iterators are the idiomatic way to iterate in C++.	2021-01-15 14:05:44 +01:00
Michał Chojnowski	2e38647a95	keys: update comments after changes and remove an unused method The comments were outdated after the latest changes (bytes_view vs managed_bytes_view). compound_view_wrapper::get_component() is unused, so we remove it.	2021-01-15 14:05:44 +01:00
Piotr Sarna	6ae94d31c1	treewide: remove shared pointer usage from the pager The pager interface doesn't really need to be virtual, so the next step could be to remove the need for pointers entirely, but migrating from shared_ptr to unique_ptr is a low-hanging fruit. Message-Id: <a5bdecb17ae58e914da020fb58a41f4574565c66.1610709560.git.sarna@scylladb.com>	2021-01-15 15:03:14 +02:00
Avi Kivity	f20736d93d	Merge 'Support unofficial distributions' from Takuya ASADA Since we introduced relocatable package and offline installer, scylla binary itself can run almost any distributions. However, setup scripts are not designed to run in unsupported distributions, it causes error on such environment. This PR adds minimal support to run offline installation on unsupported distributions, tested on SLES, Arch Linux and Gentoo. Closes #7858 * github.com:scylladb/scylla: dist: use sysconfig_parser to parse gentoo config file dist: add package name translation dist: support SLES/OpenSUSE install.sh: add systemd existance check install.sh: ignore error missing sysctl entries dist: show warning on unsupported distributions dist: drop Ubuntu 14.04 code dist: move back is_amzn2() to scylla_util.py dist: rename is_gentoo_variant() to is_gentoo() dist: support Arch Linux dist: make sysconfig directory detectable	2021-01-14 16:59:49 +02:00
Raphael S. Carvalho	97e076365e	Fix stalls on Memtable flush by preempting across fragment generation if needed Flush is facing stalls because partition_snapshot_flat_reader::fill_buffer() generates mutation fragment until buffer is full[1] without yielding. this is the code path: flush_reader::fill_buffer() <---------\| flat_mutation_reader::consume_pausable() <--------\| partition_snapshot_flat_reader::fill_buffer() -\| [1]: https://github.com/scylladb/scylla/blob/6cfc949e/partition_snapshot_reader.hh#L261 This is fixed by breaking the loop in do_fill_buffer() if preemption is needed, allowing do_until() to yield in sequence, and when it resumes, continue from where it left off, until buffer is full. Fixes #7885. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210114141417.285175-1-raphaelsc@scylladb.com>	2021-01-14 16:30:55 +02:00
Ivan Prisyazhnyy	32fd38f349	docker: remove sshd from the image implicit revert of `6322293263` sshd previosly was used by the scylla manager 1.0. new version does not need it. there is no point of having it currently. it also confuses everyone. Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com> Closes #7921	2021-01-14 12:52:24 +02:00
Pavel Emelyanov	2b31be0daa	client-state,cdc: Remove call for storage_service from permissions check The client_state::check_access() calls for global storage service to get the features from it and check if the CDC feature is on. The latter is needed to perform CDC-specific checks. However it was noticed, that the check for the feature is excessive as all the guarded if-s will resolve to false in case CDC is off and the check_access will effectively work as it would with the feature check. With that observation, it's possible to ditch one more global storage service reference. tests: unit(dev), dtest(dev, auth) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210105063651.7081-1-xemul@scylladb.com>	2021-01-14 12:52:24 +02:00
Takuya ASADA	7a74f8cd2e	dist: use sysconfig_parser to parse gentoo config file Use sysconfig_parser instead of regex, to improve code readability.	2021-01-13 21:34:23 +09:00
Takuya ASADA	2a4d293841	dist: add package name translation Translate package name from CentOS package to different distribution package name, to use single package name for pkg_install().	2021-01-13 21:27:14 +09:00
Takuya ASADA	0a9843842d	dist: support SLES/OpenSUSE Add support SLES/OpenSUSE on setup script.	2021-01-13 19:32:46 +09:00
Takuya ASADA	a34edf8169	install.sh: add systemd existance check offline installer can run in non-systemd distributions, but it won't work since we only have systemd units. So check systemd existance and print error message.	2021-01-13 19:32:45 +09:00
Takuya ASADA	b8c35772b3	install.sh: ignore error missing sysctl entries On some kernel may not have specified sysctl parameter, so we should ignore the error.	2021-01-13 19:32:45 +09:00
Takuya ASADA	e8f74e800c	dist: show warning on unsupported distributions Add warning message on unsupported distributions, for scylla_cpuscaling_setup and scylla_ntp_setup.	2021-01-13 19:32:45 +09:00
Takuya ASADA	2f344cf50d	dist: drop Ubuntu 14.04 code We don't support Ubuntu 14.04 anymore, drop them	2021-01-13 19:32:45 +09:00
Takuya ASADA	8e59f70080	dist: move back is_amzn2() to scylla_util.py Distribution detection functions should be placed same place, so move back it to scylla_util.py	2021-01-13 19:32:45 +09:00
Takuya ASADA	921b1676c0	dist: rename is_gentoo_variant() to is_gentoo() is_redhat_variant() is the function to detect RHEL/CentOS/Fedora/OEL, and is_debian_variant() is the function to detect Debian/Ubuntu. Unlike these functions, is_gentoo_variant() does not detect "Gentoo variants", we should rename it to is_gentoo().	2021-01-13 19:32:45 +09:00
Takuya ASADA	fffa8f5ded	dist: support Arch Linux Add support Arch Linux on setup script.	2021-01-13 19:32:45 +09:00
Takuya ASADA	0d11f9463d	dist: make sysconfig directory detectable Currently, install.sh provide a way to customize sysconfig directory, but sysconfig directory is hardcoded on script. Also, /etc/sysconfig seems correct to use default value, but current code specify /etc/default as non-redhat distributions. Instead of hardcoding, generate generate python script in install.sh to save specified sysconfig directory path in python code.	2021-01-13 19:32:45 +09:00
Wojciech Mitros	93613e20a3	api: remove potential large allocation in /column_family/ GET request handler The reply to a /column_family/ GET request contains info about all column families. Currently, all this info is stored in a single string when replying, and this string may require a big allocation when there are many column families. To avoid that allocation, instead of a single string, use a body_writer function, which writes chunks of the message content to the output stream. Fixes #7916 Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com> Closes #7917	2021-01-13 12:04:18 +02:00
Avi Kivity	ed53b3347e	Merge 'idl: remove the large allocation in mutation_partition_view::rows()' from Wojciech Mitros After these changes the generated code deserializes the stream into a chunked vector, instead of an contiguous one, so even if there are many fields in it, there won't be any big allocations. I haven't run the scylla cluster test with it yet but it passes the unit tests. Closes #7919 * github.com:scylladb/scylla: idl: change the type of mutation_partition_view::rows() to a chunked_vector idl-compiler: allow fields of type utils::chunked_vector	2021-01-13 11:07:29 +02:00
Nadav Har'El	711b311d47	cql-pytest: tests for fromJson() integer overflow Numbers in JSON are not limited in range, so when the fromJson() function converts a number to a limited-range integer column in Scylla, this conversion can overflow. The following tests check that this conversion should result in an error (FunctionFailure), not silent trunction. Scylla today does silently wrap around the number, so these tests xfail. They pass on Cassandra. Refs #7914. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210112151041.3940361-1-nyh@scylladb.com>	2021-01-13 11:07:29 +02:00
Nadav Har'El	617e1be1b6	cql-pytest: expand tests for fromJson() failures This patch adds more (failing) tests for issue #7911, where fromJson() failures should be reported as a clean FunctionFailure error, not an internal server error. The previous tests we had were about JSON parse failures, but a different type of error we should support is valid JSON which returned the wrong type - e.g., the JSON returning a string when an integer was expected, or the JSON returning a string with non-ASCII characters when ASCII was expected. So this patch adds more such tests. All of them xfail on Scylla, and pass on Cassandra. Refs #7911. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210112122211.3932201-1-nyh@scylladb.com>	2021-01-13 11:07:29 +02:00
Nadav Har'El	2ebe8055ee	cql-pytest: add test for fromJson() null parameter. This patch adds a reproducer test for issue #7912, which is about passing a null parameter to the fromJson() function supposed to be legal (and return a null value), and is legal in Cassandra, but isn't allowed in Scylla. There are two tests - for a prepared and unprepared statement - which fail in different ways. The issue is still open so the tests xfail on Scylla - and pass on Cassandra. Refs #7912. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210112114254.3927671-1-nyh@scylladb.com>	2021-01-13 11:07:29 +02:00
dgarcia360	78e9f45214	docs: update url Related issue scylladb/sphinx-scylladb-theme#88 Once this commit is merged, the docs will be published under the new domain name https://scylla.docs.scylladb.com Frequently asked questions: Should we change the links in the README/docs folder? GitHub automatically handles the redirections. For example, https://scylladb.github.io/sphinx-scylladb-theme/stable/examples/index.html redirects to https://sphinx-theme.scylladb.com/stable/examples/index.html Nevertheless, it would be great to change URLs progressively to avoid the 301 redirections. Do I need to add this new domain in the custom dns domain section on GitHub settings? It is not necessary. We have already edited the DNS for this domain and the theme creates programmatically the required CNAME file. If everything goes well, GitHub should detect the new URL after this PR is merged. The DNS doesn't seem to have the right SSL certificates GitHub handles the certificate provisioning but is not aware of the subdomain for this repo yet. make multi-version will create a new file "CNAME". This is published in gh-pages branch, therefore GitHub should create the missing cert. Closes #7877	2021-01-13 11:07:29 +02:00
Avi Kivity	d508a63d4b	row_cache: linearize key in cache_entry::do_read() do_read() does not linearize cache_entry::_key; this can cause a crash with keys larger than 13k. Fixes #7897. Closes #7898	2021-01-13 11:07:29 +02:00
dgarcia360	36f8d35812	docs: added multiversion_regex_builder Fixed makefile Added path Closes #7876	2021-01-13 11:07:29 +02:00
Benny Halevy	5e41228fe8	test: everywhere: use seastar::testing::local_random_engine Use the thread_local seastar::testing::local_random_engine in all seastar tests so they can be reproduced using the --random-seed option. Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210112103713.578301-2-bhalevy@scylladb.com>	2021-01-13 11:07:29 +02:00
Benny Halevy	43ab094c88	configure: add utf8_test to pure_boost_tests Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210112103713.578301-1-bhalevy@scylladb.com>	2021-01-13 11:07:29 +02:00
Dejan Mircevski	d79c2cab63	cql3: Use correct comparator in timeuuid min/max The min/max aggregators use aggregate_type_for comparators, and the aggregate_type_for<timeuuid> is regular uuid. But that yields wrong results; timeuuids should be compared as timestamps. Fix it by changing aggregate_type_for<timeuuid> from uuid to timeuuid, so aggregators can distinguish betwen the two. Then specialize the aggregation utilities for timeuuid. Add a cql-pytest and change some unit tests, which relied on naive uuid comparators. Fixes #7729. Tests: unit (dev, debug) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7910	2021-01-13 11:07:29 +02:00
Avi Kivity	96d64b7a1f	Merge "Wire interposer consumer for memtable flush" from Raphael " Without interposer consumer on flush, it could happen that a new sstable, produced by memtable flush, will not conform to the strategy invariant. For example, with TWCS, this new sstable could span multiple time windows, making it hard for the strategy to purge expired data. If interposer is enabled, the data will be correctly segregated into different sstables, each one spanning a single window. Fixes #4617. tests: - mode(dev). - manually tested it by forcing a flush of memtable spanning many windows " * 'segregation_on_flush_v2' of github.com:raphaelsc/scylla: test: Add test for TWCS interposer on memtable flush table: Wire interposer consumer for memtable flush table: Add write_memtable_to_sstable variant which accepts flat_mutation_reader table: Allow sstable write permit to be shared across monitors memtable: Track min timestamp table: Extend cache update to operate a memtable split into multiple sstables	2021-01-13 11:07:29 +02:00
Nadav Har'El	8164c52871	cql-pytest: add test for fromJson() parse error This patch adds a reproducer test for issue #7911, which is about a parse error in JSON string passed to the fromJson() function causing an internal error instead of the expected FunctionFailure error. The issue is still open so the test xfails on Scylla (and passes on Cassandra). Refs #7911. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210112094629.3920472-1-nyh@scylladb.com>	2021-01-13 11:07:29 +02:00

1 2 3 4 5 ...

24861 Commits