scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Avi Kivity	8e8975730d	Update seastar submoodule * seastar cb7026c16f...b3fb4aaab3 (10): > Revert "scheduling groups: Adding per scheduling group data support" > scheduling groups: Adding per scheduling group data support > rpc: check that two servers are not created with the same streaming id > future: really ignore exceptions in ignore_ready_future > iostream: Constify eof() function > apply.hh: add missing #include for size_t > scheduling_group_demo: add explicit yields since future::get() no longer does > Fix buffer size used when calling accept4() > future-util: reduce allocations and continuations in parallel_for_each > rpc: lz4_decompressor: Add a static constexpr variable decleration for Cpp14 compatibility	2019-09-09 15:15:34 +03:00
Gleb Natapov	9e9f64d90e	messaging_service: configure different streaming domain for each rpc server A streaming domain identifies a server across shards. Each server should have different one. Fixes: #4953 Message-Id: <20190908085327.GR21540@scylladb.com>	2019-09-08 14:05:40 +03:00
Piotr Sarna	01410c9770	transport: make sure returning connection errors happens inside the gate. Previously, the gate could get closed too early, which would result in shutting down the server before it had an opportunity to respond to the client. Refs #4818	2019-09-08 13:23:20 +03:00
Avi Kivity	5663218fac	Merge "types: Fix decimal to integer and varint to integer conversion" from Rafael " The release notes for boost 1.67.0 includes: Breaking Change: When converting a multiprecision integer to a narrower type, if the value is too large (or negative) to fit in the smaller type, then the result is either the maximum (or minimum) value of the target Since we just moved out of boost 1.66, we have to update our code. This fixes issue #4960 " * 'espindola/fix-4960' of https://github.com/espindola/scylla: types: fix varint to integer conversion types: extract a from_varint_to_integer from make_castas_fctn_from_decimal_to_integer types: fix decimal to integer conversion types: extract helper for converting a decimal to a cppint types: rename and detemplate make_castas_fctn_from_decimal_to_integer	2019-09-08 10:45:42 +03:00
Avi Kivity	244218e483	Merge "simplify date type" from Rafael " With this patch series one has to be explicit to create a date_type_impl and now there is only the one documented difference between date_type_impl and timestamp_type_impl. " * 'espindola/simplify-date-type' of https://github.com/espindola/scylla: types: Reduce duplication around date_type_impl types: Don't use date_type_native_type when we want a timestamp types: Remove timestamp_native_type types: Don't specialize data_type_for for db_clock::time_point types: Make it harder to create date_type	2019-09-08 10:21:48 +03:00
Rafael Ávila de Espíndola	3bac4ebac7	types: Reduce duplication around date_type_impl According to the comments, the only different between date_type_impl and timestamp_type_impl is the comparison function. This patch makes that explicit by merging all code paths except: * The warning when converting between the two * The compare function The date_type_impl type can still be user visible via very old sstables or via the thrift protocol. It is not clear if we still need to support either, but with this patch it is easy to do so. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	36d40b4858	types: Don't use date_type_native_type when we want a timestamp In these cases it is pretty clear that the original code wanted to create a timestamp_type data_value but was creating a date_type one because of the old defaults. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	01cd21c04d	types: Remove timestamp_native_type Now that we know that anything expecting a date_type has been converted to date_type_native_type, switch to using db_clock::time_point when we want a timestamp_type. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	df6c2d1230	types: Don't specialize data_type_for for db_clock::time_point This also moves every user to date_type_native_type. A followup patch will convert to timestamp_type when appropriate. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Rafael Ávila de Espíndola	e09fa2dcff	types: Make it harder to create date_type date_type was replaced with timestamp_type, but it was very easy to create a date_type instead of a timestamp_type by accident. This patch changes the code so that a date_type is no longer implicitly used when constructing a data_value. All existing code that was depending on this is converted to explicitly using date_type_native_type. A followup patch will convert to timestamp_type when appropriate. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-07 10:07:33 -07:00
Gleb Natapov	f78b2c5588	transport: remove remaining craft related to cql's server load balancing Commit `7e3805ed3d` removed the load balancing code from cql server, but it did not remove most of the craft that load balancing introduced. The most of the complexity (and probably the main reason the code never worked properly) is around service::client_state class which is copied before been passed to the request processor (because in the past the processing could have happened on another shard) and then merged back into the "master copy" because a request processing may have changed it. This commit remove all this copying. The client_request is passed as a reference all the way to the lowest layer that needs it and it copy construction is removed to make sure nobody copies it by mistake. tests: dev, default c-s load of 3 node cluster Message-Id: <20190906083050.GA21796@scylladb.com>	2019-09-07 18:17:53 +03:00
Avi Kivity	3b5aa13437	Merge "Optimize type find" from Rafael " This avoids a double dispatch on _kind and also removes a few shared_ptr copies. The extra work was a small regression from the recent types refactoring. " * 'espindola/optimize_type_find' of https://github.com/espindola/scylla: types: optimize type find implementation types: Avoid shared_ptr copies	2019-09-07 18:14:36 +03:00
Gleb Natapov	5b9dc00916	test: fix query_processor_test::test_query_counters to use SERIAL consistency correctly It is not possible to scan a table with SERIAL consistency only to read a single partition. Message-Id: <20190905143023.GQ21540@scylladb.com>	2019-09-07 18:07:01 +03:00
Gleb Natapov	e52ebfb957	cql3: remove unused next_timestamp() function next_timestamp() just calls get_timestamp() directly and nobody uses it anyway. Message-Id: <20190905101648.GO21540@scylladb.com>	2019-09-05 17:20:21 +03:00
Botond Dénes	783277fb02	stream_session: STREAM_MUTATION_FRAGMENTS: print errors in receive and distribute phase Currently when an error happens during the receive and distribute phase it is swallowed and we just return a -1 status to the remote. We only log errors that happen during responding with the status. This means that when streaming fails, we only know that something went wrong, but the node on which the failure happened doesn't log anything. Fix by also logging errors happening in the receive and distribute phase. Also mention the phase in which the error happened in both error log messages. Refs: #4901 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190903115735.49915-1-bdenes@scylladb.com>	2019-09-05 13:43:00 +02:00
Rafael Ávila de Espíndola	dd81e94684	types: fix varint to integer conversion The previous code was using the boost::multiprecision::cpp_int to integer conversion, but that doesn't have the same semantics an cql for signed numbers. This fixes the dtest cql_cast_test.py:CQLCastTest.cast_varint_test. Fixes #4960 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	263e18b625	types: extract a from_varint_to_integer from make_castas_fctn_from_decimal_to_integer It will be used when converting varint to integer too. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	2d453b8e17	types: fix decimal to integer conversion The previous code was using the boost::multiprecision::cpp_rational to integer conversion, but that doesn't have the same semantics an cql. This patch avoids creating a cpp_rational in the first place and works just with integers. This fixes the dtest cql_cast_test.py:CQLCastTest.cast_decimal_test. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:14 -07:00
Rafael Ávila de Espíndola	fb760774dd	types: extract helper for converting a decimal to a cppint It will also be used in the decimal to integer conversion. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 15:08:07 -07:00
Rafael Ávila de Espíndola	40e6882906	types: rename and detemplate make_castas_fctn_from_decimal_to_integer It was only ever used for varint. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-04 14:54:47 -07:00
Avi Kivity	301246f6c0	storage_proxy: protect _view_update_handlers_list iterators from invalidation on_down() iterates over _view_update_handlers_list, but it yields during iteration, and while it yields, elements in that list can be removed, resulting in a use-after-free. Prevent this by registering iterators that can be potentially invalidated, and any time we remove an element from the list, check whether we're removing an element that is being pointed to by a live iterator. If that is the case, advance the iterator so that it points at a valid element (or at the end of the list). Fixes #4912. Tests: unit (dev)	2019-09-04 17:19:28 +03:00
Tomasz Grabiec	9f5826fd4b	Merge "Use canonical mutations for background schema sync" from Botond Currently the background schema sync (push/pull) uses frozen mutation to send the schema mutations over the wire to the remote node. For this to work correctly, both nodes have to have the exact same schema for the system schema tables, as attempting to unpack the frozen mutation with the wrong schema leads to undefined behaviour. To avoid this and to ensure syncing schema between nodes with different schema table schema versions is defined we migrate the background schema sync to use canonical mutations for the transfer of the schema mutations. Canonical mutations are immune to this problem, as they support deserializing with any version of the schema, older or newer one. The foreground schema sync mechanisms -- the on-demand schema pulls on reads and writes -- already use canonical mutations to transmit the schema mutations. It is important to note that due to this change, column-level incompatibilities between the schema mutations and the schema used to deserialize them will be hidden. This is undesired and should be fixed in a follow-up (#4956). Table level incompatibilities are detected and schema mutations containing such mutations will be rejected just like before. This patch adds canonical mutation support to the two background schema sync verbs: * `DEFINITIONS_UPDATE` (schema push) * `MIGRATION_REQUEST` (schema pull) Both verbs still support the old frozen mutation schema transfer, albeit that path is now much less efficient. After all nodes are upgraded, the pull verb can effectively avoid sending frozen mutations altogether, completely migrating to canonical mutations. Unfortunately this was not possible for the push verb, so that one now has an overhead as it needs to send both the frozen and canonical mutations. Fixes: #4273	2019-09-04 13:58:14 +02:00
Rafael Ávila de Espíndola	000514e7cc	sstable: close file_writer if an exception in thrown The previous code was not exception safe and would eventually cause a file to be destroyed without being closed, causing an assert failure. Unfortunately it doesn't seem to be possible to test this without error injection, since using an invalid directory fails before this code is executed. Fixes #4948 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190904002314.79591-1-espindola@scylladb.com>	2019-09-04 13:28:55 +03:00
Botond Dénes	7adc764b6e	messaging_service: add canonical_support to schema pull and push verbs The verbs are: * DEFINITIONS_UPDATE (push) * MIGRATION_REQUEST (pull) Support was added in a backward-compatible way. The push verb, sends both the old frozen mutation parameter, and the new optional canonical mutation parameter. It is expected that new nodes will use the latter, while old nodes will fall-back to the former. The pull verb has a new optional `options` parameter, which for now contains a single flag: `remote_supports_canonical_mutation_retval`. This flag, if set, means that the remote node supports the new canonical mutation return value, thus the old frozen mutations return value can be left empty.	2019-09-04 10:32:44 +03:00
Botond Dénes	d9a8ff15d8	service::migration_manager: add canonical_mutation merge_schema_from() overload Add an overload which takes a vector of canonical mutations. Going forward, this is the overload to use.	2019-09-04 10:32:44 +03:00
Botond Dénes	e02b93cae1	schema_tables: convert_schema_to_mutations: return canonical_mutations In preparation to the schema push/pull migrating to use canonical mutations, convert the method producing the schema mutations to return a vector of canonical mutations. The only user, MIGRATION_REQUEST verb, converts the canonical mutations back to frozen mutations. This is very inefficient, but this path will only be used in mixed clusters. After all nodes are upgraded the verb will be sending the canonical mutations directly instead.	2019-09-04 08:47:20 +03:00
Rafael Ávila de Espíndola	b100f95adc	types: optimize type find implementation This turns find into a template so there is only one switch over the kind of each type in the search. To evaluate the change in code size sizes, I added [[noinline]] to find and obtained the following results. The release columns for release in the before case have an extra column because the functions are sufficiently complex to trigger gcc to split them in hot + cold. before: dev release (hot + cold split) find 0x35f = 863 0x3d5 + 0x112 = 1255 references_duration 0x62 + 0x22 + 0x8 = 140 0x55 + 0x1f + 0x2a + 0x8 = 166 references_user_type 0x6b + 0x26 + 0x111 = 418 0x65 + 0x1f + 0x32 + 0x11b = 465 after: dev release find 0xd6 + 0x1b4 = 650 0xd2 + 0x1f5 = 711 references_duration 0x13 = 19 0x13 = 19 references_user_type 0x1a = 26 0x21 = 33 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-03 08:23:21 -07:00
Rafael Ávila de Espíndola	e0065b414e	types: Avoid shared_ptr copies They are somewhat expensive (in code size at least) and not needed everywhere. Inside the getter the variables are 'const data_type&', so we can return that. Everything still works when a copy is needed, but in code that just wants to check a property we avoid the copy. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-03 07:43:35 -07:00
Benny Halevy	bdfb73f67d	scripts/create-relocatable-package: ldd: print executable name in exception Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190903080511.534-1-bhalevy@scylladb.com>	2019-09-03 15:34:38 +03:00
Avi Kivity	294a86122e	Merge "nonroot installer" from Takuya " This is nonroot installer patchset v9. " * 'nonroot_v9' of https://github.com/syuu1228/scylla: dist/common/scripts: support nonroot mode on setup scripts reloc/python3: add install.sh on python relocatable package install.sh: add --nonroot mode dist/common/systemd: untemplataize .service, use drop-in units instead dist/debian: delete debian/.install, debian/*.dirs	2019-09-03 15:33:20 +03:00
Piotr Sarna	7b297865e1	transport: wait for the connections to finish when stopping (#4818 ) During CQL request processing, a gate is used to ensure that the connection is not shut down until all ongoing requests are done. However, the gate might have been left too early if the database was not ready to respond immediately - which could result in trying to respond to an already closed connection later. This issue is solved by postponing leaving the gate until the continuation chain that handles the request is finished. Refs #4808	2019-09-03 14:49:11 +03:00
Avi Kivity	8fb59915bb	Merge "Minor cleanup patches for sstables" from Asias * 'cleanup_sstables' of https://github.com/asias/scylla: sstables: Move leveled_compaction_strategy implementation to source file sstables: Include dht/i_partitioner.hh for dht::partition_range	2019-09-03 14:47:44 +03:00
Takuya ASADA	31ddb2145a	dist/common/scripts: support nonroot mode on setup scripts Since nonroot mode requires to run everything on non-privileged user, most of setup scripts does not able to use nonroot mode. We only provide following functions on nonroot mode: - EC2 check - IO setup - Node exporter installer - Dev mode setup Rest of functions will be skipped on scylla_setup. To implement nonroot mode on setup scripts, scylla_util provides utility functions to abstract difference of directory structure between normal installation and nonroot mode.	2019-09-03 20:06:35 +09:00
Takuya ASADA	cfa8885ae1	reloc/python3: add install.sh on python relocatable package To support nonroot installation on scylla-python3, add install.sh on scylla-python3 relocatable package.	2019-09-03 20:06:30 +09:00
Takuya ASADA	2de14e0800	install.sh: add --nonroot mode This implements the way to install Scylla without requires root privilege, not distribution dependent, does not uses package manager.	2019-09-03 20:06:24 +09:00
Takuya ASADA	cde798dba5	dist/common/systemd: untemplataize *.service, use drop-in units instead Since systemd unit can override parameters using drop-in unit, we don't need mustache template for them. Also, drop --disttype and --target options on install.sh since it does not required anymore, introduce --sysconfdir instead for non-redhat distributions.	2019-09-03 20:06:15 +09:00
Takuya ASADA	49a360f234	dist/debian: delete debian/.install, debian/.dirs Since `ac9b115`, we switched to install.sh on Debian so we don't rely on .deb specific packaging scripts anymore. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2019-09-03 20:06:09 +09:00
Nadav Har'El	6c4ad93296	api/compaction_manager: do not hold map on the stack Merged patch series by Amnon Heiman: This patch fixes a bug that a map is held on the stack and then is used by a future. Instead, the map is now moved to the relevant lambda function. Fixes #4824	2019-09-01 13:16:34 +03:00
Avi Kivity	e962beea20	toolchain: update to Fedora 30 and gcc 9.2 In Fedora 30 we have a new boost version, so we no longer need to use our patched boost, so we also remove the scylladb/toolchain copr.	2019-09-01 12:05:26 +03:00
Piotr Sarna	23c891923e	main: make sure view_builder doesn't propagate semaphore errors Stopping services which occurs in a destructor of deferred_action should not throw, or it will end the program with terminate(). View builder breaks a semaphore during its shutdown, which results in propagating a broken_semaphore exception, which in turn results in throwing an exception during stop().get(). In order to fix that issue, semaphore exceptions are explicitly ignored, since they're expected to appear during shutdown. Fixes #4875	2019-09-01 11:59:57 +03:00
Tomasz Grabiec	c8f8a9450f	Merge "Improve cpu instruction set support checks" from Avi To prevent termination with SIGILL, tighten the instruction set support checks. First, check for CLMUL too. Second, add a check in scylla_prepare to catch the problem early. Fixes #4921.	2019-08-30 16:54:04 +02:00
Avi Kivity	07010af44c	scylla_prepare: verify processor satisfies instruction set requirements Scylla requires the CLMUL and SSE 4.2 instruction sets and will fail without them. There is a check in main(), but that happens after the code is running and it may already be too late. Add a check in scylla_prepare which runs before the main executable.	2019-08-29 15:34:29 +03:00
Avi Kivity	9579946e72	main: extend CPU feature check to verify that PCLMUL is available Since `79136e895f`, we use the pclmul instruction set, so check it is there.	2019-08-29 15:13:32 +03:00
Gleb Natapov	e61a86bbb2	to_string: Add operator<< overload for std::tuple. Message-Id: <20190829100902.GN21540@scylladb.com>	2019-08-29 13:35:02 +03:00
Rafael Ávila de Espíndola	036f51927c	sstables: Remove unused include Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190827210424.37848-1-espindola@scylladb.com>	2019-08-28 11:32:44 +03:00
Benny Halevy	869b518dca	sstables: auto-delete unsealed sstables Fixes #4807 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190827082044.27223-1-bhalevy@scylladb.com>	2019-08-28 09:46:17 +03:00
Botond Dénes	969aa22d51	configure.py: promote unused result warning to error Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190827111428.6829-2-bdenes@scylladb.com>	2019-08-28 09:46:17 +03:00
Botond Dénes	480b42b84f	tests/gossip_test: silence discarded future warning Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190827111428.6829-1-bdenes@scylladb.com>	2019-08-28 09:46:17 +03:00
Avi Kivity	d85339e734	Update seastar submodule * seastar 20bfd61955...cb7026c16f (2): > net: dpdk: suppress discarded future warning > Merge "Optimize promises in then/then_wrapped" from Rafael	2019-08-28 09:46:17 +03:00
Avi Kivity	f1d73d0c13	Merge "systemd: put scylla processes in systemd slices. #4743 " from Glauber " It is well known that seastar applications, like Scylla, do not play well with external processes: CPU usage from external processes may confuse the I/O and CPU schedulers and create stalls. We have also recently seen that memory usage from other application's anonymous and page cache memory can bring the system to OOM. Linux has a very good infrastructure for resource control contributed by amazingly bright engineers in the form of cgroup controllers. This infrastructure is exposed by SystemD in the form of slices: a hierarchical structure to which controllers can be attached. In true systemd way, the hierarchy is implicit in the filenames of the slice files. a "-" symbol defines the hierarchy, so the files that this patch presents, scylla-server and scylla-helper, essentially create a "scylla" cgroup at the top level with "server" and "helper" children. Later we mark the Services needed to run scylla as belonging to one or the other through the Slice= directive. Scylla DBAs can benefit from this setup by using the systemd-run utility to fire ad-hoc commands. Let's say for example that someone wants to hypothetically run a backup and transfer files to an external object store like S3, making sure that the amount of page cache used won't create swap pressure leading to database timeouts. One can then run something like: sudo systemd-run --uid=id -u scylla --gid=id -g scylla -t --slice=scylla-helper.slice /path/to/my/magical_backup_tool (or even better, the backup tool can itself be a systemd timer) " * 'slices' of https://github.com/glommer/scylla: systemd: put scylla processes in systemd slices. move postinst steps to an external script	2019-08-26 20:16:55 +03:00

1 2 3 4 5 ...

19349 Commits