scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	9f5826fd4b	Merge "Use canonical mutations for background schema sync" from Botond Currently the background schema sync (push/pull) uses frozen mutation to send the schema mutations over the wire to the remote node. For this to work correctly, both nodes have to have the exact same schema for the system schema tables, as attempting to unpack the frozen mutation with the wrong schema leads to undefined behaviour. To avoid this and to ensure syncing schema between nodes with different schema table schema versions is defined we migrate the background schema sync to use canonical mutations for the transfer of the schema mutations. Canonical mutations are immune to this problem, as they support deserializing with any version of the schema, older or newer one. The foreground schema sync mechanisms -- the on-demand schema pulls on reads and writes -- already use canonical mutations to transmit the schema mutations. It is important to note that due to this change, column-level incompatibilities between the schema mutations and the schema used to deserialize them will be hidden. This is undesired and should be fixed in a follow-up (#4956). Table level incompatibilities are detected and schema mutations containing such mutations will be rejected just like before. This patch adds canonical mutation support to the two background schema sync verbs: * `DEFINITIONS_UPDATE` (schema push) * `MIGRATION_REQUEST` (schema pull) Both verbs still support the old frozen mutation schema transfer, albeit that path is now much less efficient. After all nodes are upgraded, the pull verb can effectively avoid sending frozen mutations altogether, completely migrating to canonical mutations. Unfortunately this was not possible for the push verb, so that one now has an overhead as it needs to send both the frozen and canonical mutations. Fixes: #4273	2019-09-04 13:58:14 +02:00
Rafael Ávila de Espíndola	000514e7cc	sstable: close file_writer if an exception in thrown The previous code was not exception safe and would eventually cause a file to be destroyed without being closed, causing an assert failure. Unfortunately it doesn't seem to be possible to test this without error injection, since using an invalid directory fails before this code is executed. Fixes #4948 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190904002314.79591-1-espindola@scylladb.com>	2019-09-04 13:28:55 +03:00
Botond Dénes	7adc764b6e	messaging_service: add canonical_support to schema pull and push verbs The verbs are: * DEFINITIONS_UPDATE (push) * MIGRATION_REQUEST (pull) Support was added in a backward-compatible way. The push verb, sends both the old frozen mutation parameter, and the new optional canonical mutation parameter. It is expected that new nodes will use the latter, while old nodes will fall-back to the former. The pull verb has a new optional `options` parameter, which for now contains a single flag: `remote_supports_canonical_mutation_retval`. This flag, if set, means that the remote node supports the new canonical mutation return value, thus the old frozen mutations return value can be left empty.	2019-09-04 10:32:44 +03:00
Botond Dénes	d9a8ff15d8	service::migration_manager: add canonical_mutation merge_schema_from() overload Add an overload which takes a vector of canonical mutations. Going forward, this is the overload to use.	2019-09-04 10:32:44 +03:00
Botond Dénes	e02b93cae1	schema_tables: convert_schema_to_mutations: return canonical_mutations In preparation to the schema push/pull migrating to use canonical mutations, convert the method producing the schema mutations to return a vector of canonical mutations. The only user, MIGRATION_REQUEST verb, converts the canonical mutations back to frozen mutations. This is very inefficient, but this path will only be used in mixed clusters. After all nodes are upgraded the verb will be sending the canonical mutations directly instead.	2019-09-04 08:47:20 +03:00
Benny Halevy	bdfb73f67d	scripts/create-relocatable-package: ldd: print executable name in exception Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190903080511.534-1-bhalevy@scylladb.com>	2019-09-03 15:34:38 +03:00
Avi Kivity	294a86122e	Merge "nonroot installer" from Takuya " This is nonroot installer patchset v9. " * 'nonroot_v9' of https://github.com/syuu1228/scylla: dist/common/scripts: support nonroot mode on setup scripts reloc/python3: add install.sh on python relocatable package install.sh: add --nonroot mode dist/common/systemd: untemplataize .service, use drop-in units instead dist/debian: delete debian/.install, debian/*.dirs	2019-09-03 15:33:20 +03:00
Piotr Sarna	7b297865e1	transport: wait for the connections to finish when stopping (#4818 ) During CQL request processing, a gate is used to ensure that the connection is not shut down until all ongoing requests are done. However, the gate might have been left too early if the database was not ready to respond immediately - which could result in trying to respond to an already closed connection later. This issue is solved by postponing leaving the gate until the continuation chain that handles the request is finished. Refs #4808	2019-09-03 14:49:11 +03:00
Avi Kivity	8fb59915bb	Merge "Minor cleanup patches for sstables" from Asias * 'cleanup_sstables' of https://github.com/asias/scylla: sstables: Move leveled_compaction_strategy implementation to source file sstables: Include dht/i_partitioner.hh for dht::partition_range	2019-09-03 14:47:44 +03:00
Takuya ASADA	31ddb2145a	dist/common/scripts: support nonroot mode on setup scripts Since nonroot mode requires to run everything on non-privileged user, most of setup scripts does not able to use nonroot mode. We only provide following functions on nonroot mode: - EC2 check - IO setup - Node exporter installer - Dev mode setup Rest of functions will be skipped on scylla_setup. To implement nonroot mode on setup scripts, scylla_util provides utility functions to abstract difference of directory structure between normal installation and nonroot mode.	2019-09-03 20:06:35 +09:00
Takuya ASADA	cfa8885ae1	reloc/python3: add install.sh on python relocatable package To support nonroot installation on scylla-python3, add install.sh on scylla-python3 relocatable package.	2019-09-03 20:06:30 +09:00
Takuya ASADA	2de14e0800	install.sh: add --nonroot mode This implements the way to install Scylla without requires root privilege, not distribution dependent, does not uses package manager.	2019-09-03 20:06:24 +09:00
Takuya ASADA	cde798dba5	dist/common/systemd: untemplataize *.service, use drop-in units instead Since systemd unit can override parameters using drop-in unit, we don't need mustache template for them. Also, drop --disttype and --target options on install.sh since it does not required anymore, introduce --sysconfdir instead for non-redhat distributions.	2019-09-03 20:06:15 +09:00
Takuya ASADA	49a360f234	dist/debian: delete debian/.install, debian/.dirs Since `ac9b115`, we switched to install.sh on Debian so we don't rely on .deb specific packaging scripts anymore. Signed-off-by: Takuya ASADA <syuu@scylladb.com>	2019-09-03 20:06:09 +09:00
Nadav Har'El	6c4ad93296	api/compaction_manager: do not hold map on the stack Merged patch series by Amnon Heiman: This patch fixes a bug that a map is held on the stack and then is used by a future. Instead, the map is now moved to the relevant lambda function. Fixes #4824	2019-09-01 13:16:34 +03:00
Avi Kivity	e962beea20	toolchain: update to Fedora 30 and gcc 9.2 In Fedora 30 we have a new boost version, so we no longer need to use our patched boost, so we also remove the scylladb/toolchain copr.	2019-09-01 12:05:26 +03:00
Piotr Sarna	23c891923e	main: make sure view_builder doesn't propagate semaphore errors Stopping services which occurs in a destructor of deferred_action should not throw, or it will end the program with terminate(). View builder breaks a semaphore during its shutdown, which results in propagating a broken_semaphore exception, which in turn results in throwing an exception during stop().get(). In order to fix that issue, semaphore exceptions are explicitly ignored, since they're expected to appear during shutdown. Fixes #4875	2019-09-01 11:59:57 +03:00
Tomasz Grabiec	c8f8a9450f	Merge "Improve cpu instruction set support checks" from Avi To prevent termination with SIGILL, tighten the instruction set support checks. First, check for CLMUL too. Second, add a check in scylla_prepare to catch the problem early. Fixes #4921.	2019-08-30 16:54:04 +02:00
Avi Kivity	07010af44c	scylla_prepare: verify processor satisfies instruction set requirements Scylla requires the CLMUL and SSE 4.2 instruction sets and will fail without them. There is a check in main(), but that happens after the code is running and it may already be too late. Add a check in scylla_prepare which runs before the main executable.	2019-08-29 15:34:29 +03:00
Avi Kivity	9579946e72	main: extend CPU feature check to verify that PCLMUL is available Since `79136e895f`, we use the pclmul instruction set, so check it is there.	2019-08-29 15:13:32 +03:00
Gleb Natapov	e61a86bbb2	to_string: Add operator<< overload for std::tuple. Message-Id: <20190829100902.GN21540@scylladb.com>	2019-08-29 13:35:02 +03:00
Rafael Ávila de Espíndola	036f51927c	sstables: Remove unused include Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190827210424.37848-1-espindola@scylladb.com>	2019-08-28 11:32:44 +03:00
Benny Halevy	869b518dca	sstables: auto-delete unsealed sstables Fixes #4807 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190827082044.27223-1-bhalevy@scylladb.com>	2019-08-28 09:46:17 +03:00
Botond Dénes	969aa22d51	configure.py: promote unused result warning to error Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190827111428.6829-2-bdenes@scylladb.com>	2019-08-28 09:46:17 +03:00
Botond Dénes	480b42b84f	tests/gossip_test: silence discarded future warning Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190827111428.6829-1-bdenes@scylladb.com>	2019-08-28 09:46:17 +03:00
Avi Kivity	d85339e734	Update seastar submodule * seastar 20bfd61955...cb7026c16f (2): > net: dpdk: suppress discarded future warning > Merge "Optimize promises in then/then_wrapped" from Rafael	2019-08-28 09:46:17 +03:00
Avi Kivity	f1d73d0c13	Merge "systemd: put scylla processes in systemd slices. #4743 " from Glauber " It is well known that seastar applications, like Scylla, do not play well with external processes: CPU usage from external processes may confuse the I/O and CPU schedulers and create stalls. We have also recently seen that memory usage from other application's anonymous and page cache memory can bring the system to OOM. Linux has a very good infrastructure for resource control contributed by amazingly bright engineers in the form of cgroup controllers. This infrastructure is exposed by SystemD in the form of slices: a hierarchical structure to which controllers can be attached. In true systemd way, the hierarchy is implicit in the filenames of the slice files. a "-" symbol defines the hierarchy, so the files that this patch presents, scylla-server and scylla-helper, essentially create a "scylla" cgroup at the top level with "server" and "helper" children. Later we mark the Services needed to run scylla as belonging to one or the other through the Slice= directive. Scylla DBAs can benefit from this setup by using the systemd-run utility to fire ad-hoc commands. Let's say for example that someone wants to hypothetically run a backup and transfer files to an external object store like S3, making sure that the amount of page cache used won't create swap pressure leading to database timeouts. One can then run something like: sudo systemd-run --uid=id -u scylla --gid=id -g scylla -t --slice=scylla-helper.slice /path/to/my/magical_backup_tool (or even better, the backup tool can itself be a systemd timer) " * 'slices' of https://github.com/glommer/scylla: systemd: put scylla processes in systemd slices. move postinst steps to an external script	2019-08-26 20:16:55 +03:00
Benny Halevy	20083be9f6	sstables: delete_atomically: fix misplaced parenthesis in pending_delete_log warning message Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20190818064637.9207-1-bhalevy@scylladb.com>	2019-08-26 19:50:21 +03:00
Avi Kivity	b9e9d7d379	Merge "Resolve discarded future warnings" from Botond " The warning for discarded futures will only become useful, once we can silence all present warnings and flip the flag to make it become error. Then it will start being useful in finding new, accidental discarding of futures. This series silences all remaining warnings in the Scylla codebase. For those cases where it was obvious that the future is discarded on purpose, the author taking all necessary precaution (handling exception) the warning was simply silenced by casting the future to void and adding a relevant comment. Where the discarding seems to have been done in error, I have fixed the code to not discard it. To the rest of the sites I added a FIXME to fix the discarding. " * 'resolve-discarded-future-warnings/v4.2' of https://github.com/denesb/scylla: treewide: silence discarded future warnings for questionable discards treewide: silence discarded future warnings for legit discards tests: silence discarded future warnings tests/cql_query_test.cc: convert some tests to thread	2019-08-26 19:40:25 +03:00
Botond Dénes	136fc856c5	treewide: silence discarded future warnings for questionable discards This patches silences the remaining discarded future warnings, those where it cannot be determined with reasonable confidence that this was indeed the actual intent of the author, or that the discarding of the future could lead to problems. For all those places a FIXME is added, with the intent that these will be soon followed-up with an actual fix. I deliberately haven't fixed any of these, even if the fix seems trivial. It is too easy to overlook a bad fix mixed in with so many mechanical changes.	2019-08-26 19:28:43 +03:00
Botond Dénes	fddd9a88dd	treewide: silence discarded future warnings for legit discards This patch silences those future discard warnings where it is clear that discarding the future was actually the intent of the original author, and they did the necessary precautions (handling errors). The patch also adds some trivial error handling (logging the error) in some places, which were lacking this, but otherwise look ok. No functional changes.	2019-08-26 18:54:44 +03:00
Botond Dénes	cff4c4932d	tests: silence discarded future warnings	2019-08-26 18:54:44 +03:00
Botond Dénes	486fa8c10c	tests/cql_query_test.cc: convert some tests to thread Some tests are currently discarding futures unjustifiably, however adding code to wait on these futures is quite inconvenient due to the continuation style code of these tests. Convert them to run in a seastar thread to make the fix easier.	2019-08-26 18:54:44 +03:00
Tomasz Grabiec	ac5ff4994a	service: Announce the new schema version when features are enabled Introduced in `c96ee98`. We call update_schema_version() after features are enabled and we recalculate the schema version. This method is not updating gossip though. The node will still use it's database::version() to decide on syncing, so it will not sync and stay inconsistent in gossip until the next schema change. We should call updatE_schema_version_and_announce() instead so that the gossip state is also updated. There is no actual schema inconsistency, but the joining node will think there is and will wait indefinitely. Making a random schema change would unbock it. Fixes #4647. Message-Id: <1566825684-18000-1-git-send-email-tgrabiec@scylladb.com>	2019-08-26 17:54:59 +03:00
Avi Kivity	a7b82af4c3	Update seastar submodule * seastar afc5bbf511...20bfd61955 (18): > reactor: closing file used to check if direct_io is supported > future: set_coroutine(): s/state()/_state/ > tests/perf/perf_test.hh: suppress discarded future warning > tests: rpc: fix memory leak in timeout wraparound tests > Revert "future-util: reduce allocations and continuations in parallel_for_each" > reactor: fix rename_priority_class() build failure in C++14 mode > future: mark future_state_base::failed() as unlikely > future-util: reduce allocations and continuations in parallel_for_each > future-utils: generalize when_all_estimate_vector_capacity() > output_stream: Add comment on sequentiality > docs/tutorial.md: minor cleanups in first section > core: fix a race in execution stages (Fixes #4856, fixes #4766) > semaphore: use semaphore's clock type in with_semaphore()/get_units() > future: fix doxygen documentation for promise<> > sharded: fixed detecting stop method when building with clang > reactor: fixed locking error in rename_priority_class > Assert that append_challenged_posix_file_impl are closed. > rpc: correctly handle huge timeouts	2019-08-26 15:37:58 +03:00
Asias He	3ea1255020	storage_service: Use sleep_abortable instead of sleep (#4697 ) Make the sleep abortable so that it is able to break the loop during shutdown. Fixes #4885	2019-08-26 13:35:44 +03:00
Asias He	2f24fd9106	sstables: Move leveled_compaction_strategy implementation to source file It is better than putting everything in header.	2019-08-26 16:49:48 +08:00
Asias He	b69138c4e4	sstables: Include dht/i_partitioner.hh for dht::partition_range Get rid of one FIXME.	2019-08-26 16:35:18 +08:00
Nadav Har'El	b60d201a11	API: column_family.cc Add get_built_indexes implmentation Merged patch series from Amnon Heiman amnon@scylladb.com This Patch adds an implementation of the get built index API and remove a FIXME. The API returns a list of secondary indexes belongs to a column family and have already been fully built. Example: CREATE KEYSPACE scylla_demo WITH replication = {'class': 'SimpleStrategy', 'replication_factor': '1'}; CREATE TABLE scylla_demo.mytableID ( uid uuid, text text, time timeuuid, PRIMARY KEY (uid, time) ); CREATE index on scylla_demo.mytableID (time); $ curl -X GET 'http://localhost:10000/column_family/built_indexes/scylla_demo%3Amytableid' ["mytableid_time_idx"]	2019-08-25 18:37:44 +03:00
Amnon Heiman	2d3185fa7d	column_family.cc: remove unhandle future The sum_ratio struct is a helper struct that is used when calculating ratio over multiple shards. Originally it was created thinking that it may need to use future, in practice it was never used and the future was ignore. This patch remove the future from the implementation and reduce an unhandle future warning from the compilation. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-08-25 16:51:14 +03:00
Amnon Heiman	21dee3d8ef	API:column_family.cc Add get_build_index implmentation This Patch adds an implementation of the get build index API and remove a FIXME. The API returns the list of the built secondary indexes belongs to a column family. Example: CREATE KEYSPACE scylla_demo WITH replication = {'class': 'SimpleStrategy', 'replication_factor': '1'}; CREATE TABLE scylla_demo.mytableID ( uid uuid, text text, time timeuuid, PRIMARY KEY (uid, time) ); CREATE index on scylla_demo.mytableID (time); $ curl -X GET 'http://localhost:10000/column_family/built_indexes/scylla_demo%3Amytableid' ["mytableid_time_idx"] Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-08-25 16:46:49 +03:00
Juliana Oliveira	711ed76c82	auth: standard_role_manager: read null columns as false When a role is created through the `create role` statement, the 'is_superuser' and 'can_login' columns are set to false by default. Likewise, `list roles`, `alter roles` and `* roles` operations expect to find a boolean when reading the same columns. This is not the case, though, when a user directly inserts to `system_auth.roles` and doesn't set those columns. Even though manually creating roles is not a desired day-to-day operation, it is an insert just like any other and it should work. `* roles` operations, on the other hand, are not prepared for this deviations. If a user manually creates a role and doesn't set boolean values to those columns, `* roles` will return all sorts of errors. This happens because `* roles` is explicitly expecting a boolean and casting for it. This patch makes `* roles` more friendly by considering the boolean variable `false` - inside `* roles` context - if the actual value is `null`; it won't change the `null` value. Fixes #4280 Signed-off-by: Juliana Oliveira <juliana@scylladb.com> Message-Id: <20190816032617.61680-1-juliana@scylladb.com>	2019-08-25 11:52:43 +03:00
Pekka Enberg	118a141f5d	scylla_blocktune.py: Kill btrfs related FIXME The scylla_blocktune.py has a FIXME for btrfs from 2016, which is no longer relevant for Scylla deployments, as Red Hat dropped support for the file system in 2017. Message-Id: <20190823114013.31112-1-penberg@scylladb.com>	2019-08-24 20:40:08 +03:00
Botond Dénes	18581cfb76	multishard_mutation_query: create_readr(): use the caller's priority class The priority class the shard reader was created with was hardcoded to be `service::get_local_sstable_query_read_priority()`. At the time this code was written, priority classes could not be passed to other shards, so this method, receiving its priority class parameters from another shard, could not use it. This is now fixed, so we can just use whatever the caller wants us to use. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190823115111.68711-1-bdenes@scylladb.com>	2019-08-23 16:10:43 +02:00
Tomasz Grabiec	080989d296	Merge "cql3: cartesian product limits" from Avi Cartesian products (generated by IN restrictions) can grow very large, even for short queries. This can overwhelm server resources. Add limit checking for cartesian products, and configuration items for users that are not satisfied with the default of 100 records fetched. Fixes #4752. Tests: unit (dev), manual test with SIGHUP.	2019-08-21 19:35:59 +02:00
Avi Kivity	67b0d379e0	main: add glue between db::config and cql3::cql_config Copy values between the flat db::config and the hierarchical cql_config, adding observers to keep the values updated.	2019-08-21 19:35:59 +02:00
Avi Kivity	8c7ad1d4cd	cql: single_column_clustering_key_restrictions: limit cartesian products Cartesian products (via IN restrictions) make it easy to generate huge primary key sets with simple queries, overflowing server resources. Limit them in the coordinator and report an exception instead of trying to execute a query that would consume all of our memory. A unit test is added.	2019-08-21 19:35:59 +02:00
Avi Kivity	3a44fa9988	cql3, treewide: introduce empty cql3::cql_config class and propagate it We need a way to configure the cql interpreter and runtime. So far we relied on accessing the configuration class via various backdoors, but that causes its own problems around initialization order and testability. To avoid that, this patch adds an empty cql_config class and propagates it from main.cc (and from tests) to the cql interpreter via the query_options class, which is already passed everywhere. Later patches will fill it with contents.	2019-08-21 19:35:59 +02:00
Rafael Ávila de Espíndola	86c29256eb	types: Fix references_user_type This was broken since the type refactoring. It was checking the static type, which is always abstract_type. Unfortunately we only had dtests for this. This can probably be optimized to avoid the double switch over kind, but it is probably better to do the simple fix first. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20190821155354.47704-1-espindola@scylladb.com>	2019-08-21 19:13:59 +03:00
Dejan Mircevski	ea9d358df9	cql3: Optimize LIKE regex construction Currently we create a regex from the LIKE pattern for every row considered during filtering, even though the pattern is always the same. This is wasteful, especially since we require costly optimization in the regex compiler. Fix it by reusing the regex whenever the pattern is unchanged since the last call. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-08-21 16:45:47 +03:00

1 2 3 4 5 ...

19326 Commits