scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 16:03:20 +00:00

Author	SHA1	Message	Date
Botond Dénes	cf24f4fe30	imr: move documentation to docs/ Where all the other documentation is, and hence where people would be looking for it. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20191128144612.378244-1-bdenes@scylladb.com>	2019-11-28 16:47:52 +02:00
Avi Kivity	36dd0140a8	Update seastar submodule * seastar 5c25de907a...8eb6a67a4b (1): > util/backtrace.hh: add missing print.hh include	2019-11-28 16:47:16 +02:00
Benny Halevy	7aef39e400	tracing: one_session_records: keep local tracing ptr Similar to trace_state keep shared_ptr<tracing> _local_tracing_ptr in one_session_records when constructed so it can be used during shutdown. Fixes #5243 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-11-28 15:24:10 +01:00
Gleb Natapov	75499896ab	client_state: store _user as optional instead of shared_ptr _user cannot outlive client_state class instance, so there is no point in holding it in shared_ptr. Tested: debug test.py and dtest auth_test.py Message-Id: <20191128131217.26294-5-gleb@scylladb.com>	2019-11-28 15:48:59 +02:00
Gleb Natapov	1538cea043	cql: modification_statement: store _restrictions as optional instead of shared_ptr _restrictions can be optional since its lifetime is managed by modification_statement class explicitly. Message-Id: <20191128131217.26294-4-gleb@scylladb.com>	2019-11-28 15:48:54 +02:00
Gleb Natapov	ce5d6d5eee	storage_service: store thrift server as an optional instead of shared_ptr Only do_stop_rpc_server uses the shared_ptr to prolong server's lifetime until stop() completes, but do_with() can be used to achieve the same. Message-Id: <20191128131217.26294-3-gleb@scylladb.com>	2019-11-28 15:48:51 +02:00
Gleb Natapov	b9b99431a8	storage_service: store cql server as an optional instead of shared_ptr Only do_stop_native_transport() uses the shared_ptr to prolong server's lifetime until stop() completes, but do_with() can be used to achieve the same. Message-Id: <20191128131217.26294-2-gleb@scylladb.com>	2019-11-28 15:48:47 +02:00
Avi Kivity	2b7e97514a	Update seastar submodule * seastar 6f0ef32514...5c25de907a (7): > shared_future: Fix crash when all returned futures time out Fixes #5322. > future: don't create temporaries on get_value(). > reactor: lower the default stall threshold to 200ms > reactor: Simplify network initialization > reactor: Replace most std::function with noncopyable_function > futures: Avoid extra moves in SEASTAR_TYPE_ERASE_MORE mode > inet_address: Make inet_address == operator ignore scope (again)	2019-11-28 14:48:01 +02:00
Nadav Har'El	fde336a882	Merged "5139 minmax bad printing" Merged pull request https://github.com/scylladb/scylla/pull/5311 from Juliusz Stasiewicz: This is a partial solution to #5139 (only for two types) because of the above and because collections are much harder to do. They are coming in a separate PR.	2019-11-28 14:06:43 +02:00
Juliusz Stasiewicz	3b9ebca269	tests/cql_query_test: add test for aggregates on inet+time_type This is a test to max(), min() and count() system functions on the arguments of types: `net::inet_address` and `time_native_type`.	2019-11-28 11:20:43 +01:00
Juliusz Stasiewicz	9c23d89531	cql3/functions: add missing min/max/count for inet and time type References #5139. Aggregate functions, like max(), when invoked on `inet_address' and `time_native_type' used to choose max(blob)->blob overload, with casting of argument and result to bytes. This is because appropriate calls to `aggregate_fcts::make_XXX_function()' were missing. This commit adds them. Functioning remains the same but now clients see user-friendly representations of aggregate result, not binary. Comparing inet addresses without inet::operator< is performed by trick, where ADL is bypassed by wrapping the name of std::min/max and providing an overload of wrapper on inet type.	2019-11-28 11:18:31 +01:00
Pavel Emelyanov	8532093c61	cql: The cql_server does not need proxy reference Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191127153842.4098-1-xemul@scylladb.com>	2019-11-28 10:58:46 +01:00
Amos Kong	d4a26f2ad0	scylla_util: get_scylla_dirs: return default data/commitlog directories if they aren't set (#5358 ) The default values of data_file_directories and commitlog_directory were commented by commit `e0f40ed16a`. It causes scylla_util.py:get_scylla_dirs() to fail in checking the values. This patch changed get_scylla_dirs() to return default data/commitlog directories if they aren't set. Fixes #5358 Reviewed-by: Pavel Emelyanov <xemul@scylladb.com> Signed-off-by: Amos Kong <amos@scylladb.com>	2019-11-27 13:52:05 +02:00
Nadav Har'El	cb1ed5eab2	alternator-test: test Query's Limit parameter Add a test, test_query.py::test_query_limit, to verify that the Limit parameter correctly limits the number of rows returned by the Query. This was supposed to already work correctly - but we never had a test for it. As we hoped, the test passes (on both Alternator and DynamoDB). Another test, test_query.py::test_query_limit_paging, verifies that paging can be done with any setting of Limit. We already had tests for paging of the Scan operation, but not for the Query operation. Refs #5153 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-11-27 12:27:26 +01:00
Nadav Har'El	c01ca661a0	alternator-test: Select parameter of Query and Scan This is a comprehensive test for the "Select" parameter of Query and Scan operations, but only for the base-table case, not index, so another future patch should add similar tests in test_gsi.py and test_lsi.py as well. The main use of the Select parameter is to allow returning just the count of items, instead of their content, but it also has other esoteric options, all of which we test here. The test currently succeeds on AWS DynamoDB, demonstrating that the test is correct, but fails on Alternator because the "Select" parameter is not yet supported. So the test is marked xfail. Refs #5058 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-11-27 12:22:33 +01:00
Botond Dénes	9d09f57ba5	scylla-gdb.py: scylla_smp_queues: use lazy initalization Currently the command tries to read all seastar smp queues in its initialization code in the constructor. This constructor is run each time `scylla-gdb.py` is sourced in `gdb` which leads to slowdowns and sometimes also annoying errors because the sourcing happens in the wrong context and seastar symbols are not available. Avoid this by running this initializing code lazily, on the first invocation. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20191127095408.112101-1-bdenes@scylladb.com>	2019-11-27 12:04:57 +01:00
Tomasz Grabiec	87b72dad3e	Merge "treewide: add missing const qualifiers" from Pavel Solodovnikov This patchset adds missing "const" function qualifiers throughout the Scylla code base, which would make code less error-prone. The changeset incorporates Kostja's work regarding const qualifiers in the cql code hierarchy along with a follow-up patch addressing the review comment of the corresponding patch set (the patch subject is "cql: propagate const property through prepared statement tree.").	2019-11-27 10:56:20 +01:00
Rafael Ávila de Espíndola	91b43f1f06	dbuild: fix podman with selinux enabled With this change I am able to run tests using docker-podman. The option also exists in docker. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191126194101.25221-1-espindola@scylladb.com>	2019-11-26 21:50:56 +02:00
Rafael Ávila de Espíndola	480055d3b5	dbuild: Fix missing docker options With the recent changes docker was missing a few options. In particular, it was missing -u. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191126194347.25699-1-espindola@scylladb.com>	2019-11-26 21:45:31 +02:00
Rafael Ávila de Espíndola	c0a2cd70ff	lua: fix test with boost 1.66 The boost 1.67 release notes says Changed maximum supported year from 10000 to 9999 to resolve various issues So change the test to use a larger number so that we get an exception with both boost 1.66 and boost 1.67. Fixes #5344 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191126180327.93545-1-espindola@scylladb.com>	2019-11-26 21:17:15 +02:00
Pavel Solodovnikov	55a1d46133	cql: some more missing const qualifiers There are several virtual functions in public interfaces named "is_*" that clearly should be marked as "const", so fix that.	2019-11-26 17:57:51 +03:00
Pavel Solodovnikov	412f1f946a	cql: remove "mutable" on _opts in select_statement _opts initialization can be safely done in the constructor, hence no need to make it mutable.	2019-11-26 17:55:10 +03:00
Piotr Sarna	d90dbd6ab0	Merge "support podman as a replacement to docker" from Avi Docker on Fedora 31 is flakey, and is not supported at all on RHEL 8. Podman is a drop-in replacement for docker; this series adds support for using podman in dbuild. Apart from actually working on Fedora 31 hosts, podman is nicer in being more secure and not requiring a daemon. Fixes #5332	2019-11-26 15:17:49 +01:00
Tomasz Grabiec	5c9fe83615	Merge "Sanitize sub-modules shutting down" from Pavel As suggested in issue #4586 here is the helper that prints "shutting down foo" message, then shuts the foo down, then prints the "[it] was successull" one. In between it catches the exception (if any) and warns this in logs. By "then" I mean literally then, not the seastar's then() :) Fixes: #4586	2019-11-26 15:14:22 +02:00
Piotr Sarna	9c5a5a5ac2	treewide: add names to semaphores By default, semaphore exceptions bring along very little context: either that a semaphore was broken or that it timed out. In order to make debugging easier without introducing significant runtime costs, a notion of named semaphore is added. A named semaphore is simply a semaphore with statically defined name, which is present in its errors, bringing valuable context. A semaphore defined as: auto sem = semaphore(0); will present the following message when it breaks: "Semaphore broken" However, a named semaphore: auto named_sem = named_semaphore(0, named_semaphore_exception_factory{"io_concurrency_sem"}); will present a message with at least some debugging context: "Semaphore broken: io_concurrency_sem" It's not much, but it would really help in pinpointing bugs without having to inspect core dumps. At the same time, it does not incur any costs for normal semaphore operations (except for its creation), but instead only uses more CPU in case an error is actually thrown, which is considered rare and not to be on the hot path. Refs #4999 Tests: unit(dev), manual: hardcoding a failure in view building code	2019-11-26 15:14:21 +02:00
Avi Kivity	6fbb724140	conf: remove unsupported options from scylla.yaml (#5299 ) These unsupported options do nothing except to confuse users who try to tune them. Options removed: hinted_handoff_throttle_in_kb max_hints_delivery_threads batchlog_replay_throttle_in_kb key_cache_size_in_mb key_cache_save_period key_cache_keys_to_save row_cache_size_in_mb row_cache_save_period row_cache_keys_to_save counter_cache_size_in_mb counter_cache_save_period counter_cache_keys_to_save memory_allocator saved_caches_directory concurrent_reads concurrent_writes concurrent_counter_writes file_cache_size_in_mb index_summary_capacity_in_mb index_summary_resize_interval_in_minutes trickle_fsync trickle_fsync_interval_in_kb internode_authenticator native_transport_max_threads native_transport_max_concurrent_connections native_transport_max_concurrent_connections_per_ip rpc_server_type rpc_min_threads rpc_max_threads rpc_send_buff_size_in_bytes rpc_recv_buff_size_in_bytes internode_send_buff_size_in_bytes internode_recv_buff_size_in_bytes thrift_framed_transport_size_in_mb concurrent_compactors compaction_throughput_mb_per_sec sstable_preemptive_open_interval_in_mb inter_dc_stream_throughput_outbound_megabits_per_sec cross_node_timeout streaming_socket_timeout_in_ms dynamic_snitch_update_interval_in_ms dynamic_snitch_reset_interval_in_ms dynamic_snitch_badness_threshold request_scheduler request_scheduler_options throttle_limit default_weight weights request_scheduler_id	2019-11-26 15:14:21 +02:00
Amos Kong	817f34d1a9	ami: support new aws instance types: c5d, m5d, m5ad, r5d, z1d (#5330 ) Currently scylla_io_setup will skip in scylla_setup, because we didn't support those new instance types. I manually executed scylla_io_setup, and the scylla-server started and worked well. Let's apply this patch first, then check if there is some new problem in ami-test. Signed-off-by: Amos Kong <amos@scylladb.com>	2019-11-26 15:14:21 +02:00
Konstantin Osipov	90346236ac	cql: propagate const property through prepared statement tree. cql_statement is a class representing a prepared statement in Scylla. It is used concurrently during execution, so it is important that its change is not changed by execution. Add const qualifier to the execution methods family, throghout the cql hierarchy. Mark a few places which do mutate prepared statement state during execution as mutable. While these are not affecting production today, as code ages, they may become a source of latent bugs and should be moved out of the prepared state or evaluated at prepare eventually: cf_property_defs::_compaction_strategy_class list_permissions_statement::_resource permission_altering_statement::_resource property_definitions::_properties select_statement::_opts	2019-11-26 14:18:17 +03:00
Pavel Solodovnikov	2f442f28af	treewide: add const qualifiers throughout the code base	2019-11-26 02:24:49 +03:00
Pavel Emelyanov	50a1ededde	main: Remove now unused defer-with-log helper Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	a0f92d40ee	main: Shut down sighup handler with verbose helper And (!) fix the misprinted variable name. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	0719369d83	repair: Remove extra logging on shutdown The shutdown start/finish messages are already printed in verbose_shutdown() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	2d64fc3a3e	main: Shut down database with verbose_shutdown helper Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	636c300db5	main: Shut down prometheus with verbose_shutdown() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> --- v2: - Have stop easrlier so that exception in start/listen do not prevent prometheu.stop from calling	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	804b152527	main: Sanitize shutting down callbacks As suggested in issue #4586 here is the helper that prints "shutting down foo" message, then shuts the foo down, then prints the "shutting down foo was successfull". In between it catches the exception (if any) and warns this in logs. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:45:49 +03:00
Nadav Har'El	4160b3630d	Merge "Return preimage from CDC only when it's enabled" Merged pull request https://github.com/scylladb/scylla/pull/5218 from Piotr Jastrzębski: Users should be able to decide whether they need preimage or not. There is already an option for that but it's not respected by the implementation. This PR adds support for this functionality. Tests: unit(dev). Individual patches: cdc: Don't take storage_proxy as transformer::pre_image_select param cdc::append_log_mutations: use do_with instead of shared_ptr cdc::append_log_mutations: fix undefined behavior cdc: enable preimage in test_pre_image_logging test cdc: Return preimage only when it's requested cdc: test both enabled and disabled preimage in test_pre_image_logging	2019-11-25 14:32:17 +02:00
Pavel Emelyanov	f6ac969f1e	mm: Stop migration manager Before stopping the db itself, stop the migration service. It must be stopped before RPC, but RPC is not stopped yet itself, so we should be safe here. Here's the tail of the resulting logs: INFO 2019-11-20 11:22:35,193 [shard 0] init - shutdown migration manager INFO 2019-11-20 11:22:35,193 [shard 0] migration_manager - stopping migration service INFO 2019-11-20 11:22:35,193 [shard 1] migration_manager - stopping migration service INFO 2019-11-20 11:22:35,193 [shard 0] init - Shutdown database started INFO 2019-11-20 11:22:35,193 [shard 0] init - Shutdown database finished INFO 2019-11-20 11:22:35,193 [shard 0] init - stopping prometheus API server INFO 2019-11-20 11:22:35,193 [shard 0] init - Scylla version 666.development-0.20191120.25820980f shutdown complete. Also -- stop the mm on drain before the commitlog it stopped. [Tomasz: mm needs the cl because pulling schema changes from other nodes involves applying them into the database. So cl/db needs to be stopped after mm is stopped.] The drain logs would look like ... INFO 2019-11-25 11:00:40,562 [shard 0] migration_manager - stopping migration service INFO 2019-11-25 11:00:40,562 [shard 1] migration_manager - stopping migration service INFO 2019-11-25 11:00:40,563 [shard 0] storage_service - DRAINED: and then on stop ... INFO 2019-11-25 11:00:46,427 [shard 0] init - shutdown migration manager INFO 2019-11-25 11:00:46,427 [shard 0] init - Shutdown database started INFO 2019-11-25 11:00:46,427 [shard 0] init - Shutdown database finished INFO 2019-11-25 11:00:46,427 [shard 0] init - stopping prometheus API server INFO 2019-11-25 11:00:46,427 [shard 0] init - Scylla version 666.development-0.20191125.3eab6cd54 shutdown complete. Fixes #5300 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191125080605.7661-1-xemul@scylladb.com>	2019-11-25 12:59:01 +01:00
Asias He	6ec602ff2c	repair: Fix rx_hashes_nr metrics (#5213 ) In get_full_row_hashes_with_rpc_stream and repair_get_row_diff_with_rpc_stream_process_op which were introduced in the "Repair switch to rpc stream" series, rx_hashes_nr metrics are not updated correctly. In the test we have 3 nodes and run repair on node3, we makes sure the following metrics are correct. assertEqual(node1_metrics['scylla_repair_tx_hashes_nr'] + node2_metrics['scylla_repair_tx_hashes_nr'], node3_metrics['scylla_repair_rx_hashes_nr']) assertEqual(node1_metrics['scylla_repair_rx_hashes_nr'] + node2_metrics['scylla_repair_rx_hashes_nr'], node3_metrics['scylla_repair_tx_hashes_nr']) assertEqual(node1_metrics['scylla_repair_tx_row_nr'] + node2_metrics['scylla_repair_tx_row_nr'], node3_metrics['scylla_repair_rx_row_nr']) assertEqual(node1_metrics['scylla_repair_rx_row_nr'] + node2_metrics['scylla_repair_rx_row_nr'], node3_metrics['scylla_repair_tx_row_nr']) assertEqual(node1_metrics['scylla_repair_tx_row_bytes'] + node2_metrics['scylla_repair_tx_row_bytes'], node3_metrics['scylla_repair_rx_row_bytes']) assertEqual(node1_metrics['scylla_repair_rx_row_bytes'] + node2_metrics['scylla_repair_rx_row_bytes'], node3_metrics['scylla_repair_tx_row_bytes']) Tests: repair_additional_test.py:RepairAdditionalTest.repair_almost_synced_3nodes_test Fixes: #5339 Backports: 3.2	2019-11-25 13:57:37 +02:00
Piotr Jastrzebski	2999cb5576	cdc: test both enabled and disabled preimage in test_pre_image_logging Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Piotr Jastrzebski	222b94c707	cdc: Return preimage only when it's requested Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Piotr Jastrzebski	c94a5947b7	cdc: enable preimage in test_pre_image_logging test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Piotr Jastrzebski	595c9f9d32	cdc::append_log_mutations: fix undefined behavior The code was iterating over a collection that was modified at the same time. Iterators were used for that and collection modification can invalidate all iterators. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Piotr Jastrzebski	f0f44f9c51	cdc::append_log_mutations: use do_with instead of shared_ptr This will not only safe some allocations but also improve code readability. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Piotr Jastrzebski	b8d9158c21	cdc: Don't take storage_proxy as transformer::pre_image_select param transformer has access to storage_proxy through its _ctx field. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-11-25 12:43:39 +01:00
Nadav Har'El	3eab6cd549	Merged "toolchain: update to Fedora 31" Merged pull request https://github.com/scylladb/scylla/pull/5310 from Avi Kivity: This is a minor update as gcc and boost versions did not change. A noteable update is patchelf 0.10, which adds support to large binaries. A few minor issues exposed by the update are fixed in preparatory patches. Patches: dist: rpm: correct systemd post-uninstall scriptlet build: force xz compression on rpm binary payload tools: toolchain: update to Fedora 31	2019-11-24 13:38:45 +02:00
Tomasz Grabiec	e3d025d014	row_cache: Fix abort on bad_alloc during cache update Since `90d6c0b`, cache will abort when trying to detach partition entries while they're updated. This should never happen. It can happen though, when the update fails on bad_alloc, because the cleanup guard invalidates the cache before it releases partition snapshots (held by "update" coroutine). Fix by destroying the coroutine first. Fixes #5327. Tests: - row_cache_test (dev) Message-Id: <1574360259-10132-1-git-send-email-tgrabiec@scylladb.com>	2019-11-24 12:06:51 +02:00
Rafael Ávila de Espíndola	8599f8205b	rpmbuild: don't use dwz By default rpm uses dwz to merge the debug info from various binaries. Unfortunately, it looks like addr2line has not been updated to handle this: // This works $ addr2line -e build/release/scylla 0x1234567 $ dwz -m build/release/common.debug build/release/scylla.debug build/release/iotune.debug // now this fails $ addr2line -e build/release/scylla 0x1234567 I think the issue is https://sourceware.org/bugzilla/show_bug.cgi?id=23652 Fixes #5289 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191123015734.89331-1-espindola@scylladb.com>	2019-11-24 11:35:29 +02:00
Rafael Ávila de Espíndola	25d5d39b3c	reloc: Force using sha1 for build-ids The default build-id used by lld is xxhash, which is 8 bytes long. rpm requires build-ids to be at least 16 bytes long (https://github.com/rpm-software-management/rpm/issues/950). We force using sha1 for now. That has no impact in gold and bfd since that is their default. We set it in here instead of configure.py to not slow down regular builds. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191123020801.89750-1-espindola@scylladb.com>	2019-11-24 11:35:29 +02:00
Rafael Ávila de Espíndola	b5667b9c31	build: don't compress debug info in executables By default we were compressing debug info only in release executables. The idea, if I understand it correctly, is that those are the ones we ship, so we want a more compact binary. I don't think that was doing anything useful. The compression is just gzip, so when we ship a .tar.xz, having the debug info compressed inside the scylla binary probably reduces the overall compression a bit. When building a rpm the situation in amusing. As part of the rpm build process the debug info is decompressed and extracted to an external file. Given that most of the link time goes to compressing debug info, it is probably a good idea to just skip that. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191123022825.102837-1-espindola@scylladb.com>	2019-11-24 11:35:29 +02:00
Tomasz Grabiec	d84859475e	Merge "Refactor test.py and cleanup resources" from Kostja Structure the code to be able to introduce futures. Apply trivial cleanups. Switch to asyncio and use it to work with processes and handle signals. Cleanup all processes upon signal.	2019-11-24 11:35:29 +02:00

1 2 3 4 5 ...

20297 Commits