scylladb

Author	SHA1	Message	Date
Benny Halevy	3fab0f8694	storage_proxy: convert to shared_token_metadata get() the latest token_metadata_ptr from the shared_token_metadata before each use. expose get_token_metadata_ptr() rather than get_token_metadata() so that caller can keep it across continuations. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Pavel Solodovnikov	5ff5df1afd	storage_proxy: un-hardcode force sync flag for `mutate_locally(mutation)` overload Corresponding overload of `storage_proxy::mutate_locally` was hardcoded to pass `db::commitlog::force_sync::no` to the `database::apply`. Unhardcode it and substitute `force_sync::no` to all existing call sites (as it were before). `force_sync::yes` will be used later for paxos learn writes when trying to apply mutations upgraded from an obsolete schema version (similar to the current case when applying locally a `frozen_mutation` stored in accepted proposal). Tests: unit(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200716124915.464789-1-pa.solodovnikov@scylladb.com>	2020-07-16 16:38:48 +03:00
Piotr Sarna	92aadb94e5	treewide: propagate trace state to write path In order to add tracing to places where it can be useful, e.g. materialized view updates and hinted handoff, tracing state is propagated to all applicable call sites.	2020-05-18 16:05:23 +02:00
Rafael Ávila de Espíndola	c5795e8199	everywhere: Replace engine().cpu_id() with this_shard_id() This is a bit simpler and might allow removing a few includes of reactor.hh. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200326194656.74041-1-espindola@scylladb.com>	2020-03-27 11:40:03 +03:00
Pavel Emelyanov	7cdfd94207	batchlog: Use token_metadata from proxy This kills the second global reference on storage_service from batchlog code and breaks the dependency loop between these two. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	b4e66ddf1d	batchlog: Use in-config ring-delay This kills the first (out of two) global reference on storage_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	d361894b9d	batchlog_manager: Speed up token_metadata endpoints counting a bit In this place we only need to know the number of endpoints, while current code additionally shuffles them before counting. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-23 14:22:45 +02:00
Botond Dénes	fddd9a88dd	treewide: silence discarded future warnings for legit discards This patch silences those future discard warnings where it is clear that discarding the future was actually the intent of the original author, and they did the necessary precautions (handling errors). The patch also adds some trivial error handling (logging the error) in some places, which were lacking this, but otherwise look ok. No functional changes.	2019-08-26 18:54:44 +03:00
Gleb Natapov	6a4207f202	Pass service permit to storage_proxy Current cql transport code acquire a permit before processing a query and release it when the query gets a reply, but some quires leave work behind. If the work is allowed to accumulate without any limit a server may eventually run out of memory. To prevent that the permit system should account for the background work as well. The patch is a first step in this direction. It passes a permit down to storage proxy where it will be later hold by background work.	2019-08-12 10:20:43 +03:00
Gleb Natapov	95c6d19f6c	batchlog_manager: fix array out of bound access endpoint_filter() function assumes that each bucket of std::unordered_multimap contains elements with the same key only, so its size can be used to know how many elements with a particular key are there. But this is not the case, elements with multiple keys may share a bucket. Fix it by counting keys in other way. Fixes #3229 Message-Id: <20190501133127.GE21208@scylladb.com>	2019-05-01 17:30:11 +03:00
Asias He	af579a055b	gossip: Get rid of the gms::get_local_failure_detector static object Store the failure_detector object inside gossiper object. - No more the global object sharded<failure_detector> - No need to initialize sharded<failure_detector> manually which simplifies the code in tests/cql_test_env.cc and init.cc.	2019-03-22 09:08:51 +08:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	30745eeb72	query_processor: replace sharded<database> with the local shard query_processor uses storage_proxy to access data, and the local database object to access replicated metadata. While it seems strange that the database object is not used to access data, it is logical when you consider that a sharded<database> only contain's this node's data, not the cluster data. Take advantage of this to replace sharded<database> with a single database shard.	2018-12-29 11:02:15 +02:00
Avi Kivity	89be47e291	batchlog_manager: remove dependency on db::config Extract configuration into a new struct batchlog_manager_config and have the callers populate it using db::config. This reduces dependencies on global objects.	2018-12-09 20:11:38 +02:00
Avi Kivity	d77e044cde	db: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Tomasz Grabiec	cd201d1987	db/batchlog_manager: Do not return a value from timer callback Timer callbacks are std::function<void()>. Exposed by changing callback_t to noncopyable_function<>. Message-Id: <1536138045-29209-1-git-send-email-tgrabiec@scylladb.com>	2018-09-05 12:32:21 +03:00
Nadav Har'El	25bd139508	cross-tree: clean up use of std::random_device() std::random_device() uses the relatively slow /dev/urandom, and we rarely if ever intend to use it directly - we normally want to use it to seed a faster random_engine (a pseudo-random number generator). In many places in the code, we first created a random_device variable, and then using it created a random_engine variable. However, this practice created the risk of a programmer accidentally using the random_device object, instead of the random_engine object, because both have the same API; This hurts performance. This risk materialized in just two places in the code, utils/uuid.cc and gms/gossiper.cc. A patch for to uuid.cc was sent previously by Pawel and is not included in this patch, and the fix for gossiper.{cc,hh} is included here. To avoid risking the same mistake in the future, this patch switches across the code to an idiom where the random_device object is not named, so cannot be accidentally used. We use the following idiom: std::default_random_engine _engine{std::random_device{}()}; Here std::random_device{}() creates the random device (/dev/urandom) and pulls a random integer from it. It then uses this seed to create the random_engine (the pseudo-random number generator). The std::random_device{} object is temporary and unnamed, and cannot be unintentionally used directly. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180726154958.4405-1-nyh@scylladb.com>	2018-07-26 16:54:58 +01:00
Avi Kivity	512baf536f	storage_proxy: implement write timeouts Require a timeout parameter for storage_proxy::mutate_begin() and all its callers (all the way to thrift and cql modification_statement and batch_statement). This should fix spurious debug-mode test failures, where overcommit and general debug slowness result in the default timeouts being exceeded. Since the tests use infinite timeouts, they should not time out any more. Tests: unit (release), with an extra patch that aborts when a non-infinite timeout is detected. Message-Id: <20180707204424.17116-1-avi@scylladb.com>	2018-07-08 10:27:03 +01:00
Avi Kivity	7c01e66d53	cql3: query_processor: store and use just local shard reference of storage_proxy Since storage_proxy provides access to the entire cluster, a local shard reference is sufficient. Adjust query_processor to store a reference to just the local shard, rather than a seastar::sharded<storage_proxy> and adjust callers. This simplifies the code a little. Message-Id: <20180415142656.25370-3-avi@scylladb.com>	2018-04-16 10:20:50 +02:00
José Guilherme Vanz	380bc0aa0d	Swap arguments order of mutation constructor Swap arguments in the mutation constructor keeping the same standard from the constructor variants. Refs #3084 Signed-off-by: José Guilherme Vanz <guilherme.sft@gmail.com> Message-Id: <20180120000154.3823-1-guilherme.sft@gmail.com>	2018-01-21 12:58:42 +02:00
Avi Kivity	e44517851e	untyped_result_set: reduce dependencies Forward-declare untyped_result_set and untyped_result_set_row, and remove the include from query_processor.hh. Message-Id: <20170916170859.27612-3-avi@scylladb.com>	2017-09-18 15:15:15 +02:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Duarte Nunes	9e88b60ef5	mutation: Set cell using clustering_key_prefix Change the clustering key argument in mutation::set_cell from exploded_clustering_prefix to clustering_key_prefix, which allows for some overall code simplification and fewer copies. This mostly affects the cql3 layer. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:50 +02:00
Vlad Zolotarov	a9f6e5f8da	db::batchlog_manager: move collectd registration to the metrics registration layer Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-01-10 16:24:54 -05:00
Avi Kivity	c94fb1bf12	build: reduce inclusions of messaging_service.hh Remove inclusions from header files (primary offender is fb_utilities.hh) and introduce new messaging_service_fwd.hh to reduce rebuilds when the messaging service changes. Message-Id: <1475584615-22836-1-git-send-email-avi@scylladb.com>	2016-10-05 11:46:49 +03:00
Vlad Zolotarov	b36b69c1d6	service::storage_proxy: remove a default value for a tracing::trace_state_ptr parameter Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-07-19 18:21:59 +03:00
Calle Wilund	88ffe60138	batchlog_manager: Change replay mutation CL to ALL Try to emulate the origin behaviour for batch reply. They use an explicit write handler, combinging 1.) Hinting to all known dead endpoints 2.) Sending to all persumed live, requiring ack from all 3.) Hinting to endpoint to which send failed. We don't have hints, so try to work around by doing send with cl=ALL, and if send fails (wholly or partially), retain the batch in the log. This is still slight behavioural difference, and we also risk filling up the batch log in extreme cases. (Though probably not in any real environment). Refs #1222 Message-Id: <1466444170-23797-1-git-send-email-calle@scylladb.com>	2016-06-21 09:41:09 +03:00
Vlad Zolotarov	4ef5b11e9b	batchlog_manager: add a counter for a total number of write attempts Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-04-21 11:29:21 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Asias He	5550aeba1d	batchlog_manager: Avoid stopping batchlog_manager more than once We can stop batchlog_manager in decommission and drain. Avoid stopping it more than once. Fix the following error: $ nodetool decommission $ nodetool drain storage_service - DECOMMISSIONING: stop_gossiping done storage_service - messaging_service stopped storage_service - DECOMMISSIONING: stop messaging_service done storage_service - DECOMMISSIONING: set_bootstrap_state done storage_service - DECOMMISSIONED: storage_service - DECOMMISSIONING: done storage_service - DRAINING: starting drain process gossip - gossip is already stopped scylla: ./seastar/core/gate.hh:93: future<> seastar::gate::close(): Assertion `!_stopped && "seastar::gate::close() cannot be called more than once"' failed.	2016-03-30 20:54:30 +08:00
Asias He	cdb43c5586	batchlog_manager: Allow user initiated bachlog replay operation During decommission, the storage_service::unbootstrap() needs to initiate a batchlog replay operation. To sync the replay operation initiated by the timer in batchlog_manager and storage_service, a semaphore is introduced. To simplify the semaphore locking, the management code now always runs on shard zero, but the real work is distruted to all shards.	2016-03-30 20:54:30 +08:00
Tomasz Grabiec	697d9bfa56	serializer: Introduce as_input_stream(bytes_view)	2016-02-26 12:26:13 +01:00
Paweł Dziepak	1b52264dfd	batchlog_manager: use new canonical_mutation serializers Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-02-19 23:12:00 +00:00
Gleb Natapov	c509e48674	Parallelize batchlog replay Current code is serialized by get_truncated_at(). Use map_reduce to make it run in parallel. Message-Id: <1454421603-13080-4-git-send-email-gleb@scylladb.com>	2016-02-02 17:08:54 +01:00
Gleb Natapov	42e3999a00	Check batchlog version before replaying In case batchlog serialization format changes check it before trying to interpret raw data. Message-Id: <1454421603-13080-3-git-send-email-gleb@scylladb.com>	2016-02-02 17:08:54 +01:00
Tomasz Grabiec	ec12b75426	batchlog_manager: Store canonical_mutations We need to be able to replay mutations created using older versions of the table's schema. frozen_mutation can be only read using the version it was serialized with, and there is no guarantee that the node will know this version at the time of replay. Currently versions kept in-memory so a node forgets all past versions when it restarts. To solve this, let's store canonical_mutations which, like data in sstables, can be read using any later schema version of given table.	2016-01-19 13:46:28 +01:00
Tomasz Grabiec	e21049328f	batchlog_manager: Add more debug logging	2016-01-19 13:46:28 +01:00
Tomasz Grabiec	a9c00cbc11	batchlog_manager: Use requested schema version	2016-01-11 10:34:52 +01:00
Asias He	f57ba6902b	storage_service: Introduce ring_delay_ms option It is hard-coded as 30 seconds at the moment. Usage: $ scylla --ring-delay-ms 5000 Time a node waits to hear from other nodes before joining the ring in milliseconds. Same as -Dcassandra.ring_delay_ms in cassandra.	2015-12-25 15:08:22 +08:00
Tomasz Grabiec	179b587d62	Abstract timestamp creation behind new_timestamp() Replace db_clock::now_in_usec() and db_clock::now() * 1000 accesses where the intent is to create a new auto-generate cell timestamp with a call to new_timestamp(). Now the knowledge of how to create timestamps is in a single place.	2015-12-15 15:16:04 +02:00
Avi Kivity	47499dcf18	data_value: make conversion from bytes explicit Since bytes is a very generic value that is returned from many calls, it is easy to pass it by mistake to a function expecting a data_value, and to get a wrong result. It is impossible for the data_value constructor to know if the argument is a genuine bytes variable, a data_value of another type, but serialized, or some other serialized data type. To prevent misuse, make the data_value(bytes) constructor (and complementary data_value(optional<bytes>) explicit.	2015-11-13 17:12:29 +02:00
Calle Wilund	42c086a5cd	batchlog_manager: Fixup includes + exception handling * Fix exception handling in batch loop (report + still re-arm) * Cleanup seastar include reference style	2015-10-07 17:06:34 +03:00
Calle Wilund	a4c14d3d1d	batchlog_manager: Add hint of which cpu timer callback is running on	2015-10-07 14:57:55 +02:00
Calle Wilund	b46496da34	batchlog_manager: Rename logger * More useful/referrable on command line (--log) Matches class name (though not origin)	2015-10-07 14:30:09 +02:00
Calle Wilund	6f94a3bdad	batchlog_manager: Use gate instead of semaphore Since that exists now.	2015-10-07 14:30:09 +02:00
Calle Wilund	874da0eb67	batchlog_manager: Run timer loop on only one shard Since replay is a "node global" operation, we should not attempt to do it in parallel on each shard. It will just overlap/interfere. Could just run this on cpu 0 or but since this _could_ be a lengty operation, each timer callback is round-robined shards just in case...	2015-10-07 14:30:09 +02:00
Calle Wilund	b3c95ce42d	system_keyspace: Change truncation record method to use context qp Align with rest of file (for better or worse). This allows calls from entity without query_processor handy (i.e. storage_proxy). Added "minimal" setup method for the "global" state, to facilitate tests. Doing a full setup either in cql_test_env or after it is created breaks badly. (Not sure why). So quick workaround. Updated the current two users (batchlog_manager and commitlog_replayer) callsites to conform.	2015-09-30 09:09:41 +02:00
Pekka Enberg	f43f0d6f04	keys: Add compound_wrapper::from_singular() Clean up code by adding a from_singular() helper function to compound wrapper and use it in.	2015-09-28 16:29:44 +02:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	830a86258b	db: avoid copying mutations Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-09-03 10:30:32 +02:00

1 2

56 Commits