scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-01 13:45:53 +00:00

Author	SHA1	Message	Date
Nadav Har'El	4e2bf28b84	alternator-test: make Alternator tests runnable from test.py To make the tests in alternator-test runnable by test.py, we need to move the directory alternator-test/ to test/alternator, because test.py only looks for tests in subdirectories of test/. Then, we need to create a test/alternator/suite.yaml saying that this test directory is of type "Run", i.e., it has a single run script "run" which runs all its tests. The "run" script had to be slightly modified to be aware of its new location relative to the source directory. To run the Alternator tests from test.py, do: ./test.py --mode dev alternator Note that in this version, the "--mode" has no effect - test/alternator/run always runs the latest compiled Scylla, regardless of the chosen mode. The Alternator tests can still be run manually and individually against a running Scylla or DynamoDB as before - just go to the test/alternator directory (instead of alternator-test previously) and run "pytest" with the desired parameters. Fixes #6046 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:27:45 +03:00
Nadav Har'El	0cccb5a630	test.py: add xunit XML output file for "Run" tests Assumes that "Run" tests can take the --junit-xml=<path> option, and pass it to ask the test to generate an XML summary of the run to a file like testlog/dev/xml/run.1.xunit.xml. This option is honored by the Alternator tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:26:50 +03:00
Nadav Har'El	0ae3136900	test.py: add new test type "Run" This patch adds a new test type, "Run". A test subdirectory of type "Run" has a script called "run" which is expected to run all the tests in that directory. This will be used, in the next patch, by the Alternator functional tests. These tests indeed have a "run" script, which runs Scylla and then runs all of Alternator's tests, finishing fairly quickly (in less than a minute). All of that will become one test.py test. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:26:50 +03:00
Nadav Har'El	36e44972f1	test.py: flag for aborting tests with SIGTERM, not SIGKILL Today, if test.py is interrupted with SIGINT or SIGTERM, the ongoing test is killed with SIGKILL. Some types of tests - such as Alternator's test - may depend on being killed politely (e.g., with SIGTERM) to clean up files. We cannot yet change the signal to SIGTERM for all tests, because Seastar tests often don't deal well with signals, but we can at least add a flag that certain test types - that know they can be killed gently - will use. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:26:50 +03:00
Nadav Har'El	24fcc0c0ff	alternator-test: change "run" script to pick random IP address Before this patch, the Alternator tests "run" script ran Scylla on a fixed listening address, 127.0.0.1. There is a problem that there might be other concurrent runs of Scylla using the same IP address - e.g., CCM (used by dtest) uses exactly this IP address for its first node. Luckily, Linux's loopback device actually allows us to pick any of over a million addresses in 127.0.0.0/8 to listen on - we don't need to use 127.0.0.1 specifically. So the code in this patch picks an address in 127.1.., so it cannot collide with CCM (which uses 127.0.0.* for up to 255 nodes). Moreover, the last two bytes of the listen address are picked based on the process ID of the run script; This allows multiple copies of this script to run concurrently - in case anybody wishes to do that. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:26:31 +03:00
Nadav Har'El	1aec4baa51	alternator-test: add "--url" option to choose Alternator's URL The "--aws" and "--local" test options chooses between two useful default URLs - Amazon's, or http://localhost:8000 for a local installation. However, sometimes one wants to run Scylla on a different IP address or port, so in this patch we add a "--url" option to choose a specific URL to connect to. For example, "--url http://127.1.2.3:1234". We will later use this option in the alternator-test/run script, to pick a random IP address on which to run Scylla, and then run the test against this address. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-04-12 16:25:04 +03:00
Piotr Sarna	ea827d42b9	test: move config to heap in config_test ... in order to get rid of a large stack warning. Tests: unit(dev) Message-Id: <010517a6029a70de069d5952cc853f5724280eea.1586422630.git.sarna@scylladb.com>	2020-04-09 11:22:49 +02:00
Ivan Prisyazhnyy	1c444b7e1e	api: support table auto compaction control This patch adds API endpoint /column_family/autocompaction/{name} that listen to GET and POST requests to pick and control table background compactions. To implement that the patch introduces "_compaction_disabled_by_user" flag that affects if CompactionManager is allowed to push background compactions jobs into the work. It introduces table::enable_auto_compaction(); table::disable_auto_compaction(); bool table::is_auto_compaction_disabled_by_user() const to control auto compaction state. Fixes #1488 Fixes #1808 Fixes #440 Tests: unit(sstable_datafile_test autocompaction_control_test), manual	2020-04-08 21:18:38 +03:00
Botond Dénes	aa9a582f4a	cql3: functions/castas_fcts: allow self-casting any type Casting a type to itself doesn't make sense, but it is harmless so allow it instead of reporting a confusing error message that makes even less sense: InvalidRequest: Error from server: code=2200 [Invalid query] message="org.apache.cassandra.db.marshal.BooleanType cannot be cast to org.apache.cassandra.db.marshal.BooleanType" Note that some types already supported self-casting, this patch just extends this to all types in a forward compatible way. Fixes: #5102 Tests: unit(dev), manual test casting boolean to boolean. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200408135041.854981-1-bdenes@scylladb.com>	2020-04-08 18:52:36 +03:00
Piotr Sarna	123edfc10c	alternator: fix failure on incorrect table name with no indexes If a table name is not found, it may still exist as a local index, but the check tried to fetch a local index name regardless if it was present in the request, which was a nullptr dereference bug. Fixes #6161 Tests: alternator-test(local, remote) Message-Id: <428c21e94f6c9e450b1766943677613bd46cbc68.1586347130.git.sarna@scylladb.com>	2020-04-08 15:33:48 +03:00
Botond Dénes	196dd5fa9b	treewide: throw std::bad_function_call with backtraces We typically use `std::bad_function_call` to throw from mandatory-to-implement virtual functions, that cannot have a meaningful implementation in the derived class. The problem with `std::bad_function_call` is that it carries absolutely no information w.r.t. where was it thrown from. I originally wanted to replace `std::bad_function_call` in our codebase with a custom exception type that would allow passing in the name of the function it is thrown from to be included in the exception message. However after I ended up also including a backtrace, Benny Halevy pointed out that I might as well just throw `std:bad_function_call` with a backtrace instead. So this is what this patch does. All users are various unimplemented methods of the `flat_mutation_reader::impl` interface. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200408075801.701416-1-bdenes@scylladb.com>	2020-04-08 13:54:06 +02:00
Avi Kivity	a490cb669b	Update seastar submodule * seastar fd9af3a26...cce2ddac1 (6): > rpc: fix build failures in C++14 mode due to std::string_view > util/backtrace: introduce make_backtraced_exception_ptr() > future: make do_for_each noexcept > fair_queue rename the fair_queue_descriptor and change its default init > future: do_with: make noexcept > io_queue: batch communication with the fair_queue for ready requests	2020-04-08 13:54:06 +02:00
Botond Dénes	f0530c7d41	configure.py: add {mode}-test, {mode}-check, test and check targets The test target builds all tests and runs them. The check target compiles all the headers in addition to this. The {mode} variants do these just for the respective mode. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200407132641.598412-1-bdenes@scylladb.com> Reviewed-by: Pekka Enberg <penberg@scylladb.com>	2020-04-08 13:54:06 +02:00
Calle Wilund	65a6ebbd73	cdc: Postimage must check iff we have (pre-)image row data for non-touched columns Fixes #6143 When doing post-image generation, we also write values for columns not in delta (actual update), based on data selected in pre-image row. However, if we are doing initial update/insert with only a subset of columns, when the pre-image result set is nil, this cannot be done. Adds check to non-touched column post-image code. Also uses the pre-image value extractor to handle non-atomic sets properly. Tests updated.	2020-04-08 13:48:54 +02:00
Tomasz Grabiec	55240e9db2	Merge "Fix open-ended tombstone issues in alternator" from Piotr Sarna This miniseries provides workarounds for open-ended range tombstones reportedly appearing in alternator tables. The issue was that row tombstones created for tables without clustering keys look like open-ended range tombstones, which confuses the LA/KA format writer. Tests: alternator-test(local) Fixes #6035 Refs #6157	2020-04-08 13:43:40 +02:00
Pavel Solodovnikov	3206c1bf66	paxos_state: introduce error injections for testing timeouts in paxos stages The following sleep injections are added to paxos_state: * paxos_state_prepare_timeout (timeouts in paxos_state::prepare) * paxos_state_accept_timeout (timeouts in paxos_state::accept) * paxos_state_learn_timeout (timeouts in paxos_state::learn) Tests: unit ({dev}), unit ({debug}) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Message-Id: <20200403092107.181057-1-alejo.sanchez@scylladb.com>	2020-04-08 10:47:15 +02:00
Piotr Sarna	a4da07f8b3	alternator-test: mark identical gsi test as skipped Creating an index on a table with only the partition key can lead to open-ended range tombstones appearing, if the indexed column is also the very same partition key - which is quite a useless case, but it's allowed both by alternator and DynamoDB. In order to make the tests pass when KA/LA sstables are used, this test case is hereby skipped until further notice. Refs #6157	2020-04-08 08:11:39 +02:00
Piotr Sarna	0a2d7addc0	alternator: use partition tombstone if there's no clustering key As @tgrabiec helpfully pointed out, creating a row tombstone for a table which does not have a clustering key in its schema creates something that looks like an open-ended range tombstone. That's problematic for KA/LA sstable formats, which are incapable of writing such tombstones, so a workaround is provided in order to allow using KA/LA in alternator. Fixes #6035	2020-04-08 08:08:45 +02:00
Glauber Costa	54a0928a85	systemd: disable start timeout I am about to change resharding to block the start of the node. Being a somewhat slow operation, the timeout of 900 sec is guaranteed to trigger in large nodes with lots of data. This patch effectively disables the start timeout, while keeping the stop timeout unchanged. My preference would have been to use a timeout extension mechanism during resharding. Systemd actually has such mechanism, where we can send a message through sd_notify asking the timeout to be extended. However such mechanism is not present in SystemD v219, used by RHEL7. That means for RHEL7 we need a different way to deal with the timeout anyway. The second preference is also obviously to write "infinity" as the timeout value. But guess what? SystemD v219 also has a bug in which infinity is interepreted as zero (https://bugzilla.redhat.com/show_bug.cgi?id=1446015) Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200407155754.10020-1-glauber@scylladb.com>	2020-04-08 08:14:35 +03:00
Botond Dénes	16246d1c99	frozen_schema: make freezing constructor explicit Freezing is an expensive operation, that involves serializing the entire mutation. Having an implicit freezing constructor means this can happen as part of an implicit type conversion without the programmer even noticing, even when this is not really necessary. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200407080245.234021-1-bdenes@scylladb.com>	2020-04-07 12:00:36 +03:00
Benny Halevy	89b3974e56	sstables: print invalid boundary type as unsigned int Otherwise it prints a binary value to the log and corrupting it. Seen when testing scrub with randomly-corrupted sstable using scrub_with_one_node_expect_data_loss_test as of https://github.com/scylladb/scylla-dtest/pull/1414 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200407055617.1045977-1-bhalevy@scylladb.com>	2020-04-07 10:18:19 +02:00
Benny Halevy	a20c85713b	storage_proxy: paxos_response_handler::prune: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200405115046.733450-2-bhalevy@scylladb.com>	2020-04-07 08:47:38 +03:00
Benny Halevy	4e37aee8a2	storage_proxy: paxos_response_handler::prune: no need for futurize_apply parallel_for_each already futurize_invoke's the lambda passed to it since seastar commit c5e158e5f173e25a62308997a3da4348053b2a0f Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200405115046.733450-1-bhalevy@scylladb.com>	2020-04-07 08:47:38 +03:00
Raphael S. Carvalho	044f80b1b5	cql3: don't reset default TTL when not explicitly specified in alter table statement Any alter table statement that doesn't explicitly set the default time to live will reset it to 0. That can be very dangerous for time series use cases, which rely on all data being eventually expired, and a default TTL of 0 means data never being expired. Fixes #5048. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200402211653.25603-1-raphaelsc@scylladb.com>	2020-04-07 08:47:38 +03:00
Avi Kivity	0bc90756db	tools: toolchain: add note explaining how to use podman to build images podman is compatible with docker, but by default emits a manifest format that is not understood by old docker clients, so give it an extra flag to generate the old format instead. Message-Id: <20200406134526.21521-1-avi@scylladb.com>	2020-04-07 08:47:38 +03:00
Glauber Costa	80f414ed6e	sstables: restore ident Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200401162722.28780-3-glauber@scylladb.com>	2020-04-06 16:02:31 +03:00
Glauber Costa	463d0ab37c	compaction: move rewrite_sstables to the compaction_manager There is no reason why the table code has to be aware of the efforts of rewriting (cleanup, scrub, upgrade) an SSTable versus compacting it. Rewrite is special, because we need to do it one SSTable at a time, without lumping it together. However, the compaction manager is totally capable of doing that itself. If we do that, the special "table::rewrite_sstables" can be killed. This code would maybe be better off as a thread, where we wouldn't need to keep state. However there are some methods like maybe_stop_on_error() that expect a future so I am leaving this be for now. This is a cleanup that can be done later. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20200401162722.28780-2-glauber@scylladb.com>	2020-04-06 16:02:30 +03:00
Nadav Har'El	ac43a9e2aa	merge: Fix generating base keys from empty indexing paging state Merged pull request https://github.com/scylladb/scylla/pull/6136 from Piotr Sarna: An empty partition/clustering key pair is a valid state of the query paging state. Unfortunately, recent attempts at debugging a flaky test (#5856) resulted in introducing an assertion (`7616290`) which breaks when trying to generate a key from such a pair. In order to keep the assertion (since it still makes sense in its scope), but at the same time translate empty keys properly, empty keys are now explicitly processed at the beginning of the function. This behaviour was 100% reproducible in a secondary index dtest below. Fixes #6134 Refs #5856 Tests: unit(dev), dtest(TestSecondaryIndexes.test_truncate_base)	2020-04-06 15:23:39 +03:00
Takuya ASADA	3ce6cdc6d8	install.sh: suppoprt --upgrade To use install.sh as Scylla install script w/o using .rpm/.deb package, we need to provide a way to upgrade Scylla version, not just install. With --upgrade option, install.sh does not overwrite config files. It will install <filename>.new file on same directory, when old config file and new config file does not contain same data. If old one and new one is exactly same, it will nothing. To implement this, rewriting api_ui_dir/api_doc_dir path on scylla.yaml moved from .rpm/.deb scriptlet to install.sh. Fixes #5874	2020-04-06 15:07:28 +03:00
Takuya ASADA	5f18964763	dist/common/scripts/scylla_coredump_setup: bind-mount coredump directory, add coredump test On some environment systemd-coredump does not work with symlink directory, we can use bind-mount instead. Also, it's better to check systemd-coredump is working by generating coredump. To fix #5916, drop scylla_coredump_setup from .rpm %post scriptlet. Fixes #5753 Fixes #5916	2020-04-06 15:03:11 +03:00
Avi Kivity	e9e2b75a76	Merge "Allow Major compactions for TWCS" from Glauber " This patch makes makes major compaction aware of time buckets for TWCS. That means that calling a major compaction with TWCS will not bundle all SSTables together, but rather split them based on their timestamps. There are two motivations for this work: Telling users not to ever major compact is easier said than done: in practice due to a variety of circumstances it might end up being done in which case data will have a hard time expiring later. We are about to start working with offstrategy compactions, which are compactions that work in parallel with the main compactions. In those cases we may be converting SSTables from one format to another and it might be necessary to split a single big STCS SSTable into something that TWCS expects In order to achieve that, we start by changing the way resharding works: it will now work with a read interposer, similar to the one TWCS uses for streaming data. Once we do that, a lot of assumptions that exist in the compaction code can be simplified and supporting TWCS major compactions become a matter of simply enabling its interposer in the compaction code as well. There are many further simplifications that this work exposes: The compaction method create_new_sstable seems out of place. It is not used by resharding, and it seems duplicated for normal compactions. We could clean it up with more refactoring in a later patch. The whole logic of the feed_writer could be part of the consumer code. Testing details: scylla unit tests (dev, release) sstable_datafile_test (debug) dtests (resharding_test.py) manual scylla resharding Fixes #1431 " Reviewed-by: Raphael S. Carvalho <raphaelsc@scylladb.com> * 'twcs-major-v3' of github.com:glommer/scylla: compaction: make major compaction time-aware with TWCS compaction: do resharding through an interposer mutation_writer: introduce shard_based splitting writer mutation_writer: factor out part of the code for the timestamp splitter compaction: abort if create_new_sstable is called from resharding	2020-04-06 12:54:08 +03:00
Gleb Natapov	e5f7ccc4c8	lwt: fix possible leak of "prune" counter If get_schema_for_read() fails "prune" counter will not be decremented. The patch fixes it by creating RAI object earlier. Also return releasing of a mutation in release_mutation() which was dropped by mistake. Fixes #6124 Message-Id: <20200405080233.GA22509@scylladb.com>	2020-04-06 11:30:38 +02:00
Nadav Har'El	d9d50362af	alternator: remove mentions of experimental status of LWT Since commit `9948f548a5`, the LWT no longer requires an "experimental" flag, so Alternator documents and scripts which referred to the need for enabling experimental LWT, are fixed here to no longer do that. Fixes #6118. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200405143237.12693-1-nyh@scylladb.com>	2020-04-06 12:12:08 +03:00
Piotr Sarna	8fea5075f2	test: fix manual gossip test When trying to get rid of a large stack warning for gossip test, I found out that it actually does not run at all for multiple reasons: 1. It segfaults due to wrong initialization order 2. After fixing that, it segfaults on use-after-free (due to capturing a shared pointer by reference instead of by copy) 3. After that, cleanups are in order: * seastar thread does not need to be spawned inside another thread; * default captures are harmful, so they're made explicit instead; * db::config is moved to heap, to finally get rid of the warning. Tests: manual(gossip) Message-Id: <feaca415d0d29a16c541f9987645365310663630.1585128338.git.sarna@scylladb.com>	2020-04-06 11:07:10 +02:00
Piotr Sarna	88913e9d44	test: add cases for empty paging state for index queries In order to check regressions related to #6136 and similar issues, test cases for handling paging state with empty partition/clustering key pair are added.	2020-04-06 08:59:40 +02:00
Piotr Sarna	45751ee24f	cql3: fix generating base keys from empty index paging state An empty partition/clustering key pair is a valid state of the query paging state. Unfortunately, recent attempts at debugging a flaky test resulted in introducing an assertion which breaks when trying to generate a key from such a pair. In order to keep the assertion (since it still makes sense in its scope), but at the same time translate empty keys properly, empty keys are now explicitly processed at the beginning of the function. This behaviour was 100% reproducible in a secondary index dtest below. Fixes #6134 Refs #5856 Tests: unit(dev), dtest(TestSecondaryIndexes.test_truncate_base)	2020-04-06 07:49:06 +02:00
Avi Kivity	4e6f543676	tools: toolchain: use "docker build --pull" in instructions for building an image Specify --pull in order to refresh the base image (some Fedora release). Usually this is not important, because we run `dnf update`. But if the cached image happens to be a pre-release version of Fedora, the image will have the update-testing repository enabled, and we may get some unwanted updates. It's sad that we need two separate flags for correctness (the other is --no-cache. Message-Id: <20200405164227.8210-1-avi@scylladb.com>	2020-04-05 19:48:25 +03:00
Piotr Sarna	0bb211a65f	alternator: defuse a serialization path time bomb The default serialization path for items was subtly broken - instead of parsing JSON string representation of objects, it tried to parse a regular string implementation - which is often also a valid JSON, but nothing guarantees that it actually is. Tests: alternator-test(local) Message-Id: <e1668bf4e9029f2675a4ac28bb4598714575efeb.1586096732.git.sarna@scylladb.com>	2020-04-05 18:55:54 +03:00
Nadav Har'El	c1a7a071ea	merge: Remove most inclusions of reactor.hh Merged patch series from Avi Kivity: This patchset removes most inclusions of reactor.hh, by switching to new namespace-scoped API:s instead of those using engine() as a way to get the reactor. With this, we are down to 12 translation units depending on reactor.hh, mostly for deprecated API:s like reactor::at_exit(). Avi Kivity (3): logalloc: use namespace-scope seastar::idle_cpu_handler and related rather than reactor scope test: sstable-utils: deinline do_make_keys() treewide: replace calls to engine().some_api() with some_api() configure.py \| 14 +++----- auth/common.hh \| 3 +- checked-file-impl.hh \| 4 +-- db/system_keyspace_view_types.hh \| 2 +- flat_mutation_reader.hh \| 1 + lister.hh \| 2 +- message/messaging_service.hh \| 2 +- redis/server.hh \| 2 +- sstables/compress.hh \| 2 +- sstables/integrity_checked_file_impl.hh \| 2 +- test/lib/sstable_utils.hh \| 35 ++++--------------- test/lib/test_services.hh \| 2 +- thrift/server.hh \| 2 +- transport/server.hh \| 2 +- utils/error_injection.hh \| 3 +- utils/joinpoint.hh \| 2 +- utils/loading_cache.hh \| 2 +- utils/logalloc.hh \| 6 ++-- utils/rate_limiter.hh \| 2 +- api/system.cc \| 1 + auth/default_authorizer.cc \| 2 +- auth/password_authenticator.cc \| 2 +- database.cc \| 1 + db/commitlog/commitlog.cc \| 4 +-- db/hints/resource_manager.cc \| 3 +- db/system_distributed_keyspace.cc \| 2 +- dht/i_partitioner.cc \| 2 +- gms/feature_service.cc \| 3 +- lister.cc \| 4 +-- locator/ec2_snitch.cc \| 3 +- locator/gce_snitch.cc \| 1 + main.cc \| 1 + reader_concurrency_semaphore.cc \| 2 +- redis/server.cc \| 4 +-- sstables/sstables.cc \| 11 +++--- table.cc \| 3 +- test/boost/commitlog_test.cc \| 2 +- test/boost/database_test.cc \| 2 +- test/boost/flush_queue_test.cc \| 2 +- test/boost/gossip_test.cc \| 2 +- .../gossiping_property_file_snitch_test.cc \| 1 + test/boost/loading_cache_test.cc \| 2 +- test/boost/sstable_3_x_test.cc \| 1 + test/boost/sstable_datafile_test.cc \| 1 + test/boost/sstable_test.cc \| 1 + test/lib/sstable_utils.cc \| 26 ++++++++++++++ test/manual/gossip.cc \| 2 +- test/manual/hint_test.cc \| 2 +- test/manual/sstable_scan_footprint_test.cc \| 2 +- test/perf/perf_mutation.cc \| 1 + test/perf/perf_row_cache_update.cc \| 1 + test/perf/perf_sstable.cc \| 1 + test/tools/cql_repl.cc \| 2 +- thrift/server.cc \| 2 +- transport/server.cc \| 4 +-- utils/config_file.cc \| 3 +- utils/file_lock.cc \| 2 +- utils/logalloc.cc \| 14 ++++---- utils/updateable_value.cc \| 2 +- 59 files changed, 119 insertions(+), 98 deletions(-)	2020-04-05 13:47:39 +03:00
Nadav Har'El	dcfdd917e1	merge: Guard against potential races in view builder Merge patch series from Piotr Sarna: This series adds extra precautions against potential races in view building. In particular, it was based on the following scenario: 1. View builder detects that a view V is no longer here, so it schedules removing its info from bookkeeping, without any semaphores, and this continuation gets preempted immediately. 2. A view is deleted and recreated with the same name - V. 3. View V building is finished. 4. The continuation from (1.) is finally executed, and it removes old view V info from bookkeeping - which is a problem, since view building bookkeeping is based on names, not uuids - consequently, the new view bookkeeping info is erroneously removed. The issue is solved by putting startup code (which also does cleanup from point (1.)) under the same semaphore as other bookkeeping operations. With that, it will be impossible to execute step (2.) before (1.) ends, which effectively prevents the race. Refs #6094 (possible fixes it too, but since I could not reproduce the issue...) Tests: unit(dev) Piotr Sarna (4): db,view: fix waiting for a view building future db,view: remove unneeded implicit capture-by-reference db,view: nitpick: change & operator to && for booleans db,view: guard view builder startup with a semaphore db/view/view.cc \| 11 +++++++---- 1 file changed, 7 insertions(+), 4 deletions(-)	2020-04-05 13:19:23 +03:00
Avi Kivity	88ade3110f	treewide: replace calls to engine().some_api() with some_api() This removes the need to include reactor.hh, a source of compile time bloat. In some places, the call is qualified with seastar:: in order to resolve ambiguities with a local name. Includes are adjusted to make everything compile. We end up having 14 translation units including reactor.hh, primarily for deprecated things like reactor::at_exit(). Ref #1	2020-04-05 12:46:04 +03:00
Avi Kivity	5e32ecb514	test: sstable-utils: deinline do_make_keys() This hides a call to engine_is_ready() which is only available in reactor.hh. Dependencies are adjusted so tests link. Ref #1.	2020-04-05 12:46:04 +03:00
Avi Kivity	1799cfa88a	logalloc: use namespace-scope seastar::idle_cpu_handler and related rather than reactor scope This allows us to drop a #include <reactor.hh>, reducing compile time. Several translation units that lost access to required declarations are updated with the required includes (this can be an include of reactor.hh itself, in case the translation unit that lost it got it indirectly via logalloc.hh) Ref #1.	2020-04-05 12:45:08 +03:00
Piotr Sarna	1a9083b342	db,view: guard view builder startup with a semaphore The startup routine performs some bookkeeping operations on views, and so do these events: - on_create_view; - on_drop_view; - on_update_view. Since the above events are guarded with a semaphore, the startup routine should also take the same semaphore - in order to ensure that all bookkeeping operations are serialized. Refs #6094	2020-04-05 11:41:26 +02:00
Piotr Sarna	8da4a5b78c	db,view: nitpick: change & operator to && for booleans Although it's technically correct to use the bitwise and operator on booleans as well, it's slightly confusing for the reader.	2020-04-05 11:41:25 +02:00
Piotr Sarna	e49805b7b8	db,view: remove unneeded implicit capture-by-reference The lambda does not use any other captures, so it does not to implicitly capture anything by reference.	2020-04-05 11:41:25 +02:00
Piotr Sarna	3f19865493	db,view: fix waiting for a view building future The future was marked with a `FIXME: discarded future`, but there's really no reason not to wait for it, and it was probably meant to be waited for since its implementation.	2020-04-05 11:41:25 +02:00
Piotr Sarna	76969ea619	test: move config to heap in gossip_test ... in order to get rid of a large stack warning. Tests: unit(dev) Message-Id: <da4349b89554265ec419544b63ce084eab25ac0f.1586068467.git.sarna@scylladb.com>	2020-04-05 10:18:14 +03:00
Rafael Ávila de Espíndola	c59a307f17	table_helper: Use CanInvoke instead of CanApply The CanApply predicate is deprecated. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200403225907.7910-1-espindola@scylladb.com>	2020-04-05 08:36:29 +02:00
Tomasz Grabiec	df48b5ec9d	gossip: Fix a confusing parameter name Message-Id: <1585940635-1194-1-git-send-email-tgrabiec@scylladb.com>	2020-04-05 08:24:51 +02:00

1 2 3 4 5 ...

21811 Commits