scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Konstantin Osipov	4095ab08c8	test.py: remove tests_to_run Avoid storing each test twice, use per-tests list to construct a global iterable.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	169128f80b	test.py: virtualize Test.run(), to introduce CqlTest.Run next	2020-01-15 10:53:24 +03:00
Konstantin Osipov	d05f6c3cc7	test.py: virtualize test search pattern per TestSuite CQL tests have .cql extension, while unit tests have .cc.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	abcc182ab3	test.py: virtualize write_xunit_report() Make sure any non-boost test can participate in the report.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	18aafacfad	test.py: ensure print_summary() is agnostic of test type Introduce a virtual Test.print_summary() to print a failed test summary.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	21fbe5fa81	test.py: tidy up print_summary() Now that we have tabular output, make print_summary() more concise.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	c171882b51	test.py: introduce base class Test for CQL and Unit tests	2020-01-15 10:53:24 +03:00
Konstantin Osipov	fd6897d53e	test.py: move the default arguments handling to UnitTestSuite Move UnitTeset default seastar argument handling to UnitTestSuite (cleanup).	2020-01-15 10:53:24 +03:00
Konstantin Osipov	d3126f08ed	test.py: move custom unit test command line arguments to suite.yaml Load the command line arguments, if any, from suite.yaml, rather than keep them hard-coded in test.py. This is allows operations team to have easier access to these. Note I had to sacrifice dynamic smp count for mutation_reader_test (the new smp count is fixed at 3) since this is part of test configuration now.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	ef6cebcbd2	test.py: move command line argument processing to UnitTestSuite	2020-01-15 10:53:24 +03:00
Konstantin Osipov	4a20617be3	test.py: introduce add_test(), which is suite-specific	2020-01-15 10:53:24 +03:00
Konstantin Osipov	7e10bebcda	test.py: move long test list to suite.yaml Use suite.yaml for long tests	2020-01-15 10:53:24 +03:00
Konstantin Osipov	32ffde91ba	test.py: move test id assignment to TestSuite Going forward finding and creating tests will be a responsibility of TestSuite, so the id generator needs to be shared.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	b5b4944111	test.py: move repeat handling to TestSuite This way we can avoid iterating over all tests to handle --repeat. Besides, going forward the tests will be stored in two places: in the global list of all tests, for the runner, and per suite, for suite-based reporting, so it's easier if TestSuite if fully responsible for finding and adding tests.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	34a1b49fc3	test.py: move add_test_list() to TestSuite	2020-01-15 10:53:24 +03:00
Konstantin Osipov	44e1c4267c	test.py: introduce test suites - UnitTestSuite - for test/unit tests - BoostTestSuite - a tweak on UnitTestSuite, with options to log xml test output to a dedicated file	2020-01-15 10:53:24 +03:00
Konstantin Osipov	eed3201ca6	test.py: use path, rather than test kind, for search pattern Going forward there may be multiple suites of the same kind.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	f95c97667f	test.py: support arbitrary number of test suites Scan entire test/ for folders that contain suite.yaml, and load tests from these folders. Skip the rest. Each folder with a suite.yaml is expected to have a valid suite configuration in the yaml file. A suite is a folder with test of the same type. E.g. it can be a folder with unit tests, boost tests, or CQL tests. The harness will use suite.yaml to create an appropriate suite test driver, to execute tests in different formats.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	c1f8169cd4	test.py: add suite.yaml to boost and unit tests The plan is to move suite-specific settings to the configuration file.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	ec9ad04c8a	test.py: move 'success' to TestUnit class There will be other success attributes: program return status 0 doesn't mean the test is successful for all tests.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	b4aa4d35c3	test.py: save test output in tmpdir It is handy to have it so that a reference of a failed test is available without re-running it.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	f4efe03ade	test.py: always produce xml output, derive output paths from tmpdir It reduces the number of configurations to re-test when test.py is modified. and simplifies usage of test.py in build tools, since you no longer need to bother with extra arguments.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	d2b546d464	test.py: output job count in the log	2020-01-15 10:53:24 +03:00
Konstantin Osipov	233f921f9d	test.py: make test output brief&tabular New format: % ./test.py --verbose --mode=release ================================================================================ [N/TOTAL] TEST MODE RESULT ------------------------------------------------------------------------------ [1/111] boost/UUID_test release [ PASS ] [2/111] boost/enum_set_test release [ PASS ] [3/111] boost/like_matcher_test release [ PASS ] [4/111] boost/observable_test release [ PASS ] [5/111] boost/allocation_strategy_test release [ PASS ] ^C % ./test.py foo ================================================================================ [N/TOTAL] TEST MODE RESULT ------------------------------------------------------------------------------ [3/3] unit/memory_footprint_test debug [ PASS ] ------------------------------------------------------------------------------	2020-01-15 10:53:24 +03:00
Konstantin Osipov	879bea20ab	test.py: add a log file Going forward I'd like to make terminal output brief&tabular, but some test details are necessary to preserve so that a failure is easy to debug. This information now goes to the log file. - open and truncate the log file on each harness start - log options of each invoked test in the log, so that a failure is easy to reproduce - log test result in the log Since tests are run concurrently, having an exact trace of concurrent execution also helps debugging flaky tests.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	cbee76fb95	test.py: gitignore the default ./test.py tmpdir, ./testlog	2020-01-15 10:53:24 +03:00
Konstantin Osipov	1de69228f1	test.py: add --tmpdir It will be used for test log files.	2020-01-15 10:53:24 +03:00
Konstantin Osipov	caf742f956	test.py: flake8 style fix	2020-01-15 10:53:24 +03:00
Konstantin Osipov	dab364c87d	test.py: sort imports	2020-01-15 10:53:24 +03:00
Konstantin Osipov	7ec4b98200	test.py: make name a positional argument. Accept multiple test names, treat test name as a substring, and if the same name is given multiple times, run the test multiple times.	2020-01-15 10:53:24 +03:00
Dejan Mircevski	bb2e04cc8b	alternator: Improve comments on comparators Some comparator methods in conditions.cc use unexpected operators; explain why. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-01-14 22:25:55 +02:00
Tomasz Grabiec	c8a5a27bd9	Merge "storage_service: Move load_broadcaster away" from Pavel E. The storage_service struct is a collection of diverse things, most of them requiring only on start and on stop and/or runing on shard 0 (but is nonetheless sharded). As a part of clearing this structure and generated by it inter- -componenes dependencies, here's the sanitation of load_broadcaster.	2020-01-14 19:26:06 +01:00
Calle Wilund	313ed91ab0	cdc: Listen for migration callbacks on all shards Fixes #5582 ... but only populate log on shard 0. Migration manager callbacks are slightly assymetric. Notifications for pre-create/update mutations are sent only on initiating shard (neccesary, because we consider the mutations mutable). But "created" callbacks are sent on all shards (immutable). We must subscribe on all shards, but still do population of cdc table only once, otherwise we can either miss table creat or populate more than once. v2: - Add test case Message-Id: <20200113140524.14890-1-calle@scylladb.com>	2020-01-14 16:35:41 +01:00
Avi Kivity	2138657d3a	Update seastar submodule * seastar 36cf5c5ff0...3f3e117de3 (16): > memcached: don't use C++17-only std::optional > reactor: Comment why _backend is assigned in constructor body > log: restore --log-to-stdout for backward compatibility > used_size.hh: Include missing headers > core: Move some code from reactor.cc to future.cc > future-util: move parallel_for_each to future-util.cc > task: stop wrapping tasks with unique_ptr > Merge "Setup timer signal handler in backend constructor" from Pavel Fixes #5524 > future: avoid a branch in future's move constructor if type is trivial > utils: Expose used_size > stream: Call get_future early > future-util: Move parallel_for_each_state code to a .cc > memcached: log exceptions > stream: Delete dead code > core: Turn pollable_fd into a simple proxy over pollable_fd_state. > Merge "log to std::cerr" from Benny	2020-01-14 16:56:25 +02:00
Pavel Emelyanov	e1ed8f3f7e	storage_service: Remove _shadow_token_metadata This is the part of de-bloating storage_service. The field in question is used to temporary keep the _token_metadata value during shard-wide replication. There's no need to have it as class member, any "local" copy is enough. Also, as the size of token_metadata is huge, and invoke_on_all() copies the function for each shard, keep one local copy of metadata using do_with() and pass it into the invoke_on_all() by reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Reviewed-by: Asias He <asias@scylladb.com> Message-Id: <20200113171657.10246-1-xemul@scylladb.com>	2020-01-14 16:29:10 +02:00
Rafael Ávila de Espíndola	054f5761a7	types: Refactor code into a serialize_varint helper This is a bit cleaner and avoids a boost::multiprecision::cpp_int copy while serializing a decimal. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200110221422.35807-1-espindola@scylladb.com>	2020-01-14 16:28:27 +02:00
Avi Kivity	6c84dd0045	cql3: update_statement: do not set query option always_return_static_content for list read-before-write The query option always_return_static_content was added for lightweight transations in commits `e0b31dd273` (infrastructure) and `65b86d155e` (actual use). However, the flag was added unconditionally to update_parameters::options. This caused it to be set for list read-modify-write operations, not just for lightweight transactions. This is a little wasteful, and worse, it breaks compatibility as old nodes do not understand the always_return_static_content flag and complain when they see it. To fix, remove the always_return_static_content from update_parameters::options and only set it from compare-and-swap operations that are used to implement lightweight transactions. Fixes #5593. Reviewed-by: Gleb Natapov <gleb@scylladb.com> Message-Id: <20200114135133.2338238-1-avi@scylladb.com>	2020-01-14 16:15:20 +02:00
Alejo Sanchez	6909d4db42	cql3: BYPASS CACHE query counter This patch is the first part of requested full scan metrics. It implements a counter of SELECT queries with BYPASS CACHE option. In scope of #5209 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Message-Id: <20200113222740.506610-2-alejo.sanchez@scylladb.com>	2020-01-14 12:19:00 +02:00
Rafael Ávila de Espíndola	dca1bc480f	everywhere: Use serialized(foo) instead of data_value(foo).serialize() This is just a simple cleanup that reduces the size of another patch I am working on and is an independent improvement. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200114051739.370127-1-espindola@scylladb.com>	2020-01-14 12:17:12 +02:00
Pavel Emelyanov	b9f28e9335	storage_service: Remove dead drain branch The drain_in_progress variable here is the future that's set by the drain() operation itself. Its promise is set when the drain() finishes. The check for this future in the beginning of drain() is pointless. No two drain()-s can run in parallels because of run_with_api_lock() protection. Doing the 2nd drain after successfull 1st one is also impossible due to the _operation_mode check. The 2nd drain after _exceptioned_ (and thus incomplete) 1st one will deadlock, after this patch will try to drain for the 2nd time, but that should by ok. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200114094724.23876-1-xemul@scylladb.com>	2020-01-14 12:07:29 +02:00
Piotr Sarna	36ec43a262	Merge "add table with connected cql clients" from Juliusz This change introduces system.clients table, which provides information about CQL clients connected. PK is the client's IP address, CK consists of outgoing port number and client_type (which will be extended in future to thrift/alternator/redis). Table supplies also shard_id and username. Other columns, like connection_stage, driver_name, driver_version..., are currently empty but exist for C* compatibility and future use. This is an ordinary table (i.e. non-virtual) and it's updated upon accepting connections. This is also why C*'s column request_count was not introduced. In case of abrupt DB stop, the table should not persist, so it's being truncated on startup. Resolves #4820	2020-01-14 10:01:07 +02:00
Avi Kivity	1f46133273	Merge "data: make cell::make_collection() exception safe" from Botond " Most of the code in `cell` and the `imr` infrastructure it is built on is `noexcept`. This means that extra care must be taken to avoid rouge exceptions as they will bring down the node. The changes introduced by 0a453e5d3a did just that - introduced rouge `std::bad_alloc` into this code path by violating an undocumented and unvalidated assumption -- that fragment ranges passed to `cell::make_collection()` are nothrow copyable and movable. This series refactors `cell::make_collection()` such that it does not have this assumption anymore and is safe to use with any range. Note that the unit test included in this series, that was used to find all the possible exception sources will not be currently run in any of our build modes, due to `SEASTAR_ENABLE_ALLOC_FAILURE_INJECTION` not being set. I plan to address this in a followup because setting this flags fails other tests using the failure injection mechanism. This is because these tests are normally run with the failure injection disabled so failures managed to lurk in without anyone noticing. Fixes: #5575 Refs: #5341 Tests: unit(dev, debug) " * 'data-cell-make-collection-exception-safety/v2' of https://github.com/denesb/scylla: test: mutation_test: add exception safety test for large collection serialization data/cell.hh: avoid accidental copies of non-nothrow copiable ranges utils/fragment_range.hh: introduce fragment_range_view	2020-01-14 10:01:06 +02:00
Nadav Har'El	5b08ec3d2c	alternator: error on unsupported ScanIndexForward=false We do not yet support the ScanIndexForward=false option for reversing the sort order of a Query operation, as reported in issue #5153. But even before implementing this feature, it is important that we produce an error if a user attempts to use it - instead of outright ignoring this parameter and giving the user wrong results. This is what this patch does. Before this patch, the reverse-order query in the xfailing test test_query.py::test_query_reverse seems to succeed - yet gives results in the wrong order. With this patch, the query itself fails - stating that the ScanIndexForward=false argument is not supported. Refs #5153 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200105113719.26326-1-nyh@scylladb.com>	2020-01-14 10:01:06 +02:00
Pavel Emelyanov	c4bf532d37	storage_service: Fix race in removenode/force_removenode/other Here's another theoretical problem, that involves 3 sequential calls to respectively removenode, force_removenode and some other operation. Let's walk through them First goes the removenode: run_with_api_lock _operation_in_progress = "removenode" storage_service::remove_node sleep in replicating_nodes.empty() loop Now the force_removenode can run: run_with_no_api_lock storage_service::force_removenode check _operation_in_progress (not empty) _force_remove_completion = true sleep in _operation_in_progress.empty loop Now the 1st call wakes up and: if _force_remove_completion == true throw <some exception> .finally() handler in run_with_api_lock _operation_in_progress = <empty> At this point some other operation may start. Say, drain: run_with_api_lock _operation_in_progress = "drain" storage_service::drain ... go to sleep somewhere No let's go back to the 1st op that wakes up from its sleep. The code it executes is while (!ss._operation_in_progress.empty()) { sleep_abortable() } and while the drain is running it will never exit. However (! and this is the core of the race) should the drain operation happen _before_ the force_removenode, another check for _operation_in_progress would have made the latter exit with the "Operation drain is in progress, try again" message. Fix this inconsistency by making the check for current operation every wake-up from the sleep_abortable. Fixes #5591 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-14 10:01:06 +02:00
Pavel Emelyanov	cc92683894	storage_service: Fix race and deadlock in removenode/force_removenode Here's a theoretical problem, that involves 3 sequential calls to respectively removenode, force_removenode and removenode (again) operations. Let's walk through them First goes the removenode: run_with_api_lock _operation_in_progress = "removenode" storage_service::remove_node sleep in replicating_nodes.empty() loop Now the force_removenode can run: run_with_no_api_lock storage_service::force_removenode check _operation_in_progress (not empty) _force_remove_completion = true sleep in _operation_in_progress.empty loop Now the 1st call wakes up and: if _force_remove_completion == true _force_remove_completion = false throw <some exception> .finally() handler in run_with_api_lock _operation_in_progress = <empty> ! at this point we have _force_remove_completion = false and _operation_in_progress = <empty>, which opens the following opportunity for the 3d removenode: run_with_api_lock _operation_in_progress = "removenode" storage_service::remove_node sleep in replicating_nodes.empty() loop Now here's what we have in 2nd and 3rd ops: 1. _operation_in_progress = "removenode" (set by 3rd) prevents the force_removenode from exiting its loop 2. _force_remove_completion = false (set by 1st on exit) prevents the removenode from waiting on replicating_nodes list One can start the 4th call with force_removenode, it will proceed and wake up the 3rd op, but after it we'll have two force_removenode-s running in parallel and killing each other. I propose not to set _force_remove_completion to false in removenode, but just exit and let the owner of this flag unset it once it gets the control back. Fixes #5590 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-14 10:01:06 +02:00
Benny Halevy	ff55b5dca3	cql3: functions: limit sum overflow detection to integral types Other types do not have a wider accumulator at the moment. And static_cast<accumulator_type>(ret) != _sum evaluates as false for NaN/Inf floating point values. Fixes #5586 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200112183436.77951-1-bhalevy@scylladb.com>	2020-01-14 10:01:06 +02:00
Avi Kivity	e3310201dd	atomic_cell_or_collection: type-aware print atomic_cell or collection components Now that atomic_cell_view and collection_mutation_view have type-aware printers, we can use them in the type-aware atomic_cell_or_collection printer. Message-Id: <20191231142832.594960-1-avi@scylladb.com>	2020-01-14 10:01:06 +02:00
Avi Kivity	931b196d20	mutation_partition: row: resolve column name when in schema-aware printer Instead of printing the column id, print the full column name. Message-Id: <20191231142944.595272-1-avi@scylladb.com>	2020-01-14 10:01:06 +02:00
Nadav Har'El	4aa323154e	merge: Pretty print canonical_mutation objects Merged pull request https://github.com/scylladb/scylla/pull/5533 from Avi Kivity: canonical_mutation objects are used for schema reconciliation, which is a fragile area and thus deserves some debugging help. This series makes canonical_mutation objects printable.	2020-01-14 10:01:06 +02:00
Takuya ASADA	5241deda2d	dist: nonroot: fix CLI tool path for nonroot (#5584 ) CLI tool path is hardcorded, need to specify correct path on nonroot.	2020-01-14 10:01:06 +02:00

1 2 3 4 5 ...

20651 Commits