scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	fe70333c19	test: Auto-skip object-storage test cases if run from shell In case an sstable unit test case is run individually, it would fail with exception saying that S3_... environment is not set. It's better to skip the test-case rather than fail. If someone wants to run it from shell, it will have to prepare S3 server (minio/AWS public bucket) and provide proper environment for the test-case. refs: #13569 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13755	2023-05-04 14:15:18 +03:00
Pavel Emelyanov	6dbe41d277	test.py: Equip it with minio server When test.py starts it activates a minio server inside test-dir and configures an anonymous bucket for test cases to run on Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Nadav Har'El	ef50e4022c	test: drop our "pytest" wrapper script When Fedora 37 came out, we discovered that its "pytest" script started to run Python with the "-s" option, which caused problems for packages installed personally via pip. We fixed this by adding our own wrapper script test/pytest. But this bug (https://bugzilla.redhat.com/show_bug.cgi?id=2152171) was already fixed in Fedora 37, and the new version already reached our dbuild. So we no longer need this wrapper script. Let's remove it. Fixes #12412 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13083	2023-03-08 07:31:37 +02:00
Nadav Har'El	2653865b34	Merge 'test.py: improve test failure handling' from Kamil Braun Improve logging by printing the cluster at the end of each test. Stop performing operations like attempting queries or dropping keyspaces on dirty clusters. Dirty clusters might be completely dead and these operations would only cause more "errors" to happen after a failed test, making it harder to find the real cause of failure. Mark cluster as dirty when a test that uses it fails - after a failed test, we shouldn't assume that the cluster is in a usable state, so we shouldn't reuse it for another test. Rely on the `is_dirty` flag in `PythonTest`s and `CQLApprovalTest`s, similarly to what `TopologyTest`s do. Closes #12652 * github.com:scylladb/scylladb: test.py: rely on ScyllaCluster.is_dirty flag for recycling clusters test/topology: don't drop random_tables keyspace after a failed test test/pylib: mark cluster as dirty after a failed test test: pylib, topology: don't perform operations after test on a dirty cluster test/pylib: print cluster at the end of test	2023-02-12 12:13:25 +02:00
Kamil Braun	d991f71910	test.py: rely on ScyllaCluster.is_dirty flag for recycling clusters `TopologyTest`s (used by `topology/` suite and friends) already relied on the `is_dirty` flag stored in `ScyllaCluster` thanks to `ScyllaClusterManager` (which passes the flag when returning a cluster to the pool). But `PythonTest`s (cql-pytest/ suite) and `CQLApprovalTest`s (cql/ suite) had different ways to decide whether a cluster should be recycled. For example, `PythonTest` would recycle a cluster if `after_test` raised an exception. This depended on a post-condition check made by `after_test`: it would query the number of keyspaces and throw an exception if it was different than when the test started. If the cluster (which for `PythonTest` is always single-node) was dead, this query would fail. However, we modified the behavior of `after_test` in earlier commits - it no longer preforms the post-condition check on dirty clusters. So it's also no longer reliable to use the exception raised by `after_test` to decide that we should recycle the cluster. Unify the behavior of `PythonTest` and `CQLApprovalTest` with what `TopologyTest` does - using the `is_dirty` flag to decide that we should recycle a cluster. Thanks to earlier commits, this flag is set to `True` whenever a test fails, so it should cover most cases where we want to recycle a cluster. (The only case not currently covered is if a non-dirty cluster crashes after we perform the keyspace post-condition check, which seems quite improbable.) Note that this causes us to recycle clusters more often in these tests: previously, when a `PythonTest` or `CQLApprovalTest` failed, but the cluster was still alive and the post-condition check passed, we would use the cluster for the next test. Now we recycle a cluster whenever a test that used it fails.	2023-02-03 11:49:35 +01:00
Kamil Braun	a9dbd89478	test/pylib: mark cluster as dirty after a failed test We don't expect the cluster to be functioning at all after a failed test. The whole cluster might have crashed, for example. In these situations the framework would report multiple errors (one for the actual failure, another for a failed post-condition check because the cluster was down) which would only obscure the report and make debugging harder. It's also not safe in general to reuse the cluster in another test - if the test previous failed, we should not assume that it's in a valid state. Therefore, mark the cluster as dirty after a failed test. This will let us recycle the cluster based on the dirty flag and it will disable post-condition check after a failed test (which is only done on non-dirty clusters). To implement this in topology tests, we use the `pytest_runtest_makereport` hook which executes after a test finishes but before fixtures finish. There we store a test-failed flag in a stash provided by pytest, then access the flag in the `manager` fixture.	2023-02-02 16:35:55 +01:00
Raphael S. Carvalho	e3923a9caf	test.py: Add option to run scylla tests with multiple compaction groups The tests can now optionally run with multiple groups via option --x-log2-compaction-groups. This includes boost tests and the ones which run against either one (e.g. cql) or many instances (e.g. topology). Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-02-01 20:17:16 -03:00
Kamil Braun	858803cc2c	test/pylib: pool: replace `steal` with `put(is_dirty=True)` The pool usage was kind of awkward previously: if the user of a pool decided that a previously borrowed object should no longer be used, it was their responsibility to destroy the object (releasing associated resources and so on) and then call `steal()` on the pool to free space for a new object. Change the interface. Now the `Pool` constructor obtains a `destroy` function additionally to the `build` function. The user calls the function `put` to return both objects that are still usable and those aren't. For the latter, they set `is_dirty=True`. The pool will 'destroy' the object with the provided function, which could mean e.g. releasing associated resources. For example, instead of: ``` if self.cluster.is_dirty: self.clusters.stop() self.clusters.release_ips() self.clusters.steal() else: self.clusters.put(self.cluster) ``` we can now use: ``` self.clusters.put(self.cluster, is_dirty=self.cluster.is_dirty) ``` (assuming that `self.clusters` is a pool constructed with a `destroy` function that stops the cluster and releases its IPs.) Also extend the interface of the context manager obtained by `instance()` - the user must now pass a flag `dirty_on_exception`. If the context manager exists due to an exception and that flag was `True`, the object will be considered dirty. The dirty flag can also be set manually on the context manager. For example: ``` async with (cm := pool.instance(dirty_on_exception=True)) as server: cm.dirty = await run_test(test, server) # It will also be considered dirty if run_test throws an exception ```	2023-01-26 11:58:00 +01:00
Alejo Sanchez	f236d518c6	test.py: manual cluster handling for PythonSuite Instead of complex async with logic, use manual cluster pool handling. Revert the discard() logic in Pool from a recent commit. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-01-24 11:38:17 +01:00
Alejo Sanchez	a6059e4bb7	test.py: stop cluster if PythonSuite fails to start If cluster fails to start, stop it. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-01-24 11:36:49 +01:00
Alejo Sanchez	dec0c1d9f6	test.py: minor fix for failed PythonSuite test Even though test can't fail both before and after, make the logic explicit in case code changes in the future. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-01-24 11:36:49 +01:00
Alejo Sanchez	51e84508ee	test.py: handle broken clusters for Python suite If the after test check fails (!is_after_test_ok), discard the cluster and raise exception so context manager (pool) does not recycle it. Ignore Pool exception re-raised by the context manager. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-01-19 21:43:50 +01:00
Kamil Braun	4f7e5ee963	test/pylib: prefix cluster/manager logs with the current test name The log file produced by test.py combines logs coming from multiple concurrent test runs. Each test has its own log file as well, but this "global" log file is useful when debugging problems with topology tests, since many events related to managing clusters are stored there. Make the logs easier to read by including information about the test case that's currently performing operations such as adding new servers to clusters and so on. This includes the mode, test run name and the name of the test case. We do this by using custom `Logger` objects (instead of calling `logging.info` etc. which uses the root logger) with `LoggerAdapter`s that include the prefixes. A bit of boilerplate 'plumbing' through function parameters is required but it's mostly straightforward. This doesn't apply to all events, e.g. boost test cases which don't setup a "real" Scylla cluster. These events don't have additional prefixes. Example: ``` 17:41:43.531 INFO> [dev/topology.test_topology.1] Cluster ScyllaCluster(name: 7a414ffc-903c-11ed-bafb-f4d108a9e4a3, running: ScyllaServer(1, 127.40.246.1, 29c4ec73-8912-45ca-ae19-8bfda701a6b5), ScyllaServer(4, 127.40.246.4, 75ae2afe-ff9b-4760-9e19-cd0ed8d052e7), ScyllaServer(7, 127.40.246.7, 67a27df4-be63-4b4c-a70c-aeac0506304f), stopped: ) adding server... 17:41:43.531 INFO> [dev/topology.test_topology.1] installing Scylla server in /home/kbraun/dev/scylladb/testlog/dev/scylla-10... 17:41:43.603 INFO> [dev/topology.test_topology.1] starting server at host 127.40.246.10 in scylla-10... 17:41:43.614 INFO> [dev/topology.test_topology.2] Cluster ScyllaCluster(name: 7a497fce-903c-11ed-bafb-f4d108a9e4a3, running: ScyllaServer(2, 127.40.246.2, f59d3b1d-efbb-4657-b6d5-3fa9e9ef786e), ScyllaServer(5, 127.40.246.5, 9da16633-ce53-4d32-8687-e6b4d27e71eb), ScyllaServer(9, 127.40.246.9, e60c69cd-212d-413b-8678-dfd476d7faf5), stopped: ) adding server... 17:41:43.614 INFO> [dev/topology.test_topology.2] installing Scylla server in /home/kbraun/dev/scylladb/testlog/dev/scylla-11... 17:41:43.670 INFO> [dev/topology.test_topology.2] starting server at host 127.40.246.11 in scylla-11... ```	2023-01-11 10:09:39 +01:00
Kamil Braun	ff2c030bf9	test.py: include mode in ScyllaClusterManager logs The logs often mention the test run and the current test case in a given run, such as `test_topology.1` and `test_topology.1::test_add_server_add_column`. However, if we run test.py in multiple modes, the different modes might be running the same test case and the logs become confusing. To disambiguate, prefix the test run/case names with the mode name. Example: ``` Leasing Scylla cluster ScyllaCluster(name: 7a414ffc-903c-11ed-bafb-f4d108a9e4a3, running: ScyllaServer(1, 127.40.246.1, 29c4ec73-8912-45ca-ae19-8bfda701a6b5), ScyllaServer(4, 127.40.246.4, 75ae2afe-ff9b-4 760-9e19-cd0ed8d052e7), ScyllaServer(7, 127.40.246.7, 67a27df4-be63-4b4c-a70c-aeac0506304f), stopped: ) for test dev/topology.test_topology.1::test_add_server_add_column ```	2023-01-10 17:41:54 +01:00
Kamil Braun	4268b1bbc2	Merge 'raft: raft_group0, register RPC verbs on all shards' from Gusev Petr raft_group0 used to register RPC verbs only on shard 0. This worked on clusters with the same --smp setting on all nodes, since RPCs in this case are processed on the same shard as the calling code, and raft_group0 methods only run on shard 0. A new test test_nodes_with_different_smp was added to identify the problem. Since --smp can only be specified via the command line, a corresponding parameter was added to the ManagerClient.server_add method. It allows to override the default parameters set by the SCYLLA_CMDLINE_OPTIONS variable by changing, adding or deleting individual items. Fixes: #12252 Closes #12374 * github.com:scylladb/scylladb: raft: raft_group0, register RPC verbs on all shards raft: raft_append_entries, copy entries to the target shard test.py, allow to specify the node's command line in test	2023-01-04 11:11:21 +01:00
Petr Gusev	1c23390f12	test.py, allow to specify the node's command line in test An optional parameter cmdline has been added to the ManagerClient.server_add method. It allows you to override the default parameters set by the SCYLLA_CMDLINE_OPTIONS variable by changing, adding or deleting individual items. To change or add a parameter just specify its name and value one after the other. To remove parameter use the special keyword __remove__ as a value. To set a parameter without a value (such as --overprovisioned) use the special keyword __missing__ as the value.	2023-01-03 15:24:54 +03:00
Avi Kivity	c70a9b0166	test: make test xml filenames more unique `ea99750de7` ("test: give tests less-unique identifiers") made the disambiguating ids only be unambiguous within a single test case. This made all tests named "run" have the name name "run.1". Fix that by adding the suite name everywhere: in test paths, and in junit test case names. Fixes #12310. Closes #12313	2022-12-19 15:03:51 +02:00
Alejo Sanchez	9b65448d38	test.py: convert param to str The format_unidiff() function takes str, not pathlib PosixPath, so convert it to str. This prevented diff output of unexpected result to be shown in the log file. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-12-15 20:46:35 +01:00
Alejo Sanchez	5142d80bb1	test.py: fix error level for CQL tests If the test fails, use error log level. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-12-15 20:45:44 +01:00
Avi Kivity	ea99750de7	test: give tests less-unique identifiers Test identifiers are very unique, but this makes them less useful in Jenkins Test Result Analyzer view. For example, counter_test can be counter_test.432 in one run and counter_test.442 in another. Jenkins considers them different and so we don't see a trend. Limit the id uniqueness within a test case, so that we'll have counter_test.{1, 2, 3} consistently. Those test will be grouped together so we can see pass/fail trends. Closes #11946	2022-11-29 13:14:14 +02:00
Kamil Braun	3934eefd20	test/pylib: scylla_cluster: (re)lease IP addr outside ScyllaServer `ScyllaServer`s were constructed without IP addresses. They leased an IP address from `HostRegistry` and released them in `uninstall`. This responsibility was now moved into `ScyllaCluster`, which leases an IP address for a server before constructing it, and passes it to the constructor. It releases the addresses of its serverswhen uninstalling itself. This will allow the cluster to reuse the IP address of an existing server in that cluster when adding a new server which wants to replace the existing one. Instead of leasing a new address, it will pass the existing IP address to the new server's constructor. The refactor is also nice in that it establishes an invariant for `ScyllaServer`, simplifying reasoning about the class: now it has an `ip_addr` field at all times. `host_registry` was moved from `ScyllaServer` to `ScyllaCluster`.	2022-11-24 16:26:23 +01:00
Kamil Braun	9d5e1191da	test/pylib: scylla_cluster: refactor create_server parameters to a struct `ScyllaCluster` constructor takes a function `create_server` which itself takes 3 parameters now. Soon it will take a 4th. The list of parameters is repeated at the constructor definition and the call site of the constructor, with many parameters it begins being tiresome. Refactor the list of parameters to a `NamedTuple`.	2022-11-24 16:26:23 +01:00
Kamil Braun	d582666293	test.py: stop/uninstall clusters instead of servers when cleaning up `self.artifacts` was calling `ScyllaServer.stop` and `ScyllaServer.uninstall`. Now it calls `ScyllaCluster.stop` and `ScyllaCluster.uninstall`, which underneath stops/uninstalls servers in this cluster. We must be a bit more careful now in case installing/starting a server inside a cluster fails: there are no server cleanup artifacts, and a server is added to cluster's `running` map only after `install_and_start` finishes (until that happens, `ScyllaCluster.stop/uninstall` won't catch this server). So handle failures explicitly in `install_and_start`. This commit does not logically change how the tests are running - every started server belongs to some cluster, so it will be cleaned up - but it's an important refactor. It will allow us to move IP address (de)allocation code outside `ScyllaServer`, into `ScyllaCluster`, which in turn will allow us to implement node replace operation for the case where we want to reuse the replaced node's IP. Also, `ScyllaCluster.uninstall` was unused before this change, now it's used.	2022-11-24 16:26:17 +01:00
Kamil Braun	135eb4a041	test.py: prepare for adding extra config from test when creating servers We will use this for replace operations to pass the IP of replaced node.	2022-11-21 10:57:03 +01:00
Petr Gusev	41629e97de	test.py: handle --markers parameter Some tests may take longer than a few seconds to run. We want to mark such tests in some way, so that we can run them selectively. This patch proposes to use pytest markers for this. The markers from the test.py command line are passed to pytest as is via the -m parameter. By default, the marker filter is not applied and all tests will be run without exception. To exclude e.g. slow tests you can write --markers 'not slow'. The --markers parameter is currently only supported by Python tests, other tests ignore it. We intend to support this parameter for other types of tests in the future. Another possible improvement is not to run suites for which all tests have been filtered out by markers. The markers are currently handled by pytest, which means that the logic in test.py (e.g., running a scylla test cluster) will be run for such suites. Closes #11713	2022-11-18 12:36:20 +01:00
Avi Kivity	e5e7780f32	test: work around modern pytest rejecting site-packages Modern (as of Fedora 37) pytest has the "-sP" flags in the Python command line, as found in /usr/bin/pytest. This means it will reject the site-packages directory, where we install the Scylla Python driver. This causes all the tests to fail. Work around it by supplying an alternative pytest script that does not have this change. Closes #11764	2022-10-17 07:18:33 +03:00
Kamil Braun	f94d547719	test.py: include modes in log file name Instead of `test.py.log`, use: `test.py.dev.log` when running with `--mode dev`, `test.py.dev-release.log` when running with `--mode dev --mode release`, and so on. This is useful in Jenkins which is running test.py multiple times in different modes; a later run would overwrite a previous run's test.py file. With this change we can preserve the test.py files of all of these runs. Closes #11678	2022-10-06 18:20:39 +03:00
Kamil Braun	5be818d73b	test.py: pass the unique test name (e.g. `test_topology.1`) to cluster manager This helps us distinguish the different repeats of a test in logs. Rename the variable accordingly in `ScyllaClusterManager`.	2022-09-30 16:24:10 +02:00
Nadav Har'El	de1bc147bc	Merge 'test.py: cleanups in topology test suites' from Kamil Braun Fix the type of `create_server`, rename `topology_for_class` to `get_cluster_factory`, simplify the suite definitions and parameters passed to `get_cluster_factory` Closes #11590 * github.com:scylladb/scylladb: test.py: replace `topology` with `cluster_size` in Topology tests test.py: rename `topology_for_class` to `get_cluster_factory` test/pylib: ScyllaCluster: fix create_server parameter type	2022-09-28 15:19:54 +03:00
Kamil Braun	696bdb2de7	test.py: replace `topology` with `cluster_size` in Topology tests First, a reminder of a few basic concepts in Scylla: - "topology" is a mapping: for each node, its DC and Rack. - "replication strategy" is a method of calculating replica sets in a cluster. It is not a cluster-global property; each keyspace can have a different replication strategy. A cluster may have multiple keyspaces. - "cluster size" is the number of nodes in a cluster. Replication strategy is orthogonal to topology. Cluster size can be derived from topology and is also orthogonal to replication strategy. test.py was confusing the three concepts together. For some reason, Topology suites were specifying a "topology" parameter which contained replication strategy details - having nothing to do with topology. Also it's unclear why a test suite would specify anything to do with replication strategies - after all, a test may create keyspaces with different replication strategies, and a suite may contain multiple different tests. Get rid of the "topology" parameter, replace it with a simple "cluster_size". In the future we may re-introduce it when we actually implement the possibility to start clusters with custom topologies (which involves configuring the snitch etc.) Simplify the test.py code.	2022-09-26 15:17:50 +02:00
Kamil Braun	0725ab3a3e	test.py: rename `topology_for_class` to `get_cluster_factory` The previous name had nothing to do with what the function calculated and returned (it returned a `create_cluster` function; the standard name for a function that constructs objects would be 'factory', so `get_cluster_factory` is an appropriate name for a function that returns cluster factories).	2022-09-26 11:45:44 +02:00
Kamil Braun	06cc4f9259	test/pylib: ScyllaCluster: fix create_server parameter type The only usage of `ScyllaCluster` constructor passed a `create_server` function which expected a `List[str]` for the second parameter, while the constructor specified that the function should expect an `Optional[List[str]]`. There was no reason for the latter, we can easily fix this type error. Also give a type hint for `create_cluster` function in `PythonTestSuite.topology_for_class`. This is actually what catched the type error.	2022-09-26 11:45:44 +02:00
Alejo Sanchez	933d93d052	test.py: fix topology init error handling Start ScyllaClusterManager within error handling so the ScyllaCluster logs are available in case of error starting up. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-09-21 09:15:25 +02:00
Kamil Braun	ce7bb8b6d0	test.py: PythonTestSuite: sum default config params with user-provided ones Previously, if the suite.yaml file provided `extra_scylla_config_options` but didn't provide values for `authorizer` or `authenticator` inside the config options, the harness wouldn't give any defaults for these keys. It would only provide defaults for these keys if suite.yaml didn't specify `extra_scylla_config_options` at all. It makes sense to give the user the ability to provide extra options while relying on harness defaults for `authenticator` and `authorizer` if the user doesn't care about them.	2022-09-12 11:58:05 +02:00
Alejo Sanchez	4190c61dbf	test.py: enable log capture for Python suite Enable pytest log capture for Python suite. This will help debugging issues in remote machines. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-09-08 11:37:32 +02:00
Nadav Har'El	516089beb0	Merge 'Raft test topology II part 1' from Alecco - Remove `ScyllaCluster.__getitem__()` (pending request by @kbr- in a previous pull request), for this remove all direct access to servers from caller code - Increase Python driver timeouts (req by @nyh) - Improve `ManagerClient` API requests: use `http+unix://<sockname>/<resource>` instead of `http://localhost/<resource>` and callers of the helper method only pass the resource - Improve lint and type hints Closes #11305 * github.com:scylladb/scylladb: test.py: remove ScyllaCluster.__getitem__() test.py: ScyllaCluster check kesypace with any server test.py: ScyllaCluster server error log method test.py: ScyllaCluster read_server_log() test.py: save log point for all running servers test.py: ScyllaCluster provide endpoint test.py: build host param after before_test test.py: manager client disable lint warnings test.py: scylla cluster lint and type hint fixes test.py: increase more timeouts test.py: ManagerClient improve API HTTP requests	2022-08-18 20:27:50 +03:00
Konstantin Osipov	7481f0d404	test.py: simplify CQL test search No need to repeat code available in the base class. Closes #11156	2022-08-18 19:28:43 +03:00
Alejo Sanchez	7ad7a5e718	test.py: ScyllaCluster server error log method Provide server error logs to caller (test.py). Avoids direct access to list of servers. To be done later: pick the failed server. For now it just provides the log of one server. While there, fix type hints. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-17 10:24:48 +02:00
Alejo Sanchez	e755207fcc	test.py: ScyllaCluster read_server_log() Instead of accessing the first server, now test.py asks ScyllaCluster for the server log. In a later commit, ScyllaCluster will pick the appropriate server. Also removes another direct access to the list of servers we want to get rid of.	2022-08-17 10:24:48 +02:00
Alejo Sanchez	f141ab95f9	test.py: save log point for all running servers For error reporting, before a test a mark of the log point in time is saved. Previously, only the log of the first server was saved. Now it's done for all running servers. While there, remove direct access to servers on test.py. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-17 10:24:48 +02:00
Alejo Sanchez	8fff636776	test.py: ScyllaCluster provide endpoint For pytest CQL driver connections a host id (IP) is used. Provide it with a method. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-17 10:24:48 +02:00
Alejo Sanchez	5bd266424e	test.py: build host param after before_test If no server started, there is no server in the cluster list. So only build the pytest --host param after before_test check is done. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-17 10:24:48 +02:00
Alejo Sanchez	a585a82ad1	test.py: call before/after_test for each test case Preparing for topology tests with changing clusters, run before and after checks per test case. Change scope of pytest fixtures to function as we need them per test casse. Add server and client API logic. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-11 23:39:13 +02:00
Alejo Sanchez	fe561a7dbd	test.py: ClusterManager API and ManagerClient Add an API via Unix socket to Manager so pytests can query information about the cluster. Requests are managed by ManagerClient helper class. The socket is placed inside a unique temporary directory for the Manager (as safe temporary socket filename is not possible in Python). Initial API services are manager up, cluster up, if cluster is dirty, cql port, configured replicas (RF), and list of host ids. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-11 23:39:13 +02:00
Alejo Sanchez	aad015d4e2	test.py: improve topology docstring Improve docstring of TopologyTestSuite to reflect its differences with other test suites. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-11 23:39:13 +02:00
Alejo Sanchez	a6448458bb	test.py: improve seeds for new servers Instead of only using last started server as seed, use all started servers as seed for new servers. This also avoids tracking last server's state. Pass empty list instead of None. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-09 14:26:13 +02:00
Alejo Sanchez	83dab6045b	test.py: Topology tests and Manager for Scylla clusters Preparing to cycle clusters modified (dirty) and use multiple clusters per topology pytest, introduce Topology tests and Manager class to handle clusters. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-09 14:26:13 +02:00
Alejo Sanchez	14328d1e42	test.py: rename scylla_server to scylla_cluster This file's most important class is ScyllaCluster, so rename it accordingly. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-09 14:26:13 +02:00
Alejo Sanchez	bc494754e8	test.py: configurable authenticator and authorizer For scylla servers, keep default PasswordAuthenticator and CassandraAuthorizer but allow this to be configurable per test suite. Use AllowAll* for topology test suite. Disabling authentication avoids complications later for topology tests as system_auth kespace starts with RF=1 and tests take down nodes. The keyspace would need to change RF and run repair. Using AllowAll avoids this problem altogether. A different cql fixture is created without auth for topology tests. Topology tests require servers without auth from scylla.yaml conf. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-09 14:26:13 +02:00
Alejo Sanchez	573ed429ad	test.py: FIXME for bad cluster log handling logic The code in test.py using a ScyllaCluster is getting a server id and taking logs from only the first server. If there is a failure in another server it's not reported properly. And CQL connection will go only to the first server. Also, it might be better to have ScyllaCluster to handle these matters and be more opaque. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-08-09 14:26:13 +02:00

1 2 3 4 5 ...

444 Commits