scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Author	SHA1	Message	Date
Benny Halevy	79c19a166c	sstables: writer: keep track of large data stats In the next patch, this is will be written to the sstable's scylla_metadata component. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-01 15:19:41 +02:00
Benny Halevy	8ab053bd44	large_data_handler: expose methods to get threshold To be used for keeping large_data statistics in sstable. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-01 15:18:14 +02:00
Benny Halevy	f1257dfdc0	sstables: kl/writer: never record too many rows rows_count is not tracked prior to the mc format. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-01 15:18:14 +02:00
Benny Halevy	dd7422a713	large_data_handler: indicate recording of large data entries Return true from the maybe_{record,log}_* methods if a large data record or log entry were emitted. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-01 15:18:14 +02:00
Benny Halevy	873107821b	large_data_handler: move constructor out of line No need for it to be inlined. Also, add debug logging to the large data handler options. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-01 15:18:14 +02:00
Avi Kivity	ea9c058be3	Merge 'Don't use secondary indices for multi-column restrictions' from Dejan Mircevski Fix #7680 by never using secondary index for multi-column restrictions. Modify expr::is_supported_by() to handle multi-column correctly. Tests: unit (dev) Closes #7699 * github.com:scylladb/scylla: cql3/expr: Clarify multi-column doesn't use indexing cql3: Don't use index for multi-column restrictions test: Add eventually_require_rows	2020-11-30 12:38:26 +02:00
Avi Kivity	12c20c4101	Merge 'test/cql-pytest: tests for string validation (UTF-8 and ASCII)' from Nadav Har'El The first two patches in this series are small improvements to cql-pytest to prepare for the third and main patch. This third patch adds cql-pytest tests which check that we fail CQL queries that try to inject non-ASCII and non-UTF-8 strings for ascii and text columns, respectively. The tests do not discover any unknown bug in Scylla, however, they do show that Scylla is more strict in its definition of "valid UTF-8" compared to Cassandra. Closes #7719 * github.com:scylladb/scylla: test/cql-pytest: add tests for validation of inserted strings test/cql-pytest: add "scylla_only" fixture test/cpy-pytest: enable experimental features	2020-11-30 12:26:25 +02:00
Piotr Wojtczak	3560acd311	cql_metrics: Add metrics for CQL errors This change adds tracking of all the CQL errors that can be raised in response to a CQL message from a client, as described in the CQL v4 protocol and with Scylla's CDC_WRITE_FAILUREs included. Fixes #5859 Closes #7604	2020-11-30 12:18:37 +02:00
Takuya ASADA	6238d105d9	dist/redhat: drop Conflicts with older kernel We have "Conflicts: kernel < 3.10.0-514" on rpm package to make sure the environment is running newer kernel. However, user may use non-standard kernel which has different package name, like kernel-ml or kernel-uek. On such environment Conflicts tag does not works correctly. Even the system running with newer kernel, rpm only checks "kernel" package version number. To avoid such issue, we need to drop Conflicts tag. Fixes #7675	2020-11-30 11:38:42 +02:00
Nadav Har'El	48c78ade33	test/cql-pytest: add tests for validation of inserted strings This patch adds comprehensive cql-pytest tests for checking the validation of strings - ASCII or UTF-8 - in CQL. Strings can be represented in CQL using several methods - a strings can be a string literal as part of the statement, can be encoded as a blob (0x...), or can be a binding parameter for a prepared statement, or returned by user-defined functions - and these tests check all of them. We already have low-level unit tests for UTF-8 parsing in test/boost/utf8_test.cc, but the new tests here confirms that we really call these low-level functions in the correct way. Moreover, since these are CQL tests, they can also be run against Cassandra, and doing that demonstrated that Scylla's UTF-8 parsing is stricter than Cassandra's - Scylla's UTF-8 parser rejects the following sequences which Cassandra's accepts: 1. \xC0\x80 as another non-minimal representation of null. Note that other non-minimal encodings are rejected by Cassandra, as expected. 2. Characters beyond the official Unicode range (or what Scylla considers the end of the range). 3. UTF-16 surrogates - these are not considered valid UTF-8, but Cassandra accepts them, and Scylla does not. In the future, we should consider whether Scylla is more correct than Cassandra here (so we're fine), or whether compatibility is more important than correctness (so this exposed a bug). The ASCII tests reproduces issue #5421 - that trying to insert a non-ASCII string into an "ascii" column should produce an error on insert - not later when fetching the string. This test now passes, because issue 5421 was already fixed. These tests did not exposed any bug in Scylla (other than the differences with Cassandra mentioned a bug), so all of them pass on Scylla. Two of the tests fail on Cassandra, because Cassandra does not recognize some invalid UTF-8 (according to Scylla's definition) as invalid. Refs #5421. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-11-29 17:43:20 +02:00
Dejan Mircevski	5bc7e31284	restrictions: Forbid mixing ck=0 and (ck)=(0) Reject the previously accepted case where the multi-column restriction applied to just a single column, as it causes a crash downstream. The user can drop the parentheses to avoid the rejection. Fixes #7710 Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7712	2020-11-29 17:06:41 +02:00
Avi Kivity	0584db1eb3	Merge "Unstall cleanup_compaction::get_ranges_for_invalidation" from Benny " This series adds maybe_yield called from cleanup_compaction::get_ranges_for_invalidation to avoid reactor stalls. To achieve that, we first extract bool_class can_yield to utils/maybe_yield.hh, and add a convience helper: utils::maybe_yield(can_yield) that conditionally calls seastar::thread::maybe_yield if it can (when called in a seastar thread). With that, we add a can_yield parameter to dht::to_partition_ranges and dht::partition_range::deoverlap (defaults to false), and use it from cleanup_compaction::get_ranges_for_invalidation, as the latter is always called from `consume_in_thread`. Fixes #7674 Test: unit(dev) " * tag 'unstall-get_ranges_for_invalidation-v2' of github.com:bhalevy/scylla: compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points dht/i_partitioner: to_partition_ranges: support yielding locator: extract can_yield to utils/maybe_yield.hh	2020-11-29 14:10:39 +02:00
Asias He	0a3a2a82e1	api: Add force_remove_endpoint for gossip It is used to force remove a node from gossip membership if something goes wrong. Note: run the force_remove_endpoint api at the same time on _all_ the nodes in the cluster in order to prevent the removed nodes come back. Becasue nodes without running the force_remove_endpoint api cmd can gossip around the removed node information to other nodes in 2 * ring_delay (2 * 30 seconds by default) time. For instance, in a 3 nodes cluster, node 3 is decommissioned, to remove node 3 from gossip membership prior the auto removal (3 days by default), run the api cmd on both node 1 and node 2 at the same time. $ curl -X POST --header "Accept: application/json" "http://127.0.0.1:10000/gossiper/force_remove_endpoint/127.0.0.3" $ curl -X POST --header "Accept: application/json" "http://127.0.0.2:10000/gossiper/force_remove_endpoint/127.0.0.3" Then run 'nodetool gossipinfo' on all the nodes to check the removed nodes are not present. Fixes #2134 Closes #5436	2020-11-29 13:58:46 +02:00
Nadav Har'El	0864933d4d	test/cql-pytest: add "scylla_only" fixture This patch adds a fixture "scylla_only" which can be used to mark tests for Scylla-specific features. These tests are skipped when running against other CQL servers - like Apache Cassandra. We recognize Scylla by looking at whether any system table exists with the name "scylla" in its name - Scylla has several of those, and Cassandra has none. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-11-29 10:18:58 +02:00
Nadav Har'El	91ccb2afb5	test/cpy-pytest: enable experimental features Enable experimental features, and in particular UDF, so we can test those features in our tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-11-29 10:18:58 +02:00
Lubos Kosco	4d0587ed11	scylla_util.py: fix metadata gcp call for disks to get details disk parsing expects output from recursive listing of GCP metadata REST call, the method used to do it by default, but now it requires a boolean flag to run in recursive mode Fixes #7684 Closes #7685	2020-11-27 15:20:56 +02:00
Pekka Enberg	c84754a634	Update tools/java submodule * tools/java ad48b44a26...8080009794 (1): > sstableloader: Fix command line parsing of "ignore-missing-columns"	2020-11-27 15:19:48 +02:00
Avi Kivity	390e07d591	dist: sysctl: configure more inotify instances Since `f3bcd4d205` ("Merge 'Support SSL Certificate Hot Reloading' from Calle"), we reload certificates as they are modified on disk. This uses inotify, which is limited by a sysctl fs.inotify.max_user_instances, with a default of 128. This is enough for 64 shards only, if both rpc and cql are encrypted; above that startup fails. Increase to 1200, which is enough for 6 instances * 200 shards. Fixes #7700. Closes #7701	2020-11-26 23:44:48 +02:00
Takuya ASADA	5f81f97773	install.sh: apply sysctl.d files on non-packaging installation We don't apply sysctl.d files on non-packaging installation, apply them just like rpm/deb taking care of that. Fixes #7702 Closes #7705	2020-11-26 09:52:14 +02:00
Takuya ASADA	ba4d54efa3	dist/redhat: packaging dependencies.conf as normal file, not ghost When we introduced dependencies.conf, we mistakenly added it on rpm as %ghost, but it should be normal file, should be installed normally on package installation. Fixes #7703 Closes #7704	2020-11-26 09:50:05 +02:00
Dejan Mircevski	7f8ed811c1	cql3/expr: Clarify multi-column doesn't use indexing Although not currently used, the old code was wrong and confusing to readers. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-11-25 10:59:13 -05:00
Avi Kivity	956f031a68	Merge 'Add missing shaded<>::stop in exceptional startup code for CQL/redis' from Calle Wilund Fixes #7211 If we start a sharded<> object, then proceed to do potentially exceptional stuff, we should destroy it on said exception. Otherwise, the exception propagation will abort on RAII destruction of the sharded<>. And we get no exception logging. Closes #7697 * github.com:scylladb/scylla: redis::service: Shut down sharded<> subobject on startup exception transport::controller: Shut down distributed object on startup exception	2020-11-25 17:57:53 +02:00
Calle Wilund	55acf09662	redis::service: Shut down sharded<> subobject on startup exception Refs #7211 If we start a sharded<> object, then proceed to do potentially exceptional stuff, we should destroy it on said exception. Otherwise, the exception propagation will abort on RAII destruction of the sharded<>. And we get no exception logging.	2020-11-25 15:52:47 +00:00
Calle Wilund	ae4d5a60ca	transport::controller: Shut down distributed object on startup exception Fixes #7211 If we start a sharded<> object, then proceed to do potentially exceptional stuff, we should destroy it on said exception. Otherwise, the exception propagation will abort on RAII destruction of the sharded<>. And we get no exception logging.	2020-11-25 15:52:47 +00:00
Dejan Mircevski	db63b40347	cql3: Don't use index for multi-column restrictions The downstream code expects a single-column restriction when using an index. We could fix it, but we'd still have to filter the rows fetched from the index table, unlike the code that queries the base table directly. For instance, WHERE (c1,c2,c3) = (1,2,3) with an index on c3 can fetch just the right rows from the base table but all the c3=3 rows from the index table. Fixes #7680 Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-11-25 10:39:04 -05:00
Dejan Mircevski	ab7aa57b24	test: Add eventually_require_rows Makes it easier to combine eventually{assert_that} with useful error messages. Refs #7573. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-11-25 10:34:44 -05:00
Benny Halevy	e1fe1f18c7	compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points Avoid reactor stalls by allowing yielding in long-running loops as seen in #7674. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-25 13:46:32 +02:00
Tomasz Grabiec	0aa1f7c70a	Merge "raft: fix replication if existing log on leader" from Gleb * scylla-dev/add_dummy_v2: raft: test: replication works on leader change without adding an entry raft: commit a dummy entry after leader change raft: test: fix snapshot correctness check sstables: add `may_have_partition_tombstones` method	2020-11-24 11:35:18 +01:00
Gleb Natapov	51d1d20687	raft: test: replication works on leader change without adding an entry Check that a newly elected leader commits all the entries in its log without waiting for more entries to be submitted.	2020-11-24 11:35:18 +01:00
Gleb Natapov	6130fb8b39	raft: commit a dummy entry after leader change After a node becomes leader it needs to do two things: send an append message to establish its leadership and commit one entry to make sure all previous entries with smaller terms are committed as well.	2020-11-24 11:35:18 +01:00
Gleb Natapov	e3a886738b	raft: test: fix snapshot correctness check Snapshot index cannot be used to check snapshot correctness since some entries may not be command and thus do not affect snapshot value. Lest use applied entries count instead.	2020-11-24 11:35:18 +01:00
Benny Halevy	37e971ad87	dht/i_partitioner: to_partition_ranges: support yielding Allow yielding to prevent reactor stalls when called with a long vector of ranges. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Benny Halevy	157a964a63	locator: extract can_yield to utils/maybe_yield.hh Move the definition of bool_class can_yield to a standalone header file and define there a maybe_yield(can_yield) helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Asias He	1b2155eb1d	repair: Use same description for the same metric In commit `9b28162f88` (repair: Use label for node ops metrics), we switched to use label for different node operations. We should use the same description for the same metric name. Fixes #7681 Closes #7682	2020-11-24 09:35:39 +02:00
Avi Kivity	e8ff77c05f	Merge 'sstables: a bunch of refactors' from Kamil Braun 1. sstables: move `sstable_set` implementations to a separate module All the implementations were kept in sstables/compaction_strategy.cc which is quite large even without them. `sstable_set` already had its own header file, now it gets its own implementation file. The declarations of implementation classes and interfaces (`sstable_set_impl`, `bag_sstable_set`, and so on) were also exposed in a header file, sstable_set_impl.hh, for the purposes of potential unit testing. 2. mutation_reader: move `mutation_reader::forwarding` to flat_mutation_reader.hh Files which need this definition won't have to include mutation_reader.hh, only flat_mutation_reader.hh (so the inclusions are in total smaller; mutation_reader.hh includes flat_mutation_reader.hh). 3. sstables: move sstable reader creation functions to `sstable_set` Lower level functions such as `create_single_key_sstable_reader` were made methods of `sstable_set`. The motivation is that each concrete sstable_set may decide to use a better sstable reading algorithm specific to the data structures used by this sstable_set. For this it needs to access the set's internals. A nice side effect is that we moved some code out of table.cc and database.hh which are huge files. 4. sstables: pass `ring_position` to `create_single_key_sstable_reader` instead of `partition_range`. It would be best to pass `partition_key` or `decorated_key` here. However, the implementation of this function needs a `partition_range` to pass into `sstable_set::select`, and `partition_range` must be constructed from `ring_position`s. We could create the `ring_position` internally from the key but that would involve a copy which we want to avoid. 5. sstable_set: refactor `filter_sstable_for_reader_by_pk` Introduce a `make_pk_filter` function, which given a ring position, returns a boolean function (a filter) that given a sstable, tells whether the sstable may contain rows with the given position. The logic has been extracted from `filter_sstable_for_reader_by_pk`. Split from #7437. Closes #7655 * github.com:scylladb/scylla: sstable_set: refactor filter_sstable_for_reader_by_pk sstables: pass ring_position to create_single_key_sstable_reader sstables: move sstable reader creation functions to `sstable_set` mutation_reader: move mutation_reader::forwarding to flat_mutation_reader.hh sstables: move sstable_set implementations to a separate module	2020-11-24 09:23:57 +02:00
Kamil Braun	d158921966	sstables: add `may_have_partition_tombstones` method For sstable versions greater or equal than md, the `min_max_column_names` sstable metadata gives a range of position-in-partitions such that all clustering rows stored in this sstable have positions in this range. Partition tombstones in this context are understood as covering the entire range of clustering keys; thus, if the sstable contains at least one partition tombstone, the sstable position range is set to be the range of all clustered rows. Therefore, by checking that the position range is not the range of all clustered rows we know that the sstable cannot have any partition tombstones. Closes #7678	2020-11-23 23:30:19 +02:00
Kamil Braun	72c59e8000	flat_mutation_reader: document assumption about fast_forward_to It is not legal to fast forward a reader before it enters a partition. One must ensure that there even is a partition in the first place. For this one must fetch a `partition_start` fragment. Closes #7679	2020-11-23 17:39:46 +01:00
Pavel Emelyanov	fea4a5492f	system-keyspace: Remove dead code Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20201123151453.27341-1-xemul@scylladb.com>	2020-11-23 17:16:15 +02:00
Tomasz Grabiec	36f9da6420	Merge "raft: testing: snapshots and partitioning elections" from Alejo Fixes, features needed for testing, snapshot testing. Free election after partitioning (replication test) . * https://github.com/alecco/scylla/tree/raft-ale-tests-05e: raft: replication test: partitioning with leader raft: replication test: run free election after partitioning raft: expose fsm tick() to server for testing raft: expose is_leader() for testing raft: replication test: test take and load snapshot raft: fix a bug in leader election raft: fix default randomized timeout raft: replication test: fix custom next leader raft: replication test: custom next leader noop for same raft: replication test: fix failure detector for disconnected	2020-11-23 14:36:39 +01:00
Kamil Braun	6c8b0af505	sstable_set: refactor filter_sstable_for_reader_by_pk Introduce a `make_pk_filter` function, which given a ring position, returns a boolean function (a filter) that given a sstable, tells whether the sstable may contain rows with the given position. The logic has been extracted from `filter_sstable_for_reader_by_pk`.	2020-11-23 12:35:10 +01:00
Kamil Braun	68663d0de0	sstables: pass ring_position to create_single_key_sstable_reader instead of partition_range. It would be best to pass `partition_key` or `decorated_key` here. However, the implementation of this function needs a `partition_range` to pass into `sstable_set::select`, and `partition_range` must be constructed from `ring_position`s. We could create the `ring_position` internally from the key but that would involve a copy which we want to avoid.	2020-11-23 12:33:24 +01:00
Takuya ASADA	b90ddc12c9	scylla_prepare: add --tune system when SET_CLOCKSOURCE=yes perftune.py only run clocksource setup when --tune system specified, so we need to add it on the parameter when SET_CLOCKSOURCE=yes. Fixes #7672	2020-11-23 10:51:16 +02:00
Avi Kivity	f8e0517bc7	cql: do not advance timeouts on internal pages Currently, each internal page fetched during aggregating gets a timeout based on the time the page fetch was started, rather than the query start time. This means the query can continue processing long after the client has abandoned it due to its own timeout, which is based on the query start time. Fix by establishing the timeout once when the query starts, and not advancing it. Test: manual (SELECT count(*) FROM a large table). Fixes #1175. Closes #7662	2020-11-23 08:14:18 +01:00
Alejo Sanchez	1f8ca4e06d	raft: replication test: partitioning with leader For test simplicity support partition{leader{A},B,C,D} Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-11-22 22:39:00 -04:00
Avi Kivity	3eac976e24	build: remove non-C/C++ jobs from submodule_pools The C and C++ sub-builds were placed in submodule_pool to reduce concurrency, as they are memory intensive (well, at least the C++ jobs are), and we choose build concurrency based on memory. But the other submodules are not memory intensives, and certainly the packaging jobs are not (and they are single-threaded too). To allow these simple jobs to utilize multicores more efficiently, remove them from submodule_pool so they can run in parallel. Closes #7671	2020-11-23 00:32:41 +02:00
Avi Kivity	bcced9f56b	build: compress unified package faster The unified package is quite large (1GB compressed), and it is the last step in the build so its build time cannot be parallized with other tasks. Compress it with pigz to take advantage of multiple cores and speed up the build a little. Closes #7670	2020-11-23 00:31:04 +02:00
Takuya ASADA	3fefa520bd	dist/common/scripts: drop run() and out(), swtich to subprocess.run() We initially implemented run() and out() functions because we couldn't use subprocess.run() since we were on Python 3.4. But since we moved to relocatable python3, we don't need to implement it ourselves. Why we keep using these functions are, because we needed to set environemnt variable to set PATH. Since we recently moved away these codes to python thunk, we finally able to drop run() and out(), switch to subprocess.run().	2020-11-22 17:59:27 +02:00
Alejo Sanchez	f12fed0809	raft: replication test: run free election after partitioning When partitioning without keeping the existing leader, run an election without forcing a particular leader. To force a leader after partitioning, a test can just set it with new_leader{X}. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-11-22 10:32:34 -04:00
Alejo Sanchez	d610d5a7b8	raft: expose fsm tick() to server for testing For tests to advance servers they need to invoke tick(). This is needed to advance free elections. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-11-22 10:32:34 -04:00
Alejo Sanchez	9e7e14fc50	raft: expose is_leader() for testing Expose fsm leader check to allow tests to find out the leader after an election. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2020-11-22 10:32:34 -04:00

1 2 3 4 5 ...

24450 Commits