scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 04:56:58 +00:00

Author	SHA1	Message	Date
Kamil Braun	a67e887dea	sstables: fix sstable file I/O CQL tracing when reading multiple files (#5285 ) CQL tracing would only report file I/O involving one sstable, even if multiple sstables were read from during the query. Steps to reproduce: create a table with NullCompactionStrategy insert row, flush memtables insert row, flush memtables restart Scylla tracing on select * from table The trace would only report DMA reads from one of the two sstables. Kudos to @denesb for catching this. Related issue: #4908	2019-11-17 00:38:37 -08:00
Tomasz Grabiec	a384d0af76	Merge "A set of cleanups over main() code" from Pavel E. There are ... signs of massive start/stop code rework in the main() function. While fixing the sub-modules interdependencies during start/stop I've polished these signs too, so here's the simplest ones.	2019-11-15 15:25:18 +01:00
Pavel Emelyanov	1dc490c81c	tracing: Move register_tracing_keyspace_backend forward decl into proper header Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	7e81df71ba	main: Shorten developer_mode() evaluation Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	1bd68d87fc	main: Do not carry pctx all over the code v2: - do not use struct initialization extention Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	655b6d0d1e	main: Hide start_thrift Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	26f2b2ce5e	main,db: Kill some unused .hh includes Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	f5b345604f	main: Factor out get_conf_sub Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	924d52573d	main: Remove unused return_value variable (and capture) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	2195edb819	gitignore: Add tags file This file is generated by ctags utility for navigation, so it is not to be tracked by git. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191031221339.19030-1-xemul@scylladb.com>	2019-11-14 16:50:11 +01:00
Gleb Natapov	e0668f806a	lwt: change format of partition key serialization for system.paxos table Serialize provided partition_key in such a way that the serialized value will hash to the same token as the original key. This way when system.paxos table is updated the update is shard local. Message-Id: <20191114135449.GU10922@scylladb.com>	2019-11-14 15:07:16 +01:00
Avi Kivity	19b665ea6b	Merge "Correctly handle null/unset frozen collection/UDT columns in INSERT JSON." from Kamil " When using INSERT JSON with frozen collection/UDT columns, if the columns were left unspecified or set to null, the statement would create an empty non-null value for these columns instead of using null values as it should have. For example: cqlsh:b> create table t (k text primary key, l frozen<list<int>>, m frozen<map<int, int>>, s frozen<set<int>>, u frozen<ut>); cqlsh:b> insert into t JSON '{"k": "insert_json"}'; cqlsh:b> select * from t; k \| l \| m \| s \| u -------------------+------+------+------+------ insert_json \| [] \| {} \| {} \| This PR fixes this. Resolves #5246 and closes #5270. " * 'frozen-json' of https://github.com/kbr-/scylla: tests: add null/unset frozen collection/UDT INSERT JSON test cql3: correctly handle frozen null/unset collection/UDT columns in INSERT JSON cql3: decouple execute from term binding in user_type::setter	2019-11-14 15:29:30 +02:00
Avi Kivity	4544aa0b34	Update seastar submodule * seastar 75e189c6ba...6f0ef32514 (6): > Merge "Add named semaphores" from Piotr > parallel_for_each_state: pass rvalue reference to add_future > future: Pass rvalue to uninitialized_wrapper::uninitialized_set. > dependencies: Add libfmt-dev to debian > log: Fix logger behavior when logging both to stdout and syslog. > README.md: list Scylla among the projects using Seastar	2019-11-14 15:01:18 +02:00
Vladimir Davydov	ab42b72c6d	cql: fix SERIAL consistency check for batch statements If CONSISTENCY is set to SERIAL or LOCAL SERIAL, all write requests must fail according to Cassandra's documentation. However, batched writes bypass this check. Fix this.	2019-11-14 12:15:39 +01:00
Vladimir Davydov	25aeefd6f3	cql: fix CAS consistency level validation This patch resurrects Cassandra's code validating a consistency level for CAS requests. Basically, it makes CAS requests use a special function instead of validate_for_write to make error messages more coherent. Note, we don't need to resurrect requireNetworkTopologyStrategy as EACH_QUORUM should work just fine for both CAS and non-CAS writes. Looks like it is just an artefact of a rebase in the Cassandra repository.	2019-11-14 12:15:39 +01:00
Avi Kivity	cd075e9132	reloc: do not install dependencies when building the relocatable package The dependencies are provided by the frozen toolchain. If a dependency is missing, we must update the toolchain rather than rely on build-time installation, which is not reproducible (as different package versions are available at different times). Luckily "dnf install" does not update an already-installed package. Had that been a case, none of our builds would have been reproducible, since packages would be updated to the latest version as of the build time rather than the version selected by the frozen toolchain. So, to prevent missing packages in the frozen toolchain translating to an unreproducible build, remove the support for installing dependencies from reloc/build_reloc.sh. We still parse the --nodeps option in case some script uses it. Fixes #5222. Tests: reloc/build_reloc.sh.	2019-11-14 09:37:14 +02:00
Gleb Natapov	552c56633e	storage_proxy: do not release mutation if not all replies were received MV backpressure code frees mutation for delayed client replies earlier to save memory. The commit `2d7c026d6e` that introduced the logic claimed to do it only when all replies are received, but this is not the case. Fix the code to free only when all replies are received for real. Fixes #5242 Message-Id: <20191113142117.GA14484@scylladb.com>	2019-11-13 16:23:19 +02:00
Raphael S. Carvalho	3e70523111	distributed_loader: Release disk space of SSTables deleted by resharding Resharding is responsible for the scheduling the deletion of sstables resharded, but it was not refreshing the cache of the shards those sstables belong to, which means cache was incorrectly holding reference to them even after they were deleted. The consequence is sstables deleted by resharding not having their disk space freed until cache is refreshed by a subsequent procedure that triggers it. Fixes #5261. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20191107193550.7860-1-raphaelsc@scylladb.com>	2019-11-13 16:03:27 +02:00
Avi Kivity	6aed3b7471	Merge "cql: trivial cleanup" from Vova * 'cql-trivial-cleanup' of ssh://github.com/scylladb/scylla-dev: cql: rename modification_statement::_sets_a_collection to _selects_a_collection cql: rename _column_conditions to _regular_conditions cql: remove unnecessary optional around prefetch_data	2019-11-13 15:12:10 +02:00
Avi Kivity	1cb9f9bdfe	Merge "Use a fixed-size bitset for column set" from Kostja " Use a fixed-size, rather than a dynamically growing bitset for column mask. This avoids unnecessary memory reallocation in the most common case. " * 'column_set' of ssh://github.com/scylladb/scylla-dev: schema: pre-allocate the bitset of column_set schema: introduce schema::all_columns_count() schema: rename column_mask to column_set	2019-11-13 15:08:13 +02:00
Tomasz Grabiec	f68e17eb52	Merge "Partition/row hit/miss counters for memtable write operations" from Piotr D. Adds per-table metrics for counting partition and row reuse in memtables. New metrics are as follows: - memtable_partition_writes - number of write operations performed on partitions in memtables, - memtable_partition_hits - number of write operations performed on partitions that previously existed in a memtable, - memtable_row_writes - number of row write operations performed in memtables, - memtable_row_hits - number of row write operations that ovewrote rows previously present in a memtable. Tests: unit(release)	2019-11-13 13:11:51 +01:00
Juliusz Stasiewicz	8318a6720a	cql3: error msg w/ arg counts for prepared stmts with wrong arg cnt Fixes #3748. Very small change: added argument count (expectation vs. reality) to error msg within `invalid_request_exception'.	2019-11-13 13:43:37 +02:00
Nadav Har'El	ccb9038c69	alternator: Implement Expected operators LT and GT Merged patch series from Dejan Mircevski. Implements the "LT" and "GT" operators of the Expected update option (i.e., conditional updates), and enables the pre-existing tests for them.	2019-11-13 12:07:44 +02:00
Konstantin Osipov	6159c012db	schema: pre-allocate the bitset of column_set The number of columns is usually small, and avoiding a resize speeds up bit manipulation functions.	2019-11-13 11:41:51 +03:00
Konstantin Osipov	e95d675567	schema: introduce schema::all_columns_count() schema::all_columns_count() will be used to reserve memory of the column_set bitmask.	2019-11-13 11:41:42 +03:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Kamil Braun	d6446e352e	tests: add null/unset frozen collection/UDT INSERT JSON test When using INSERT JSON with null/unspecified frozen collection/UDT columns, the columns should be set to null. See #5270.	2019-11-12 18:24:47 +01:00
Vladimir Davydov	8110178e5d	cql: rename modification_statement::_sets_a_collection to _selects_a_collection This is merely to avoid confusion: we use _sets prefix to indicate that there are operations over static/regular columns (_sets_static_columns, _sets_regular_columns), but _sets_a_collection is set for both operations and conditions. So let's rename it to _selects_a_collection and add some comments.	2019-11-12 20:15:42 +03:00
Vladimir Davydov	a19192950e	cql: rename _column_conditions to _regular_conditions It's weird that modification_statement has _static_conditions for conditions on static columns and _column_conditions for conditions on regular columns, as if conditions on static columns are not column conditions. Let's rename _column_conditions to _regular_conditions to avoid confusion.	2019-11-12 20:15:35 +03:00
Konstantin Osipov	0ad0369684	cql: remove unnecessary optional around prefetch_data	2019-11-12 20:15:24 +03:00
Kamil Braun	6c04c5bed5	cql3: correctly handle frozen null/unset collection/UDT columns in INSERT JSON Before this commit, an empty non-null value was created for frozen collection/UDT columns when an INSERT JSON statement was executed with the value left unspecified or set to null. This was incompatible with Cassandra which inserted a null (dead cell). Fixes #5270.	2019-11-12 18:05:01 +01:00
Kamil Braun	0ad7d71f31	cql3: decouple execute from term binding in user_type::setter This commit makes it possible to pass a bound value terminal directly to the setter. Continuation of commit `bfe3c20035`.	2019-11-12 18:02:21 +01:00
Takuya ASADA	614ec6fc35	install.sh: drop --pkg option, use .install file on .deb package --pkg option on install.sh is introduced for .deb packaging since it requires different install directory for each subpackage. But we actually able to use "debian/tmp" for shared install directory, then we can specify file owner of the package using .install files. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20191030203142.31743-1-syuu@scylladb.com>	2019-11-12 16:50:37 +02:00
Piotr Dulikowski	59fbbb993f	memtables: add partition/row hit/miss counters Adds per-table metrics for counting partition and row reuse in memtables. New metrics are as follows: - memtable_partition_writes - number of write operations performed on partitions in memtables, - memtable_partition_hits - number of write operations performed on partitions that previously existed in a memtable, - memtable_row_writes - number of row write operations performed in memtables, - memtable_row_hits - number of row write operations that ovewrote rows previously present in a memtable. Tests: unit(release)	2019-11-12 13:35:41 +01:00
Piotr Dulikowski	48f7b2e4fb	table: move out table::stats to table_stats This change was done in order to be able to forward-declare the table::stats structure.	2019-11-12 13:35:41 +01:00
Avi Kivity	cf7291462d	Merge "cql3/functions: add missing min/max/count functions for ascii type" from Piotr " Adds missing overloads of functions count, min, max for type ascii. Now they work: cqlsh> CREATE KEYSPACE ks WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1}; cqlsh> USE ks; cqlsh:ks> CREATE TABLE test_ascii (id int PRIMARY KEY, value ascii); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (0, 'abcd'); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (1, 'efgh'); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (2, 'ijkl'); cqlsh:ks> SELECT * FROM test_ascii; id \| value ----+------- 1 \| efgh 0 \| abcd 2 \| ijkl (3 rows) cqlsh:ks> SELECT count(value) FROM test_ascii; system.count(value) --------------------- 3 (1 rows) cqlsh:ks> SELECT min(value) FROM test_ascii; system.min(value) ------------------- abcd (1 rows) cqlsh:ks> SELECT max(value) FROM test_ascii; system.max(value) ------------------- ijkl (1 rows) Tests: unit(release) cql_group_functions_tests.py (with added check for ascii type) Fixes #5147. " * '5147-fix-min-max-count-for-ascii' of https://github.com/piodul/scylla: tests/cql_query_test: add aggregate functions test cql3/functions: add missing min/max/count for ascii	2019-11-12 14:15:14 +02:00
Piotr Dulikowski	41cb16a526	tests/cql_query_test: add aggregate functions test Adds a test for min, max and avg functions for those primitive types for which those functions are working at the moment.	2019-11-12 13:01:34 +01:00
Piotr Dulikowski	6d78d7cc69	cql3/functions: add missing min/max/count for ascii Adds missing overloads of functions `count`, `min`, `max` for type `ascii`. Now they work: cqlsh> CREATE KEYSPACE ks WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1}; cqlsh> USE ks; cqlsh:ks> CREATE TABLE test_ascii (id int PRIMARY KEY, value ascii); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (0, 'abcd'); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (1, 'efgh'); cqlsh:ks> INSERT INTO test_ascii (id, value) VALUES (2, 'ijkl'); cqlsh:ks> SELECT * FROM test_ascii; id \| value ----+------- 1 \| efgh 0 \| abcd 2 \| ijkl (3 rows) cqlsh:ks> SELECT count(value) FROM test_ascii; system.count(value) --------------------- 3 (1 rows) cqlsh:ks> SELECT min(value) FROM test_ascii; system.min(value) ------------------- abcd (1 rows) cqlsh:ks> SELECT max(value) FROM test_ascii; system.max(value) ------------------- ijkl (1 rows) Tests: - unit(release) - cql_group_functions_tests.py (with added check for `ascii` type) Fixes #5147.	2019-11-12 13:01:34 +01:00
Yaron Kaikov	4a9b2a8d96	dist/docker: Add SCYLLA_REPO_URL argument to Dockerfile (#5264 ) This change adds a SCYLLA_REPO_URL argument to Dockerfile, which defines the RPM repository used to install Scylla from. When building a new Docker image, users can specify the argument by passing the --build-arg SCYLLA_REPO_URL=<url> option to the docker build command. If the argument is not specified, the same RPM repository is used as before, retaining the old default behavior. We intend to use this in release engineering infrastructure to specify RPM repositories for nightly builds of release branches (for example, 3.1.x), which are currently only using the stable RPMs.	2019-11-07 09:21:05 +02:00
Pavel Emelyanov	486e3f94d0	deps: Add libunistring-dev to debian With this, previous patch to seastar and (suddenly) xenial repo for scylla-libthrift010-dev scylla-antlr35-c++-dev the build on debian buster finally passes. Signed-off-by: Pavel Emelyanov <xemul@scyladb.com> Message-Id: <CAHTybb-QFyJ7YQW0b6pjhY_xUr-_b1w_O3K1=1FOwrNM55BkLQ@mail.gmail.com>	2019-11-01 09:03:39 +02:00
Dejan Mircevski	859883b31d	alternator: Implement GT operator in Expected Add cmp_gt and use it in check_compare() to handle the GT case. Also reactivate GT tests. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-10-31 17:18:22 -04:00
Dejan Mircevski	0f7d837757	alternator: Factor out check_compare() Code for check_LT(), check_GT(), etc. will be nearly identical, so factor it out into a single function that takes a comparator object. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-10-31 17:01:29 -04:00
Dejan Mircevski	a47b768959	alternator: Implement LT operator in Expected Add check_LT() function and reactivate LT tests. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-10-31 16:07:29 -04:00
Dejan Mircevski	ceae3c182f	alternator: Overload base64_decode on rjson::value In `1ca9dc5d47`, it was established that the correct way to base64-decode a JSON value is via string_view, rather than directly from GetString(). This patch adds a base64_decode(rjson::value) overload, which automatically uses the correct procedure. It saves typing, ensures correctness (fixing one incorrect call found), and will come in handy for future EXPECTED comparisons. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-10-31 15:56:03 -04:00
Dejan Mircevski	9955f0342f	alternator: Make unwrap_number() visible unwrap_number() is now a public function in serialization.hh instead of a static function visible only in executor.cc. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-10-31 10:46:30 -04:00
Nadav Har'El	3f859adebd	Merge: Fix filtering static columns on empty partitions Merged patch series from Piotr Sarna: An otherwise empty partition can still have a valid static column. Filtering didn't take that fact into account and only filtered full-fledged rows, which may result in non-matching rows being returned to the client. Fixes #5248	2019-10-31 10:50:21 +02:00
Pavel Emelyanov	5fe4757725	docs: The scylla's dpdk config is boolean Docs say one can say --disable-dpdk , while it's not so. It's the seastar's configure.py that has tristate -dpdk option, the scylla's one can only be enabled. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <CAHTybb-rxP8DbH-wW4Zf-w89iuCirt6T6-PjZAUfVFj7C5yb=A@mail.gmail.com>	2019-10-31 10:12:17 +02:00
Vladimir Davydov	9ea8114f8c	cql: fix CAS metric label "type" label is already in use for the counter type ("derive", "gauge", etc). Using the same label for "cas" / "non-cas" overwrites it. Let's instead call the new label "conditional" and use "yes" / "no" for its value, as suggested by Kostja. Message-Id: <3082b16e4d6797f064d58da95fb4e50b59ab795c.1572451480.git.vdavydov@scylladb.com>	2019-10-30 17:14:17 +01:00
Avi Kivity	398c482cd0	Merge "combined reader gallop mode" from Piotr " In case when a single reader contributes a stream of fragments and keeps winning over other readers, mutation_reader_merger will enter gallop mode, in which it is assumed that the reader will keep winning over other readers. Currently, a reader needs to contribute 3 fragments to enter that mode. In gallop mode, fragments returned by the galloping reader will be compared with the best fragment from _fragment_heap. If it wins, the fragment is directly returned. Otherwise, gallop mode ends and merging performed as in general case, which involves heap operations. In current implementation, when the end of partition is encountered while in gallop mode, the gallop mode is ended unconditionally. A microbenchmark was added in order to test performance of the galloping reader optimization. A combining reader that merges results from four other readers is created. Each sub-reader provides a range of 32 clustering rows that is disjoint from others. All sub-readers return rows from the same partition. An improvement can be observed after introducing the galloping reader optimization. As for other benchmarks from the "combined" group, results are pretty close to the old ones. The only one that seems to have suffered slightly is combined.many_overlapping. Median times from a single run of perf_mutation_readers.combined: (1s run duration, 5 runs per benchmark, release mode) test name before after improvement one_row 49.070ns 48.287ns 1.60% single_active 61.574us 61.235us 0.55% many_overlapping 488.193us 514.977us -5.49% disjoint_interleaved 57.462us 57.111us 0.61% disjoint_ranges 56.545us 56.006us 0.95% overlapping_partitions_disjoint_rows 127.039us 80.849us 36.36% Same results, normalized per mutation fragment: test name before after improvement one_row 16.36ns 16.10ns 1.60% single_active 109.46ns 108.86ns 0.55% many_overlapping 216.97ns 228.88ns -5.49% disjoint_interleaved 102.15ns 101.53ns 0.61% disjoint_ranges 100.52ns 99.57ns 0.95% overlapping_partitions_disjoint_rows 246.38ns 156.80ns 36.36% Tested on AMD Ryzen Threadripper 2950X @ 3.5GHz. Tests: unit(release) Fixes #3593. " * '3593-combined_reader-gallop-mode' of https://github.com/piodul/scylla: mutation_reader: gallop mode microbenchmark mutation_reader: combined reader gallop tests mutation_reader: gallop mode for combined reader mutation_reader: refactor prepare_next	2019-10-30 17:34:47 +02:00
Piotr Sarna	dd00470a44	tests: add a test case for filtering on static columns The test case covers filtering with an empty partition. Refs #5248	2019-10-30 15:34:10 +01:00

1 2 3 4 5 ...

20106 Commits