scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 14:03:06 +00:00

Author	SHA1	Message	Date
Piotr Jastrzebski	cb84ca8abb	Pass sstable_version_types to parse methods Parsing will depend on the sstable version when we have support for both 2_x and 3_x formats. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	444b468d46	Add test for reading filter Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	ff06d2153c	Add test for read_summary Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	10f9b06145	sstables 3.x: Add test for reading TOC Make sure DigestCRC32 is handled correctly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	561ca34ec2	sstable: Make component_map version dependent Introduce sstable_version_constants that will be a proxy serving correct constants depending on the format version. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	7aef74c55f	sstable::component_type: add operator<< Make it possible to print out component_type. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	d492e92b15	Extract sstable::component_type to separete header It will be used in other places which won't depend on sstable. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:29:57 +02:00
Piotr Jastrzebski	279b426ee8	Remove unused sstable::get_shared_components Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 09:45:55 +02:00
Piotr Jastrzebski	7248752698	sstable_version_types: add mc version This is the latest version of 3.x SSTable format. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 09:45:55 +02:00
Raphael S. Carvalho	11940ca39e	sstables: Fix bloom filter size after resharding by properly estimating partition count We were feeding the total estimation partition count of an input shared sstable to the output unshared ones. So sstable writer thinks, from estimation, that each sstable created by resharding will have the same data amount as the shared sstable they are being created from. That's a problem because estimation is feeded to bloom filter creation which directly influences its size. So if we're resharding all sstables that belong to all shards, the disk usage taken by filter components will be multiplied by the number of shards. That becomes more of a problem with #3302. Partition count estimation for a shard S will now be done as follow: // // TE, the total estimated partition count for a shard S, is defined as // TE = Sum(i = 0...N) { Ei / Si }. // // where i is an input sstable that belongs to shard S, // Ei is the estimated partition count for sstable i, // Si is the total number of shards that own sstable i. Fixes #2672. Refs #3302. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180423151001.9995-1-raphaelsc@scylladb.com>	2018-04-23 18:11:20 +03:00
Avi Kivity	8a8f688dbf	Merge "Materialized views: Fixes to update generation" from Duarte " Fixes to several issues around view update generation, pertaining to timestamp and TTL management. Fixes #3361 Fixes #3360 Fixes #3140 Refs #3362 Tests: unit(release, debug), dtest(materialized_views.py) " Reviewed-by: Nadav Har'El <nyh@scylladb.com> * 'materialized-views/fixes-galore/v2' of http://github.com/duarten/scylla: mutation_partition: Clarify comment about emptiness tests: Add view_complex_test tests/view_schema_test: Complete test db/view: Move cells instead of copying in add_cells_to_view() db/view: Handle unselected base columns and corner cases mutation_partition: Regular base column in view determines row liveness db/view: Don't avoid read-before-write when view PK matches base db/view: Process base updates to column unselected by its views db/view: Consider partition tombstone when generating updates tests/view_schema_test: Remove unneeded test mutation_fragment: Allow querying if row is live view_info: Add view_column() overload view_info: Explicitly initialize base-dependent fields cql3/alter_table_statement: Forbid dropping columns of MV base tables	2018-04-23 16:49:29 +03:00
Nadav Har'El	1ec5688b0b	Materialized Views: fix incorrect limitations on row filtering This patch fixes several cases where it was disallowed to create a materialized view with a filter ("where ..."), for no good reason. After this patch, these cases will be allowed. Fixes #2367. In ordinary SELECT queries, certain types of filtering which is known to be deceptively inefficient is now allowed. For example, trying to query a range of partition keys cannot be done without reading the entire database (because the murmur3 tokenizer randomizes the order of partitions). Restricting two partition key components also cannot be done without reading excessive amount of the entire partition. So Scylla, following Cassandra, chooses to disallow such SELECT queries, and give an error message. However, the same SELECT statements should be allowed when defining a materialized view. In this case, the filter is just used to check an individual row - not to search for one - so there is no performance concern. Unfortunately the existing code did these validations while building the SELECT statement's "restrictions", in code shared by both uses of SELECT (query and MV definition). It was easy to move one of the validations to later code which runs after the restriction has already been built (and knows if it is working for query or MV), but because of the way the "restrictions" objects (translated from Cassandra 2's code) hide what they contain, many of the checks are harder to perform after having built the restrictions object. So instead, we add in strategic places in the restriction-handling code a new "allow_filtering" flag. If restrictions are built with allow_filtering=true, the extra performance-oriented tests on the filtering restrictions is not done. Materialized views sets allow_filtering=true. The allow_filtering flag will also be useful later when we want to support the "ALLOW FILTERING" query option which is currently not supported properly (we have several open issues on that). However note that this patch doesn't complete that support: I left a FIXME in the spot where we set allow_filtering in the Materialized Views case, but in the futre also need to set it if the user specified "ALLOWED FILTERING" in the query. This patch also enables several unit tests written by Duarte which used to fail because of this bug, and now pass. These tests verify that the restrictions are now allowed and filter the view as desired; But I also added test code to verify that the same restrictions are still forbidden, as before, when used in ordinary SELECT queries. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180423124343.17591-1-nyh@scylladb.com>	2018-04-23 14:08:04 +01:00
Avi Kivity	ff055a291a	Merge "Improve "out-of-the-box" build experience on centos" from Botond " Make sure install_dependencies.sh installs all the right dependencies and that the example `configure.py` invokation can just be copy-pasted into the terminal and will "just work". Ref: #3208 " * 'fix_centos_compile/v2' of https://github.com/denesb/scylla: install_dependencies.sh: update centos package list and example configure.py: add --with-ragel option configure.py: add --with-antlr3 configure.py: check compiler version first	2018-04-23 15:49:27 +03:00
Botond Dénes	bfe741c03d	install_dependencies.sh: update centos package list and example Add missing packages to `yum install` list: * scylla-boost163-static * scylla-python34-pyparsing20 Update the configure.py example so that it just works: * Change g++ to 7.3 * Add --with-antlr3 pointing to antlr3 installed from scylla 3rdparty	2018-04-23 15:46:43 +03:00
Botond Dénes	1efcf215b6	configure.py: add --with-ragel option To allow the user to select the exact ragel executable they whish to use.	2018-04-23 15:46:43 +03:00
Botond Dénes	784be9cc43	configure.py: add --with-antlr3 To allow the user to select the exact antlr3 executable they whish to use.	2018-04-23 15:46:43 +03:00
Botond Dénes	ea8d8f9fbf	configure.py: check compiler version first Before checking anything else (presence of boost, its version, etc.) check that the compiler is present and can compile and link a simple c++ program. Before if the compiler was not set up correctly configure.py would fail at one of the other try_compile checks, whichever came first (usually the one checking for boost). This lead the user into chasing some false-positive error when in fact the compiler wasn't working.	2018-04-23 15:46:43 +03:00
Takuya ASADA	7b92c3fd3f	dist: Drop AmbientCapabilities from scylla-server.service for Debian 8 Debian 8 causes "Invalid argument" when we used AmbientCapabilities on systemd unit file, so drop the line when we build .deb package for Debian 8. For other distributions, keep using the feature. Fixes #3344 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20180423102041.2138-1-syuu@scylladb.com>	2018-04-23 13:27:14 +03:00
Avi Kivity	269207fdf6	Merge "Introducing INSERT JSON and fromJson to CQL3" from Piotr " This series complements JSON support with INSERT JSON and fromJson cql function. INSERT JSON implementation tries hard to interfere as little as possible with regular INSERT path. So, after being parsed, insertJsonStatement exists as a separate statement and is handled in a special way. Overridden add_update_for_key extracts values from JSON map and applies them to columns. Converting from insert_json_statement to insert_statement uses auxiliary from_json_object methods to convert JSON-encoded types to bytes. Then, terms are matched to appropriate column names and cells are updated. fromJson CQL function uses the same from_json_object helper methods, but applies them to single arguments, not whole rows. Existing json handling functions from json.hh and libjsoncpp were used where possible. Things implemented: * expanding CQL grammar to accept INSERT JSON * converting JSON representation of cql values to cql terms * serving 'INSERT INTO xxx JSON yyy' clause * tests for INSERT JSON and fromJson() " * 'json_ops_2' of https://github.com/psarna/scylla: tests: add cql unit tests for INSERT JSON cql3: add fromJson() function cql3: add INSERT JSON parsing to CQL grammar cql3: add support for INSERT JSON clause cql3: decouple execute from term binding in setters cql3: change operation::make_* functions to static cql3: add from_json_object function to types cql3: Make literals::NULL_VALUE public	2018-04-23 13:19:54 +03:00
Piotr Sarna	97e89f2efb	tests: add cql unit tests for INSERT JSON This commit adds tests for INSERT JSON clause, which is expected to accept JSON strings and insert appropriate values to columns defined there. The tests also cover fromJson function calls and inserting prepared batch statements with INSERT JSON inside. References #2058	2018-04-23 12:00:57 +02:00
Piotr Sarna	cd76a01747	cql3: add fromJson() function This function extends JSON support with fromJson() function, which can be used in UPDATE clause to transform JSON value into a value with proper CQL type. fromJson() accepts strings and may return any type, so its instances, like toJson(), are generated during calls. This commit also extends functions::get() with additional 'receiver' parameter. This parameter is used to extract receiver type information neeeded to generate proper fromJson instance. Receiver is known only during insert/update, so functions::get() also accepts a nullptr if receiver is not known (e.g. during selection). References #2058	2018-04-23 12:00:57 +02:00
Piotr Sarna	9dd34bf34d	cql3: add INSERT JSON parsing to CQL grammar This commit makes it possible to parse INSERT JSON statement in CQL grammar, so it's available via cqlsh. References #2058	2018-04-23 12:00:57 +02:00
Piotr Sarna	cdcbf654a8	cql3: add support for INSERT JSON clause This commit adds the implementation of INSERT JSON clause which accepts JSON object as parameter and inserts appropriate values into appropriate columns, as defined in given JSON. Example: INSERT INTO testme JSON '{ "id" : 77, "name" : "Jones", "ranking" : 8.5 }' References #2058	2018-04-23 12:00:57 +02:00
Piotr Sarna	bfe3c20035	cql3: decouple execute from term binding in setters This commit makes it possible to pass values to setters, instead of having to pass cql3::term instances. Thanks to that previously prepared terminals can be directly used in a setter execution. References #2058	2018-04-23 12:00:56 +02:00
Piotr Sarna	2b729a10bc	cql3: change operation::make_* functions to static This commit makes operation::make* functions static, because they don't access any instance-specific data anyway. It is later needed to decouple setter execution from binding a cql3::term.	2018-04-23 12:00:56 +02:00
Piotr Sarna	1d40d2186e	cql3: add from_json_object function to types This commit adds a 'from_json_object' method which will be used for converting JSON representation of a value to raw bytes representing the same value. This functionality will be needed by 'INSERT JSON' clause implementation, which can turn these raw bytes into cql3::term. References #2058	2018-04-23 12:00:56 +02:00
Piotr Sarna	e3dfa2193b	cql3: Make literals::NULL_VALUE public This commit makes NULL_VALUE public for future use in JSON translation. References #2058	2018-04-23 12:00:56 +02:00
Botond Dénes	c34b69f4b2	Add PULL_REQUEST_TEMPLATE.md Hopefully it will guide people wanting to contribute to the mailing list. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <73c5d9c9884d8595b466412486494d6aa45d1d55.1524476490.git.bdenes@scylladb.com>	2018-04-23 10:45:25 +01:00
Avi Kivity	1a6b891ce2	Update scylla-ami submodule * dist/ami/files/scylla-ami 9b4be70...02b1853 (1): > scylla_install_ami: remove the host id file after scylla_setup	2018-04-23 12:43:56 +03:00
Avi Kivity	b7b3d2bfec	tests: continuous_data_consumer_test: increase coverage Cover also values in the ranges 0 to 1 and 2^63 to 2^64 - 1. Message-Id: <20180422150938.29143-2-avi@scylladb.com>	2018-04-23 11:39:06 +03:00
Avi Kivity	732177d2b0	tests: continuous_data_consumer_test: reduce runtime continuous_data_consumer_test takes an unreasonable amount of time to run, especially in debug mode. Reduce the run time by reducing the number of loops. Message-Id: <20180422150938.29143-1-avi@scylladb.com>	2018-04-23 11:39:06 +03:00
Duarte Nunes	c8baba4e3a	mutation_partition: Clarify comment about emptiness empty() doesn't distinguish between live and dead data, so clarify that in its comment. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	cc6c96bc92	tests: Add view_complex_test This patch introduces view_complex_test and adds more test coverage for materialized views. A new file was introduced to avoid making view_schema_test slower. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	7ba1291731	tests/view_schema_test: Complete test Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	844e0b41d1	db/view: Move cells instead of copying in add_cells_to_view() Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	4b4d1dbd1f	db/view: Handle unselected base columns and corner cases When a view's PK only contains the columns that form the base's PK, then the liveness of a particular view row is determined not only by the base row's marker, but also by the selected and, more importantly, unselected columns. This patch ensures that unselected columns are considered as much as possible, even though some limitations will still exist. In particular, we need to represent multiple timestamps (from all the unselected columns), but have only mechanisms to record a single timestamp. We also have some issues when dealing with selected column, and the way we currently delete them. Consider the following: create table cf (p int, c int, a int, b int, primary key (p, c)) create materialized view vcf as select a, b from cf where p is not null and c is not null primary key (p, c) 1) update cf using timestamp 10 set a = 1 where p = 1 and c = 1 2) delete a from cf using timestamp 11 where p = 1 and c = 1 3) update cf using timestamp 1 set a = 2 where p = 1 and c = 1 After 1), the MV should include a row with row marker @ ts10, p = 1, c = 1, a = 1. After 2), this row should be removed. At 3), we should add a row with row marker @ ts1, p = 1, c = 1, a = 1, with a lower timestamp. This means that the delete should not insert a row tombstone with timestamp @ 11, as we do now but it should just delete the view's row marker (which exists) with ts1. Refs #3362 Fixes #3140 Fixes #3361 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	67dac67c46	mutation_partition: Regular base column in view determines row liveness When views contain a primary key column that is not part of the base table primary key, that column determines whether the row is live or not. We need to ensure that when that cell is dead, and thus the derived row marker, either by normal deletion of by TTL, so is the rest of the row. This patch introduces the idea of shawdowing row marker. We map the status of the regular base column in the view's PK to the view row's marker. If this marker is dead, so is that cell in the base table, and so should the view row become. To enforce that, a view row's dead marker shadows the whole row if that view includes a base regular column in its PK. Fixes #3360 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	4dfce4d369	db/view: Don't avoid read-before-write when view PK matches base When a view's PK only contains the columns that form the base's PK, then the liveness of a particular view row is determined not only by the base row's marker, but also by the selected and, more importantly, unselected columns. When calculating the view's row marker we need to access those unselected columns, so we can't avoid the read-before-write as we were doing. Refs #3362 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	bd3cedd240	db/view: Process base updates to column unselected by its views When a view's PK only contains the columns that form the base's PK, then the liveness of a particular view row is determined not only by the base row's marker, but also by the selected and, more importantly, unselected columns. So, process base updates to columns unselected by any of its views. Refs #3362 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	ac9b93eb89	db/view: Consider partition tombstone when generating updates Not adding the partition tombstone to the current list of tombstones may cause updates to be incorrectly generated. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	e6467f46b7	tests/view_schema_test: Remove unneeded test Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	b0cb5480d5	mutation_fragment: Allow querying if row is live For clustering_row and static_row, allow querying whether they are live or not. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	164f043768	view_info: Add view_column() overload For when we already have the base's column_definition. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	31370fd7b1	view_info: Explicitly initialize base-dependent fields Instead of lazily-initializing the regular base column in the view's PK field, explicitly initialize it. This will be used by future patches that don't have access to the schema when wanting to obtain that column. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	b77b71436d	cql3/alter_table_statement: Forbid dropping columns of MV base tables When a view's PK only contains the columns that form the base's PK, then the liveness of a particular view row is determined not only by the base row's marker, but also by the selected and, more importantly, unselected columns. The fact that unselected columns can keep a view row alive also requires that users cannot drop columns of base tables with materialized views, which this patch implements. Refs #3362 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Avi Kivity	28be4ff5da	Revert "Merge "Implement loading sstables in 3.x format" from Piotr" This reverts commit `513479f624`, reversing changes made to `01c36556bf`. It breaks booting. Fixes #3376.	2018-04-23 06:47:00 +03:00
Avi Kivity	513479f624	Merge "Implement loading sstables in 3.x format" from Piotr " Pass sstable version to parse, write and describe_type methods to make it possible to handle different versions. For now serialization header from 3.x format is ignored. Tests: units (release) " * 'haaawk/sstables3/loading_v3' of ssh://github.com/scylladb/seastar-dev: Add test for loading the whole sstable Add test for loading statistics Add support for 3_x stats metadata Pass sstable version to describe_type Pass sstable version to write methods metadata_type: add Serialization type Pass sstable_version_types to parse methods Add test for reading filter Add test for read_summary sstables 3.x: Add test for reading TOC sstable: Make component_map version dependent sstable::component_type: add operator<< Extract sstable::component_type to separete header Remove unused sstable::get_shared_components sstable_version_types: add mc version	2018-04-22 16:18:39 +03:00
Piotr Jastrzebski	0288121c0a	Add test for loading the whole sstable Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 15:07:03 +02:00
Piotr Jastrzebski	fbe9ee72d6	Add test for loading statistics Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 15:07:03 +02:00
Piotr Jastrzebski	b683870644	Add support for 3_x stats metadata Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 15:06:51 +02:00

1 2 3 4 5 ...

15121 Commits