scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Avi Kivity	7161244130	Merge seastar upstream * seastar 70aecca...ac02df7 (5): > Merge "Prefix preprocessor definitions" from Jesse > cmake: Do not enable warnings transitively > posix: prevent unused variable warning > build: Adjust DPDK options to fix compilation > io_scheduler: adjust property names DEBUG, DEFAULT_ALLOCATOR, and HAVE_LZ4_COMPRESS_DEFAULT macro references prefixed with SEASTAR_. Some may need to become Scylla macros.	2018-04-29 11:03:21 +03:00
Vladimir Krivopalov	b3572acd6e	A few improvements to encoding_stats structure. - Use the same default epoch as Origin - Use default value for the encoding_stats parameter in sstable::write_components() Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <846c6d2cbb97d2dd25968cb00b8557c86ff5e35c.1524854727.git.vladimir@scylladb.com>	2018-04-27 22:03:38 +03:00
Tomasz Grabiec	b1465291cf	db: schema_tables: Treat drop of scylla_tables.version as an alter After upgrade from 1.7 to 2.0, nodes will record a per-table schema version which matches that on 1.7 to support the rolling upgrade. Any later schema change (after the upgrade is done) will drop this record from affected tables so that the per-table schema version is recalculated. If nodes perform a schema pull (they detect schema mismatch), then the merge will affect all tables and will wipe the per-table schema version record from all tables, even if their schema did not change. If then only some nodes get restarted, the restarted nodes will load tables with the new (recalculated) per-table schema version, while not restarted nodes will still use the 1.7 per-table schema version. Until all nodes are restarted, writes or reads between nodes from different groups will involve a needless exchange of schema definition. This will manifest in logs with repeated messages indicating schema merge with no effect, triggered by writes: database - Schema version changed to 85ab46cd-771d-36c9-bc37-db6d61bfa31f database - Schema version changed to 85ab46cd-771d-36c9-bc37-db6d61bfa31f database - Schema version changed to 85ab46cd-771d-36c9-bc37-db6d61bfa31f The sync will be performed if the receiving shard forgets the foreign version, which happens if it doesn't process any request referencing it for more than 1 second. This may impact latency of writes and reads. The fix is to treat schema changes which drop the 1.7 per-table schema version marker as an alter, which will switch in-memory data structures to use the new per-table schema version immediately, without the need for a restart. Fixes #3394 Tests: - dtest: schema_test.py, schema_management_test.py - reproduced and validated the fix with run_upgrade_tests.sh from git@github.com:tgrabiec/scylla-dtest.git - unit (release) Message-Id: <1524764211-12868-1-git-send-email-tgrabiec@scylladb.com>	2018-04-27 17:12:33 +03:00
Vladimir Krivopalov	77fdfa3e7a	Add tests for writing data and index files in SSTables 3.0 ('mc') format. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-04-26 14:34:20 -07:00
Avi Kivity	5119c1e9c1	Merge "Implement reading simple table from sstable 3.x" from Piotr " This patchset prepares everything for support of both 2.x and 3.x formats and implements reading from sstable 3.x very simple table with just partition keys. Tests: units (release) " * 'haaawk/sstables3/read_only_partitions_v4' of ssh://github.com/scylladb/seastar-dev: (22 commits) Test for reading sstable in MC format with no columns Use new mp_row_consumer_m and data_consume_rows_context_m Introduce mp_row_consumer_m Rename mp_row_consumer to mp_row_consumer_k_l Introduce consumer_m and data_consume_rows_context_m Use read_short_length_bytes in RANGE_TOMBSTONE Use read_short_length_bytes in ATOM_START Use read_short_length_bytes in ROW_START Add continuous_data_consumer::read_short_length_bytes Reduce duplication with continuous_data_consumer::read_partial_int Add test for a simple table with just partition key Add test for reading index Extract mp_row_consumer to separate header Make sstable_mutation_reader independent from mp_row_consumer Make sstable_mutation_reader a template Make data_consume_context a template Move data_consume_rows_context from row.cc to row.hh Decouple sstable.hh and row.hh Reduce visibility of sstable::data_consume_* Move data_consume_context to separate header ...	2018-04-26 14:35:42 +03:00
Piotr Jastrzebski	5c223c13d6	Test for reading sstable in MC format with no columns Just a simple table with only partition key. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:38 +02:00
Piotr Jastrzebski	9a3f93a42b	Add test for a simple table with just partition key Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:37 +02:00
Piotr Jastrzebski	c6d4f49abb	Add test for reading index Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:37 +02:00
Piotr Jastrzebski	9fad5831df	Make data_consume_context a template Parametrize it with the type of data consume rows context. There will be different implementations used for different sstable file formats. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:37 +02:00
Piotr Jastrzebski	bcf5717753	Reduce visibility of sstable::data_consume_* They are used just in partition.cc, row.cc and sstables_test.cc so it is usefull to cut their scope by moving them to data_consume_context.hh. This will make it much easier to turn data_consume_context into a template. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:37 +02:00
Piotr Jastrzebski	578aa6826f	Move data_consume_context to separate header It's used only in row.cc, partition.cc and sstables_test.cc so it's better to reduce the dependency just to those files. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-26 12:49:37 +02:00
Vladimir Krivopalov	948c4d79d3	Collect encoding statistics for memtable updates. We keep track of all updates and store the minimal values of timestamps, TTLs and local deletion times across all the inserted data. These values are written as a part of serialization_header for Statistics.db and used for delta-encoding values when writing Data.db file in SSTables 3.0 (mc) format. For #1969. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-04-25 15:39:14 -07:00
Calle Wilund	b1edf75c8b	types: Make seastar::inet_address the "native" type for CQL inet. Fixes #3187 Requires seastar "inet_address: Add constructor and conversion function from/to IPv4" Implements support IPv6 for CQL inet data. The actual data stored will now vary between 4 and 16 bytes. gms::inet_address has been augumented to interop with seastar::inet_address, though of course actually trying to use an Ipv6 address there or in any of its tables with throw badly. Tests assuming ipv4 changed. Storing a ipv4_address should be transparent, as it now "widens". However, since all ipv4 is inet_address, but not vice versa, there is no implicit overloading on the read paths. I.e. tests and system_keyspace (where we read ip addresses from tables explicitly) are modified to use the proper type. Message-Id: <20180424161817.26316-1-calle@scylladb.com>	2018-04-24 23:12:07 +01:00
Duarte Nunes	f5eeafe1bf	tests/secondary_index_test: Add test for dropping index-backing MV Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180424140745.7144-2-duarte@scylladb.com>	2018-04-24 17:02:59 +01:00
Nadav Har'El	d674b6f672	secondary index: fix bug in indexing case-sensitive column names CQL normally folds identifiers such as column names to lowercase. However, if the column name is quoted, case-sensitive column names and other strange characters can be used. We had a bug where such columns could be indexed, but then, when trying to use the index in a SELECT statement, it was not found. The existing code remembered the index's column after converting it to CQL format (adding quotes). But such conversion was unnecessary, and wrong, because the rest of the code works with bare strings and does not involve actual CQL statements. So the fix avoids this mistaken conversion. This patch also includes a test to reproduce this problem. Fixes #3154. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180424154920.15924-1-nyh@scylladb.com>	2018-04-24 16:57:17 +01:00
Piotr Sarna	d323b5cddc	tests: add missing case-sensitive JSON tests This commit complements cql_query_test with case-sensitivity cases for both SELECT JSON and INSERT JSON statements. Message-Id: <20bc7df2ec644618727183e09f2352ca5546a9b9.1524576066.git.sarna@scylladb.com>	2018-04-24 16:30:56 +03:00
Avi Kivity	13ea1a89b5	Merge "Implement loading sstables in 3.x format" from Piotr " Pass sstable version to parse, write and describe_type methods to make it possible to handle different versions. For now serialization header from 3.x format is ignored. Tests: units (release) " * 'haaawk/sstables3/loading_v4' of ssh://github.com/scylladb/seastar-dev: Add test for loading the whole sstable Add test for loading statistics Add support for 3_x stats metadata Pass sstable version to describe_type Pass sstable version to write methods metadata_type: add Serialization type Pass sstable_version_types to parse methods Add test for reading filter Add test for read_summary sstables 3.x: Add test for reading TOC sstable: Make component_map version dependent sstable::component_type: add operator<< Extract sstable::component_type to separete header Remove unused sstable::get_shared_components sstable_version_types: add mc version	2018-04-24 12:49:41 +03:00
Piotr Jastrzebski	6310fc5f1c	Add test for loading the whole sstable Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	9e78b6d4c6	Add test for loading statistics Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	444b468d46	Add test for reading filter Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	ff06d2153c	Add test for read_summary Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	10f9b06145	sstables 3.x: Add test for reading TOC Make sure DigestCRC32 is handled correctly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:30:26 +02:00
Piotr Jastrzebski	d492e92b15	Extract sstable::component_type to separete header It will be used in other places which won't depend on sstable. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-24 11:29:57 +02:00
Nadav Har'El	9605059a2b	secondary index: move tests to separate source file Move the two tests we have for the secondary indexing feature from the huge tests/cql_query_test.cc to a new file, secondary_index_test.cc. Having these tests in a separate file will make it easier and faster to write more tests for this feature, and to run these tests together. This patch doesn't change anything in the tests' code - it's just a code move. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180424084700.28816-1-nyh@scylladb.com>	2018-04-24 11:49:57 +03:00
Vladimir Krivopalov	fc644a8778	Fix Scylla to compile with older versions of JsonCpp (<= 1.7.0). Old versions of JsonCpp declare the following typedefs for internally used aliases: typedef long long int Int64; typedef unsigned long long int UInt64; In newer versions (1.8.x), those are declared as: typedef int64_t Int64; typedef uint64_t UInt64; Those base types are not identical so in cases when a type has constructors overloaded only for specific integral types (such as Json::Value in JsonCpp or data_value in Scylla), an attempt to pack/unpack an integer from/to a JSON object causes ambiguous calls. Fixes #3208 Tests: unit {release}. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <e9fff9f41e0f34b15afc90b5439be03e4295623e.1524556258.git.vladimir@scylladb.com>	2018-04-24 10:58:38 +03:00
Raphael S. Carvalho	11940ca39e	sstables: Fix bloom filter size after resharding by properly estimating partition count We were feeding the total estimation partition count of an input shared sstable to the output unshared ones. So sstable writer thinks, from estimation, that each sstable created by resharding will have the same data amount as the shared sstable they are being created from. That's a problem because estimation is feeded to bloom filter creation which directly influences its size. So if we're resharding all sstables that belong to all shards, the disk usage taken by filter components will be multiplied by the number of shards. That becomes more of a problem with #3302. Partition count estimation for a shard S will now be done as follow: // // TE, the total estimated partition count for a shard S, is defined as // TE = Sum(i = 0...N) { Ei / Si }. // // where i is an input sstable that belongs to shard S, // Ei is the estimated partition count for sstable i, // Si is the total number of shards that own sstable i. Fixes #2672. Refs #3302. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180423151001.9995-1-raphaelsc@scylladb.com>	2018-04-23 18:11:20 +03:00
Avi Kivity	8a8f688dbf	Merge "Materialized views: Fixes to update generation" from Duarte " Fixes to several issues around view update generation, pertaining to timestamp and TTL management. Fixes #3361 Fixes #3360 Fixes #3140 Refs #3362 Tests: unit(release, debug), dtest(materialized_views.py) " Reviewed-by: Nadav Har'El <nyh@scylladb.com> * 'materialized-views/fixes-galore/v2' of http://github.com/duarten/scylla: mutation_partition: Clarify comment about emptiness tests: Add view_complex_test tests/view_schema_test: Complete test db/view: Move cells instead of copying in add_cells_to_view() db/view: Handle unselected base columns and corner cases mutation_partition: Regular base column in view determines row liveness db/view: Don't avoid read-before-write when view PK matches base db/view: Process base updates to column unselected by its views db/view: Consider partition tombstone when generating updates tests/view_schema_test: Remove unneeded test mutation_fragment: Allow querying if row is live view_info: Add view_column() overload view_info: Explicitly initialize base-dependent fields cql3/alter_table_statement: Forbid dropping columns of MV base tables	2018-04-23 16:49:29 +03:00
Nadav Har'El	1ec5688b0b	Materialized Views: fix incorrect limitations on row filtering This patch fixes several cases where it was disallowed to create a materialized view with a filter ("where ..."), for no good reason. After this patch, these cases will be allowed. Fixes #2367. In ordinary SELECT queries, certain types of filtering which is known to be deceptively inefficient is now allowed. For example, trying to query a range of partition keys cannot be done without reading the entire database (because the murmur3 tokenizer randomizes the order of partitions). Restricting two partition key components also cannot be done without reading excessive amount of the entire partition. So Scylla, following Cassandra, chooses to disallow such SELECT queries, and give an error message. However, the same SELECT statements should be allowed when defining a materialized view. In this case, the filter is just used to check an individual row - not to search for one - so there is no performance concern. Unfortunately the existing code did these validations while building the SELECT statement's "restrictions", in code shared by both uses of SELECT (query and MV definition). It was easy to move one of the validations to later code which runs after the restriction has already been built (and knows if it is working for query or MV), but because of the way the "restrictions" objects (translated from Cassandra 2's code) hide what they contain, many of the checks are harder to perform after having built the restrictions object. So instead, we add in strategic places in the restriction-handling code a new "allow_filtering" flag. If restrictions are built with allow_filtering=true, the extra performance-oriented tests on the filtering restrictions is not done. Materialized views sets allow_filtering=true. The allow_filtering flag will also be useful later when we want to support the "ALLOW FILTERING" query option which is currently not supported properly (we have several open issues on that). However note that this patch doesn't complete that support: I left a FIXME in the spot where we set allow_filtering in the Materialized Views case, but in the futre also need to set it if the user specified "ALLOWED FILTERING" in the query. This patch also enables several unit tests written by Duarte which used to fail because of this bug, and now pass. These tests verify that the restrictions are now allowed and filter the view as desired; But I also added test code to verify that the same restrictions are still forbidden, as before, when used in ordinary SELECT queries. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180423124343.17591-1-nyh@scylladb.com>	2018-04-23 14:08:04 +01:00
Piotr Sarna	97e89f2efb	tests: add cql unit tests for INSERT JSON This commit adds tests for INSERT JSON clause, which is expected to accept JSON strings and insert appropriate values to columns defined there. The tests also cover fromJson function calls and inserting prepared batch statements with INSERT JSON inside. References #2058	2018-04-23 12:00:57 +02:00
Avi Kivity	b7b3d2bfec	tests: continuous_data_consumer_test: increase coverage Cover also values in the ranges 0 to 1 and 2^63 to 2^64 - 1. Message-Id: <20180422150938.29143-2-avi@scylladb.com>	2018-04-23 11:39:06 +03:00
Avi Kivity	732177d2b0	tests: continuous_data_consumer_test: reduce runtime continuous_data_consumer_test takes an unreasonable amount of time to run, especially in debug mode. Reduce the run time by reducing the number of loops. Message-Id: <20180422150938.29143-1-avi@scylladb.com>	2018-04-23 11:39:06 +03:00
Duarte Nunes	cc6c96bc92	tests: Add view_complex_test This patch introduces view_complex_test and adds more test coverage for materialized views. A new file was introduced to avoid making view_schema_test slower. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	7ba1291731	tests/view_schema_test: Complete test Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	e6467f46b7	tests/view_schema_test: Remove unneeded test Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Avi Kivity	28be4ff5da	Revert "Merge "Implement loading sstables in 3.x format" from Piotr" This reverts commit `513479f624`, reversing changes made to `01c36556bf`. It breaks booting. Fixes #3376.	2018-04-23 06:47:00 +03:00
Avi Kivity	513479f624	Merge "Implement loading sstables in 3.x format" from Piotr " Pass sstable version to parse, write and describe_type methods to make it possible to handle different versions. For now serialization header from 3.x format is ignored. Tests: units (release) " * 'haaawk/sstables3/loading_v3' of ssh://github.com/scylladb/seastar-dev: Add test for loading the whole sstable Add test for loading statistics Add support for 3_x stats metadata Pass sstable version to describe_type Pass sstable version to write methods metadata_type: add Serialization type Pass sstable_version_types to parse methods Add test for reading filter Add test for read_summary sstables 3.x: Add test for reading TOC sstable: Make component_map version dependent sstable::component_type: add operator<< Extract sstable::component_type to separete header Remove unused sstable::get_shared_components sstable_version_types: add mc version	2018-04-22 16:18:39 +03:00
Piotr Jastrzebski	0288121c0a	Add test for loading the whole sstable Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 15:07:03 +02:00
Piotr Jastrzebski	fbe9ee72d6	Add test for loading statistics Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 15:07:03 +02:00
Piotr Jastrzebski	9b448b9082	Add test for reading filter Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 13:46:12 +02:00
Piotr Jastrzebski	6bb5468ba0	Add test for read_summary Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 13:46:12 +02:00
Piotr Jastrzebski	6c2cf40ce8	sstables 3.x: Add test for reading TOC Make sure DigestCRC32 is handled correctly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 13:46:12 +02:00
Piotr Jastrzebski	82d483a1d3	Extract sstable::component_type to separete header It will be used in other places which won't depend on sstable. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-22 13:45:29 +02:00
Avi Kivity	70220d8f85	tests: sstable_datafile_test: peel off redundant parentheses around compression_parameters initializer The compression_parameter constructor is called with an extra level of parentheses. Presumably this caused a temporary object to be constructed and then moved into the argument being initialized, but gcc 8 complains about ambiguity. Make it happy by stripping off the redundant parentheses. Message-Id: <20180421121854.12314-1-avi@scylladb.com>	2018-04-21 13:53:29 +01:00
Avi Kivity	7a141c0240	tests: network_topology_strategy_test: peel off redundant parentheses around token initializer The token constructor is called with an extra level of parentheses. Presumably this caused a temporary object to be constructed and then moved into the variable being initialized, but gcc 8 complains about ambiguity. Make it happy by stripping off the redundant parentheses. Message-Id: <20180421121736.12136-1-avi@scylladb.com>	2018-04-21 13:53:29 +01:00
Avi Kivity	2c2175ab34	Merge "Add support for reading variant integers from SSTables" from Piotr " Enhance continuous_data_consumer to use existing vint serialization for reading variant integers from SSTables. Also available at: https://github.com/scylladb/seastar-dev/commits/haaawk/sstables3/unsigned-vint-v6 Tests: units (release) " * 'haaawk/sstables3/unsigned-vint-v6' of ssh://github.com/scylladb/seastar-dev: sstables: add test for continuous_data_consumer::read_unsigned_vint buffer_input_stream: make it possible to specify chunk size Add tests for make_limiting_data_source Introduce make_limiting_data_source sstables: add continuous_data_consumer::read_unsigned_vint Cover serialized_size_from_first_byte in tests core: add unsigned_vint::serialized_size_from_first_byte sstables: add all dependant headers to consumer.hh sstables: add all dependant headers to exceptions.hh core: add #pragma once to vint-serialization.hh	2018-04-17 10:09:38 +03:00
Piotr Jastrzebski	c5dda1c0c9	sstables: add test for continuous_data_consumer::read_unsigned_vint Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-16 21:14:34 +02:00
Piotr Jastrzebski	4406d11095	Add tests for make_limiting_data_source Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-16 21:00:35 +02:00
Piotr Jastrzebski	4431c1bbe7	Cover serialized_size_from_first_byte in tests Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-04-16 20:26:44 +02:00
Avi Kivity	7c01e66d53	cql3: query_processor: store and use just local shard reference of storage_proxy Since storage_proxy provides access to the entire cluster, a local shard reference is sufficient. Adjust query_processor to store a reference to just the local shard, rather than a seastar::sharded<storage_proxy> and adjust callers. This simplifies the code a little. Message-Id: <20180415142656.25370-3-avi@scylladb.com>	2018-04-16 10:20:50 +02:00
Piotr Sarna	5a6fcebed6	cql3: add toJson function This commit extends JSON support with toJson() function, which can be used in SELECT clause to transform a single argument to JSON form. toJson() accepts any type including nested collection types, so instead of being declared with concrete types, proper toJson() instances are generated during calls. This commit also supplements JSON CQL query tests with toJson calls. Finally, it refactors JSON tests so they use do_with_cql_env_thread. References #2058 Message-Id: <a7833650428e9ef590765a14e91c4d42532588f4.1523528698.git.sarna@scylladb.com>	2018-04-14 15:23:47 +03:00

1 2 3 4 5 ...

2201 Commits