scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 16:03:20 +00:00

Author	SHA1	Message	Date
Gleb Natapov	da20d86423	Configure authorized_prepared_statment_cache memory limit during object creation	2018-06-11 15:34:14 +03:00
Gleb Natapov	b38ced0fcd	Configure logalloc memory size during initialization	2018-06-11 15:34:14 +03:00
Gleb Natapov	04727acee9	Configure querier_cache size limit during object creation	2018-06-11 15:34:13 +03:00
Gleb Natapov	ac88935baa	Provide available memory size to storage_proxy object during creation	2018-06-11 15:34:13 +03:00
Gleb Natapov	f41575a156	Provide available memory size to database object during creation	2018-06-11 15:34:13 +03:00
Gleb Natapov	461f20e7b1	Configure prepared_statements_cache memory limit from outside Pass desirable memory limit during construction instead of querying memory size explicitly.	2018-06-11 15:34:13 +03:00
Tomasz Grabiec	a91974af7a	tests: row_cache: Reduce concurrency limit to avoid bad_alloc The test uses random mutations. We saw it failing with bad_alloc from time to time. Reduce concurrency to reduce memory footprint. Message-Id: <20180611090304.16681-1-tgrabiec@scylladb.com>	2018-06-11 10:06:56 +01:00
Vladimir Krivopalov	9c9c85cde5	tests: Add test writing UDT data to SSTables 3.x. Original data and index files are generated using Cassandra 3.11. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <d0ea8146d6f2a76a5f661271500b35390962a9d4.1528420647.git.vladimir@scylladb.com>	2018-06-10 19:20:42 +03:00
Avi Kivity	74469ecc09	Merge "Support reading collections" from Piotr " Implement and test support for reading collections in SSTables 3. Tests: unit {release} " * 'haaawk/sstables3/read-collections-v1' of ssh://github.com/scylladb/seastar-dev: sstables 3: Add tests for reading collections flat_mutation_reader_assertions: add more flexible asserts data_consume_rows_context_m: add support for collections mp_row_consumer_m: Add support for collections data_consume_rows_context_m: introduce cell_path Use column_translation::_is_collection in reading column_translation: add _column_is_collection() column_flags_m: add HAS_COMPLEX_DELETION Use read_unsigned_vint_length_bytes for COLUMN_VALUE Use read_unsigned_vint_length_bytes for CK_BLOCKS Implement read_unsigned_vint_length_bytes	2018-06-10 17:10:52 +03:00
Avi Kivity	2582f53b44	Merge "database and API: Add column_family::get_sstables_by_key" from Amnon " This is series is for nodetool getsstables. This patch is based on: `8daaf9833a` With some minor adjustments because of the code change in sstables. The idea is to allow searching for all the sstables that contains a given key. After this patch if there is a table t1 in keyspace k1 and it has a key called aa. curl -X GET "http://localhost:10000/column_family/sstables/by_key/k1%3At1?key=aa" Will return the list of sstables file names that contains that key. " * 'amnon/sstable_for_key_v4' of github.com:scylladb/seastar-dev: Add the API implementation to get_sstables_by_key api: column_family.json make the get_sstables_for_key doc clearer column_family: Add the get_sstables_by_partition_key method sstable test: add has_partition_key test sstable: Add has_partition_key method keys_test: add a test for nodetool_style string keys: Add from_nodetool_style_string factory method	2018-06-10 16:53:56 +03:00
Amnon Heiman	b8e5029991	sstable test: add has_partition_key test This patch adds a test to the has_partition_key method, it creates an sstable with a partition key and then used that key in the has_partition_key method to verify that it is there. It creates a different key and use that to verify that a non exist key return false.	2018-06-10 16:12:12 +03:00
Piotr Jastrzebski	7d3abb0668	sstables 3: Add tests for reading collections Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-07 23:40:10 +02:00
Piotr Jastrzebski	176305c2f2	flat_mutation_reader_assertions: add more flexible asserts Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-07 23:39:51 +02:00
Avi Kivity	6f23403137	Merge "Virtualize IndexInfo system table" from Duarte " The IndexInfo table tracks the secondary indexes that have already been populated. Since our secondary index implementation is backed by materialized views, we can virtualize that table so queries are actually answered by built_views. Fixes #3483 " * 'built-indexes-virtual-reader/v2' of github.com:duarten/scylla: tests/virtual_reader_test: Add test for built indexes virtual reader db/system_keysace: Add virtual reader for IndexInfo table db/system_keyspace: Explain that table_name is the keyspace in IndexInfo index/secondary_index_manager: Expose index_table_name() db/legacy_schema_migrator: Don't migrate indexes	2018-06-06 17:35:51 +03:00
Tomasz Grabiec	9975135110	row_cache: Make sure reader makes forward progress after each fill_buffer() If reader's buffer is small enough, or preemption happens often enough, fill_buffer() may not make enough progress to advance _lower_bound. If also iteartors are constantly invalidated across fill_buffer() calls, the reader will not be able to make progress. See row_cache_test.cc::test_reading_progress_with_small_buffer_and_invalidation() for an examplary scenario. Also reproduced in debug-mode row_cache_test.cc::test_concurrent_reads_and_eviction Message-Id: <1528283957-16696-1-git-send-email-tgrabiec@scylladb.com>	2018-06-06 16:01:52 +03:00
Duarte Nunes	833d34e88a	Merge 'Make rows in a secondary index ordered by token' from Piotr " As in #3423, ensuring token order on secondary index queries can be done by adding an additional column to views that back secondary indexes. This column is a first clustering column and contains token value, computed on updates. This series also updates tests and comments refering to issue 3423. Tests: unit (release, debug) " * 'order_by_token_in_si_5' of https://github.com/psarna/scylla: cql3: update token order comments index, tests: add token column to secondary index schema view: add handling of a token column for secondary indexes view: add is_index method	2018-06-06 10:07:43 +01:00
Piotr Sarna	4a9bf7ed5b	index, tests: add token column to secondary index schema Additional token column is now present in every view schema that backs a secondary index. This column is always a first part of the clustering key, so it forces token order on queries. Column's name is ideally idx_token, but can be postfixed with a number to ensure its uniqueness. It also updates tests to make them acknowledge the new token order. Fixes #3423	2018-06-06 09:02:33 +02:00
Piotr Jastrzebski	25a7f03f7f	Add test for compression in sstables 3.x Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-04 18:41:10 +02:00
Piotr Jastrzebski	be9c7391aa	Extract test_partition_key_with_values_of_different_types_read It will be used also for testing compression. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-04 18:41:10 +02:00
Piotr Jastrzebski	1f324b7fc8	sstable_3_x_test: use SEASTAR_THREAD_TEST_CASE Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-04 18:40:52 +02:00
Piotr Jastrzebski	3e3ccdb323	Drop UNCOMPRESSD_ when code will be used for compressed too Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-04 18:29:02 +02:00
Duarte Nunes	01676a2cda	tests/virtual_reader_test: Add test for built indexes virtual reader Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-06-04 11:31:29 +01:00
Avi Kivity	6f2d3b7f9f	Merge "Fix previous row size calculation for SSTables 3.x" from Vladimir " SSTables 3.x format ('m') stores the size of previous row or RT marker inside each row/marker. That potentially allows to traverse rows/markers in reverse order. The previous code calculating those sizes appeared to produce invalid values for all rows except the first one. The problem with detecting this bug was that neither Cassandra itself nor the sstabledump tool use those values, they are simply rejected on reading. From UnfilteredSerializer.deserializeRowBody() method, https://github.com/apache/cassandra/blob/cassandra-3.11/src/java/org/apache/cassandra/db/rows/UnfilteredSerializer.java#L562 : if (header.isForSSTable()) { in.readUnsignedVInt(); // Skip row size in.readUnsignedVInt(); // previous unfiltered size } So while the previous test files were technically correct in that they contained valid data readable by Cassandra/sstabledump, they didn't follow the format specification. This patchset fixes the code to produce correct values and replaces incorrect data files with correct ones. The newly generated data files have been validated to be identical to files generated with Cassandra using same data and timestamps as unit tests. Tests: Unit {release} " * 'projects/sstables-30/fix-prev-row_size/v1' of https://github.com/argenet/scylla: tests: Fix test files to use correct previous row sizes. sstables: Fix calculation of previous row size for SSTables 3.x sstables: Factor out code building promoted index blocks into separate helpers.	2018-06-03 11:38:22 +03:00
Avi Kivity	a43b3e22fc	Merge "Fix clustering blocks serialization for SSTables 3.x" from Vladimir " This patchset contains two fixes to the clustering key prefixes serialization logic for SSTables 3.x. First, it fixes a vexing typo: a bitwise-and (&) has been used instead of a remainder operator (%) for truncating the shift value. This did not show up in existing tests because they all had non-empty clustering columns values. Added tests to cover empty clustering columns values. Second, it fixes the logic of serialization to write values up to the prefix length, not the length of the clustering key as defined by schema. This matches the way it is done by the Origin. There is, however, a special case where the prefix size is smaller than that of a clustering key but we still need to serialize up to the full size. This is the case when a compact table is being used and some rows in it are added using incomplete clustering keys (containing null for trailing columns). In Cassandra, these prefixes still have a full length and missing columns are just set to 'null'. In our code those prefixes have their real length, but since we need to serialize beyond it, we pass a flag to indicate this. " * 'projects/sstables-30/fix-clustering-blocks/v1' of https://github.com/argenet/scylla: tests: Add test covering compact table with non-full clustering key. sstables: Improve clustering blocks writing, use logical clustering prefix size. tests: Add test covering large clustering keys (>32 columns) for SSTables 3.x tests: Add unit test covering empty values in clustering key. sstables: Fix typo in clustering blocks write helper.	2018-06-03 11:35:49 +03:00
Avi Kivity	1071e481ed	Merge "Implement support for missing columns in SSTable 3.0" from Piotr " Add handling for missing columns and tests for it. There are 3 cases: 1. Number of columns in a table is smaller than 64 2. Number of columns in a table is greater than 64 2a. and less than half of all possible columns are present in sstable 2b. and at least half of all possible columns are present in sstable Case 1 is implemented using bit mask and column is present if mask & (1 << <column number>) == 0 Case 2 is implemented by storing list of column numbers for each present column case 3 is implemented by storing list of column numbers for each absent column " * 'haaawk/sstables3/read-missing-columns-v3' of ssh://github.com/scylladb/seastar-dev: sstables 3: add test for reading big dense subset of columns sstables 3: support reading big dense subsets of columns sstables 3: add test for reading big sparse subset of columns sstables 3: support reading big sparse subsets of columns sstables 3: add test for reading small subset of columns sstables 3: support reading small subsets of columns	2018-06-03 10:42:00 +03:00
Piotr Jastrzebski	2fd0566eb7	sstables 3: add test for reading big dense subset of columns Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-02 10:41:18 +02:00
Piotr Jastrzebski	4e4972ffea	sstables 3: add test for reading big sparse subset of columns Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-02 10:40:56 +02:00
Piotr Jastrzebski	24e9ab4ab6	sstables 3: add test for reading small subset of columns Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-06-01 21:34:03 +02:00
Vladimir Krivopalov	b6511d1b07	tests: Add test covering compact table with non-full clustering key. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-31 17:30:36 -07:00
Vladimir Krivopalov	3f404f19dc	tests: Add test covering large clustering keys (>32 columns) for SSTables 3.x Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-31 17:30:36 -07:00
Vladimir Krivopalov	487796de85	tests: Add unit test covering empty values in clustering key. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-31 17:30:36 -07:00
Avi Kivity	aab6b0ee27	Merge "Introduce new in-memory representation for cells" from Paweł " This is the first part of the first step of switching Scylla. It covers converting cells to the new serialisation format. The actual structure of the cells doesn't differ much from the original one with a notable exception of the fact that large values are now fragmented and linearisation needs to be explicit. Counters and collections still partially rely on their old, custom serialisation code and their handling is not optimial (although not significantly worse than it used to be). The new in-memory representation allows objects to be of varying size and makes it possible to provide deserialisation context so that we don't need to keep in each instance of an IMR type all the information needed to interpret it. The structure of IMR types is described in C++ using some metaprogramming with the hopes of making it much easier to modify the serialisation format that it would be in case of open-coded serialisation functions. Moreover, IMR types can own memory thanks to a limited support for destructors and movers (the latter are not exactly the same thing as C++ move constructors hence a different name). This makes it (relatively) to ensure that there is an upper bound on the size of all allocations. For now the only thing that is converted to the IMR are atomic_cells and collections which means that the reduction in the memory footprint is not as big as it can be, but introducing the IMR is a big step on its own and also paves the way towards complete elimination of unbounded memory allocations. The first part of this patchset contains miscellaneous preparatory changes to various parts of the Scylla codebase. They are followed by introduction of the IMR infrastructure. Then structure of cells is defined and all helper functions are implemented. Next are several treewide patches that mostly deal with propagating type information to the cell-related operations. Finally, atomic_cell and collections are switched to used the new IMR-based cell implementation. The IMR is described in much more detail in imr/IMR.md added in "imr: add IMR documentation". Refs #2031. Refs #2409. perf_simple_query -c4, medians of 30 results: ./perf_base ./perf_imr diff read 308790.08 309775.35 0.3% write 402127.32 417729.18 3.9% The same with 1 byte values: ./perf_base1 ./perf_imr1 diff read 314107.26 314648.96 0.2% write 463801.40 433255.96 -6.6% The memory footprint is reduced, but that is partially due to removal of small buffer optimisation (whether it will be restored depends on the exact mesurements of the performance impact). Generally, this series was not expected to make a huge difference as this would require converting whole rows to the IMR. Memory footprint: Before: mutation footprint: - in cache: 1264 - in memtable: 986 After: mutation footprint: - in cache: 1104 - in memtable: 866 Tests: unit (release, debug) " * tag 'imr-cells/v3' of https://github.com/pdziepak/scylla: (37 commits) tests/mutation: add test for changing column type atomic_cell: switch to new IMR-based cell reperesentation atomic_cell: explicitly state when atomic_cell is a collection member treewide: require type for creating collection_mutation_view treewide: require type for comparing cells atomic_cell: introduce fragmented buffer value interface treewide: require type to compute cell memory usage treewide: require type to copy atomic_cell treewide: require type info for copying atomic_cell_or_collection treewide: require type for creating atomic_cell atomic_cell: require column_definition for creating atomic_cell views tests: test imr representation of cells types: provide information for IMR data: introduce cell data: introduce type_info imr/utils: add imr object holder imr: introduce concepts imr: add helper for allocating objects imr: allow creating lsa migrators for IMR objects imr: introduce placeholders ...	2018-05-31 19:21:15 +03:00
Tomasz Grabiec	b5e42bc6a0	tests: row_cache: Do not hang when only one of the readers throws Message-Id: <20180531122729.3314-1-tgrabiec@scylladb.com>	2018-05-31 18:00:22 +03:00
Paweł Dziepak	131a47dea3	tests/mutation: add test for changing column type With the introduction of the new in-memory representation changing column type has become a more complex operation since it needs to handle switch from fixed-size to variable-size types. This commit adds an explicit test for such cases.	2018-05-31 15:51:11 +01:00
Paweł Dziepak	aa25f0844f	atomic_cell: introduce fragmented buffer value interface As a prepratation for the switch to the new cell representation this patch changes the type returned by atomic_cell_view::value() to one that requires explicit linearisation of the cell value. Even though the value is still implicitly linearised (and only when managed by the LSA) the new interface is the same as the target one so that no more changes to its users will be needed.	2018-05-31 15:51:11 +01:00
Paweł Dziepak	418c159057	treewide: require type to copy atomic_cell	2018-05-31 15:51:11 +01:00
Paweł Dziepak	27014a23d7	treewide: require type info for copying atomic_cell_or_collection	2018-05-31 15:51:11 +01:00
Paweł Dziepak	e9d6fc48ac	treewide: require type for creating atomic_cell	2018-05-31 15:51:11 +01:00
Paweł Dziepak	93130e80fb	atomic_cell: require column_definition for creating atomic_cell views	2018-05-31 15:51:11 +01:00
Paweł Dziepak	b25cc61a13	tests: test imr representation of cells	2018-05-31 15:51:11 +01:00
Duarte Nunes	f8626c7c93	tests/view_schema_test: Test view correctness under base schema changes Reproducer for #3443. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180530194536.51202-2-duarte@scylladb.com>	2018-05-31 12:10:50 +03:00
Paweł Dziepak	8c38f09fbc	tests/imr: add tests for destructor and mover methods	2018-05-31 10:09:01 +01:00
Paweł Dziepak	a29a88c9d9	tests/imr/compound: add tests for structure<...>	2018-05-31 10:09:01 +01:00
Paweł Dziepak	466d91f652	tests/imr/compound: add tests for variant<Ts...>	2018-05-31 10:09:01 +01:00
Paweł Dziepak	7c28c9eda8	tests/imr: add test for optional<T>	2018-05-31 10:09:01 +01:00
Paweł Dziepak	eb2479fa9a	tests: add test for new in memory representation	2018-05-31 10:09:01 +01:00
Paweł Dziepak	0092076167	tests: add helpers for generating random data	2018-05-31 10:09:01 +01:00
Paweł Dziepak	cc76480174	tests: introduce tests for metaprogramming helpers	2018-05-31 10:09:01 +01:00
Vladimir Krivopalov	0886c189bf	tests: Fix test files to use correct previous row sizes. Since sstabledump and Cassandra do not use row size values, the new files have been validated to be identical to files generated by Cassandra with the same data inserted at same timestamps. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-05-30 18:18:35 -07:00
Nadav Har'El	a1cbeeffcd	tests/view_complex_test.cc: fix and enable buggy test tests/view_complex_test.cc contained a #ifdef'ed-out test claiming to be a reproducer for issue #3362. Unfortunately, it it is not - after earlier commits the only reason this test still fails is a mistake in the test, which expects 0 rows in a case where the real result is 1 row. Issue #3362 does not have to be fixed to fix this test. So this patch fixes the broken test, and enables it. It also adds comments explaining what this test is supposed to do, and why it works the way it does. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180530142214.29398-1-nyh@scylladb.com>	2018-05-30 15:39:25 +01:00

1 2 3 4 5 ...

2345 Commits