scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 08:23:29 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	d29482dce8	sstables: deprecate sstable metadata's ancestors The reason for that is that it's not available in sstable format mc, so we can no longer rely on it in common code for the currently supported formats. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20181121170057.20900-1-raphaelsc@scylladb.com>	2018-11-23 19:38:32 +01:00
Tomasz Grabiec	564b328b2e	Merge 'Add tests for schema changes' from Paweł This series adds a generic test for schema changes that generates various schema and data before and after an ALTER TABLE operation. It is then used to check correctness of mutation::upgrade() and sstable readers and lead to the discovery of #3924 and #3925. Fixes #3925. * https://github.com/pdziepak/scylla.git schema-change-test/v3.1 schema_builder: make member function names less confusing converting_mutation_partition_applier: fix collection type changes converting_mutation_partition_applier: do not emit empty collections sstable: use format() instead of sprint() tests/random-utils: make functions and variables inline tests: add models for schemas and data tests: generate schema changes tests/mutation: add test for schema changes tests/sstable: add test for schema changes	2018-11-23 15:11:31 +01:00
Paweł Dziepak	09439cd809	tests/sstable: add test for schema changes for_each_schema_change() is used for testing reading an sstable that was written with a different schema. Because of #3924, for now the mc format is not verified this way.	2018-11-23 12:14:06 +00:00
Paweł Dziepak	dc7f9fea5b	tests/mutation: add test for schema changes	2018-11-23 12:14:06 +00:00
Paweł Dziepak	35f9f424e9	tests: generate schema changes This patch adds for_each_schema_change() functions which generates schemas and data before and after some modification to the schema (e.g. adding a column, changing its type). It can be used to test schema upgrades.	2018-11-23 12:14:06 +00:00
Paweł Dziepak	daee4bd3b8	tests: add models for schemas and data This patch introduces a model of Scylla schemas and data, implemented using simple standard library primitives. It can be used for testing the actuall schemas, mutation_partitions, etc. used by the schema by comparing the results of various actions. The initial use case for this model was to test schema changes, but there is no reason why in the future it cannot be extended to test other things as well.	2018-11-23 12:14:06 +00:00
Takuya ASADA	cf0d00b81a	dist/ami: fix 'unknown configuration key: "enhanced_networking"' error while building AMI packer 1.3.2 no longer supported enhanced_networking directive, we need to use new directives("sriov_support" and "ena_support") to build with new version. packer provides automatic configuration file fixing tool, so new scylla.json is generated by following command: ./packer/packer fix scylla.json Fixes #3938 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20181123053719.32451-1-syuu@scylladb.com>	2018-11-23 08:15:47 +02:00
Paweł Dziepak	91793c0a43	bytes_ostream: drop appending_hash specialisation appending_hash is used for computing hashes that become part of the binary interface. They cannot change between Scylla version and the same data needs to always result in the same hash. At the moment, appending_hash<bytes_ostream> doesn't fulfil those requirements since it leaks information how the underlying buffer is fragmented. Fortunately, it has no users so it doesn't casue any compatibility issues. Moreover, bytes_ostream is usually used as an output of some serialisation routine (e.g. frozen_mutation_fragment or CQL response). Those serialisation formats do not guarantee that there is a single representation of a given data and therefore are not fit to be hashed by appending_hash. Removing appending_hash<bytes_ostream> may help preventing such incorrect uses. Message-Id: <20181122163823.12759-1-pdziepak@scylladb.com>	2018-11-22 23:53:54 +00:00
Tomasz Grabiec	fb38f0e9f8	Update seastar submodule * seastar b924495...1fbb633 (3): > rpc: Reduce code duplication > tests: perf: Make do_not_optimize() take the argument by const& > doc: Fix import paths in the tutorial	2018-11-22 23:53:54 +00:00
Paweł Dziepak	2a0e929830	tests/random-utils: make functions and variables inline random-utils.hh is a header which may be included in multiple translation units so all members should be non-static inline to avoid any duplication.	2018-11-22 11:30:31 +00:00
Paweł Dziepak	edb5402a73	sstable: use format() instead of sprint() The format message was using the new stlye formatting markers ("{}") which are understood by format() but not by sprint() (the latter is basically deprecated).	2018-11-22 11:30:31 +00:00
Paweł Dziepak	1fbe33791d	converting_mutation_partition_applier: do not emit empty collections This patch changes the behaviour of the schema upgrade code so that if all cells and the tombstons of a collection are removed during the upgrade the collection is not emitted (as opposed to emitting an empty one). Both behaviours are valid, but the new one makes it more consistent with how atomic cells are upgraded and how schema upgrades work for sstable readers.	2018-11-22 11:30:31 +00:00
Paweł Dziepak	7b12aaa093	converting_mutation_partition_applier: fix collection type changes ALTER TABLE allows changing the type of a collection to a compatible one. This includes changes from a fixed-sized type to a variable-sized one. If that happens the atomic_cells representing collection elements need to be rewritten so that the value size is included. The logic for rewritting atomic cells already exists (for those that are not collection members) and is reused in this patch. Fixes #3925.	2018-11-22 11:30:31 +00:00
Paweł Dziepak	43e0201ec6	schema_builder: make member function names less confusing Right now, schema_builder member functions have names that very poorly convey the actions that are performed for them. This is made even worse by some overloads which drastically change the semantics. For example: schema_builder() .with_column("v1", /* ... /) .without_column("v1", removal_timestamp); Creates a column "v1" and adds an information that there was a column with that name that was removed at 'removal_timestamp'. schema_builder() .with_coulmn("v1") .without_column(utf8_type->decompose("v1")); This adds column "v1" and then immediately removes it. In order to clean up this mess the names were changes so that: with_/without_ functions only add informations to the schema (e.g. info that a column was removed, but without removing a column of that name if one exists) * functions which names start with a verb actually perform that action, e.g. the new remove_column() removes the column (and adds information that it used to exist) as in the second example.	2018-11-22 11:30:31 +00:00
Benny Halevy	dcd18e2b62	remove exec permission from top_k source files This was introduced by `32525f2694` Cc: Rafi Einstein <rafie@scylladb.com> Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20181121163352.13325-1-bhalevy@scylladb.com>	2018-11-21 18:38:50 +02:00
Gleb Natapov	b4a8802edc	hints: make hints manager more resilient to unexpected directory content Currently if hints directory contains unexpected directories Scylla fails to start with unhandled std::invalid_argument exception. Make the manager ignore malformed files instead and try to proceed anyway. Message-Id: <20181121134618.29936-2-gleb@scylladb.com>	2018-11-21 14:53:03 +00:00
Gleb Natapov	9433d02624	hints: add auxiliary function for scanning high level hints directory We scan hints directory in two places: to search for files to replay and to search for directories to remove after resharding. The code that translates directory name to a shard is duplicated. It is simple now, so not a bit issue but in case it grows better have it in one place. Message-Id: <20181121134618.29936-1-gleb@scylladb.com>	2018-11-21 14:53:03 +00:00
Paweł Dziepak	4aa5d83590	Merge "Optimize sstable writing of the MC format" from Tomasz " Tested with perf_fast_forward from: github.com/tgrabiec/scylla.git perf_fast_forward-for-sst3-opt-write-v1 Using the following command line: build/release/tests/perf/perf_fast_forward_g --populate --sstable-format=mc \ --data-directory /tmp/perf-mc --rows=10000000 -c1 -m4G \ --datasets small-part The average reported flush throughput was (stdev for the avergages is around 4k): - for mc before the series: 367848 frag/s - for lc before the series: 463458 frag/s (= mc.before +25%) - for mc after the series: 429276 frag/s (= mc.before +16%) - for lc after the series: 466495 frag/s (= mc.before +26%) Refs #3874. " * tag 'sst3-opt-write-v2' of github.com:tgrabiec/scylla: sstables: mc: Avoid serialization of promoted index when empty sstables: mc: Avoid double serialization of rows tests: sstable 3.x: Do not compare Statistics component utils: Introduce memory_data_sink schema: Optimize column count getters sstables: checksummed_file_data_sink_impl: Bypass output_stream	2018-11-21 13:11:40 +00:00
Tomasz Grabiec	049926bfb8	sstables: mc: Avoid serialization of promoted index when empty calculate_write_size() adds some overhead, even if we're not going to write anything.	2018-11-21 14:04:27 +01:00
Tomasz Grabiec	0a9f5b563a	sstables: mc: Avoid double serialization of rows The old code was serializing the row twice. Once to get the size of its block on disk, which is needed to write the block length, and then to actually write the block. This patch avoids this by serializing once into a temporary buffer and then appending that buffer to the data file writer. I measured about 10% improvement in memtable flush throughput with this for the small-part dataset in perf_fast_forward.	2018-11-21 14:04:27 +01:00
Tomasz Grabiec	8f686af9af	tests: sstable 3.x: Do not compare Statistics component The Statistics component recorded in the test was generated using a buggy verion of Scylla, and is not correct. Exposed by fixing the bug in the way statistics are generated. Rather than comparing binary content, we should have explicit checks for statistics.	2018-11-21 14:04:27 +01:00
Tomasz Grabiec	143fd6e1c2	utils: Introduce memory_data_sink	2018-11-21 14:04:27 +01:00
Tomasz Grabiec	789fac9884	schema: Optimize column count getters	2018-11-21 14:04:27 +01:00
Tomasz Grabiec	8e8b96c6ed	sstables: checksummed_file_data_sink_impl: Bypass output_stream We can avoid the data copying by switching from this: sink -> stream -> sink to this: sink -> sink	2018-11-21 14:04:27 +01:00
Avi Kivity	bb85a21a8f	Merge "compress: Restore lz4 as default compressor" from Duarte " Enables sstable compression with LZ4 by default, which was the long-time behavior until a regression turned off compression by default. Fixes #3926 " * 'restore-default-compression/v2' of https://github.com/duarten/scylla: tests/cql_query_test: Assert default compression options compress: Restore lz4 as default compressor tests: Be explicit about absence of compression	2018-11-21 14:20:39 +02:00
Benny Halevy	76b1c184b7	conf: clean up cassandra references in scylla.yaml Indicate the default scylla directories, rather than Cassandra's. Provide links to Scylladocumentation where possible, update links to Casandra documentation otherwise. Clean up a few typos. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20181119141912.28830-1-bhalevy@scylladb.com>	2018-11-21 13:04:24 +02:00
Rafael Ávila de Espíndola	7fa7e9716d	Mention scylla-tools-java and scylla-jmx in HACKING.md I struggled a bit finding out why nodetool was not working, so it might be a good idea to expand the documentation a bit. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20181120233358.25859-1-espindola@scylladb.com>	2018-11-21 12:55:17 +02:00
Tomasz Grabiec	349c9f7a69	HACKING.md: Add a link to the slides about core dump debugging tools Message-Id: <1542793207-1620-1-git-send-email-tgrabiec@scylladb.com>	2018-11-21 11:45:23 +02:00
Michael Munday	53fdde75f6	dht: use little endian byte order explicitly for token hash This avoids a difference between little and big endian sytems. We now also calculate a full murmur hash for tokens with less than 8 bytes, however in practice the token size is always 8. Message-Id: <20181120214733.43800-1-mike.munday@ibm.com>	2018-11-21 11:44:29 +02:00
Michael Munday	360374cfde	tests: fix compilation of partitioner_test with boost 1.68 on IBM Z The boost multiprecision library that I am compiling against seems to be missing an overload for the cast to a string. The easy workaround seems to be to call str() directly instead. This also fixes #3922. Message-Id: <20181120215709.43939-1-mike.munday@ibm.com>	2018-11-21 11:43:42 +02:00
Duarte Nunes	9464fffc8c	tests/cql_query_test: Assert default compression options Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-11-20 22:47:27 +00:00
Duarte Nunes	36dc9e3280	compress: Restore lz4 as default compressor Fixes a regression introduced in `74758c87cd`, where tables started to be created without compression by default (before they were created with lz4 by default). Fixes #3926 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-11-20 22:47:27 +00:00
Duarte Nunes	5f64e34fcc	tests: Be explicit about absence of compression Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-11-20 22:47:26 +00:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Takuya ASADA	42baf6a6f7	dist/ami: update packer Update packer to latest version, 1.3.2. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20181031110441.16284-2-syuu@scylladb.com>	2018-11-20 21:29:57 +02:00
Takuya ASADA	b9a42e83ad	dist/ami: enable AMI build log To make easier to debug AMI build error, enable logging. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20181031110441.16284-1-syuu@scylladb.com>	2018-11-20 21:29:57 +02:00
Takuya ASADA	72411f95cb	reloc/build_reloc.sh: find ninja-build after executed install-dependencies.sh The build environment may not installed ninja-build before running install-dependencies.sh, so do it after running the script. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <20181031110737.17755-1-syuu@scylladb.com>	2018-11-20 21:29:57 +02:00
Avi Kivity	183c2369f3	Update seastar submodule * seastar a44cedf...d59fcef (10): > dns: Set tcp output stream buffer size to zero explicitly > tests: add libc-ares to travis dependencies > tests: add dns_test to test suite > build: drop bundled c-ares package > prometheus: replace the instance label with an optional one > build: Refactor C++ dialect detection > build: add libatomic to install-depenencies.sh > core: use std::underlying_type for open_flags > core: introduce open_flags::operator& > core: Fix build for `gnu++14`	2018-11-20 21:29:57 +02:00
Tomasz Grabiec	57e25fa0f8	utils: phased_barrier: Make advance_and_await() have strong exception guarantees Currently, when advance_and_await() fails to allocate the new gate object, it will throw bad_alloc and leave the phased_barrier object in an invalid state. Calling advance_and_await() again on it will result in undefined behavior (typically SIGSEGV) beacuse _gate will be disengaged. One place affected by this is table::seal_active_memtable(), which calls _flush_barrier.advance_and_await(). If this throws, subsequent flush attempts will SIGSEGV. This patch rearranges the code so that advance_and_await() has strong exception guarantees. Message-Id: <1542645562-20932-1-git-send-email-tgrabiec@scylladb.com>	2018-11-20 16:15:12 +00:00
Glauber Costa	9f403334c8	remove monitor if sstable write failed In (almost) all SSTable write paths, we need to inform the monitor that the write has failed as well. The monitor will remove the SSTable from controller's tracking at that point. Except there is one place where we are not doing that: streaming of big mutations. Streaming of big mutations is an interesting use case, in which it is done in 2 parts: if the writing of the SSTable fails right away, then we do the correct thing. But the SSTables are not commited at that point and the monitors are still kept around with the SSTables until a later time, when they are finally committed. Between those two points in time, it is possible that the streaming code will detect a failure and manually call fail_streaming_mutations(), which marks the SSTable for deletions. At that point we should propagate that information to the monitor as well, but we don't. Fixes #3732 (hopefully) Tests: unit (release) Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20181114213618.16789-1-glauber@scylladb.com>	2018-11-20 16:15:12 +00:00
Gleb Natapov	d144e6ceac	messaging_service: enable port load balancing algorithm for RPC server In a homogeneous cluster this will reduce number of internal cross-shard hops per request since RPC calls will arrive to correct shard. Message-Id: <20181118150817.GF2062@scylladb.com>	2018-11-20 16:15:12 +00:00
Michael Munday	b9a2f4a228	dht: fix byte ordered partitioner midpoint calculation New versions of boost saturate the output of the convert_to method so we need to mask the part we want to extract. Updates #3922. Message-Id: <20181116191441.35000-1-mike.munday@ibm.com>	2018-11-16 21:19:06 +02:00
Glauber Costa	c6811bd877	sstables: correctly parse estimated histograms In commit `a33f0d6`, we changed the way we handle arrays during the write and parse code to avoid reactor stalls. Some potentially big loops were transformed into futurized loops, and also some calls to vector resizes were replaced by a reserve + push_back idiom. The latter broke parsing of the estimated histogram. The reason being that the vectors that are used here are already initialized internally by the estimated_histogram object. Therefore, when we push_back, we don't fill the array all the way from index 0, but end up with a zeroed beginning and only push back some of the elements we need. We could revert this array to a resize() call. After all, the reason we are using reserve + push_back is to avoid calling the constructor member for each element, but We don't really expect the integer specialization to do any of that. However, to avoid confusion with future developers that may feel tempted to converted this as well for the sake of consistency, it is safer to just make sure these arrays are zeroed. Fixes #3918 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <20181116130853.10473-1-glauber@scylladb.com>	2018-11-16 20:52:44 +02:00
Avi Kivity	d708dabab9	doc: add reference to Linux' submitting-patches document Since our development process is a derivative of Linux, almost everything there is pertinent. Message-Id: <20181115184037.5256-1-avi@scylladb.com>	2018-11-16 20:15:40 +02:00
Vladimir Krivopalov	759fbbd5f6	random_mutation_generator: Add row_marker to rows regardless of whether they're deleted. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <f55b91f1349f0e98def6b7ca9755b5ccf4f48a3e.1542308626.git.vladimir@scylladb.com>	2018-11-16 13:17:07 +01:00
Avi Kivity	6548a404b2	Remove patch file committed by mistake	2018-11-15 19:47:55 +02:00
Duarte Nunes	6fbf792777	db/view/view_builder: Don't timeout waiting for view to be built Remove the timeout argument to db::view::view_builder::wait_until_built(), a test-only function to wait until a given materialized view has finished building. This change is motivated by the fact that some tests running on slow environments will timeout. Instead of incrementally increasing the timeout, remove it completely since tests are already run under an exterior timeout. Fixes #3920 Tests: unit release(view_build_test, view_schema_test) Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181115173902.19048-1-duarte@scylladb.com>	2018-11-15 19:41:43 +02:00
Amnon Heiman	25378916bc	API: colummn_family.hh yield in map_reduce_column_families_locally map_reduce_column_families_locally iterate over all tables (column family) in a shard. If the number of tables is big it can cause latency spikes. This patch replaces the current loop with a do_for_each allowing preepmtion inside the loop. Fixes #3886 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20181115154825.23430-1-amnon@scylladb.com>	2018-11-15 18:58:23 +02:00
Nadav Har'El	45f05b06d2	view_complex_test: fix another ttl In a previous patch I fixed most TTLs in the view_complex_test.cc tests from low numbers to 100 seconds. I missed one. This one never caused problems in practice, but for good form, let's fix it too. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20181115160234.26478-1-nyh@scylladb.com>	2018-11-15 18:03:28 +02:00
Nadav Har'El	78ed7d6d0c	Materialized Views and Secondary Index: no longer experimental After this patch, the Materialized Views and Secondary Index features are considered generally-available and no longer require passing an explicit "--experimental=on" flag to Scylla. The "--experimental=on" flag and the db::config::check_experimental() function remain unused, as we graduated the only two features which used this flag. However, we leave the support for experimental features in the code, to make it easier to add new experimental features in the future. Another reason to leave the command-line parameter behind is so existing scripts that still use it will not break. Fixes #3917 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20181115144456.25518-1-nyh@scylladb.com>	2018-11-15 17:59:27 +02:00

1 2 3 4 5 ...

16983 Commits