scylladb

Author	SHA1	Message	Date
Paweł Dziepak	4e43cb84ff	mests/sstables: test reading sstable with duplicated range tombstones Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> (cherry picked from commit `b405ff8ad2`)	2016-07-27 14:09:02 +03:00
Paweł Dziepak	a39bec0e24	tests: extract streamed_mutation assertions Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> (cherry picked from commit `50469e5ef3`)	2016-07-27 14:05:43 +03:00
Raphael S. Carvalho	66ebef7d10	tests: add new test for date tiered strategy This test set the time window to 1 hour and checks that the strategy works accordingly. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `cf54af9e58`)	2016-07-21 12:00:26 +03:00
Raphael S. Carvalho	7b9cf528ad	tests: fix occassional failure in date tiered test That was a bug in the test itself. It could happen that a sstable would incorrectly belong to the next time window if the current minute is approaching its end. Fix is about having all sstables that we want in the same time window with the same min/max timestamp. Fixes #1448. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <ee25d49e7ed12b4cf7d018a08163404c3d122e56.1468782787.git.raphaelsc@scylladb.com>	2016-07-18 15:18:29 +02:00
Paweł Dziepak	93cc4454a6	streamed_mutation: emit range_tombstones directly Originally, streamed_mutations guaranteed that emitted tombstones are disjoint. In order to achieve that two separate objects were produced for each range tombstone: range_tombstone_begin and range_tombstone_end. Unfortunately, this forced sstable writer to accumulate all clustering rows between range_tombstone_begin and range_tombstone_end. However, since there is no need to write disjoint tombstones to sstables (see #1153 "Write range tombstones to sstables like Cassandra does") it is also not necessary for streamed_mutations to produce disjoint range tombstones. This patch changes that by making streamed_mutation produce range_tombstone objects directly. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-13 09:51:18 +01:00
Raphael S. Carvalho	b5ec4d46c6	tests: add test for date tiered compaction strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-07-06 02:11:47 -03:00
Raphael S. Carvalho	cab2892866	tests: add test for sstables::get_fully_expired_sstables Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-07-06 02:11:47 -03:00
Raphael S. Carvalho	69b3860662	tests: add test for leveled_manifest::overlapping Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-07-06 02:11:45 -03:00
Raphael S. Carvalho	1118cfc51a	tests: test that sstable max_local_deletion_time is properly updated Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-07-06 01:13:34 -03:00
Avi Kivity	2a46410f4a	Change sstable_list from a map to a set sstable_list is now a map<generation, sstable>; change it to a set in preparation for replacing it with sstable_set. The change simplifies a lot of code; the only casualty is the code that computes the highest generation number.	2016-07-03 10:26:57 +03:00
Paweł Dziepak	71e961427a	test/sstables: test reading sstables with incorrect ordering Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Paweł Dziepak	b6f78a8e2f	sstable: make sstable reads return streamed_mutation Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Paweł Dziepak	125c4e20e2	tests/sstables: add test for sliced mutation reads Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Paweł Dziepak	737eb73499	mutation_reader: make readers return streamed_mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Duarte Nunes	91aac30f12	mutations: Row tombstones are now a set of ranges This patch changes the type of the mutation partition's row_tombstones to be a range_tombstone_list, so that they are now represented as a set of disjoint ranges. All of its usages are updated accordingly. Fixes #1155 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:59 +02:00
Duarte Nunes	dc8319ed91	keys: Remove schema argument from make_empty An empty key is independent of the schema. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:36 +02:00
Raphael S. Carvalho	cbc2e96a58	tests: check that overlapping sstable has its level changed to 0 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-17 11:11:05 -03:00
Raphael S. Carvalho	5aeeb0b3e8	compaction: add support to parallel compaction on the same column family It was noticed that small sstables will accumulate for a column family because scylla was limited to two compaction per shard, and a column family could have at most one compaction running at a given shard. With the number of sstables increasing rapidly, read performance is degraded. At the moment, our compaction manager works by running two compaction task handlers that run in parallel to the rest of the system. Each task handler gets to run when needed, gets a column family from compaction manager queue, runs compaction on it, and goes to sleep again. That's basically its cycle. Compaction manager only allows one instance of a column family to be on its queue, meaning that it's impossible for a column family to be compacted in parallel. One compaction starts after another for a given column family. To solve the problem described, we want to concurrently run compaction jobs of a column family that have different "size tier" (or "weight"). For those unfamiliar, compaction job contains a list of sstables that will be compacted together. The "size tier" of a compaction job is the log of the total size of the input sstables. So a compaction job only gets to run if its "size tier" is not the same of an ongoing compaction. There is no point in compacting concurrently at the same "size tier", because that slows down both compactions. We will no longer queue column families in compaction manager. Instead, we create a new fiber to run compaction on demand. This fiber that runs asynchronously will do the following: 1) Get a compaction job from compaction strategy. 2) Calculate "size tier" of compaction job. 3) Run compaction job if its "size tier" is not the same of an ongoing compaction for the given column family. As before, it may decide to re-compact a column family based on a stat stored in column family object. Ran all compaction-related dtests. Fixes #1216. Reviewed-by: Nadav Har'El <nyh@scylladb.com> Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <d30952ff136192a522bde4351926130addec8852.1462311908.git.raphaelsc@scylladb.com>	2016-05-04 11:46:09 +03:00
Raphael S. Carvalho	eb51c93a5a	tests: fix use-after-free in sstable test After commit `a843aea547`, a gate was introduced to make sure that an asynchronous operation is finished before column family is destroyed. A sstable testcase was not stopping column family, instead it just removed column family from compaction manager. That could cause an user-after-free if column family is destroyed while the asynchronous operation is running. Let's fix it by stopping column family in the test. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <ed910ec459c1752148099e6dc503e7f3adee54da.1461177411.git.raphaelsc@scylladb.com>	2016-04-20 22:08:08 +03:00
Raphael S. Carvalho	beaacbda2e	tests: test that leveled strategy was fixed L1 wasn't being compacted into L2. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <1a357896a448eafa7da4d28bc56fa02b89d4193e.1460508373.git.raphaelsc@scylladb.com>	2016-04-13 11:14:28 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Glauber Costa	a339296385	database: turn sstable generation number into an optional This patch makes sure that every time we need to create a new generation number - the very first step in the creation of a new SSTable, the respective CF is already initialized and populated. Failure to do so can lead to data being overwritten. Extensive details about why this is important can be found in Scylla's Github Issue #1014 Nothing should be writing to SSTables before we have the chance to populate the existing SSTables and calculate what should the next generation number be. However, if that happens, we want to protect against it in a way that does not involve overwriting existing tables. This is one of the ways to do it: every column family starts in an unwriteable state, and when it can finally be written to, we mark it as writeable. Note that this cannot be a part of add_column_family. That adds a column family to a db in memory only, and if anybody is about to write to a CF, that was most likely already called. We need to call this explicitly when we are sure we're ready to issue disk operations safely. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-03-10 21:06:05 -05:00
Raphael S. Carvalho	7f0371129c	tests: sstable_test: submit compaction request through column family That's needed for reverted commit `9586793c` to work. It's also the correct thing to do, i.e. column family submits itself to manager. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <2a1d141ad929c1957933f57412083dd52af0390b.1456415398.git.raphaelsc@scylladb.com>	2016-02-25 18:02:00 +02:00
Calle Wilund	159dbe3a64	sstable_datafile_tests: Replace '---' with auto Fixes compilation issues on some g++. Message-Id: <1454323749-21933-1-git-send-email-calle@scylladb.com>	2016-02-01 12:58:33 +02:00
Raphael S. Carvalho	a46aa47ab1	make sstables::compact_sstables return list of created sstables Now, sstables::compact_sstables() receives as input a list of sstables to be compacted, and outputs a list of sstables generated by compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <0d8397f0395ce560a7c83cccf6e897a7f464d030.1454110234.git.raphaelsc@scylladb.com>	2016-01-31 12:39:20 +02:00
Raphael S. Carvalho	2164aa8d5b	move compaction manager from /utils to /sstables Compaction manager was initially created at utils because it was more generic, and wasn't only intended for compaction. It was more like a task handler based on futures, but now it's only intended to manage compaction tasks, and thus should be moved elsewhere. /sstables is where compaction code is located. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-21 15:23:05 -02:00
Raphael S. Carvalho	0c67b1d22b	compaction: filter out mutation that doesn't belong to shard When compacting sstable, mutation that doesn't belong to current shard should be filtered out. Otherwise, mutation would be duplicated in all shards that share the sstable being compacted. sstable_test will now run with -c1 because arbitrary keys are chosen for sstables to be compacted, so test could fail because of mutations being filtered out. fixes #527. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <1acc2e8b9c66fb9c0c601b05e3ae4353e514ead5.1453140657.git.raphaelsc@scylladb.com>	2016-01-19 10:16:41 +01:00
Tomasz Grabiec	5a1587353f	tests: Don't depend on partition_key representation Representation format is an implementation detail of partition_key. Code which compares a value to representation makes assumptions about key's representation. Compare keys to keys instead. Message-Id: <1453136316-18125-1-git-send-email-tgrabiec@scylladb.com>	2016-01-18 19:01:56 +02:00
Pekka Enberg	7d3a3bd201	Merge "column family cleanup support" from Raphael "This patch is intended to add support to column family cleanup, which will make 'nodetool cleanup' possible. Why is this feature needed? Remove irrelevant data from a node that loses part of its token range to a newly added node."	2016-01-18 10:15:05 +02:00
Paweł Dziepak	dbf23fdff5	tests/sstable: add test for multi schema Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-01-15 13:12:40 +01:00
Raphael S. Carvalho	d44a5d1e94	compaction: filter out compacting sstables The implementation is about storing generation of compacting sstables in an unordered set per column family, so before strategy is called, compaction manager will filter out compacting sstables. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-12 01:18:29 -02:00
Raphael S. Carvalho	68619211f5	tests: add test to a sstable rewrite Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-11 21:43:41 -02:00
Raphael S. Carvalho	a81b660c0d	tests: check that tombstone is purged under certain conditions Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-05 15:19:21 +01:00
Glauber Costa	74fbd8fac0	do not call open_file_dma directly We have an API that wraps open_file_dma which we use in some places, but in many other places we call the reactor version directly. This patch changes the latter to match the former. It will have the added benefit of allowing us to make easier changes to these interfaces if needed. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <29296e4ec6f5e84361992028fe3f27adc569f139.1451950408.git.glauber@scylladb.com>	2016-01-05 10:37:57 +02:00
Paweł Dziepak	b39d1fb1fc	test/sstables: add more key_reader tests This patch introduces a test for reading keys from a single sstable with the range begining and end being the keys present in the index summary. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-18 17:24:29 +01:00
Tomasz Grabiec	179b587d62	Abstract timestamp creation behind new_timestamp() Replace db_clock::now_in_usec() and db_clock::now() * 1000 accesses where the intent is to create a new auto-generate cell timestamp with a call to new_timestamp(). Now the knowledge of how to create timestamps is in a single place.	2015-12-15 15:16:04 +02:00
Avi Kivity	47499dcf18	data_value: make conversion from bytes explicit Since bytes is a very generic value that is returned from many calls, it is easy to pass it by mistake to a function expecting a data_value, and to get a wrong result. It is impossible for the data_value constructor to know if the argument is a genuine bytes variable, a data_value of another type, but serialized, or some other serialized data type. To prevent misuse, make the data_value(bytes) constructor (and complementary data_value(optional<bytes>) explicit.	2015-11-13 17:12:29 +02:00
Avi Kivity	2c3591cbd9	data_value de-any-fication We use boost::any to convert to and from database values (stored in serlialized form) and native C++ values. boost::any captures information about the data type (how to copy/move/delete etc.) and stores it inside the boost::any instance. We later retrieve the real value using boost::any_cast. However, data_value (which has a boost::any member) already has type information as a data_type instance. By teaching data_type intances about the corresponding native type, we can elimiante the use of boost::any. While boost::any is evil and eliminating it improves efficiency somewhat, the real goal is growing native type support in data_type. We will use that later to store native types in the cache, enabling O(log n) access to collections, O(1) access to tuples, and more efficient large blob support.	2015-10-30 17:38:51 +01:00
Raphael S. Carvalho	5cdc886f84	tests: add test to read_indexes Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-10-23 16:57:41 -02:00
Avi Kivity	5f3a46eabb	Merge "load_new_sstables" from Glauber "This patchset implements load_new_sstables, allowing one to move tables inside the data directory of a CF, and then call "nodetool refresh" to start using them. Keep in mind that for Cassandra, this is deemed an unsafe operation: https://issues.apache.org/jira/browse/CASSANDRA-6245 It is still for us something we should not recommend - unless the CF is totally empty and not yet used, but we can do a much better job in the safety front. To guarantee that, the process works in four steps: 1) All writes to this specific column family are disabled. This is a horrible thing to do, because dirty memory can grow much more than desired during this. Throughout out this implementation, we will try to keep the time during which the writes are disabled to its bare minimum. While disabling the writes, each shard will tell us about the highest generation number it has seen. 2) We will scan all tables that we haven't seen before. Those are any tables found in the CF datadir, that are higher than the highest generation number seen so far. We will link them to new generation numbers that are sequential to the ones we have so far, and end up with a new generation number that is returned to the next step 3) The generation number computed in the previous step is now propagated to all CFs, which guarantees that all further writes will pick generation numbers that won't conflict with the existing tables. Right after doing that, the writes are resumed. 4) The tables we found in step 2 are passed on to each of the CFs. They can now load those tables while operations to the CF proceed normally."	2015-10-22 13:42:24 +03:00
Glauber Costa	f3bad2032d	database: fix type for sstable generation. Avoid using long for it, and let's use a fixed size instead. Let's do signed instead of unsigned to avoid upsetting any code that we may have converted. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-10-21 18:01:20 +02:00
Paweł Dziepak	68e1b1f613	tests/sstables: test sstable key_reader Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:47 +02:00
Raphael S. Carvalho	c3a9d342f4	range: rename overlap to overlaps overlaps() is more grammatical. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-10-20 17:23:35 +03:00
Raphael S. Carvalho	28ef8feffa	tests: fix test for leveled compaction test_setup::do_with_test_directory is missing. For some reason, the test wasn't failing without it until now. Adding it is the correct thing to do anyway. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2015-10-19 17:07:08 +03:00
Raphael S. Carvalho	e555ad6370	tests: add tests for leveled compaction Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-10-16 01:57:08 -03:00
Raphael S. Carvalho	35b75e9b67	adapt compaction procedure to support leveled strategy Adapt our compaction code to start writing a new sstable if the one being written reached its maximum size. Leveled strategy works with that concept. If a strategy other than leveled is being used, everything will work as before. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-10-16 01:54:52 -03:00
Raphael S. Carvalho	d05a5fbeb4	tests: add testcase for bug on consumer parser problem described by commit: `3926748594` Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-10-13 11:49:57 -03:00
Glauber Costa	639ba2b99d	incremental backups: move control to the CF level Currently, we control incremental backups behavior from the storage service. This creates some very concrete problems, since the storage service is not always available and initialized. The solution is to move it to the column family (and to the keyspace so we can properly propagate the conf file value). When we change this from the api, we will have to iterate over all of them, changing the value accordingly. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-10-05 13:16:11 +02:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Paweł Dziepak	b17f5c442f	tests/sstable: uncomment part of compaction test Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-09-07 21:21:38 +02:00

1 2

65 Commits