scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 00:50:35 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	8c4b5e4283	db: Avoiding checking bloom filters during compaction Checking bloom filters of sstables to compute max purgeable timestamp for compaction is expensive in terms of CPU time. We can avoid calculating it if we're not about to GC any tombstone. This patch changes compacting functions to accept a function instead of ready value for max_purgeable. I verified that bloom filter operations no longer appear on flame graphs during compaction-heavy workload (without tombstones). Refs #1322.	2016-07-10 09:54:20 +02:00
Paweł Dziepak	983321f194	tests/mutation: do not create memtable on stack Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:51 +01:00
Paweł Dziepak	e4ae7894d4	tests/mutation: test slicing mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:51 +01:00
Paweł Dziepak	737eb73499	mutation_reader: make readers return streamed_mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Duarte Nunes	dc8319ed91	keys: Remove schema argument from make_empty An empty key is independent of the schema. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:36 +02:00
Duarte Nunes	a15ed3c60f	mutation_test: Specify tmp data dir Otherwise we attempt to create sstable files under /. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1464618602-1124-1-git-send-email-duarte@scylladb.com>	2016-05-30 20:34:47 +02:00
Avi Kivity	db03295c8a	Merge "Fix query digest mismatch" from Tomasz "Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165."	2016-04-08 12:13:29 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	474a35ba6b	tests: Add test for query digest calculation	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	5d768d0681	tests: mutation_test: Move mutation generator to mutation_source_test.hh So that it can be reused.	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	30d25bc47a	tests: mutation_test: Add test case for querying of expired cells	2016-04-07 19:57:19 +02:00
Tomasz Grabiec	2fbb55929d	mutation_test: Add allocation failure stress test for apply() The test injects allocation failures at every allocation site during apply(). Only allocations throug allocation_strategy are instrumented, but currently those should include all allocations in the apply() path. The target and source mutations are randomized.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	8ede27f9c6	mutation_test: Add more apply() tests	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	36575d9f01	mutation_test: Hoist make_blob() to a function	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	4c85d06df7	mutation_test: Make make_blob() return different blob each time random_bytes was constructed with the same seed each time.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	19b3df9f0f	mutation_test: Fix use-after-free The problem was that verify_row() was returning a future which was not waited on. Fix by running the code in a thread.	2016-03-21 21:49:53 +01:00
Benoît Canet	1fb9a48ac5	exception: Optionally shutdown communication on I/O errors. I/O errors cannot be fixed by Scylla the only solution is to shutdown the database communications. Signed-off-by: Benoît Canet <benoit@scylladb.com> Message-Id: <1458154098-9977-1-git-send-email-benoit@scylladb.com>	2016-03-17 15:02:52 +02:00
Glauber Costa	a339296385	database: turn sstable generation number into an optional This patch makes sure that every time we need to create a new generation number - the very first step in the creation of a new SSTable, the respective CF is already initialized and populated. Failure to do so can lead to data being overwritten. Extensive details about why this is important can be found in Scylla's Github Issue #1014 Nothing should be writing to SSTables before we have the chance to populate the existing SSTables and calculate what should the next generation number be. However, if that happens, we want to protect against it in a way that does not involve overwriting existing tables. This is one of the ways to do it: every column family starts in an unwriteable state, and when it can finally be written to, we mark it as writeable. Note that this cannot be a part of add_column_family. That adds a column family to a db in memory only, and if anybody is about to write to a CF, that was most likely already called. We need to call this explicitly when we are sure we're ready to issue disk operations safely. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-03-10 21:06:05 -05:00
Tomasz Grabiec	4e5a52d6fa	db: Make read interface schema version aware The intent is to make data returned by queries always conform to a single schema version, which is requested by the client. For CQL queries, for example, we want to use the same schema which was used to compile the query. The other node expects to receive data conforming to the requested schema. Interface on shard level accepts schema_ptr, across nodes we use table_schema_version UUID. To transfer schema_ptr across shards, we use global_schema_ptr. Because schema is identified with UUID across nodes, requestors must be prepared for being queried for the definition of the schema. They must hold a live schema_ptr around the request. This guarantees that schema_registry will always know about the requested version. This is not an issue because for queries the requestor needs to hold on to the schema anyway to be able to interpret the results. But care must be taken to always use the same schema version for making the request and parsing the results. Schema requesting across nodes is currently stubbed (throws runtime exception).	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	036974e19b	Make mutation interfaces support multiple versions Schema is tracked in memtable and cache per-entry. Entries are upgraded lazily on access. Incoming mutations are upgraded to table's current schema on given shard. Mutating nodes need to keep schema_ptr alive in case schema version is requested by target node.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	5184381a0b	memtable: Deconstify memtable in readers We want to upgrade entries on read and for that we need mutating permission.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	3e447e4ad1	tests: mutation_test: Add tests for equality and hashing	2016-01-11 10:34:50 +01:00
Tomasz Grabiec	4b92ef01fc	test: Add tests for mutation upgrade	2016-01-08 21:10:26 +01:00
Raphael S. Carvalho	03eee06784	remove empty rows in mutation_partition::do_compact do_compact() wasn't removing an empty row that is covered by a tombstone. As a result, an empty partition could be written to a sstable. To solve this problem, let's make trim_rows remove a row that is considered to be empty. A row is empty if it has no tombstone, no marker and no cells. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-05 15:19:21 +01:00
Avi Kivity	5c5331d910	tests: test large blobs in memtables	2015-12-08 15:17:09 +02:00
Tomasz Grabiec	7e0f99cc3b	Merge tag 'native-preparatory/v1' from https://github.com/avikivity/scylla.git Assorted patches that pave the way for native storage (while not committing us in any way).	2015-11-16 10:01:38 +01:00
Glauber Costa	0989f80c97	provide cf_stats where one is needed Recently, I have introduced cf_stats into the database, propagating all the way back to the column family. The problem, however, is that some tests create a column family config themselves instead of going through make_column_family. That is ultimately ok if those tests are not expected to flush memtables. But if they are, the cf_stats pointer will be null and we will crash. Although there are many solutions to this, the one that is in tune with our current practices is to have the test that requires it provide an empty cf_stats storage area that can be written to. That's already how we handle the disk directory and other things like compaction properties. With this patch, test.py passes again. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-11-15 10:31:32 +02:00
Avi Kivity	47499dcf18	data_value: make conversion from bytes explicit Since bytes is a very generic value that is returned from many calls, it is easy to pass it by mistake to a function expecting a data_value, and to get a wrong result. It is impossible for the data_value constructor to know if the argument is a genuine bytes variable, a data_value of another type, but serialized, or some other serialized data type. To prevent misuse, make the data_value(bytes) constructor (and complementary data_value(optional<bytes>) explicit.	2015-11-13 17:12:29 +02:00
Avi Kivity	2c3591cbd9	data_value de-any-fication We use boost::any to convert to and from database values (stored in serlialized form) and native C++ values. boost::any captures information about the data type (how to copy/move/delete etc.) and stores it inside the boost::any instance. We later retrieve the real value using boost::any_cast. However, data_value (which has a boost::any member) already has type information as a data_type instance. By teaching data_type intances about the corresponding native type, we can elimiante the use of boost::any. While boost::any is evil and eliminating it improves efficiency somewhat, the real goal is growing native type support in data_type. We will use that later to store native types in the cache, enabling O(log n) access to collections, O(1) access to tuples, and more efficient large blob support.	2015-10-30 17:38:51 +01:00
Paweł Dziepak	740e2166c5	tests/mutation: add test for mutation diff Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Glauber Costa	639ba2b99d	incremental backups: move control to the CF level Currently, we control incremental backups behavior from the storage service. This creates some very concrete problems, since the storage service is not always available and initialized. The solution is to move it to the column family (and to the keyspace so we can properly propagate the conf file value). When we change this from the api, we will have to iterate over all of them, changing the value accordingly. Signed-off-by: Glauber Costa <glommer@scylladb.com>	2015-10-05 13:16:11 +02:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Tomasz Grabiec	b2c2eb6cd2	tests: Add test exploiting flush while scanning issue	2015-09-09 10:38:43 +02:00
Tomasz Grabiec	b5845e96e5	tests: Fix liveness issue in mutation_test	2015-09-09 10:38:43 +02:00
Tomasz Grabiec	29e33dee4a	tests: mutation_test: Restore indentation	2015-08-31 21:50:17 +02:00
Tomasz Grabiec	ff8c81b25f	memtable: Encapsulate unsafe accessors	2015-08-31 21:50:17 +02:00
Raphael S. Carvalho	9823164c89	db: introduce compaction manager Currently, each column family creates a fiber to handle compaction requests in parallel to the system. If there are N column families, N compactions could be running in parallel, which is definitely horrible. To solve that problem, a per-database compaction manager is introduced here. Compaction manager is a feature used to service compaction requests from N column families. Parallelism is made available by creating more than one fiber to service the requests. That being said, N compaction requests will be served by M fibers. A compaction request being submitted will go to a job queue shared between all fibers, and the fiber with the lowest amount of pending jobs will be signalled. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-08-11 17:25:46 +03:00
Avi Kivity	c720cddc5c	tests: mv tests/urchin/* -> tests/ Now that seastar is in a separate repository, we can use the tests/ directory.	2015-08-05 14:16:52 +03:00

38 Commits