scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-23 01:50:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	522f23b830	Merge "Schema table cleanups" from Pekka "Clean up the schema table code. Be explicit that we don't support Cassandra 3.0 and eliminate some dead code."	2015-08-05 15:09:59 +03:00
Raphael S. Carvalho	3ddb9be984	db: fix compaction on an empty column family When forcing a compaction on a column family with no sstables, an assert will fail because there is no sstables to be compacted. This problem is fixed by ignoring a compaction request when no sstable is provided. Fixes #61. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com> Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-08-05 14:04:22 +03:00
Pekka Enberg	99a80050e3	db: Rename legacy_schema_tables to schema_tables There's nothing legacy about it so rename legacy_schema_tables to schema_tables. The naming comes from a Cassandra 3.x development branch which is not relevant for us in the near future. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-05 13:56:47 +03:00
Avi Kivity	55ca295154	Merge "Initial CQL event support" from Pekka "This series implements initial support for CQL events. We introduce migration_listener hook in migration manager as well as event notifier in the CQL server that's built on top of it to send out the events via CQL binary protocol. We also wire up create keyspace events to the system so subscribed clients are notified when a new keyspace is created. There's still more work to be done to support all the events. That requires some work to restructure existing code so it's better to merge this initial series now and avoid future code conflicts."	2015-08-05 12:56:37 +03:00
Pekka Enberg	618ba067bf	database: Wire up create keyspace listener hook Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-05 11:50:52 +03:00
Pekka Enberg	05c23c7f73	database: Add create_keyspace_on_all() helper Add a create_keyspace_on_all() helper which is needed for sending just one event notification per created keyspace, not one per shard. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-05 11:50:52 +03:00
Paweł Dziepak	8a0d21b8b8	query: support option distinct in partition_slice In case of SELECT DISTINCT statments we are not intersted in clustering keys at all. The only important information is whether partition key exists and what's in static row (if it exists). Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-08-04 15:39:42 +02:00
Pekka Enberg	a3c95235e6	migration_manager: Make stateful with sharded<> In preparation for adding listener state to migration manager, use sharded<> for migration manager. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-04 11:23:23 +03:00
Raphael S. Carvalho	34eaeedff2	db: remove imprecise log message about compaction This message is printed when we are about to run the strategy code which may not decide to compact anything. Compaction is already properly logged in sstables::compact_sstables(). Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-08-04 00:15:50 +03:00
Avi Kivity	9663ea86e9	db: fix test for whether an sstable includes a shard's range Spotted by Raphael and Nadav.	2015-08-04 00:14:22 +03:00
Avi Kivity	c1a2831d41	db: ignore sstables that clearly don't belong to this shard	2015-08-03 20:17:41 +03:00
Pekka Enberg	e22f5a1cd7	database: Add CF UUID validation to update Add CF UUID validation to update table paths to make us behave like Origin for parallel table creation. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-03 13:41:16 +03:00
Pekka Enberg	0b762338c1	database: Futurize update_column_family() Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-03 13:41:16 +03:00
Shlomi Livne	199f4d2545	Add enable-in-memory-data-store,enable-commitlog,enable-cache config Abillity to enable/disable specific sub-modules - this settings do not affect system tables which are allways persisted,cached and written to commitlog enable-in-memory-data-store marks if tables will be written/read to/from disk enable-commitllog marks if tables will be written to commitlog enable-cache marks if tables will be written/read to/from cache Please note in-memory-data-store does not change the read path so "old" sstables are still read and cache may be used to cache their data Signed-off-by: Shlomi Livne <shlomi@cloudius-systems.com>	2015-08-02 17:19:30 +03:00
Avi Kivity	98ec451d6a	Extract range<> into its own header It's not just for queries any more.	2015-08-02 16:07:42 +03:00
Raphael S. Carvalho	6bc822dd71	db: fix problem with initialization of a column family We should only call column_family::start after the checks because if a check failed, column_family would be destroyed without column_family::stop being called first, and that would lead to a problem, such as _compaction_done future not being resolved. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-31 13:03:07 +03:00
Raphael S. Carvalho	d791438a43	db: enable automatic compaction by default So far, automatic compaction was disabled, but now that we support size-tiered strategy, the default compaction strategy algorithm, we could definitely enable automatic compaction by default. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-29 19:02:16 +03:00
Raphael S. Carvalho	5a70c8c8f4	db: implement retry policy for compaction Currently, compaction will no longer happen for a column family which a compaction failed for some unexpected reason. We want to implement a retry policy that will sleep for a while until the next compaction attempt. This patch implements retry policy for compaction using exponential_backoff_retry. With exponential_backoff_retry, the sleep time grows exponentially with the number of retries until the maximum sleep time is reached. For compaction specifically, the base sleep time will be 5 seconds and the maximum sleeping time will be 300 seconds, i.e. 5 minutes. If compaction succeeded after a retry, the sleep time will be reset to the base sleep time. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-28 18:24:04 -03:00
Avi Kivity	b3b0d5150a	main: fix system tables vs. sstables initialization race We must wait for the system tables to be loaded on all shards before populating the other keyspaces, or we might miss some keyspaces or column families. This is hinted at by the fact that we use storage_proxy, which isn't usable until the system keyspace is ready. Credit to Tomek for identifying the problem and the fix.	2015-07-28 09:49:11 +02:00
Avi Kivity	2e745bebad	Merge "use compaction strategy options" from Raphael	2015-07-27 17:06:43 +03:00
Raphael S. Carvalho	15bbb71b7b	db: handle compaction exception outside keep doing Otherwise, we would needlessly handle it twice. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-24 19:12:34 -03:00
Raphael S. Carvalho	5f89f80ae5	Revert "db: dont rethrow exceptions for termination of compaction fiber" Actually we should rethrow exceptions because they are needed for keep_doing() to finish. Otherwise, the future _compaction_done will never be resolved. This reverts commit `89698b0d1c`.	2015-07-24 19:07:47 -03:00
Raphael S. Carvalho	634d00511b	compaction: use compaction options in strategy Support to compaction strategy options was recently added. Previously, we were using default values in compaction strategy for options, but now we can use the options defined in the schema. Currently, we only support size-tiered strategy, so let's start with it. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-23 15:26:47 -03:00
Glauber Costa	d1496944d9	sstables: handle compaction strategy Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-23 00:02:11 -04:00
Avi Kivity	8870bf1bf8	Merge "Handling of non-full partition range queries" from Tomasz	2015-07-22 15:18:02 +03:00
Tomasz Grabiec	f9da612581	memtable: Implement range queries	2015-07-22 13:14:33 +02:00
Tomasz Grabiec	152582a869	sstables: Add read_range_rows() variant which takes a partition_range	2015-07-22 13:13:38 +02:00
Pekka Enberg	791031fbc7	database: Extract update_schema_version_and_announce() function It's needed in storage proxy. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-22 11:57:00 +03:00
Tomasz Grabiec	0b0ea04958	range: Remove start_value() and end_value() It's easy to miss that they may be undefined. start() and end(), which return optional<bound> const&, make it clear.	2015-07-22 10:27:47 +02:00
Tomasz Grabiec	4a18693a23	db: Remove dead code	2015-07-22 10:27:47 +02:00
Raphael S. Carvalho	89698b0d1c	db: dont rethrow exceptions for termination of compaction fiber broken_semaphore and seastar::gate_closed_exception exceptions are used for regular termination of compaction fiber, which otherwise would live forever. We shouldn't re-throw these exceptions, but instead only print a log message. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-22 11:23:58 +03:00
Avi Kivity	8ba5d19db5	db: avoid ubsan false-positive in query_state move constructor The value is moved before initialization due to a do_with(). It's harmless, but better to silence the warning.	2015-07-21 12:19:54 +03:00
Raphael S. Carvalho	6ae3ffa319	database: add get_sstables to column_family Returns all sstables added to a given column_family. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-20 10:08:09 -03:00
Raphael S. Carvalho	ebbc7aa43e	database: add compact_sstables to column_family compact_all_sstables is about selecting all available sstables for compaction and executing a compaction code on them. This compaction code was moved to a more generic function called compact_sstables, which will compact a list of given sstables. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-20 10:08:02 -03:00
Avi Kivity	6ade74b7c3	db: recover sstable generation counter on startup Don't attempt to overwrite an existing sstable.	2015-07-20 12:00:34 +02:00
Glauber Costa	4250b7dd64	database: do not use commitlog constructor if there is no commitlog Tomek pointed out that we shouldn't be passing a reference to commitlog every time we use the add_column_family interface, because that will at times pass a reference to a null object. Test that, and pass no_commitlog if there is none. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-16 20:04:29 +03:00
Pekka Enberg	81cddec777	database: Add versioning support Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-16 14:53:30 +03:00
Raphael S. Carvalho	719898d0e5	introduce automatic compaction As the name implies, this patch introduces the concept of automatic compaction for sstables. Compaction task is triggered whenever a new sstable is written. Concurrent compaction on the same column family isn't supported, so compaction may be postponed if there is an ongoing compression. In addition, seastar::gate is used both to prevent a new compaction from starting and to wait for an ongoing compaction to finish, when the system is asked for a shutdown. This patch also introduces an abstract class for compaction strategy, which is really useful for supporting multiple strategies. Currently, null and major compaction strategies are supported. As the name implies, null compaction strategy does nothing. Major compaction strategy is about compacting all sstables into one. This strategy may end up being helpful when adding support to major compaction via nodetool. Signed-off-by: Raphael S. Carvalho <raphaelsc@cloudius-systems.com>	2015-07-16 12:00:12 +03:00
Glauber Costa	9c464aff9b	database: clean up various APIs In much of our column_families APIs, we need to pass a pointer to the database. The only reason we do that, is so we can properly handle the commit log entries after we seal the current memtables into sstables. Now that we store a pointer to the commit log in the CF itself at the time it is created, we no longer have to do it. As a result, the APIs are a lot cleaner, with no gratuitous parameters. My motivation for this was the flush method, but as a result, apply() also gets cleaner. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-15 10:24:20 -04:00
Glauber Costa	ad46daa6aa	column_family: add the commitlog as a parameter When we create a column family, we can pass as an extra parameter, the commitlog - or lack thereof. Because the commitlog is optional to begin with - it won't exist if we don't call init_commitlog, we can have this to be empty meaning no commit log. The creation of a column family should be always done through add_column_family. And if that is the case, we have the database's commitlog right there and can get the pointer through the db. Only tests are not creating the column family this way, and for them, it is fine. We want to do that, because some column family operations will use the commit log. Right now, they are forcing us to add parameters to APIs that would be much cleaner without it. So while separation is good, this level of coupling is a net win as it allows us to clean up some visible APIs. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-15 10:24:20 -04:00
Avi Kivity	99a15de9e5	logger: de-thread_local-ize logger The logger class constructor registers itself with the logger registry, in order to enable dynamically setting log levels. However, since thread_local variables may be (and are) initialized at the time of first use, when the program starts up no loggers are registered. Fix by making loggers global, not thread_local. This requires that the registry use locking to prevent registration happening on different threads from corrupting the registry. Note that technically global variables can also be initialized at the point of first use, and there is no portable way for classes to self-register. However this is the best we can do.	2015-07-14 17:18:11 +03:00
Tomasz Grabiec	9bea6aa0a3	db: Introduce mutation query interface Mutation query differs from data query in that returns information needed to reconcile data slice with that retruned by other data sources. There is a generic mutation_query() algorithm introduced, which can work with any mutation_source. database::query_mutations() is a shard-local interface for mutation queries. The reconcilable_result is introduced as a medium for mutation query results. It piggy backs on frozen_mutation as a medium for reconcilable data.	2015-07-12 12:51:38 +02:00
Tomasz Grabiec	9724b84bb3	db: Fix query of partitions with no live clustered rows When partition has no live regular rows, but has some data live in the static row, then it should appear in the results, even though we didn't select any static column. To reproduce: create table cf (k blob, c blob, v blob, s1 blob static, primary key (k, c)); update cf set s1 = 0x01 where k = 0x01; update cf set s1 = 0x02 where k = 0x02; select k from cf; The "select" statement should return 2 rows, but was returning 0. The following query worked fine, because static columns were included: select * from cf; The data query should contain only live data, so we shouldn't write a partition entry if it's supposed to be absent from the results. We can'r tell that though until we've processed all the data. To solve this problem, query result writer is using an optimistic approach, where the partition header will be retracted from the buffer (cheaply), if it turns out there's no live data in it.	2015-07-09 19:55:00 +02:00
Tomasz Grabiec	09ed972068	mutation_partition: Remove redundant slice parameter from query() The slice used by partition_writer must match the one used by query() anyway.	2015-07-09 19:47:32 +02:00
Tomasz Grabiec	8a18d2b699	Extract memtable implementation to memtable.cc	2015-07-09 19:46:29 +02:00
Avi Kivity	5d9222d935	Merge "Filter sstable data not belonging to current shard" from Tomasz "We don't want multiple shards to respond with the same data. Higher level code assumes that shard data is non-overlapping. It's cheaper to drop duplicates as soon as possible. Memtable reader for example will never have overlapping data, so cache hitting queries will never need to pay for this. Compaction process may also rely on this."	2015-07-07 18:12:35 +03:00
Tomasz Grabiec	66dfeb33d7	db: Filter out sstable partitions not belonging to current shard	2015-07-07 16:56:25 +02:00
Tomasz Grabiec	d035c499b8	db: Move database::shard_of() to dht::shard_of()	2015-07-07 16:56:25 +02:00
Pekka Enberg	a358990855	db/legacy_schema_tables: Pass storage_proxy by reference We always operate on the local storage proxy so pass it by reference. This simplifies DEFINITIONS_UPDATE message handler where all we have is a "this" pointer to the local storage proxy. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-07-07 16:27:58 +03:00
Glauber Costa	5044e5191d	gate: use with_gate idiom Aside from guaranteeing that we will always leave correctly, it will also allow us to change the implementation of the enter / leave pair without disrupting existing code. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-07-05 19:18:34 +03:00

1 2 3 4 5 ...

290 Commits