scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 17:40:34 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	11b74050a1	partitioned_sstable_set: fix quadratic space complexity streaming generates lots of small sstables with large token range, which triggers O(N^2) in space in interval map. level 0 sstables will now be stored in a structure that has O(N) in space complexity and which will be included for every read. Fixes #2287. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20170417185509.6633-1-raphaelsc@scylladb.com>	2017-04-18 13:04:38 +03:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Asias He	d1178fa299	Convert to use dht::token_range	2016-12-19 08:04:29 +08:00
Raphael S. Carvalho	02541e15c1	sstable_set: introduce incremental selector Incrementally select sstables from sstable set using token in ascending order. For leveled strategy, it returns all sstables that belong to current interval. For other strategies, it just return all sstables from the set. Useful for compaction which needs all sstables that overlap with key being currently compacted to calculate maximum purgeable timestamp. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-12-09 16:17:16 -02:00
Raphael S. Carvalho	b30a2cb21a	lcs: generate info that preserves token distribution in higher levels The information (last compacted keys) is lost after node is restarted or schema is updated, which causes strategy to be rebuilt. We need it for strategy to guarantee uniform distribution of token range across sstables, or we could end up with 1 sstable of level L overlapping with lots of sstables of level L+1, and that results in a compaction of undesired length. That information can be generated from scratch by getting last key of newest sstable in each level > 0. Fixes #1906. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <35ebd15977d5a8418239febb160c796cdc0e98fa.1480533805.git.raphaelsc@scylladb.com>	2016-12-01 11:19:58 +02:00
Raphael S. Carvalho	a16425833c	size_tiered: do not recreate bucket when it goes beyond max threshold Problem will cause size tiered to return small jobs when there are more than max_threshold sstables of similar size. For example, if max_threshold is 32, and there are 36 sstables of similar size, strategy will only return 4 sstables to be compacted. That's because we incorrectly create a new bucket when it meets the max threshold. What we should do is to allow buckets to grow beyond max threshold and trim them when selecting the most suitable one for compaction. Important to mention that estimation for size tiered will now work better when there are more than max_threshold sstables of similar size. Fixes #1901. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <080bad70d6cb86eaf52ac1bdd6765ac47aab5b03.1478316140.git.raphaelsc@scylladb.com>	2016-11-29 16:56:02 +02:00
Raphael S. Carvalho	a8ab4b8f37	lcs: fix starvation at higher levels When max sstable size is increased, higher levels are suffering from starvation because we decide to compact a given level if the following calculation results in a number greater than 1.001: level_size(L) / max_size_for_level_l(L) Fixes #1720. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-09-30 14:09:49 -03:00
Raphael S. Carvalho	a3bf7558f2	lcs: fix broken token range distribution at higher levels Uniform token range distribution across sstables in a level > 1 was broken, because we were only choosing sstable with lowest first key, when compacting a level > 0. This resulted in performance problem because L1->L2 may have a huge overlap over time, for example. Last compacted key will now be stored for each level to ensure sort of "round robin" selection of sstables for compactions at level >= 1. That's also done by C*, and they were once affected by it as described in https://issues.apache.org/jira/browse/CASSANDRA-6284. Fixes #1719. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-09-30 14:09:16 -03:00
Raphael S. Carvalho	dffb41f9d8	sstables: remove schema parameter from some sstable methods schema can now be found in the sstable object itself. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <0fa44fedbe784d924522d7eeca77c16294479c6e.1473959677.git.raphaelsc@scylladb.com>	2016-09-19 13:25:58 +02:00
Raphael S. Carvalho	a2dc88889d	db: enable clustering optimization only on dtcs Leveled strategy will not benefit from this strategy because there's only a few sstables that will contain a given partition key, which means that a clustering key that belongs to a specific partition key can only be in a few sstables as well. Date tiered strategy is the one that will actually benefit the most from this optimization. Size tiered may benefit from it too if clustering key isn't overwritten, but it will not use the clustering optimization. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-09-02 11:31:07 -03:00
Avi Kivity	7a140a306e	Revert "sstables: optimize selection of sstables for leveled strategy" This reverts commit c75b07fc34f0e7267a8e49276b96bbd4686cb78d; does not deduplicate the sstable list.	2016-09-01 18:34:08 +03:00
Raphael S. Carvalho	c75b07fc34	sstables: optimize selection of sstables for leveled strategy It's possible to copy sstables directly into vector, and that will improve performance. my benchmark tool[1] shows that new version reduces running time of copy procedure by factor of two after 1024^2 calls. Switching to back_inserter improves throughput even further. [1]: gist.github.com/raphaelsc/a4b27290f362cdecdef399770dda759c Refs #1632. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <7153514a9b5f5eb24dff518ee9fa3680e0881dae.1472741401.git.raphaelsc@scylladb.com>	2016-09-01 18:08:53 +03:00
Raphael S. Carvalho	b699ef2de3	compaction: wire up date tiered compaction strategy After this commit, date tiered compaction strategy is supported on Scylla. To understand how it works, take a look at our wiki page: https://github.com/scylladb/scylla/wiki/SSTable-compaction#date-tiered-compaction Fixes #511. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-07-06 02:11:47 -03:00
Raphael S. Carvalho	43926026c3	compaction: introduce compaction strategy method to estimate pending compaction At the moment, it's not possible to know how many compaction are needed for compaction strategy to be satisfied. It's not possible to know exactly the number of pending compaction, but the strategy can provide an estimation. For size tiered, it's based on number of sstables in each bucket. By dividing bucket size by max threshold, we get number of compaction needed to compact that single bucket. For leveled, it's about the number of sstables that exceeds the limit in each level. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <e209e52f6159ee274a8358b69961a7c0ce357f7d.1467667054.git.raphaelsc@scylladb.com>	2016-07-05 19:03:11 +03:00
Avi Kivity	c8237fc262	compaction_strategy: introduce make_sstable_set() Allow compaction_strategy to create a container for sstables that is optimized for the strategy. Most compaction_strategies return bag_sstable_set; leveled compaction returns the specialized partitioned_sstable_set.	2016-07-03 10:27:01 +03:00
Avi Kivity	168696c558	Introduce partitioned_sstable_set partitioned_sstable_set assumes that sstable are mostly partitioned along the token range: only a few sstables will be needed to access a particular token. It is implemented as an interval_map.	2016-07-03 10:27:00 +03:00
Avi Kivity	64e4357461	Introduce bag_sstable_set bag_sstable_set is a generic sstable_set implementation: it assumes nothing about the sstables. It is implemented as a vector, and any select will return the entire sstable set.	2016-07-03 10:27:00 +03:00
Avi Kivity	85e9cf4616	Introduce sstable_set sstable_set abstracts the notion of a container of sstables, allowing different compaction strategies to supply their own implementation. The intended user is leveled compaction strategy; since it partitions sstables, it can quickly restrict the number of sstables that participate in a query by looking at the min/max partition key. sstable_set also maintains an internal lw_shared_ptr<sstable_list>, in parallel with the abstract container. This is to support column_family::get_sstable(), which returns a lw_shared_ptr<sstable_list> which must be anchored somewhere if it is not saved at the caller side, as it isn't in most current callers.	2016-07-03 10:27:00 +03:00
Avi Kivity	2a46410f4a	Change sstable_list from a map to a set sstable_list is now a map<generation, sstable>; change it to a set in preparation for replacing it with sstable_set. The change simplifies a lot of code; the only casualty is the code that computes the highest generation number.	2016-07-03 10:26:57 +03:00
Raphael S. Carvalho	588ce915d6	compaction: disable parallel compaction for leveled strategy It was discussed that leveled strategy may not benefit from parallel compaction feature because almost all compaction jobs will have similar size. It was also found that leveled strategy wasn't working correctly with it because two overlapping sstable (targetting the same level) could be created in parallel by two ongoing compaction. Fixes #1293. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <60fe165d611c0283ca203c6d3aa2662ab091e363.1464883077.git.raphaelsc@scylladb.com>	2016-06-05 18:20:00 +03:00
Raphael S. Carvalho	bf03cd1ea6	sstables: kill unused code from size tiered strategy Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <485b1e49419cb052218ab4558f27270ce3bd03b4.1460761821.git.raphaelsc@scylladb.com>	2016-04-19 08:46:06 +03:00
Raphael S. Carvalho	29db5f5e1f	sstables: move compaction strategy code to a new source file Moving compaction strategy code from sstables/compaction.cc to sstables/compaction_strategy.cc That improves readability. Strategy code should be separated from the generic compaction code. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <5af6fc8f7321351a071fc0ce03c80ffea21f8396.1460761821.git.raphaelsc@scylladb.com>	2016-04-19 08:45:43 +03:00

22 Commits