scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-23 18:10:39 +00:00

Author	SHA1	Message	Date
Glauber Costa	dddc7e1676	add a token_view Ideally we would like tokens to be trivially destructible, so that we can easily dispose of giant vectors holding them. While that is hard to do with our current infrastructure, we can introduce a token_view, which holds a bytes_view elements instead of the real data - making it trivially destructible. The comparators are then changed to take a token_view, and an implicit conversion function is provided from tokens so they get compared. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-03-15 12:24:09 -04:00
Avi Kivity	025c6b45b2	dht: extend i_partitioner::next_token_for_shard() Right now, next_token_for_shard() only allows iterating linearly in shard order. Add the ability to select a specific shard to skip to (in case we're only interested in a single shard), and to select larger ranges (so that exponential increases are not implemented by iteration).	2017-05-17 12:30:03 +03:00
Avi Kivity	302fec8293	dht: make i_partitioner::name() const	2017-05-17 12:30:03 +03:00
Avi Kivity	f462c4327e	dht: make i_partitioner keep track of the number of shards it was configured with Useful for testing classes layered on top of the partitioner (the sharders).	2017-05-17 12:30:03 +03:00
Avi Kivity	8b1d689de8	partitioner: add ignore_msb parameters to byte ordered and random partitioners Ignored; doesn't make sense on byte ordered, and random is deprecated.	2016-11-22 21:56:42 +02:00
Avi Kivity	1f88d103a8	partitioner: add i_partitioner::token_for_next_shard() When performing a range query, we want to iterate over shards, running the query on each shard in order until the query range is exhausted or we have the right number of rows. To be able to do this, introduce token_for_next_shard(), which allows us to determine the boundary between shards. It is a sort-of inverse to shard_of(), in that shard_of(token_for_next_range(t)) == shard_of(t) + 1	2016-11-03 19:09:23 +02:00
Avi Kivity	6320181b97	partitioner: const correctness for comparators	2016-11-03 11:27:40 +02:00
Avi Kivity	470826d127	partitioner: change partitioners to have shard counts independent from smp::count Useful for testing.	2016-11-03 11:27:40 +02:00
Duarte Nunes	862f51cddf	partitioner: Parse token from bytes This patch adds the from_bytes() function to the i_partitioner class, whose purpose is parse a particular token and explicitly handle the case when the minimum token is specified. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-09-30 11:17:02 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Glauber Costa	229ce6cd85	dht: provide a from_sstring method Only the partitioner knows how to convert a token to a sstring. Conversely, only the partitioner can know how to convert it back. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-08-17 11:03:35 -07:00
Glauber Costa	5f807784bf	dht: fix to_sstring methods to account for min tokens Right now, we are converting the _data part of the token to a sstring, which may be latter stored somewhere - in a system sstable, for instance. Later on, we will have to get it back, but the way the code currently stands, we will get undefined results for min and max tokens, since they have the _data field empty. For murmur3, strictly speaking, the correct solution would be to change long_token to account for that. However, when we compare values, we already do kind comparations explicitly. Inserting them there would only make that operation branchier == costlier, which being a very common one, we don't want to. Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-08-17 10:23:19 -07:00
Glauber Costa	e1968c389e	dht: use tri_compare for token comparisons Loading data from memory tends to be the most expensive part of the comparison operations. Because we don't have a tri_compare function for tokens, we end up having to do an equality test, which will load the token's data in memory, and then, because all we know is that they are not equal, we need to do another one. Having two dereferences is harmful, and shows up in my simple benchmark. This is because before writing to sstables, we must order the keys in decorated key order, which is heavy on the comparisons. The proposed change speeds up index write benchmark by 8.6%: Before: 41458.14 +- 1.49 partitions / sec (30 runs) After: 45020.81 +- 3.60 partitions / sec (30 runs) Parameters: --smp 6 --partitions 500000 Signed-off-by: Glauber Costa <glommer@cloudius-systems.com>	2015-08-12 09:23:42 -05:00
Avi Kivity	f915ff1fcd	dht: introduce i_partitioner::shard_of() and implement msb sharding Make sharding partitioner-specific, since different partitioners interpret the byte content differently. Implement it by extracting the shard from the most significant bits, which can be used to minimize cross shard traffic for range queries, and reduces sstable sharing.	2015-08-03 20:17:40 +03:00
Paweł Dziepak	ede9886f50	dht: add byte_ordered_partitioner Some of the tests in DTEST take advantage of the fact that ByteOrderedPartitioner guarantees certain ordering of partition keys. Signed-off-by: Paweł Dziepak <pdziepak@cloudius-systems.com>	2015-07-09 23:43:16 +02:00

16 Commits