scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 01:20:39 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	af4fa6152b	partition_start: make partition_tombstone() const	2017-11-13 16:49:52 +00:00
Paweł Dziepak	f648f94464	partition_checksum: introduce compute() for flat_mutation_reader	2017-11-13 16:49:52 +00:00
Paweł Dziepak	37640f223b	db: drop single-range make_streaming_reader()	2017-11-13 16:49:52 +00:00
Paweł Dziepak	e2481a89e1	fragment_and_freeze: drop streamed_mutation overload	2017-11-13 16:49:52 +00:00
Paweł Dziepak	6f1e0d3ed8	stream_transfer_task: switch to flat_mutation_reader	2017-11-13 16:49:52 +00:00
Paweł Dziepak	50a1d76c1f	tests/flat_mutation_reader: add test for fragment_and_freeze Based on streamed_mutation_test:test_fragmenting_and_freezing_streamed_mutations	2017-11-13 16:49:52 +00:00
Paweł Dziepak	f5c40e0861	flat_mutation_reader_from_mutations: take vector by value	2017-11-13 16:49:51 +00:00
Paweł Dziepak	9854b8a450	fragment_and_freeze: work on flat_mutation_readers	2017-11-13 16:49:47 +00:00
Paweł Dziepak	8bb672502d	fragment_and_freeze: allow callback to stop iteration There is a user of fragment_and_freeze() (streaming) that will need to be able to break the loop Right now, it does that between streamed_mutation, but that won't be possible after we switch to flat readers.	2017-11-13 16:44:33 +00:00
Paweł Dziepak	73b8f54cf4	test/mutation_source_test: generate sets of mutations	2017-11-13 16:42:56 +00:00
Tomasz Grabiec	3536d2156c	tests: row_cache: Add reproducer for issue #2948 Message-Id: <1510229584-14398-2-git-send-email-tgrabiec@scylladb.com>	2017-11-13 15:20:21 +00:00
Tomasz Grabiec	8402728747	row_cache: Call open_version() under region's allocator partition_entry::read() calls open_version() under standard allocator, but it may allocate a new partition version if a snapshot already exists which was created in an earlier phase. Versions are supposed to be allocated using region's allocator, they will be freed using region's allocator. LSA will delegate free() to the standard allocator correctly in this case, but it will subtract from its _non_lsa_occupancy, assuming the allocation was done through it. This will corrupt occupancy() for cache region. Fixes #2948. Message-Id: <1510229584-14398-1-git-send-email-tgrabiec@scylladb.com>	2017-11-13 15:20:08 +00:00
Avi Kivity	061f6830fa	Merge "thrift/server: Ensure stop() waits for accepts" from Duarte "Ensure stop() waits for the accept loop to complete to avoid crashes during shutdown." * 'thrift-server-stop/v4' of https://github.com/duarten/scylla: thrift/server: Restore code format thrift/server: Stopping the server waits for connection shutdown thrift/server: Abort listeners on stop() thrift/server: Avoid manual memory management thrift/server: Add move ctor for connection thrift/server: Extract retry logic thrift/server: Retry with backoff for some error types thrift/server: Retry accept in case of error	2017-11-13 12:48:05 +02:00
Duarte Nunes	049fbb58f3	thrift/server: Restore code format Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:21:54 +01:00
Duarte Nunes	7b25e3200a	thrift/server: Stopping the server waits for connection shutdown This patch ensures the future returned from stop() resolves only when all connections and listeners are no longer in use. Fixes #2657 Fixes #2942 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:21:53 +01:00
Duarte Nunes	f523a0f845	thrift/server: Abort listeners on stop() Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:19:44 +01:00
Duarte Nunes	8e0e2363e9	thrift/server: Avoid manual memory management Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:19:44 +01:00
Duarte Nunes	75d04be96f	thrift/server: Add move ctor for connection	2017-11-13 11:19:44 +01:00
Duarte Nunes	9d3322ff1a	thrift/server: Extract retry logic Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:19:43 +01:00
Duarte Nunes	b5cf1a152f	thrift/server: Retry with backoff for some error types Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:19:19 +01:00
Duarte Nunes	f367dbe1ed	thrift/server: Retry accept in case of error In case of errors like ECONNABORTED, we want to retry accepting connections. Right now we immediately retry the accept, but in subsequent patches we introduce a backoff for other types of errors. We also consider fatal errors like EBADFD, which should not trigger a retry. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-13 11:19:03 +01:00
Avi Kivity	d57395dce9	cql: prevent overflow when computing averages Currently, we use type type of the column as the accumulator when we average it. This can easily overflow, e.g. (2^31-1)+(3) = overflow. Fix by using __int128 for the accumulator. It's not standard, but it's way more efficient and simpler than the alternatives. Inspired by CASSANDRA-12417, but much simpler due to the availability of __int128. Message-Id: <20171112173529.30764-1-avi@scylladb.com>	2017-11-13 08:53:59 +01:00
Piotr Jastrzebski	acfc6fef55	Simplify flat_mutation_reader wrappers If a wrapper takes a flat_mutation_reader in a constructor then it does not have to take schema_ptr because it can obtain it from the inner flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <88c3672df08d2ac465711e9138d426e43ae9c62b.1510331382.git.piotr@scylladb.com>	2017-11-13 08:53:34 +01:00
Avi Kivity	f8af4f507b	Merge "Support for varint and decimal in aggregate functions" from Daniel "This patch adds support for varint and decimal to aggregate functions. Some other types (like byte or smallint) weren't supported and they are supported by C. So their aggregate functions were added as well. To allow aggregate functions for big_decimal, following methods were added to big_decimal type: Division by int64_t that preservers number of decimal digits. * Operator += . * Comparison operators. Fixes #2842." * 'danfiala/scylla-2842-send-002' of https://github.com/hagrid-the-developer/scylla: tests: Add tests for aggregate functions. tests: Add tests for big_decimal type. cql3/functions: Add aggregate functions for big_decimal. utils/big_decimal: Added necessary operators and methods for aggregate functions. cql3/functions: Add aggregate functions for types for which it is trivial.	2017-11-12 17:11:33 +02:00
Daniel Fiala	bc20484c47	tests: Add tests for aggregate functions. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 15:53:22 +01:00
Daniel Fiala	ee1d69502b	tests: Add tests for big_decimal type. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 15:53:22 +01:00
Daniel Fiala	74c5f70b0a	cql3/functions: Add aggregate functions for big_decimal. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 15:53:13 +01:00
Daniel Fiala	ce2f010859	utils/big_decimal: Added necessary operators and methods for aggregate functions. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 15:51:29 +01:00
Daniel Fiala	115668fe70	cql3/functions: Add aggregate functions for types for which it is trivial. Signed-off-by: Daniel Fiala <daniel@scylladb.com>	2017-11-12 13:56:20 +01:00
Tomasz Grabiec	484dde692f	Merge "make sure that cache updates don't overflow dirty memory" from Glauber Since we started accounting virtual dirty memory we no longer have a cap on real dirty memory. In most situations that is not needed, since real dirty will just be at most twice as much as virtual dirty (current flushing memtable plus new memtable). However, due to things like cache updates and component flushing we can end up having a lot of memtables that are virtually freed but not yet fully released, leading real dirty memory to explode using all the box' memory. This patch adds a cap on real dirty memory as well. Because of the hierarchical nature of region_group, if the parent blocks due to memory depletion, so will the child (virtual dirty region group). After that is done, we need to make sure that dirty memory is not seen as freed until the cache update is done. Until a particular partition is moved to the cache it is not evictable. As a result we can OOM the system if we have a lot of pending cache updates as the writes will not be throttled and memory won't be made available. This patch pins the memory used by the region as real dirty before the cache update starts, and unpins it when it is over. In the mean time it gradually releases memory of the partitions that are being moved to cache. I have verified in a couple of workloads that the amount of memory accounted through this is the same amount of memory accounted through the memtable flush procedure. Fixes #1942 * git@github.com:glommer/scylla.git glommer/update-cache-v4: row_cache: modernize use of seastar threads mutation_partition: estimate size of partition memtable: factor out calculation of memtable_entry memory size memtable: add a method to export memtable's dirty memory manager dirty_memory_manager: block if we hit the real dirty limit dirty_memory_manager: add functions to manipulate real dirty partition: add method to calculate memory size of a partition row cache: pin real dirty during cache updates.	2017-11-10 13:55:12 +01:00
Piotr Jastrzebski	e7a0732f72	Add schema_ptr to flat_mutation_reader It is usefull to have a schema inside a flat reader the same way we had schema inside a streamed_mutation. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b37e0dbf38810c00bd27fb876b69e1754c16a89f.1510312137.git.piotr@scylladb.com>	2017-11-10 13:54:55 +01:00
Pekka Enberg	0c192c835c	cql3: Fix 'DROP INDEX' to also drop index view This patch fixes 'DROP INDEX' CQL statement to also drop the underlying index view automatically so that we don't leave unused materialized views behind. Message-Id: <1510303421-15945-1-git-send-email-penberg@scylladb.com>	2017-11-10 10:52:08 +01:00
Duarte Nunes	73f6c9a612	Merge seastar upstream * seastar 8040cab...11ad0b1 (7): > alloc_failure_injector: Fix compilation error with gcc 7.1 > core/gate: Add is_closed() function > doc: code formatting and fix function call > doc: tutoral code formatting > build: adjust -Wno-error=cpp for clang > build: don't error out on preprocessor #warning > Merge 'Enhancements of allocation failure injector' from Tomasz Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-11-09 14:42:06 +01:00
Takuya ASADA	f607a01cc5	dist/debian: link boost statically Since we switched scylla-boost163 which isn't provided by distribution repo, we need to link them statically. Fixes #2946 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1510229553-29801-1-git-send-email-syuu@scylladb.com>	2017-11-09 14:51:00 +02:00
Glauber Costa	1d7617723d	row cache: pin real dirty during cache updates. Right now, once a region is moved to the cache is no longer visible to the dirty memory system. Not as real dirty nor virtual dirty. The problem is that until a particular partition is moved to the cache it is not evictable. As a result we can OOM the system if we have a lot of pending cache updates as the writes will not be throttled and memory won't be made available. This patch pins the memory used by the region as real dirty before the cache update starts, and unpins it when it is over. In the mean time it gradually releases memory of the partitions that are being moved to cache. I have verified in a couple of workloads that the amount of memory accounted through this is the same amount of memory accounted through the memtable flush procedure. Fixes #1942 Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 19:46:36 -05:00
Glauber Costa	c2f49da609	partition: add method to calculate memory size of a partition Once that is added, also add a method to a memtable entry to calculate the entire size of a memtable entry. Right now we only have one method to calculate the size minus rows. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	b02ab991b9	dirty_memory_manager: add functions to manipulate real dirty There are times in which we want to add and remove real dirty memory without impacting virtual dirty. One such example is the cache update process, where real dirty is the limiting factor. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	a6b2226562	dirty_memory_manager: block if we hit the real dirty limit Since we started accounting virtual dirty memory we no longer have a cap on real dirty memory. In most situations that is not needed, since real dirty will just be at most twice as much as virtual dirty (current flushing memtable plus new memtable). However, due to things like cache updates and component flushing we can end up having a lot of memtables that are virtually freed but not yet fully released, leading real dirty memory to explode using all the box' memory. This patch adds a cap on real dirty memory as well. Because of the hierarchical nature of region_group, if the parent blocks due to memory depletion, so will the child (virtual dirty region group). A next step is to add a controller that will increase the priority of the tasks involving in releasing real dirty memory if we get dangerously close to the threshold. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	b98a48657e	memtable: add a method to export memtable's dirty memory manager It will be used by the cache update process to gradually return real dirty memory to the manager. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	ec36b9eddc	memtable: factor out calculation of memtable_entry memory size The total size is the sum of two components. Add a method that does that sum so this code gets easier to reuse. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	d49ecae201	mutation_partition: estimate size of partition In the memtable flusher, we account for the size of a partition as we read them. However, there are other points in the architecture where we would like to calculate the size of a partition in a point in which we are not reading it. One such example is the cache update process. This patch enhances the mutation_partition adding a method that returns the total size for this partition. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Glauber Costa	b836005555	row_cache: modernize use of seastar threads For a while now we have an async() function, that simplifies the code by not needing to issue an explicit join. This patch converts the row cache to use async() as well, which most of our code already does. Doing so will make it easier to make changes to update_cache. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Paweł Dziepak	b69f94fece	Merge "Implement flat_mutation_reader::consume" from Piotr "Implement flat_mutation_reader::consume and add tests for it. For that implement flat_mutation_reader_from_mutations and read_mutation_from_flat_mutation_reader." * 'haaawk/flat_reader_consume_v3' of github.com:scylladb/seastar-dev: Add tests for flat_mutation_reader::consume Add tests for flat_mutation_reader utils Introduce read_mutation_from_flat_mutation_reader Make mutation_rebuilder streamed_mutation independent flat_mutation_reader_from_mutation: support multiple mutations Introduce flat_mutation_reader::consume Move FlattenedConsumer concept to flat_mutation_reader.hh	2017-11-08 15:08:47 +00:00
Paweł Dziepak	0373f357a8	Merge "Make memtable::make_reader return flat_mutation_reader" from Piotr "This patchset introduces memtable::make_flat_reader that returns flat_mutation_reader and converts internal memtable readers into flat_mutation_readers. It also introduces some utility methods like make_forwardable and make_partition_snapshot_flat_reader." * 'haaawk/flat_reader_memtable_v4' of github.com:scylladb/seastar-dev: Turn scanning_reader into flat_mutation_reader Change memtable_entry::read to return flat_mutation_reader Make iterator_reader independent from mutation_reader Introduce make_partition_snapshot_flat_reader Prepare partition_snapshot_flat_reader Introduce flat_mutation_reader_from_mutation Prepare flat_mutation_reader_from_mutation Introduce make_forwardable Prepare make_forwardable for flat_mutation_reader Introduce empty_flat_reader memtable: Introduce make_flat_reader	2017-11-08 14:24:26 +00:00
Piotr Jastrzebski	29d409de2f	Add tests for flat_mutation_reader::consume Make sure that flat_mutation_reader::consume stops as it's asked by the consumer. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:26:10 +01:00
Piotr Jastrzebski	d42e53982d	Add tests for flat_mutation_reader utils Test flat_mutation_reader_from_mutations and read_mutation_from_flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:26:10 +01:00
Piotr Jastrzebski	4b58a05053	Introduce read_mutation_from_flat_mutation_reader This helper method reads a single mutation from a flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:26:10 +01:00
Piotr Jastrzebski	6718ecab82	Make mutation_rebuilder streamed_mutation independent mutation_rebuilder will be used not only with streamed_mutations but also with flat_mutation_readers so it's better for it to be independent from streamed_mutation. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:26:10 +01:00
Piotr Jastrzebski	aa16cd7eef	flat_mutation_reader_from_mutation: support multiple mutations Rename flat_mutation_reader_from_mutation to flat_mutation_reader_from_mutations. Make it work with std::vector<mutation> instead of a single mutation. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:26:10 +01:00
Piotr Jastrzebski	bcd5415413	Introduce flat_mutation_reader::consume This is equivalent to consume_flattened for old mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:25:28 +01:00

1 2 3 4 5 ...

13514 Commits