scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	b7b7b2bd63	combined_mutation_reader: implement fast_forward_to() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	2c0cdd55fc	mutation_reader: make combinded_reader public We want to be able to fast forward sstable readers. However, just implementing fast_forward_to() for combined_reader is not enough as the sstables we are reading from may need to change. Following patches are going to introduce a combined sstable reader that derives from combined_reader. To make that possible we first need to make combined_reader public. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	7dcd70124a	tests/sstables: add test for fast forwarding reader Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	5534dc2817	tests: add more helpers to mutation reader assertions Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	cf024975fe	sstables: enable fast forwarding for range readers Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	62c9492d33	mutation_reader: introduce fast_forward_to() This patch introduces the interface for fast forwarding mutation readers. The main user of this feature is going to be cache which, while serving range query, may need to read multiple small ranges from the sstables to populate itself with the missing entries. Fast forwarding is an alternative to recreating a reader with different range. Its main advantage is fact that it avoids dropping data that has already been read. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	c63e88d556	sstables: implement mutation_reader::impl::fast_forward_to() This patch allows sstable readers to be fast forwarded without making it necessary to recreate the reader (and dropping all buffers in the process). It is built on top of index_reader and ability of data_consume_context to be fast forwarded. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	a530762277	sstables: introduce index_reader index_reader is a helper that implements index lookups. Its goal is to avoid dropping read buffers if they still may be needed (for example to get end bound of the range or after fast forwarding the reader). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	f49a9e0d64	sstables: drop unused read_range_rows() overload That overload was used only by unit test and violated guarantee that partition range lives until mutation reader is done. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	0bc873ace5	sstables: add fast_forward_to() to continuous_data_consumer Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	25b91c51e2	ssables: add data_consume_rows_context::reset() reset() is going to be used to restore valid state after fast forwarding the reader. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	2124d08b88	sstables: add skip() to compressed_file_data_source Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-10-19 15:29:08 +01:00
Paweł Dziepak	54069162f5	Merge "Add test for partition version list consistency after compaction" from Tomek	2016-10-18 11:03:25 +01:00
Tomasz Grabiec	308434f891	tests: memtable: Add test for partition version list consistency after compaction	2016-10-18 11:57:14 +02:00
Tomasz Grabiec	6548132423	lsa: Make logalloc::tracker::full_compaction() compact all reclaimable regions is_compactible() will pass on very small regions. full_compaction() is only used in tests to force objects to be moved due to compaction, so we want all reclaimable regions to be compacted.	2016-10-18 11:16:08 +02:00
Tomasz Grabiec	ecf85cbffb	mutation: Define + operation It's more convenient to write m1 + m2 in tests than to do more elaborate constructs with copy constructors and apply().	2016-10-18 11:16:08 +02:00
Tomasz Grabiec	fe387f8ba0	partition_version: Fix corruption of partition_version list The move constructor of partition_version was not invoking move constructor of anchorless_list_base_hook. As a result, when partition_version objects were moved, e.g. during LSA compaction, they were unlinked from their lists. This can make readers return invalid data, because not all versions will be reachable. It also casues leaks of the versions which are not directly attached to memtable entry. This will trigger assertion failure in LSA region destructor. This assetion triggers with row cache disabled. With cache enabled (default) all segments are merged into the cache region, which currently is not destroyed on shutdown, so this problem would go unnoticed. With cache disabled, memtable region is destroyed after memtable is flushed and after all readers stop using that memtable. Fixes #1753. Message-Id: <1476778472-5711-1-git-send-email-tgrabiec@scylladb.com>	2016-10-18 09:25:38 +01:00
Takuya ASADA	587d375e19	main: exit with 1 when verify_seastar_io_scheduler() failed Since we are exiting Scylla process in engine().at_exit() using ::_exit(0), even verify_seastar_io_scheduler() throwing an exception, scylla always exit with 0. Systemd misunderstands scylla-server.service was shutdown successfully because of this, so we need to pass correct exit code to ::_exit() here. Fixes #1674 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1475065607-15486-1-git-send-email-syuu@scylladb.com>	2016-10-17 13:57:00 +03:00
Avi Kivity	163088c6af	Merge seastar upstream * seastar 207bf3d...ccd8649 (3): > Merge "Augment semaphore with non-blocking operations" from Glauber > Merge "More dynamic fstream patches" from Paweł > Merge "fstream: add dynamic adjustments based on stream history" from Paweł	2016-10-17 12:49:17 +03:00
Avi Kivity	65c27ccf21	bytes_ostream: make max_chunk_size() an inline function Fixes debug build looking for a variable definition and not finding it.	2016-10-17 11:49:33 +03:00
Avi Kivity	c0a1ad0b77	bytes_ostream: use larger allocations A 1MB response will require 2000 allocations with the current 512-byte chunk size. Increase it exponentially to reduce allocation count for larger responses (still respecting the upper limit). Message-Id: <1476369152-1245-1-git-send-email-avi@scylladb.com>	2016-10-16 10:05:48 +01:00
Tomasz Grabiec	d836e8f64b	tests: memtable: Add tests for flushing reader Message-Id: <1476454187-11462-1-git-send-email-tgrabiec@scylladb.com>	2016-10-14 15:11:06 +01:00
Tomasz Grabiec	63784fd921	db: Fix corruption of partition_entry Memory accounting code was attaching partition_snapshot to partition_entry in order to calculate the size of partition_version object. However, it is only allowed if partition_entry doesn't have any snapshot attached already. In this case it always has one, created by the flushing reader. Change the accounting code to reuse existing partition_snapshot reference. Fixes #1746 Message-Id: <1476449160-9252-1-git-send-email-tgrabiec@scylladb.com>	2016-10-14 15:10:48 +01:00
Paweł Dziepak	d08cffd3c7	lsa: avoid exceptions during segment_zone creation LSA tries to allocate zones as large as possible (while still leaving enough free space for the standard allocator). It uses the amount of free memory in order to guess how much it can get, but that obviously doesn't account for fragmentation and the allocation attempt may fail. This patch changes the LSA code so that it doesn't throw in case zone couldn't be created but just returns a null pointer which should be more performant if the LSA memory cannot grow any more. Fixes #1394. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1476435031-5601-1-git-send-email-pdziepak@scylladb.com>	2016-10-14 11:08:24 +02:00
Amnon Heiman	7829da13b4	scylla_setup: Reorder questions and actions The expected behaviour in the scylla_setup script is that a question will be followed by the answer. For example, after asking if the scylla should be run as a service the relevant actions will be taken before the following question. This patch address two such mis-orders: 1. the scylla-housekeeping depends on the scylla-server, but the setup should first setup the scylla-server service and only then ask (and install if needed) the scylla-housekeeping. 2. The node_exporter should be placed after the io_setup is done. Fixes #1739 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1476370098-25617-1-git-send-email-amnon@scylladb.com>	2016-10-13 18:29:36 +03:00
Pekka Enberg	3b4e6cdc5e	abstract_replication_strategy: Fix exception type if class not found Change abstract_replication_strategy::create_replication_strategy() to throw exceptions::configuration_error if replication strategy class lookup to make sure the error is converted to the correct CQL response. Fixes #1755 Message-Id: <1476361262-28723-1-git-send-email-penberg@scylladb.com>	2016-10-13 17:39:28 +03:00
Tomasz Grabiec	e617bcd8a7	logalloc: disable abort on allocation failure in places in which it is benign Some places start big expecting allocation failure, then reduce the requested size. Let's not abort in such cases. Message-Id: <1476295120-32047-1-git-send-email-tgrabiec@scylladb.com>	2016-10-13 10:53:32 +03:00
Avi Kivity	13e9d4c8e3	Merge seastar upstream * seastar f937fb0...207bf3d (11): > Merge "iotune: gracefully exit on predictable exceptions" (Fixes #1623) > core/semaphore: Add semaphore_units::release() > Merge "rometheus API with grafana uses labels" from Amnon > core/thread: Fix stack alloc-dealloc mismatch > core/thread: Make jmp_buf_link::yield_at use the same time point as thread_scheduling_group > file: support for XFS on older kernels > reactor: fix bug when handling EBADF in flush_pending_aio() > prometheus CPU should start in 0 > Collectd: bytes ordering depends on the type > tests: Check that backtrace() doesn't corrupt signal mask > core/thread: Add stack guards to seastar thread stacks	2016-10-12 23:47:12 +03:00
Avi Kivity	63f053e9b7	storage_proxy: fix mutation reordering with wrapping ranges If we have a range query involving a wrapping range (i.e., from thrift), and mutations from both halves of the result are involved, then we will return the results in the wrong order (and potentially the wrong partitions) since we order by token, so the results from the second half of the wrapping range end up before the first. Fix by splitting the two queries, and merging the second half with lower priority compared to the first half. Note: this will be fixed in a better way once we have the sharding iterator, as then we can query sequentially. Fixes #1761. Message-Id: <1476262693-30162-1-git-send-email-avi@scylladb.com>	2016-10-12 15:59:16 +02:00
Avi Kivity	1506b06617	Merge "node_exporter service on ubuntu 16" from Amnon "This series address two issues that interfere with running the node_exporter as a service in ubuntu 16. 1. The service file should be packed in the deb file 2. When setting the node_exporter as a service it doesn't need to run with scylla use" * 'amnon/node_exporter_ubuntu_v2' of github.com:cloudius-systems/seastar-dev: node-exporter service: No need to run as scylla user debian package: Include the node_exporter service file	2016-10-12 12:11:18 +03:00
Amnon Heiman	1bd50789e0	node-exporter service: No need to run as scylla user the node-exporter does not need to run as scylla user. It can run without scylla or without the scylla user being configure. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-10-11 12:44:27 +03:00
Amnon Heiman	d523bf56ed	debian package: Include the node_exporter service file This will include the node_exporter service script for ubuntu distribution with systemd support. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-10-11 12:44:14 +03:00
Avi Kivity	f6998bb260	Merge "Implement describe_splits_ex based on Cassandra" from Duarte "This patch-set re-implements the describe_splits_ex() verb to more closely follow Cassandra's implementation, on which some clients rely. Ref #1139 Ref #693" * 'describe-splits/v2' of github.com:duarten/scylla: thrift: Implement describe_splits_ex based on Cassandra storage_service: Implement get_splits() function sstables: Add function to get key samples sstables/key: Add to_partition_key function size_estimates_recorder: Increase estimate accuracy sstables: Get estimates for a particular range sstables/key: Make key::kind public	2016-10-11 11:13:35 +03:00
Takuya ASADA	0007f2d838	dist/common/sbin: add scylla_cpuset_setup and scylla_dev_mode_setup to /usr/sbin We haven't added symlinks to /usr/sbin for newly created scripts, so add them. Fixes #1702 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1474879711-31793-1-git-send-email-syuu@scylladb.com>	2016-10-11 11:02:14 +03:00
Takuya ASADA	ccad720bb1	dist/common/script/scylla_io_setup: handle comma correctly when parsing cpuset The script mistakenly split value at "," when cpuset list is separated by comma. Instead of matching possible patterns of the argument, let's pass all characters until reach to space delimiter or end of line. Fixes #1716 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1476171037-32373-1-git-send-email-syuu@scylladb.com>	2016-10-11 10:42:32 +03:00
Duarte Nunes	d8cfc56376	thrift: Implement describe_splits_ex based on Cassandra This patch re-implements the describe_splits_ex() verb to more closely follow Cassandra's implementation, on which some clients rely. Ref #1139 Ref #693 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 22:32:10 +02:00
Duarte Nunes	01ab2081cd	storage_service: Implement get_splits() function This patch implements the get_splits() function in storage_service, used to split a particular token range in slices of approximately the specified size, using the sample keys and estimates of the CF's sstables. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 22:32:08 +02:00
Duarte Nunes	c36dbaf0f1	sstables: Add function to get key samples This patch implements the get_key_samples() function, on which a future patch will base an implementation of the describe_splits() thrift verb closer to Cassandra's. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 19:50:14 +02:00
Duarte Nunes	fc07b66678	sstables/key: Add to_partition_key function Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 19:50:11 +02:00
Duarte Nunes	c19c633299	size_estimates_recorder: Increase estimate accuracy This patch uses the estimated_keys_for_range() function to get better estimates. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 17:52:16 +02:00
Duarte Nunes	ceed09b23e	sstables: Get estimates for a particular range This patch adds the estimated_keys_for_range() function, which estimates the number of keys present between the specified range. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 17:52:15 +02:00
Duarte Nunes	8c223b31c8	sstables/key: Make key::kind public Needed to create synthetic keys without any value but with ordering properties. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-10-10 17:47:24 +02:00
Avi Kivity	b305d92a65	Merge "housekeeping: check version during setup" from Amnon "The version is taken from the installation rather than the API, a mode command line indicated that this is part of the setup and uuid is used for the interaction with the checkversion server." * 'amnon/check_version_on_startup_v3' of github.com:cloudius-systems/seastar-dev: scylla_setup: Check and report the scylla version scylla-housekeeping: check version during setup	2016-10-10 16:37:14 +03:00
Vlad Zolotarov	ab748e829d	docs: tracing.md: initial commit Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1475686745-20383-1-git-send-email-vladz@cloudius-systems.com>	2016-10-10 16:12:02 +03:00
Tomasz Grabiec	4357d0a6d9	db: Add counter for writes blocked on dirty memory There is already queue_length-requests_blocked_memory, but it's a gauge so does not reflect what happened between the sampling points. total_operations-requests_blocked_memory will allow to see if there were any (and how many) requests which were blocked by dirty memory. Message-Id: <1476098616-12682-1-git-send-email-tgrabiec@scylladb.com>	2016-10-10 14:25:22 +03:00
Pekka Enberg	3b75ff1496	docs/docker: Tag `--listen-address` as 1.4 feature The Docker Hub documentation is the same for all image versions. Tag `--listen-address` as 1.4 feature. Message-Id: <1475819164-7865-1-git-send-email-penberg@scylladb.com>	2016-10-10 13:26:16 +03:00
Vlad Zolotarov	006999f46c	api::storage_service::slow_query: don't use duration_cast in GET The slow_query_record_ttl() and slow_query_threshold() return the duration of the appropriate type already - no need for an additional cast. In addition there was a mistake in a cast of ttl. Fixes #1734 Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1475669400-5925-1-git-send-email-vladz@cloudius-systems.com>	2016-10-09 18:09:13 +03:00
Takuya ASADA	469e9af1f4	dist/common/scripts/scylla_setup: use 'swapon -s' instead of 'swapon --show' Since Ubuntu 14.04 doesn't supported --show option, we need to prevent use it. Fixes #1740 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1475788340-22939-2-git-send-email-syuu@scylladb.com>	2016-10-09 18:05:14 +03:00
Takuya ASADA	8452045b85	dist/ubuntu: add realpath to dependency, requires for scylla_setup We need dependency to realpath, since scylla_setup using it. Fixes #1740. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1475788340-22939-1-git-send-email-syuu@scylladb.com>	2016-10-09 18:05:14 +03:00
Tomasz Grabiec	41e66ebce2	gdb: Introduce 'scylla heapprof' Presents current heap profile recording. Works in text mode or dumps to collapsed stacks format from which flame graph can be generated. To generate a flamegraph: (gdb) scylla heapprof --flame Wrote heapprof.stacks $ flamegraph.pl --colors mem < heapprof.stacks > heapprof.svg flamegraph.pl comes from: https://github.com/brendangregg/FlameGraph.git Text mode example: (gdb) scylla heapprof --min 100000000 All (274699676, #10213) \-- void* memory::cpu_pages::allocate_large_and_trim<memory::cpu_pages::allocate_large_aligned(unsigned int, unsigned int)::{lambda(unsigned int, unsigned int)#1}>(unsigned int, memory::cpu_pages::allocate_large_aligned(unsigned int, unsigned int)::{lambda(unsigned int, unsigned int)#1}) + 169 (268435456, #1) memory::allocate_large_aligned(unsigned long, unsigned long) + 87 memory::allocate_aligned(unsigned long, unsigned long) + 48 aligned_alloc + 9 logalloc::segment_zone::segment_zone() + 304 logalloc::segment_pool::allocate_segment() + 477 logalloc::segment_pool::segment_pool() + 304 __tls_init.part.801 + 72 logalloc::region_group::release_requests() + 1333 logalloc::region_group::add(logalloc::region_group*) + 514 The branches are formatted like this: -- <symbol> (<size>, #<count>) Where <size> is total size of live objects and <count> is total number of live objects, for all objects allocated from paths going through this node. Nodes which share the same <size> and <count> are stacked like this: -- <symbol_1> (<size>, #<count>) <symbol_2> <symbol_3> Message-Id: <1475583334-19524-1-git-send-email-tgrabiec@scylladb.com>	2016-10-09 10:54:08 +03:00

1 2 3 4 5 ...

10542 Commits