scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 20:16:43 +00:00

Author	SHA1	Message	Date
Takuya ASADA	ed4cd1908f	dist/common/scripts/scylla_selinux_setup: correct CentOS/RHEL detection CentOS/RHEL is using SELinux, and it's NOT Debian variant, so fixed from "is_debian_variant" to "! is_debian_variant". Fixes #1930 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1481643873-28984-1-git-send-email-syuu@scylladb.com>	2016-12-13 18:29:29 +02:00
Takuya ASADA	6c0dc55495	dist/common/scripts/scylla_selinux_setup: to use is_debian_variant(), need to source /usr/lib/scylla/scylla_lib.sh This fixes following command not found error: ``` /usr/sbin/scylla_selinux_setup: line 7: is_debian_variant: command not found ``` Fixes #1929 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1481643308-28637-1-git-send-email-syuu@scylladb.com>	2016-12-13 18:29:13 +02:00
Takuya ASADA	3b74c50546	dist/ubuntu: add uuidgen to package dependency We haven't added uuidgen to Ubuntu/Debian package dependency, so scylla_setup script may abort because of command not found. Fixes #1928 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1481642385-27941-1-git-send-email-syuu@scylladb.com>	2016-12-13 18:28:48 +02:00
Duarte Nunes	1e75a4950e	database: Complete query when hitting partition limit Currently, we weren't completing a query as early as possible if it reached the partition limit, we instead had to wait until reaching the end of the specified partition ranges. This patches fixes that by including a check to the partition limit in the termination condition. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20161213114559.26438-1-duarte@scylladb.com>	2016-12-13 14:53:46 +02:00
Tomasz Grabiec	f451014785	schema: Implement operator<< for column_mapping Message-Id: <1481310679-14074-1-git-send-email-tgrabiec@scylladb.com>	2016-12-13 12:20:46 +02:00
Tomasz Grabiec	059a1a4f22	db: Fix commitlog replay to not drop cell mutations with older schema column_mapping is not safe to access across shards, because data_type is not safe to access. One of the manifestation of this is that abstract_type::is_value_compatible_with() always fails if the two types belong to different shards. During replay, column_mapping lives on the replaying shard, and is used by converting_mutation_partition_applier against the schema on the target shard. Since types in the mapping will be considered incompatible with types in the schema, all cells will be dropped. Fix by using column_mapping in a safe way, by copying it to the target shard if necessary. Each shard maintains its own cache of column mappings. Fixes #1924. Message-Id: <1481310463-13868-1-git-send-email-tgrabiec@scylladb.com>	2016-12-13 12:19:32 +02:00
Avi Kivity	32d55bbb4c	Merge seastar upstream * seastar 0773e98...6fbd792 (2): > tls: Only run our "verify" function in client session > Merge "Clean the metric definition" from Amnon Includes patch from Amnon adjusting the metrics registration due to seastar API changes.	2016-12-13 12:17:14 +02:00
Avi Kivity	6f9c317b91	Merge "Use uuid file in housekeeping" from Amnon "This patch adds the use of uuid file to the housekeeping daily version check. uuid file are optional, if a file is missing no uuid will be used."	2016-12-13 10:52:44 +02:00
Avi Kivity	c67782f169	Merge seastar upstream * seastar 0a74317...0773e98 (6): > tls: Add support for client cetrificate verification & priority strings > semaphore: add consume_units > semaphore: add available_units() > thread: check need_preempt for threads in a scheduling group as well > tutorial: fix semaphore example, and text > stop_iteration: add && and \|\| operators	2016-12-12 18:06:19 +02:00
Avi Kivity	c801cc4bd1	Merge "streaming and repair updates" from Asias "This series: - We can make reader with ranges - Fix possible use after free of 'si' - Streaming ranges now are sorted and merged - Fix shard_begin shard_end end loop in both streaming and repair"	2016-12-12 11:32:42 +02:00
Asias He	ba54654af3	streaming: Use interval_set to sort and merge ranges So that the ranges are sorted and have no overlaps. We can have less ranges to deal with and it can help the mutation readers to optimize. Here is an exmaple of ranges generated by repair: Before: INFO 2016-12-07 17:44:21,185 [shard 0] stream_session - cf_id = dec9fa90-bc3b-11e6-af78-000000000001, before ranges = {(-3383928698815274642, -3376937163195039606], (-7260764223708720005, -7251657821052234309], (-4767213984179237293, -4747032371925842389], (-7645879646119667643, -7589962743703481776], (-2340199306656526861, -2320523117224780931], (-576028861239229331, -560973674020019962], (-4070378863644120252, -3987599893827407860], (-2551584407739673151, -2498779102482524711], (-5416061903556353312, -5354212455975869358], (37594980457713898, 67885601051654285], (3083778975065200884, 3091232478835418439], (3131345970514528877, 3187922544267434961], (5765437476661317163, 5778671293583720541], (5960610072466058818, 5972289771228014343], (7749618183851698485, 7758080813117351135], (-3987599893827407860, -3899198931034439776], (-7251657821052234309, -7131649010279865221], (-3576581915808403133, -3383928698815274642], (-417850207760366422, -327959672080599465], (-2671876682129336880, -2551584407739673151], (-1305178847032904465, -1137497074548854552], (8540448858050275827, 8610171849752115483], (-560973674020019962, -417850207760366422], (-2498779102482524711, -2340199306656526861], (2394447940525988167, 2523396860109747637], (-6703329224557608009, -6517757811218772762], (-3675103288021821677, -3576581915808403133], (-5622185785296846551, -5416061903556353312], (8610171849752115483, 8742605005068551458], (8068079250973315241, 8185655671734937642], (560264964510741191, 790641981923757238], (5581202487214475094, 5765437476661317163], (8742605005068551458, 8923908282731801645], (-6038176423022601107, -5622185785296846551], (5778671293583720541, 5960610072466058818], (-3899198931034439776, -3675103288021821677], (8356739976149429222, 8540448858050275827], (-6517757811218772762, -6038176423022601107], (-8052600134279395253, -7645879646119667643], (-327959672080599465, 37594980457713898], (7758080813117351135, 8019254284118543066], (4781565016737645510, 5067070718000527886], (2523396860109747637, 3083778975065200884], (-5354212455975869358, -4767213984179237293], (6784138025918878582, 7190719703944308372], (67885601051654285, 447405341661896387], (-2190610927722759275, -1305178847032904465], (-4747032371925842389, -4070378863644120252]}, size=48 After: INFO 2016-12-07 17:44:21,185 [shard 0] stream_session - cf_id = dec9fa90-bc3b-11e6-af78-000000000001, after ranges = {(-8052600134279395253, -7589962743703481776], (-7260764223708720005, -7131649010279865221], (-6703329224557608009, -3376937163195039606], (-2671876682129336880, -2320523117224780931], (-2190610927722759275, -1137497074548854552], (-576028861239229331, 447405341661896387], (560264964510741191, 790641981923757238], (2394447940525988167, 3091232478835418439], (3131345970514528877, 3187922544267434961], (4781565016737645510, 5067070718000527886], (5581202487214475094, 5972289771228014343], (6784138025918878582, 7190719703944308372], (7749618183851698485, 8019254284118543066], (8068079250973315241, 8185655671734937642], (8356739976149429222, 8923908282731801645]}, size=15	2016-12-12 11:09:26 +08:00
Asias He	e523803a5d	token_metadata: Introduce interval_to_range helper It is used to convert a boost::icl::interval<token> interval back to a range<token>.	2016-12-12 11:09:26 +08:00
Asias He	af3d76e6ac	repair: Fix a typo in the log sucessfully -> successfully	2016-12-12 11:09:26 +08:00
Asias He	374324e6fb	repair: Fix shard_begin and shard_end A range now alternates between different shards: the first part of the range goes to shard X, the next to shard X+1, but after a while we go back to shard X. So we can't do a simple loop between shard_begin and shard_end. Fix by using the newly introduced dht::split_range_to_shards Use the cf.make_streaming_reader with ranges to simplify the code a bit.	2016-12-12 11:09:26 +08:00
Asias He	1987264beb	streaming: Make streaming reader with ranges Now that we have the new interface to make readers with ranges, we can simplify the code a lot. 1) Less readers are needed before: number of ranges of readers after: smp::count readers at most 2) No foreign_ptr is needed There is no need to forward to a shard to make the foreign_ptr for send_info in the first phase and forward to that shard to execute the send_info in the second phase. 3) No do_with is needed in send_mutations since si now is a lw_shared_ptr 4) Fix possible user after free of 'si' in do_send_mutations We need to take a reference of 'si' when sending the mutation with send_stream_mutation rpc call, otherwise: msg1 got exception si->mutations_done.broken() si is freed msg2 got exception si is used again The issue is introduced in `dc50ce0ce5` (streaming: Make the mutation readers when streaming starts) which is master only, branch 1.5 is not affected.	2016-12-12 09:04:21 +08:00
Asias He	463cc4fbde	dht: Introduce split_ranges_to_shards Split a ranges into shard ranges map with ring_position_range_sharder helper.	2016-12-12 09:04:21 +08:00
Asias He	044c4ff44c	dht: Introduce split_range_to_shards Split a range into shard ranges map with ring_position_range_sharder helper.	2016-12-12 09:04:21 +08:00
Asias He	cd2105b8bd	database: make_streaming_reader for ranges Allow to make a streaming reader with a vector of ranges in addition to a single range. This will be used soon in following streaming patch. We can make the reader more efficient later.	2016-12-12 09:04:21 +08:00
Duarte Nunes	ada2f1092e	dht: Make i_partitioner::tri_compare pure virtual This patch makes the i_partitioner::tri_compare() function pure virtual as it is overridden by all partitioners. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20161211172037.16496-1-duarte@scylladb.com>	2016-12-11 19:29:37 +02:00
Duarte Nunes	bb66b051ed	dht: Make i_partitioner::tri_compare memory safe This patch fixes a typo in i_partitioner::tri_compare() where we were using std::max instead of std::min, thus avoiding accessing random memory and getting random results. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20161211165043.17816-1-duarte@scylladb.com>	2016-12-11 18:58:10 +02:00
Amnon Heiman	08dcd8cb4a	scylla housekeeping ubuntu service: use uuid file This patch adds uuid file support for ubuntu system. It also split the behaviour between restart and daily checks. The first run in r mode and the second in d mode. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-12-11 16:35:07 +02:00
Amnon Heiman	6fef24aaf0	housekeeping systemd service: use uuid file This set the housekeeping systemd service to use a uuid file and use daily mode. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-12-11 16:02:16 +02:00
Amnon Heiman	17b8306bc4	scylla-housekeeping support uuid file Allows scylla-housekeeping getting the uuid from a file instead of the command line. If the file is missing no uuid will be used. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-12-11 16:00:34 +02:00
Avi Kivity	299d1fad0b	Merge "reduce bloom filter overhead in compaction" from Raphael "Function to calculate maximum purgeable timestamp is made 10 times faster when compacting sstables overlap with 10% of all sstables. That's possible with an incremental selector that will incrementally select sstables based on key being compacted. Currently, we iterate through all non-compacting sstables and consult their bloom filter to determine max purgeable timestamp, and that will be very expensive for compactions that are frequently deciding whether or not to purge tombstones." * 'filter_overhead_fix_v4' of github.com:raphaelsc/scylla: compaction: reduce bloom filter overhead with incremental selector tests: add test for sstable set's incremental selector sstable_set: introduce incremental selector compatible_ring_position: add function to return token	2016-12-11 09:46:58 +02:00
Glauber Costa	5803957ab5	compaction: fix build Commit `732ee275` moved tracking of one statistics value inside a lambda without capturing this in that lambda. Compilation fails as a result. Signed-off-by: Glauber Costa <glauber@scylladb.com> Reviewed-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <68860640f4533dd43e43f341f1620e25464b700b.1481313455.git.glauber@scylladb.com>	2016-12-10 09:00:20 +02:00
Raphael S. Carvalho	fcfc84e836	compaction: reduce bloom filter overhead with incremental selector The procedure to calculate max purgeable timestamp is optimized by only visiting sstables that overlap with key being currently compacted. That's done using incremental sstable selector. Function to calculate maximum purgeable timestamp is made 10 times faster when compacting sstables overlap with 10% of all sstables. Fixes #1322. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-12-09 16:17:17 -02:00
Raphael S. Carvalho	548f6066c5	tests: add test for sstable set's incremental selector Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-12-09 16:17:17 -02:00
Raphael S. Carvalho	02541e15c1	sstable_set: introduce incremental selector Incrementally select sstables from sstable set using token in ascending order. For leveled strategy, it returns all sstables that belong to current interval. For other strategies, it just return all sstables from the set. Useful for compaction which needs all sstables that overlap with key being currently compacted to calculate maximum purgeable timestamp. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-12-09 16:17:16 -02:00
Glauber Costa	9b5e6d6bd8	commitlog: correctly report requests blocked The semaphore future may be unavailable for many reasons. Specifically, if the task quota is depleted right between sem.wait() and the .then() clause in get_units() the resulting future won't be available. That is particularly visible if we decrease the task quota, since those events will be more frequent: we can in those cases clearly see this counter going up, even though there aren't more requests pending than usual. This patch improves the situation by replacing that check. We now verify whether or not there are waiters in the semaphore. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <113c0d6b43cd6653ce972541baf6920e5765546b.1481222621.git.glauber@scylladb.com>	2016-12-09 15:02:26 +02:00
Raphael S. Carvalho	732ee275f8	compaction: fix running compaction counter when splitting sstables The counter was being increased before taking the semaphore, so every pending split would count as a running compaction which misleads the user as a result. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <f2050cc3599cee7af29d4579368a154708b37731.1481248048.git.raphaelsc@scylladb.com>	2016-12-09 15:01:43 +02:00
Raphael S. Carvalho	453620a316	compatible_ring_position: add function to return token Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-12-08 14:25:29 -02:00
Avi Kivity	872b5ef5f0	sstables: fix probe with Unknown component Commit `53b7b7def3` ("sstables: handle unrecognized sstable component") ignores unrecognized components, but misses one code path during probe_file(). Ignore unrecognized components there too. Fixes #1922. Message-Id: <20161208131027.28939-1-avi@scylladb.com>	2016-12-08 15:24:25 +01:00
Glauber Costa	733d87fcc6	database: try to acquire semaphore before we start flush As Tomek pointed out, as we are starting the flush before we acquire the semaphore, we are not really limiting parallelism, but only delaying the end of the flush instead. Fixes #1919 Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <6cbf9ec2f3a341c76becf94f794cfa16539c5192.1481120410.git.glauber@scylladb.com>	2016-12-08 12:18:32 +01:00
Tomasz Grabiec	3511bf4a81	Merge branch 'tgrabiec/memtable-gentle-clearing' from seastar-dev.git When row cache is disabled, update_cache() will do nothing to the memtable. Active readers may keep the memtable alive for unbounded amount of time, preventing it from going away. This doesn't play well with virtual dirty accounting. Soon before calling update_cache(), the memory which was subtracted during flush is added back to the amount of virtual dirty memory. If there was write pressure all along, we will be at the dirty memory limit. When we give back subtracted memory this will put virtual dirty way above the limit. This will stall all writes until another memtable flush drags virtual dirty down or readers finally release the memtable. We want to prevent upward jumps of virtual dirty. First part of the fix is to ensure that as long as the memtable's region is in the dirty group, we will not revert flushed memory. This must happen synchronously from region's memory being removed from the group in order to prevent upward virtual dirty jumps. To make this easier, tracking of flushed memory was moved to the memtable object. Another part of the fix is to gradually clear the memtable when cache is disabled in a similar fashion as when it's moved to cache. This ensures that the actual memory held by memtable's region is released sooner than it dies. Refs #1879	2016-12-08 12:18:32 +01:00
Gleb Natapov	a05516f14c	storage_proxy: wire up range_slice_timeouts, range_slice_unavailables and read_unavailables counters Message-Id: <20161206105154.GL1866@scylladb.com>	2016-12-08 11:42:52 +02:00
Avi Kivity	5530a61975	stables: fix build with older boost (boost::variant::get<T&>) Older boost doesn't support boost::variant::get<T&> (where the type parameter is reference qualified); remove (unneeded anyway).	2016-12-08 10:56:05 +02:00
Pekka Enberg	0bc3ce7e09	Merge "sstables: remove sharding metadata from Statistics component" from Avi "Due to my misreading of Cassandra code, I thought it would ignore new components in the Statistics component; however, it doesn't, and the change (introduced in `bdd11648ac` ("sstables: add intra-node sharding metadata") breaks sstable2json and likely any Cassandra code that touches sstables. To fix, move the sharding data into a new component ("Scylla.db"), which Cassandra does ignore. The new component is designed to be extensible so we don't experience the same issue later on."	2016-12-08 10:14:07 +02:00
Avi Kivity	7f26f9c0f9	Merge "repair refactor and fix" from Asias * tag 'asias/repair/subranges/refactor_fix/v1' of github.com:cloudius-systems/seastar-dev: repair: Limit the number of sub ranges repair: Use estimated_keys_for_range in repair_cf_range repair: Extract the target_partitions into repair_info class repair: Put request_transfer_ranges into repair_info class repair: Introduce check_failed_ranges helper repair: Introduce do_streaming helper repair: Make the neighbors const reference repair: Introduce repair_info repair: Attach the repair id in the stream plan name	2016-12-08 10:06:39 +02:00
Tomasz Grabiec	f7197dabf8	commitlog: Fix replay to not delete dirty segments The problem is that replay will unlink any segments which were on disk at the time the replay starts. However, some of those segments may have been created by current node since the boot. If a segment is part of reserve for example, it will be unlinked by replay, but we will still use that segment to log mutations. Those mutations will not be visible to replay after a crash though. The fix is to record preexisting segents before any new segments will have a chance to be created and use that as the replay list. Introduced in `abe7358767`. dtest failure: commitlog_test.py:TestCommitLog.test_commitlog_replay_on_startup Message-Id: <1481117436-6243-1-git-send-email-tgrabiec@scylladb.com>	2016-12-07 15:54:47 +02:00
Avi Kivity	4fedbf8430	Merge "service::storage_proxy: rework collectd counters registration" from Vlad - Add "coordinator" and "replica" categories - Use a new seastar/metrics_registration framework * 'rearrange-storage-proxy-stats-v4' of github.com:cloudius-systems/seastar-dev: service::storage_proxy: rework the collectd counters registration service/storage_proxy: regroup collectd statistics	2016-12-07 15:38:40 +02:00
Avi Kivity	3c3a18f222	sstables: move sharding metadata from Statistics component to a new Scylla component The Cassandra derived sstable tools (and likely Cassandra itself) object to a new sub-component in the Statistics component; create a new Scylla component instead to host this data.	2016-12-07 15:20:13 +02:00
Avi Kivity	24140ec8c6	sstables: add support for sets of discriminated union types Allow declaring discriminated unions (with an enum type as the discriminant and any sstable serializable type as a value) and sets of these unions, with the disciminant as the key. Parsers and writers are auto-generated.	2016-12-07 13:27:52 +02:00
Avi Kivity	e0cce9d299	Merge "streaming: Improve logging" from Asias "This seires adds streaming bandwidth and streaming plan name to the log when streaming is finished."	2016-12-07 12:21:47 +02:00
Amos Kong	f32f7993cc	systemd: reset housekeeping timer at each start Currently housekeeping timer won't be reset when we restart scylla-server. We expect the service to be run at each start, it will be consistent with upstart script in Ubuntu 14.04 When we restart scylla-server, housekeepting timer will also be restarted, so let's replace "OnBootSec" with "OnActiveSec". Fixes: #1601 Signed-off-by: Amos Kong <amos@scylladb.com> Message-Id: <a22943cc11a3de23db266c52fd476c08014098c4.1480607401.git.amos@scylladb.com>	2016-12-06 18:33:37 +02:00
Takuya ASADA	5a5ab51254	dist/ubuntu/dep: fix incorrect file path to detect previously built .deb existance check Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480667672-9453-4-git-send-email-syuu@scylladb.com>	2016-12-06 12:06:30 +02:00
Takuya ASADA	6dd6b868a6	scripts/scylla_install_pkg: support Debian Supported Debian on installation script. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480667672-9453-3-git-send-email-syuu@scylladb.com>	2016-12-06 12:06:30 +02:00
Takuya ASADA	7f2df8f86e	dist/common/scripts: introduce scylla_lib.sh To reduce duplicated code and simplified scripts introduce scylla_lib.sh for shellscripts which provides functions to classify distributions, and load all sysconfig files. This also fixes script bugs to misdetect Debian and RHEL. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480667672-9453-2-git-send-email-syuu@scylladb.com>	2016-12-06 12:06:30 +02:00
Takuya ASADA	8464903021	dist/common/systemd/scylla-housekeeping.timer: workaround to avoid crash of systemd on RHEL 7.3 RHEL 7.3's systemd contains known bug on timer.c: https://github.com/systemd/systemd/issues/2632 This is workaround to avoid hitting bug. Fixes #1846 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480452194-11683-1-git-send-email-syuu@scylladb.com>	2016-12-06 10:48:28 +02:00
Takuya ASADA	b2c0059da3	dist/common/scripts/scylla_coredump_setup: use systemd-coredump on Ubuntu 16.04 Ubuntu 16.04 has systemd-coredump, better to use it. Fixes #1916 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480679267-30844-1-git-send-email-syuu@scylladb.com>	2016-12-05 17:09:38 +02:00
Takuya ASADA	2976799ef2	main: fix startup failing on Ubuntu 15.10/16.04 Since Ubuntu 15.10/16.04 still uses Upstart to manage GUI session (not as init), when we directly launch Scylla on Ubuntu's GUI Terminal(not using systemctl or initctl), raise(SIGSTOP) mistakenly calls (Because GUI session has "UPSTART_JOB" environment variable, won't happen when running Scylla as systemd service). To avoid this, we need to verify UPSTART_JOB == "scylla-server". If it's part of GUI session UPSTART_JOB has to be "unity7", we need to avoid raise(SIGSTOP) in that case. Fixes #1199 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1480620421-28967-1-git-send-email-syuu@scylladb.com>	2016-12-05 16:28:25 +02:00

1 2 3 4 5 ...

10875 Commits