scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 04:26:48 +00:00

Author	SHA1	Message	Date
Takuya ASADA	898243929f	dist/common/scripts: specify queue settings for posix_net_conf.sh on scylla_prepare posix_net_conf.sh wants -sq/-mq options, so detect number of queues and specify the option in scylla_prepare.	2016-05-19 06:26:23 +09:00
Takuya ASADA	f84b7b094f	dist/common/scripts: drop special condition to enable SET_NIC on AMI, do this on AMI installation script Remove special case of SET_NIC in AMI, do this in scylla-ami-setup.service.	2016-05-19 06:25:41 +09:00
Takuya ASADA	49cdd0b786	dist: move '--cpuset' and '--smp' configuration to scylla_cpuset_setup / cpuset.conf These parameters are only required for AMI, not for non-AMI environment which want to enable SET_NIC, so split them to indivisual script / conf file, call it from AMI install script.	2016-05-19 06:25:28 +09:00
Takuya ASADA	46fa80a5a6	dist/common/scripts: replace IFNAME variable when --nic specified to scylla_sysconfig_setup scylla_sysconfig_setup has bug that it not replaces IFNAME variable, so fixed.	2016-05-19 06:25:15 +09:00
Glauber Costa	4eff07d773	database: reorder initialization In a preparation move for the LSA throttler, we have reordered the initialization fields in database.hh so that the sizes of the regions are computed before the initialization of the region. However, that seemingly innocent move broke one of our tests. The reason behind that, is that if we don't destroy the column families before destroying the region, we may end up with a use after free in the memtable destructor - that itself expects to call into the region. This patch reorders the initialization so that the CF list still comes after the dirty regions (therefore being destroyed first), while maintaining the relative ordering between size / region that we needed in the first place. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <0669984b5bccdb2c950f2444bdee4427abad56ba.1463508884.git.glauber@scylladb.com>	2016-05-18 11:02:40 +03:00
Asias He	eb9ac9ab91	gms: Optimize gossiper::is_alive In perf-flame, I saw in service::storage_proxy::create_write_response_handler (2.66% cpu) gossiper::is_alive takes 0.72% cpu locator::token_metadata::pending_endpoints_for takes 1.2% cpu After this patch: service::storage_proxy::create_write_response_handler (2.17% cpu) gossiper::is_alive does not show up at all locator::token_metadata::pending_endpoints_for takes 1.3% cpu There is no need to copy the endpoint_state from the endpoint_state_map to check if a node is alive. Optimize it since gossiper::is_alive is called in the fast path. Message-Id: <2144310aef8d170cab34a2c96cb67cabca761ca8.1463540290.git.asias@scylladb.com>	2016-05-18 10:12:38 +03:00
Avi Kivity	6ec0000df8	Merge "fix migration of tables with level > 0" from Rapahel	2016-05-17 19:14:01 +03:00
Raphael S. Carvalho	cbc2e96a58	tests: check that overlapping sstable has its level changed to 0 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-17 11:11:05 -03:00
Raphael S. Carvalho	ee0f66eef6	db: fix migration of sstables with level greater than 0 Refresh will rewrite statistics of any migrated sstable with level > 0. However, this operation is currently not working because O_EXCL flag is used, meaning that create will fail. It turns out that we don't actually need to change on-disk level of a sstable by overwriting statistics file. We can only set in-memory level of a sstable to 0. If Scylla reboots before all migrated sstables are compacted, leveled strategy is smart enough to detect sstables that overlap, and set their in-memory level to 0. Fixes #1124. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-05-17 11:08:08 -03:00
Gleb Natapov	76e0eb426e	gossiper: simplify mark_alive() The code runs in a thread so there is no need to use heap to communicate between statements. Message-Id: <20160517120245.GK984@scylladb.com>	2016-05-17 15:37:21 +03:00
Avi Kivity	4413176051	Merge "reduce performance degradation when adding node" from Asias "With this series, the operations per second drop during adding node period gets much better. Before: 45K to 10K After: 45k to 38K Refs: #1223 "	2016-05-17 14:31:31 +03:00
Asias He	089734474b	token_metadata: Speed up pending_endpoints_for pending_endpoints_for is called frequently by storage_proxy::create_write_response_handler when doing cql query. Before this patch, each call to pending_endpoints_for involves converting a multimap (std::unordered_multimap<range<token>, inet_address>>) to map (std::unordered_map<range<token>, std::unordered_set<inet_address>>). To speed up the token to pending endpoint mapping search, a interval map is introduced. It is faster than searching the map linearly and can avoid caching the token/pending endpoint mapping. With this patch, the operations per second drop during adding node period gets much better. Before: 45K to 10K After: 45k to 38K (The number is measured with the streaming code skipping to send data to rule out the streaming factor.) Refs: #1223	2016-05-17 17:32:15 +08:00
Asias He	ee0585cee9	dht: Add default constructor for token It is needed to put token in to a boost interval_map in the following patch.	2016-05-17 17:32:15 +08:00
Amnon Heiman	ad34f80e6f	API: change cache_service, column_family and storage_proxy to rate object The API would expose now the rate_moving_average and rate_moving_average_and_histogram. The old end points remains for the transition period, but marked as depricated. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:56:52 +03:00
Amnon Heiman	b33ed48527	API Definition: change cache_service, column_family and storage_proxy to use rate objects This patch replaces the latency histogram to rate_moving_avrage_and_histogram and the counters to rate_moving_average. The old endpoints where left unchagned but marked as depricated when needed.	2016-05-17 11:55:06 +03:00
Amnon Heiman	20a48b0f20	API: column family stats break the map_reduce functionality This patch replaces the helper function for column family with two function, one that collect the relevant column family from all shareds and another one that do the translation to json object. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	750f30cf07	column_family: Change histogram to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the histogram object in the column_family to timed_rate_moving_average_and_histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	468bcfbf1f	row_cache: Change counter to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the counter in the row_cache stats to timed_rate_moving_average_and_histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:53:15 +03:00
Amnon Heiman	64e0c8cd1b	storage_proxy: Change histogram to timed_rate_moving_average_and_histogram As part of moving the derived statistic in to scylla, this replaces the histogram object in the storage_proxy to timed_rate_moving_average_and_histogram. and the read, write and range counters where replaced by rate_moving_average. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:52:16 +03:00
Amnon Heiman	f6a5a4e3da	API: Add helper function for the rate objects This patch adds the helper function that are used to sum the rate_moving_average and rate_moving_average_and_histogram. The current sum functionality for histogram was modified to support rate and histogram but return a histogram. This way current endpoints would continue to behave the same. It also cleans the histogram related method by using the plus operator in the histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:49:34 +03:00
Amnon Heiman	8ef25ceb05	Add waited avrage rate related object This patch adds a few data structure for derived and accumulative statistics that are similiar to the yammer implementation used by the JMX. It also adds a plus operator to histogram which cleans the histogram usage. moving_average - An exponentially-weighted moving average. calculate an event rate on a given interval. rate_moving_average and timed_rate_moving_average - Calculate 1m, 5m and 15m ewma an all time avrage and a counter. rate_moving_average_and_histogram and timed_rate_moving_average_and_histogram - Combines a histogram with a rate_moving_average. It also expose a histogram API so it will be an easy task to replace a histogram with a timed_rate_moving_average_and_histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-17 11:47:49 +03:00
Glauber Costa	17b9203719	database: invert order of elements So that the sizes of the region can be initialized first Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <dc3df186a977b492d83c0a397f206c2db940aa37.1463448522.git.glauber@scylladb.com>	2016-05-17 11:28:39 +03:00
Glauber Costa	2ff6d38d0c	database: use a single constructor for the column family We've been keeping two constructors for the column family to allow for a version without the commitlog. But it's by now quite complicated to maintain the two, because changes always have to be made in two places. This patch adds a private constructor that does the actual construction, and have the public constructors to call it. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <dd3cb0b9c20ad154a6131bad6ece619f70ed5025.1463448522.git.glauber@scylladb.com>	2016-05-17 11:28:39 +03:00
Glauber Costa	8fede5b98e	memtables: isolate logic for disk writes disabled When we have disk writes disabled, we exit immediately from the flush function. We can just encode that separately and pass a different function in the memtable_list creation. That simplifies the memtable flush a bit. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <908e3b5eb2c6ee84b8ad7b31c3673be5531a087c.1463448522.git.glauber@scylladb.com>	2016-05-17 11:28:38 +03:00
Glauber Costa	4981362f57	memtables: always seal through memtable_list seal function I would like to be able to apply a function at the end of every flush, that is common for both memtables and streaming memtables. For instance, to unthrottle current waiters. Right now some calls to seal_active_memtable are open coded, calling the column family's function directly, for both the main memtable list and the streaming list. This patch moves all the current open code callers to call the respective memtable_list function. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <0c780254f3c4eb03e2bcd856b83941cf49a84b85.1463448522.git.glauber@scylladb.com>	2016-05-17 11:28:37 +03:00
Takuya ASADA	4972a72380	dist: drop 'sudo -E' and SETENV for security reason, source envfile from scripts As Nadav pointed out, SETENV and sudo -E might be causes security hole: https://github.com/scylladb/scylla/issues/1028#issuecomment-196202171 So drop them now, sourcing envfiles from scylla_prepare / scylla_stop scripts instead. Also on "[PATCH] ubuntu: Fix the init script variable sourcing" thread we have problem to passing variables from envfiles to scylla_prepare / scylla_stop on Ubuntu, it seems better to sourcing from these scripts. Additionally, this fixes #1249 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1462989906-30062-1-git-send-email-syuu@scylladb.com>	2016-05-17 10:31:03 +03:00
Pekka Enberg	9c450f673c	cql3: Clean up prepared_metadata class Return vectors by const reference in prepared_metadata class and add a FIXME to result_message class. Message-Id: <1463425756-20225-1-git-send-email-penberg@scylladb.com>	2016-05-17 10:02:14 +03:00
Pekka Enberg	217c1ffa95	cql3: Specify result set flag ABI explicitly As Avi points out, the flag values are an ABI. So specify them explicitly. Message-Id: <1463413379-8355-1-git-send-email-penberg@scylladb.com>	2016-05-16 19:00:52 +03:00
Avi Kivity	a3b23d75b9	Merge "Fix Prepared message metadata serialization" "The Prepared message has a metadata section that's similar to result set metadata but not exactly the same. Fix serialization by introducing a separate prepared_metadata class like Origin has and implement serialization as per the CQL protocol specification. This fixes one CQL binary protocol version 4 issue that we currently have. The changes have been verified by running the gocql integration tests using v4. Please note that this series does not enable v4 for clients because Cassandra 2.1.x series only supports CQL binary protocol v3."	2016-05-16 18:59:54 +03:00
Pekka Enberg	868ff5107c	cql3: Introduce prepared_metadata class Introduce a new prepared_metadata class that holds prepared statement metadata and implement CQL binary protocol serialization that works for all versions.	2016-05-16 18:06:01 +03:00
Tomasz Grabiec	272e89846d	Merge branch 'cache' from git@github.com:haaawk/scylla.git From Piotr: Fixes #656. It makes it possible to slice using clustering ranges in mutation readers. We don't have row index yet so the slicing is just ignoring data which is out of range.	2016-05-16 14:44:33 +02:00
Piotr Jastrzebski	dcba6f5c45	Pass clustering_row_ranges to mutation readers. This will allow readers to reduce the amount of data read. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 14:36:57 +02:00
Pekka Enberg	a68671e247	cql3: Add column_specification::all_in_same_table() helper We need it the prepared_metadata class that we're about to introduce.	2016-05-16 14:13:31 +03:00
Takuya ASADA	80037aa95b	dist/common/scripts: don't proceed to run scylla_raid_setup when disks not selected, on interactive RAID setup When disks not selected, run disk select prompt again. Fixes #1260 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1463388933-3640-1-git-send-email-syuu@scylladb.com>	2016-05-16 13:45:17 +03:00
Pekka Enberg	adfb4d7bbd	cql3: Move result_set class implementation to source file	2016-05-16 13:20:45 +03:00
Pekka Enberg	8552f222f5	cql3: Clean up result_set class Kill some left-over ifdef'd code from the result_set class. Message-Id: <1463392997-22921-1-git-send-email-penberg@scylladb.com>	2016-05-16 13:09:37 +03:00
Piotr Jastrzebski	23c23abe53	Make memtable mutation_reader slice using clustering ranges. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:41 +02:00
Piotr Jastrzebski	484d2ecd0a	Slice data with clustering key range in sstable reader Add additional parameters to mp_row_consumer to be able to fetch only cells for given clustering key ranges This will be used in row_cache when it will work on clustering key level instead of partition key level. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:30 +02:00
Piotr Jastrzebski	8307681975	Introduce clustering_ranges type. It will be used to slice data returned by mutation_readers. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:09 +02:00
Amnon Heiman	7e07d97e4b	API utils: Adding rate moving avrage rate_moving_average and rate_moving_average_and_histogram are type that are used by the JMX. They are based on the yammer meter and timer and are used to collect derivative information. Specificlly: rate_moving_average calculate rates and rate_moving_average_and_histogram collect rates and histogram. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-05-16 11:40:19 +03:00
Pekka Enberg	17765b6c06	Merge seastar upstream * seastar 3dec26f...6a849ac (4): > seastar::socket: Be resilient against ENOTCONN > Merge " improve performance and predictability of syscall thread communications" from Glauber > rpc_test: Shutdown properly > [PATCH} future: better detect get_future() on already used promise	2016-05-16 08:04:47 +03:00
Yoav Kleinberger	de7952a8db	tools/scyllatop: log input from collectd for easier debugging When running with DEBUG verbosity, scyllatop will now log every single value it receives from collectd. When you suspect that scyllatop is somehow distorting values, this is a good way to check it. Signed-off-by: Yoav Kleinberger <yoav@scylladb.com> Message-Id: <1463320730-6631-1-git-send-email-yoav@scylladb.com>	2016-05-15 19:17:10 +03:00
Tomasz Grabiec	1eabe9b840	storage_proxy: Add trace-level logging for mutating Message-Id: <1462978554-31217-1-git-send-email-tgrabiec@scylladb.com>	2016-05-12 13:52:56 +03:00
Tomasz Grabiec	7207cc8b1a	storage_proxy: Improve error reporting Knowing the source node can help in debugging the issue. Message-Id: <1462978535-31164-1-git-send-email-tgrabiec@scylladb.com>	2016-05-12 13:52:39 +03:00
Pekka Enberg	b5d9aa866d	Merge "Fixes for schema synchronization" from Tomek "Writes may start to be rejected by replicas after issuing alter table which doesn't affect columns. This affects all versions with alter table support. Fixes #1258"	2016-05-12 09:43:25 +03:00
Duarte Nunes	7dbeef3c39	storage_service: Fix ignored future in on_alive This patch ensures the future created by invoke_on_all is not ignored by waiting on it, which is safe to do since we are within a seastar::async context. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1462989837-7326-1-git-send-email-duarte@scylladb.com>	2016-05-12 09:03:46 +03:00
Tomasz Grabiec	13d8cd0ae9	migration_manager: Invalidate prepared statements on every schema change Currently we only do that when column set changes. When prepared statements are executed, paramaters like read repair chance are read from schema version stored in the statement. Not invalidating prepared statements on changes of such parameters will appear as if alter took no effect. Fixes #1255. Message-Id: <1462985495-9767-1-git-send-email-tgrabiec@scylladb.com>	2016-05-12 08:58:40 +03:00
Tomasz Grabiec	90c31701e3	tests: Add unit tests for schema_registry	2016-05-11 17:31:22 +02:00
Tomasz Grabiec	443e5aef5a	schema_registry: Fix possible hang in maybe_sync() if syncer doesn't defer Spotted during code review. If it doesn't defer, we may execute then_wrapped() body before we change the state. Fix by moving then_wrapped() body after state changes.	2016-05-11 17:31:22 +02:00
Tomasz Grabiec	8703136a4f	migration_manager: Fix schema syncing with older version The problem was that "s" would not be marked as synced-with if it came from shard != 0. As a result, mutation using that schema would fail to apply with an exception: "attempted to mutate using not synced schema of ..." The problem could surface when altering schema without changing columns and restarting one of the nodes so that it forgets past versions. Fixes #1258. Will be covered by dtest: SchemaManagementTest.test_prepared_statements_work_after_node_restart_after_altering_schema_without_changing_columns	2016-05-11 17:29:14 +02:00

... 46 47 48 49 50 ...

11716 Commits