scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 12:06:44 +00:00

Author	SHA1	Message	Date
Takuya ASADA	0c3bb2ee63	dist/common/scripts/scylla_prepare: drop unnecesarry multiqueue NIC detection code on scylla_prepare Right now scylla_prepare specifies -mq option to posix_net_conf.sh when number of RX queues > 1, but on posix_net_conf.sh it sets NIC mode to sq when queues < ncpus / 2. So the logic is different, and actually posix_net_conf.sh does not need to specify -sq/-mq now, it autodetects queue mode. So we need to drop detection logic from scylla_prepare, let posix_net_conf.sh to detect it. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1472544875-2033-1-git-send-email-syuu@scylladb.com>	2016-08-30 16:51:15 +03:00
Pekka Enberg	eff14bae0e	transport/server: Explict CQL type IDs The CQL type IDs are specified as hex in the CQL binary protocol specification. Define CQL type IDs in the code explicitly to make reviewing the code and adding new types easier. Message-Id: <1472537971-26053-1-git-send-email-penberg@scylladb.com>	2016-08-30 09:45:26 +03:00
Avi Kivity	809d739ae8	Merge seastar upstream * seastar 2b07b1f...0303e0c (3): > scripts/posix_net_conf.sh: add support --cpu-mask mode > file: improve tmpfs support > file::close: remove trailing newline in log message	2016-08-29 13:26:04 +03:00
Pekka Enberg	2d3aee73a6	systemd: Don't start Scylla service until network is up Alexandr Porunov reports that Scylla fails to start up after reboot as follows: Aug 25 19:44:51 scylla1 scylla[637]: Exiting on unhandled exception of type 'std::system_error': Error system:99 (Cannot assign requested address) The problem is that because there's no dependency to network service, Scylla simply attempts to start up too soon in the boot sequence and fails. Fixes #1618. Message-Id: <1472212447-21445-1-git-send-email-penberg@scylladb.com>	2016-08-29 13:15:39 +03:00
Takuya ASADA	74d994f6a1	dist/common/scripts/scylla_setup: support enabling services on Ubuntu 15.10/16.04 Right now it ignores Ubuntu, but we shareing .service between Fedora/CentOS and Ubuntu >= 15.10, so support it. Fixes #1556. Message-Id: <1471932814-17347-1-git-send-email-syuu@scylladb.com>	2016-08-29 13:13:14 +03:00
Avi Kivity	fb3a83a811	Merge "Slow query logging" from Vlad "This series introduces a "slow query logging" feature that allows logging the queries that take more than a specified threshold time to complete. Once such a query detected, it will be logged in a system_traces.node_slow_log table. In addition all trace for that query that have been collected on a Coordinator are going to be written as well. If the handling time on a replica in the context of a query takes more than (the same) threshold they are going to be written too. The raw in a node_slow_log contains a session_id of a corresponding tracing session, thereby allowing the user to query the system_traces tables for the corresponding trace records. The schema of the node_slow_log table is as follows: CREATE TABLE system_traces.node_slow_log ( node_ip inet, shard int, session_id uuid, date timestamp, start_time timeuuid, command text, duration int, parameters map<text, text>, source_ip inet, table_names set<text>, username text, PRIMARY KEY (start_time, node_ip, shard)) WITH default_time_to_live = 86400 where - node_ip: IP of the coordinator Node. - shard: shard ID on a Coordinator where the query was handled. - session_id: ID of a corresponding tracing session. - date: a time when the query has began. - start_time: a time-based UUID for this query (needed for a primary key mostly). - command: a query string. - duration: a time it took to handle this query (in microseconds). - parameters: a map of query parameters (like in system_traces.sessions). - source_ip: IP of a Client that sent this query. - table_names: a set of "<keyspace>.<table name>" strings representing column families used in this query. - username: a user name used for this query. The good thing is that most of the data we needed is already collected by the regular tracing framework. The only missing ones are a username and tables' names. So, this series makes the framework collect them too. The whole feature is integrated in the Tracing framework. The main changes to the framework that were made are as follows: - Store the constant capabilities of the tracing session in an enum_set, e.g.: - primary/secondary. - write on close. - Introduce two new capabilities to a tracing session of a specific query: - full tracing: collect all traces for this query (as it is before this series). - log slow query: log this query if its duration is above the threshold. These two capabilities may be defined independently. - Add the logic that handles the "log slow query"-only case: - Build the parameters<sstring, sstring> map only if the "duration" is above the given threshold. - The same about writing the trace entries. - In a not-only "log slow query" case: - Write the node_slow_log entry. - Extend the trace_info struct to pass slow query threshold and TTL to the replica Node. In addition to above this series add the capability to configure the slow query logging threshold and a TTL for the node_slow_log records. The heaviest patch in the series is the last one. The series contains a few cosmetic (renaming) patches that are meant to align the naming of the existing methods with the ones the last one is going to add."	2016-08-29 13:11:36 +03:00
Gleb Natapov	a2cdddb795	storage_proxy: forward mutation write with correct timeout value Now that mutation handler knows how much time is left for mutation write to be handled it can use this knowledge to set correct timeout for forwarded mutations. Message-Id: <20160828080637.GE9243@scylladb.com>	2016-08-29 13:06:36 +03:00
Avi Kivity	6cb796f38b	Merge seastar upstream * seastar ef063c5...2b07b1f (1): > file: make close() more robust against concurrent calls	2016-08-29 12:25:57 +03:00
Avi Kivity	f5f58b46c7	sstables: enable write-behind Write-behind allows a single sstable write to saturate the disk, improving throughput. Later we can take advantage of this to reduce the number of sstables being written concurrently.	2016-08-29 12:25:15 +03:00
Pekka Enberg	c5e5e7bb40	dist/docker: Clean up Scylla description for Docker image Message-Id: <1472145307-3399-1-git-send-email-penberg@scylladb.com>	2016-08-29 10:48:06 +03:00
Vlad Zolotarov	a491ac0f18	tracing: introduce a log_slow_query logic The main idea is to log queries that take "too long" to complete. The "too long" is above the given threshold. To achieve the above this patch does the following: - Introduce two new properties to the tracing::trace_state: - "Full tracing": when the tracing of this query was explicitly requested. In this state we will record all possible traces related to this query: both on the coordinator and on any replica involved. - "Log slow query": when slow query logging is enabled. If slow query logging is enabled and a session's "duration" is above the specified threshold we will create a record in the "slow queries log" and write all trace records created on the coordinator and on a replica if a replica's session lasts longer than that threshold. (We will propagate the Coordinator's slow query logging threshold to replicas in the context of a specific tracing/logging session). The properties above are independent, namely they may be enabled and/or disabled independently and any combination of them is legal (naturally, creating a tracing session when both states above are disabled makes no sense). - Instrument the tracing::tracing service to allow the following: - Enable/disable slow query logging. - Set/get the slow query duration threshold (in microseconds). - Set/get the slow query log record TTL value (in seconds). - Instrument the trace_keyspace_helper to write a slow query log entry when requested. - The slow query logging is disabled by default and the threshold is set to half a second. - The TTL of a slow log record is set to 86400 seconds by default. - It makes sense to use the same "slow query logging threshold" and a "slow query record TTL" both on a coordinator and on a replica Nodes in a context of the same tracing session: - Pass both TTL and a threshold to the replica in a trace_info. This patch also implements the new slow query logging specific logic: - Don't write the pending tracing records before the end of a tracing session until "duration" reaches the logging threshold. - Don't build the parameters<sstring, sstring> map unless we know we will write it to I/O. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-28 18:28:44 +03:00
Avi Kivity	e81c1df557	Merge seastar upstream * seastar 6fadd98...ef063c5 (2): > rpc: pass a timeout to a verb's server handler if the one was specified by a client > rpc: cleanup the old metaprogramming craft	2016-08-25 17:53:19 +03:00
Paweł Dziepak	6012a7e733	mutation_partition: fix iterator invalidation in trim_rows Reversed iterators are adaptors for 'normal' iterators. These underlying iterators point to different objects that the reversed iterators themselves. The consequence of this is that removing an element pointed to by a reversed iterator may invalidate reversed iterator which point to a completely different object. This is what happens in trim_rows for reversed queries. Erasing a row can invalidate end iterator and the loop would fail to stop. The solution is to introduce reversal_traits::erase_dispose_and_update_end() funcion which erases and disposes object pointed to by a given iterator but takes also a reference to and end iterator and updates it if necessary to make sure that it stays valid. Fixes #1609. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1472080609-11642-1-git-send-email-pdziepak@scylladb.com>	2016-08-25 16:52:35 +03:00
Paweł Dziepak	5f84348ce1	test.py: add missing nonwrapping_range_test Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1472126087-15484-1-git-send-email-pdziepak@scylladb.com>	2016-08-25 15:36:10 +03:00
Raphael S. Carvalho	d8be32d93a	api: use estimation of pending tasks in compaction manager too We have API for getting pending compaction tasks both in column family and compaction manager. Column family is already returning pending tasks properly. Compaction manager's one is used by 'nodetool compactionstats', and was returning a value which doesn't reflect pending compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <a20b88938ad39e95f98bfd7f93e4d1666d1c6f95.1471641211.git.raphaelsc@scylladb.com>	2016-08-24 14:00:23 +03:00
Takuya ASADA	b9e02dad2e	dist/ami: install scylla metapackage on AMI Mistakenly we didn't install scylla metapackage on AMI, so install it. Fixes #1572 Message-Id: <1471977742-21984-1-git-send-email-syuu@scylladb.com>	2016-08-24 12:55:01 +03:00
Vlad Zolotarov	8609900621	tracing: introduce trace_state capabilities bit field - Instead of keeping separate booleans introduce a trace_state_props_set enum_set and pass it around instead of separate booleans. - Change the trace_info to hold this value in addition to write_on_close. Initialize a corresponding bit in an enum_set based on a write_on_close value in a trace_info constructor for a backward compatibility. - Separate a trace_state constructor into two: - For a primary session object. - For a secondary session object. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 18:34:36 +03:00
Amnon Heiman	2b98335da4	housekeeping: Silently ignore check version if Scylla is not available Normally, the check version should start and stop with the scylla-server service. If it fails to find scylla server, there is no need to check the version, nor to report it, so it can stop silently. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-08-23 18:08:59 +03:00
Amnon Heiman	4598674673	housekeeping: Use curl instead of Python's libraries There is a problem with Python SSL's in Ubuntu 14.04: ubuntu@ip-10-81-165-156:~$ /usr/lib/scylla/scylla-housekeeping -q version Traceback (most recent call last): File "/usr/lib/scylla/scylla-housekeeping", line 94, in <module> args.func(args) File "/usr/lib/scylla/scylla-housekeeping", line 71, in check_version latest_version = get_json_from_url(version_url + "?version=" + current_version)["version"] File "/usr/lib/scylla/scylla-housekeeping", line 50, in get_json_from_url response = urllib2.urlopen(req) File "/usr/lib/python2.7/urllib2.py", line 127, in urlopen return _opener.open(url, data, timeout) File "/usr/lib/python2.7/urllib2.py", line 404, in open response = self._open(req, data) File "/usr/lib/python2.7/urllib2.py", line 422, in _open '_open', req) File "/usr/lib/python2.7/urllib2.py", line 382, in _call_chain result = func(*args) File "/usr/lib/python2.7/urllib2.py", line 1222, in https_open return self.do_open(httplib.HTTPSConnection, req) File "/usr/lib/python2.7/urllib2.py", line 1184, in do_open raise URLError(err) urllib2.URLError: <urlopen error [Errno 1] _ssl.c:510: error:14077410:SSL routines:SSL23_GET_SERVER_HELLO:sslv3 alert handshake failure> Instead of using Python libraries to connect to the check version server, we will use curl for that. Fixes #1600 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-08-23 18:07:05 +03:00
Amnon Heiman	91944b736e	housekeeping: Add curl as a dependency To work around an SSL problem with Python on Ubuntu 14.04, we need to use curl. Add it as a dependency so that it's available on the host. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-08-23 18:06:13 +03:00
Vlad Zolotarov	c8cf2ef82c	tracing::trace_state: introduce is_in_state() and set_state() accessors Use these new methods to manipulate trace_state::_state value. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	39b23cd084	tracing::trace_state: rename: get_write_on_close() -> write_on_close() Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	09624f704f	tracing::trace_state: rename: get_type() -> type() Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	b40a819d1e	tracing::trace_state: rename: get_session_id() -> session_id() Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	ed21398ce9	trace_keyspace_helper: create a system_traces.node_slow_log table This table is going to be used to store information about queries which are slower than a specified threshold. Also added a column caching and mutation creation functions Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	efeb62e72f	tracing: trace_keyspace_helper: introduce a check_column_definition() helper function Checks if a given column definition exists and has a requested type. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	6c3e1935b0	tracing::session_record: change a type of a "ttl" field to be std::chrono::seconds TTL is always defined in seconds - make its type explicitly reflect that. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	e017533229	tracing: set a username session parameter Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	93c2502be4	tracing: set a table_name in a BATCH query Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	25be28bb3c	tracing: set a table_name parameter in a SELECT statement Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	abae3b05e7	tracing: set table_name in a modification statement Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	372da7e71b	tracing: add support for setting a username and a table name parameters - "username" is a name used in the authentication process. - "table name" is a <keyspace>.<cf name> string representing a name of a table used for a query in question. Note that there may be more than one table name in a batch query. Therefore we store an unordered set of tables names. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:42 +03:00
Vlad Zolotarov	eaf5db66a8	tracing::session_record: store "parameters" data in an std::map instead of in an unordered_map Avoid sorting (and creating a new one) container at a backend code when a sorted container is needed. The overhead for the backends where it's not needed is minimal since the size of the map is very small. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-23 17:58:10 +03:00
Takuya ASADA	80f7449095	dist/ubuntu: support scylla-housekeeping service on all Ubuntu versions Current scylla-housekeeping support on Ubuntu has bug, it does not installs .service/.timer for Ubuntu 16.04. So fix it to make it work. Fixes #1502 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Tested-by: Amos Kong <amos@scylladb.com> Message-Id: <1471607903-14889-1-git-send-email-syuu@scylladb.com>	2016-08-23 13:49:44 +03:00
Takuya ASADA	aac60082ae	dist/common/systemd: don't use .in for scylla-housekeeping.*, since these are not template file .in is the name for template files witch requires to rewrite on building time, but these systemd unit files does not require rewrite, so don't name .in, reference directly from .spec. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1471607533-3821-1-git-send-email-syuu@scylladb.com>	2016-08-23 13:49:09 +03:00
Duarte Nunes	440c1b2189	thrift: Avoid always recording size estimates Size estimates for a particular column family are recorded every 5 minutes. However, when a user calls the describe_splits(_ex) verbs, they may want to see estimates for a recently created and updated column family; this is legitimate and common in testing. However, a client may also call describe_splits(_ex) very frequently and recording the estimates on every call is wasteful and, worse, can cause clients to give up. This patch fixes this by only recording estimates if the first attempt to query them produces no results. Refs #1139 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1471900595-4715-1-git-send-email-duarte@scylladb.com>	2016-08-23 13:08:25 +03:00
Takuya ASADA	383148de13	dist/common/scripts/scylla_bootparam_setup: fix failing setup hugepages= variable on boot parameter This is caused because mistakenly dropped sourcing sysconfig file, so source it again. Fixes #1599 Message-Id: <1471943742-19684-1-git-send-email-syuu@scylladb.com>	2016-08-23 12:41:39 +03:00
Takuya ASADA	1ad578ecf1	dist/common/scripts/scylla_bootparam_setup: use distribution standard grub.cfg update command on Ubuntu Result is almost same, but let's do it in ubuntu/debian flavor. Message-Id: <1471943898-24490-1-git-send-email-syuu@scylladb.com>	2016-08-23 12:41:34 +03:00
Paweł Dziepak	5feed84e32	sstables: do not call consume_end_partition() after proceed::no After state_processor().process_state() returns proceed::no the upper layer should have a chance to act before more data is pushed to the consumer. This means that in case of proceed::no verify_end_state() should not be called immediately since it may invoke consume_end_partition(). Fixes #1605. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1471943032-7290-1-git-send-email-pdziepak@scylladb.com>	2016-08-23 12:24:39 +03:00
Pekka Enberg	9d1d8baf37	dist/docker: Separate supervisord config files Move scylla-server and scylla-jmx supervisord config files to separate files and make the main supervisord.conf scan /etc/supervisord.conf.d/ directory. This makes it easier for people to extend the Docker image and add their own services. Message-Id: <1471588406-25444-1-git-send-email-penberg@scylladb.com>	2016-08-22 17:20:23 +03:00
Avi Kivity	55f2cf1626	thrift: do not generate wrapping ring_position ranges As part of the move to unwrap ranges, don't generate wrapping ranges from thrift. A little extra motivation is to avoid the need for the solution to #1573 to be able to handle wrapping ranges. This patch may also be fixing a bug in that the range (token, token] was previously translated as (-inf, +inf), while now it is translated as {(token, +inf), (-inf, token]}; the new translation respects ordering better. Reviewed-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1471869587-12972-1-git-send-email-avi@scylladb.com>	2016-08-22 14:43:45 +01:00
Avi Kivity	46caff5b06	Merge "Store frozen_mutation in fragmented buffer" from Paweł "This series switches frozen_mutations and to use bytes_ostream internally so that the size of a single allocation is bounded. Deserializers are also enhanced so that they can cope with reading from fragmented buffers. The goal of the change is to reduce memory pressure in case of large partitions. Performance as measured by perf_simple_query (median of 30). before after diff read 705270.74 702906.35 -0.3% write 814504.81 836462.33 +2.7% Refs #1440. Refs #1545. Fixes #1546."	2016-08-22 13:01:34 +03:00
Paweł Dziepak	1315090bf0	query-result: no need to linearize buffer any more Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	3fe5ed3cd9	query: use result_view::consume() where appropriate Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	cb2a557cf7	query::result: reduce chunk count Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	ea3ac0a270	frozen_mutation: reduce chunk count in constructor Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	1daf4c73a3	frozen_mutation: avoid buffer linearization and copy Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	3707d7fec3	frozen_mutation: use bytes_ostream internally Unlike bytes, bytes_ostream supports fragmented buffers, thus reducing the pressure on the memory allocator caused by large frozen partitions. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	c0425b63ff	frozen_mutation: add mutation_view() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	89f7b46f61	idl: switch to utils::input_stream Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00

1 2 3 4 5 ...

10326 Commits