scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 11:00:35 +00:00

Author	SHA1	Message	Date
Pekka Enberg	fcaa743e3d	cql3: TINYINT and SMALLINT data type support This adds support for the TINYINT and SMALLINT data types introduced in CQL 3.3.1. Refs #1284	2017-01-05 10:57:35 +02:00
Pekka Enberg	257fa541f1	types: Fix integer_type_impl::parse_int() for bytes The integer_type_impl::parse_int() function uses boost::lexical_cast() under the hood, which parses 8-bit numbers as characters. Fix the function to lexical cast to 64-bit integer and convert the result to integer_type_impl template type.	2017-01-05 10:57:35 +02:00
Nadav Har'El	45f19f2633	main: better error message on failing to start Prometheus Previously, if the Prometheus port (by default, 0.0.0.0:9180) could not be opened, the following message appeared in the log about 10 seconds into the run, and Scylla crashed. ERROR 2017-01-01 19:31:04,066 [shard 0] seastar - Exiting on unhandled exception: std::system_error (error system:98, Address already in use) The puzzled user would have no idea which address was already in use, why, or why Scylla stopped. In this patch, before the above message we get the much more informative message: ERROR 2017-01-01 19:58:19,080 [shard 0] init - Could not start Prometheus API server on 0.0.0.0:9180: std::system_error (error system:98, Address already in use) We continue to print the original message - and exit - in this case, under the assumption that it's better not to run the database while improperly configured. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20170102121304.2060-1-nyh@scylladb.com>	2017-01-04 14:58:26 +02:00
Tzach Livyatan	0c746b22e0	Fix a typo in scylla_setup housekeeping prompt Signed-off-by: Tzach Livyatan <tzach@scylladb.com> Message-Id: <1483362474-22113-1-git-send-email-tzach@scylladb.com>	2017-01-04 14:54:22 +02:00
Takuya ASADA	43655512e1	dist/redhat: add python-setuptools on dependency since it requires for scylla-housekeeping scylla-housekeeping breaks when python-setuptools doesn't installed, so add it on dependency. Fixes #1884 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1483525828-7507-1-git-send-email-syuu@scylladb.com>	2017-01-04 14:32:10 +02:00
Pekka Enberg	060841b756	tests/types_test: Fix int32 type string conversion boundary case The test case is interested in the upper boundary of 32-bit integer because we already test the lower boundary in assertions below. The old test passed, of course, but it wasn't very interesting. Message-Id: <1483522773-6008-1-git-send-email-penberg@scylladb.com>	2017-01-04 11:57:02 +01:00
Avi Kivity	3232d47d4f	dist: remove another bc dependency No longer used.	2017-01-01 11:13:34 +02:00
Tzach Livyatan	2bfa7cc086	dist/common/scripts: improve scylla_setup wording Fix a few minor typos and improve the user prompt text Signed-off-by: Tzach Livyatan <tzach@scylladb.com> Message-Id: <1482918340-19375-1-git-send-email-tzach@scylladb.com>	2016-12-30 13:18:08 +02:00
Tzach Livyatan	436ce7ae49	conf/scylla.yaml: Move broadcast_rpc_address to the supported section Fixes #1779 Signed-off-by: Tzach Livyatan <tzach@scylladb.com> Message-Id: <1483021417-8415-1-git-send-email-tzach@scylladb.com>	2016-12-29 16:24:56 +02:00
Takuya ASADA	e48cc9cf01	dist/ubuntu: check lsb_release existance since it's not included minimal Debian installation Ubuntu has it in minimal installation but Debian doesn't, so add it. Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1483003565-2753-1-git-send-email-syuu@scylladb.com>	2016-12-29 11:33:21 +02:00
Pekka Enberg	a443dfa95e	tracing: Add seastar/core/scollectd.hh include Fix the following build breakage: FAILED: build/release/gen/cql3/CqlParser.o g++ -MMD -MT build/release/gen/cql3/CqlParser.o -MF build/release/gen/cql3/CqlParser.o.d -std=gnu++1y -g -Wall -Werror -fvisibility=hidden -pthread -I/home/penberg/scylla/seastar -I/home/penberg/scylla/seastar/fmt -I/home/penberg/scylla/seastar/build/release/gen -march=nehalem -Ifmt -DBOOST_TEST_DYN_LINK -Wno-overloaded-virtual -DFMT_HEADER_ONLY -DHAVE_HWLOC -DHAVE_NUMA -DHAVE_LZ4_COMPRESS_DEFAULT -O2 -DBOOST_TEST_DYN_LINK -Wno-maybe-uninitialized -DHAVE_LIBSYSTEMD=1 -I. -I build/release/gen -I seastar -I seastar/build/release/gen -c -o build/release/gen/cql3/CqlParser.o build/release/gen/cql3/CqlParser.cpp In file included from ./query-request.hh:31:0, from ./locator/token_metadata.hh:51, from ./locator/abstract_replication_strategy.hh:29, from ./database.hh:26, from ./service/storage_proxy.hh:44, from ./db/schema_tables.hh:43, from ./db/system_keyspace.hh:46, from ./cql3/functions/function_name.hh:45, from ./cql3/selection/selectable.hh:48, from ./cql3/selection/writetime_or_ttl.hh:45, from build/release/gen/cql3/CqlParser.hpp:63, from build/release/gen/cql3/CqlParser.cpp:44: ./tracing/tracing.hh:357:5: error: ‘scollectd’ does not name a type scollectd::registrations _registrations; ^~~~~~~~~ Message-Id: <1482939751-8756-1-git-send-email-penberg@scylladb.com>	2016-12-28 18:40:18 +02:00
Nadav Har'El	d49aa7abd2	storage_service: make is_joined() an immediate function Commit `d41cd48a` made the is_joined() method a future<bool> because only cpu 0 knows its real value. This makes this function inconvenient to use. So this patch reverts commit `d41cd48a`, and instead sets this flag's value on all shards, so each shard can read its value locally (and immediately). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20161228160450.5831-1-nyh@scylladb.com>	2016-12-28 18:37:22 +02:00
Pekka Enberg	2aee7f6334	Merge seastar upstream * seastar f32e4c2...1c8e389 (2): > Merge "migrate network related seastar collectd metrics to the new metrics registration API" from Vlad > file: add dup() support	2016-12-28 17:04:11 +02:00
Duarte Nunes	1444a52fae	position_in_partition: Add tri_comparator Will be needed to order view updates with the existing mutations. Signed-off-by: Duarte Nunes <duarte@scylladb.com> [pdziepak: corrected component name in commit message] Message-Id: <1482880989-3086-2-git-send-email-duarte@scylladb.com>	2016-12-28 13:04:16 +01:00
Duarte Nunes	c6b0387f31	clustering_bounds_comparator: Add tri_comparator This patch adds a tri_comparator for bound_view, which will be used by to add a tri comparator to position_in_partition. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1482880989-3086-1-git-send-email-duarte@scylladb.com>	2016-12-28 13:02:57 +01:00
Duarte Nunes	adb727f7dc	clustering_row: Add apply() overload This patch adds an overload to the apply() function, which takes a clustering_row by reference, to copy. This will be needed by future patches, when merging base table updates with the existing data. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1482881106-3202-1-git-send-email-duarte@scylladb.com>	2016-12-28 12:45:12 +01:00
Pekka Enberg	302035577e	cql3/statements: Make batch_statement::_type private The _type member variable is never accessed outside of the batch_statement class so make it private. Message-Id: <1482921073-28485-1-git-send-email-penberg@scylladb.com>	2016-12-28 12:08:05 +01:00
Pekka Enberg	20daf43403	cql3/statements: Move batch_statement implementation to source file Clean up batch_statement class by moving implementation to the batch_statement.cc source file to make it easier to modify the class. Message-Id: <1482920872-28303-1-git-send-email-penberg@scylladb.com>	2016-12-28 12:30:03 +02:00
Duarte Nunes	86a109915d	streamed_mutations: Update comments This patch removes references to the old begin_range_tombstone and end_range_tombstone mutation_fragments, which have been replaced by a single range_tombstone fragment. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1482880820-2831-1-git-send-email-duarte@scylladb.com>	2016-12-28 09:06:49 +01:00
Gleb Natapov	4ca58959ad	storage_proxy: do not deref unengaged stdx:optional Fixes intentional short reads. Message-Id: <20161227142133.GE1829@scylladb.com>	2016-12-27 16:30:03 +02:00
Vlad Zolotarov	9606db2f08	api::set_tracing_probability: prevent a server from returning 500 for a bad probability value - Change an exception type thrown by a tracing::tracing::set_trace_probability() to make it different from the one thrown by an std::stod() when it fails to parse a given string. - Catch the std::out_of_range exception thrown by a tracing::tracing::set_trace_probability() and wrap the exception string into the httpd::bad_param_exception() object. - Throw a httpd::bad_param_exception() with a "Bad format in a probability value: <a user given probability string value>" message if std::invalid_argument is caught. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Message-Id: <1465300738-1557-1-git-send-email-vladz@cloudius-systems.com>	2016-12-27 12:07:09 +02:00
Avi Kivity	339cc0c2fa	main: verify sufficient memory per shard Refuse to boot if we don't have at least 1 GiB per shard, unless in developer mode. The primary violator here is docker, but since it starts in developer mode, it won't get fixed. We need some extra logic for this case. Message-Id: <20161221090222.28677-1-avi@scylladb.com>	2016-12-27 12:05:52 +02:00
Avi Kivity	868b4d110c	Merge "Fixes for intentional short reads" from Paweł "This patchset contains fixes for the changes introduced in "Query result size limiting". It also improves handling of short data reads. I order to minimise chances of digest mismatch during data queries replicas that were asked just to return a digest also keep track of the size of the data (in the IDL representation) so that they would stop at the same point nodes doing full data queries would. Moreover, data queries are not affected by per-shard memory limit and the coordinator sends individual result size limits to replicas in order not to depend on hardcoded values. It is still possible to get digest mismatches if the IDL changes (e.g. a new field is added), but, hopefully, that won't be a serious problem." * 'pdziepak/short-read-fixes/v4' of github.com:cloudius-systems/seastar-dev: query: introduce result_memory_accounter::foreign_state storage_proxy: fix short reads in parallel range queries storage_proxy: pass maximum result size to replicas mutation_partition: use result limiter for digest reads query: make result_memory_limiter constants available for linker result_memory_limiter: add accounter for digest reads idl: allow writers to use any output stream result_memory_limiter: split new_read() to new_{data, mutation}_read() idl: is_short_read() was added in 1.6 mutation_partition: honour allowed_short_read for static rows storage_proxy: fix _is_short_read computation storage_proxy: disallow short reads if got no live rows storage_proxy: don't stop after result with no live rows	2016-12-26 10:42:49 +02:00
Avi Kivity	1d9ee358f1	Revert "Merge "Reduce the size of mutation_partition" from Piotr" This reverts commit `aa392810ff`, reversing changes made to a24ff47c637e6a5fd158099b8a65f1191fc2d023; it uses boost::intrusive::detail directly, which it must not, and doesn't compile on all boost versions as a consequence.	2016-12-25 16:07:48 +02:00
Avi Kivity	59d389bd46	Merge seastar upstream * seastar 0b98024...f32e4c2 (11): > Merge "Moving the reactor counters to the metric layer" from Amnon > metrics: Metrics function should take variable as a refernce > Revert "Merge ""Moving the reactor counters to the metric layer from Amnon" > Merge ""Moving the reactor counters to the metric layer from Amnon > Revert "fstream: Auto-close data_sink and data_source" > rpc: Avoid resource unit leaks on failure > fstream: Auto-close data_sink and data_source > http: Move metrics registration to the metrics layer > output_stream: add batching to zero copy interface > Revert "slab: Move the metrics registration to the metrics layer" > slab: Move the metrics registration to the metrics layer	2016-12-25 15:50:09 +02:00
Amnon Heiman	70b2a1bfd4	Set the prometheus prefix to scylla This patch make the prometheus prefix configurable and set the default value to scylla. Fixes #1964 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <1482671970-21487-1-git-send-email-amnon@scylladb.com>	2016-12-25 15:21:53 +02:00
Avi Kivity	b99a0fc076	licenses: clarify that licenses in this directory do not cover entire work	2016-12-25 12:59:38 +02:00
Avi Kivity	aa392810ff	Merge "Reduce the size of mutation_partition" from Piotr "Reduce the size of mutation_partition by implementing intrusive set using bi::rbtree_algorithms directly and using tree nodes optimized for size. This will reduce the size of mutation_partition by: 24 bytes + <number of cql rows> * 8 bytes This should have a positive impact on performance because mutation_partitions are stored both in memtable and cache. Fixes #742." * 'haaawk/742' of github.com:cloudius-systems/seastar-dev: intrusive_set: rename size() to calculate_size() Make intrusive_set_external_comparator::_value_traits static Implement intrusive set using rbtree_algorithms mutation_partition: make apply_reversibly_intrusive_set nongeneric mutation_partition: take schema in find_row and clustered_row mutation_partition: Extract intrusive set logic to a class. mutation_partition: Replace value_comp with key_comp calls	2016-12-25 12:56:10 +02:00
Benoît Canet	a24ff47c63	scylla_setup: Use blkid or ls to list potentials block devices blkid does not list root raw device. Revert to lsblk while taking care of having a fallback path in case the -p option is not supported. Fixes #1963. Suggested-by: Avi Kivity <avi@scylladb.com> Signed-off-by: Benoît Canet <benoit@scylladb.com> Message-Id: <20161225100204.13297-1-benoit@scylladb.com>	2016-12-25 12:03:40 +02:00
Takuya ASADA	f3e45bc9ef	dist/redhat: don't try to adduser when user is already exists Currently we get "failed adding user 'scylla'" on .rpm installation when user is already exists, we can skip it to prevent error. Fixes #1958 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1482550075-27939-1-git-send-email-syuu@scylladb.com>	2016-12-25 11:37:25 +02:00
Piotr Jastrzebski	345ed5b6ff	intrusive_set: rename size() to calculate_size() This hopefully will make it more apparent that the time complexity of this method is O(N) not O(1). Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:32:13 +01:00
Piotr Jastrzebski	151fa3aaf0	Make intrusive_set_external_comparator::_value_traits static _value_traits can be shared among all instances and there's no need to store it in every single one. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:32:13 +01:00
Piotr Jastrzebski	671affc36c	Implement intrusive set using rbtree_algorithms This new implementation takes less memory because it does not store comparator. It also uses tree nodes optimized for size. This means that instead of storing an enum field \|color\| they embed this information inside pointer to parent. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:32:13 +01:00
Piotr Jastrzebski	b0f712a4e8	mutation_partition: make apply_reversibly_intrusive_set nongeneric apply_reversibly_intrusive_set is used only in one place and always with rows_type. There's no need for it to be generic. This will allow changing intrusive set implementation. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:29:07 +01:00
Piotr Jastrzebski	2af6ff68d9	mutation_partition: take schema in find_row and clustered_row This will allow intrusive set implementation that does not store schema. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:29:07 +01:00
Piotr Jastrzebski	b3b924dec9	mutation_partition: Extract intrusive set logic to a class. It will make it easier to change the implementation of the intrusive set. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:29:07 +01:00
Piotr Jastrzebski	ac7481f4b2	mutation_partition: Replace value_comp with key_comp calls This will reduce the size of bi::set API being used. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:29:07 +01:00
Tomasz Grabiec	f2a63270d1	sstables: Fix double close on index and data files when writing fails file output streams take the responsibility of closing the file, they will close the file as part of closing the stream. During sstable writing we create sstable object and keep file references there as well. Sstable object also has responsibility for closing the files, and does so from sstable::~sstable(). Double close was supposed to be avoided by a construct like this: writer.close().get(); _file = {}; However if close() failed, which can happen when write-ahead failed, _file would not be cleared, and both the writer and sstable would close the file. This will result in a crash in append_challenged_posix_file_impl::close(), which is not prepared to be closed twice. Another problem is that if exception happened before we reached that construct, we still should close the writer. Currently we don't, so there's no double close on the file, but that's a bug which needs to be fixed and once that's fixed double close on _file will be even more likely. The fix employed here is to not keep files inside sstable object when writing. As soon as the writer is constructed, it's the only owner of the file. Fixes #1764. Message-Id: <1482428648-22553-1-git-send-email-tgrabiec@scylladb.com>	2016-12-23 11:44:43 +02:00
Raphael S. Carvalho	fd80499b3d	database: make column_family::add_sstable() private again Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <38226308bee2970a91b0e35370d6a646b85ecfe9.1482459877.git.raphaelsc@scylladb.com>	2016-12-23 11:42:16 +02:00
Paweł Dziepak	e6d27ac529	query: introduce result_memory_accounter::foreign_state Range queries used to be performed sequentially and the shard performing part of the read was reading state of the merger's memory accounter directly. Now, they may be performed in parallel so it is safer to just pass relevant data by value to the intersted shards so that they are not reading something that another shard is modyfing at the same time. Since query is done in parallel there is a chance of overread. However, the parallelism is high only in sparsely populated tables and that's when the overread is less serious problem.	2016-12-22 17:16:24 +01:00
Paweł Dziepak	49d675223e	storage_proxy: fix short reads in parallel range queries Since `a1cafed370` "storage_proxy: handle range scans of sparsely populated tables" nonsingular range queries may be performed in parallel on multiple shards. The consequence of this that result may be added to the merger out of order. This requires more complex logic for handling short reads. As soon as mutation_result_merger gets a short read it starts to discard all subsequently received results that are known to contain partitions with larger keys. Then when the final result is being prepared the merger may need to combine and sorts results which ordering is not known. If at least one of these results is a short one all partitions with larger keys are removed. Due to request being performed in parallel it is possible that even though there was a short read the merger has got enough live data to satisfy specified limits. If this has happened the short read flag is not set on the final result.	2016-12-22 17:16:24 +01:00
Paweł Dziepak	1a52569f7d	storage_proxy: pass maximum result size to replicas We may want to change the default individual result size limit in the future. If it is provided by the coordinator and not hardcoded in the replicas this can be done without causing data query digest mismatches or wasteful mutation query results.	2016-12-22 17:16:23 +01:00
Paweł Dziepak	40176ca2f8	mutation_partition: use result limiter for digest reads Even if we are performing a digest query we should do proper result memory accounting so that the result ends exactly in the same place that it would if it was a data query. This is to avoid digest mismatches between replicas.	2016-12-22 17:16:23 +01:00
Avi Kivity	8686a59ea5	dht: use nonwrapping_ranges in ring_position_range_sharder It was the observation that ring_position_range_sharder doesn't support wrapping ranges that started the nonwrapping_range madness, but that class still has some leftover wrapping ranges. Close the circle by removing them. Message-Id: <20161123153113.8944-1-avi@scylladb.com>	2016-12-22 14:40:30 +01:00
Takuya ASADA	7c3b98806d	dist/common/scripts/scylla_setup: improve the message of disk selection prompt Not to confuse users, describe we only list up unmounted disks. Fixes #1841 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Message-Id: <1479720708-6021-1-git-send-email-syuu@scylladb.com>	2016-12-22 15:36:46 +02:00
Paweł Dziepak	a7d694654a	query: make result_memory_limiter constants available for linker	2016-12-22 13:35:04 +01:00
Paweł Dziepak	a0523df8d6	result_memory_limiter: add accounter for digest reads Digest reads differ from data reads in a way that they do not really consume any memory. We still want them to stop in the same place that data reads would, but the per-shard semaphore shouldn't be updated by them.	2016-12-22 13:35:04 +01:00
Paweł Dziepak	38ee69dee0	idl: allow writers to use any output stream Original IDL generated code was hardcoded to always use bytes_ostream. This patch makes the output stream a template parameter so that any valid output stream can be used. Unfortunately, making IDL writers generic requires updates in the code that uses them, this is fixed in C++17 which would be able to deduce the parameter in most cases.	2016-12-22 13:35:04 +01:00
Paweł Dziepak	aa083d3d85	result_memory_limiter: split new_read() to new_{data, mutation}_read() For data queries it is very important that all replicas get limited in the same place (this includes replicas returning only digest). That's why they shouldn't be affected by per-shard result memory limit. Moreover, we should make sure that individual memory limits are the same, making the coordinator provide it for replicas which allow to safely change it in the future. Mutation queries are not as sensitive but it is still beneficial to make sure that all replicas use the same individual limit.	2016-12-22 13:35:04 +01:00
Paweł Dziepak	b8e29cc99c	idl: is_short_read() was added in 1.6	2016-12-22 13:35:04 +01:00

1 2 3 4 5 ...

11078 Commits