scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 20:27:03 +00:00

Author	SHA1	Message	Date
Rafael Ávila de Espíndola	b5667b9c31	build: don't compress debug info in executables By default we were compressing debug info only in release executables. The idea, if I understand it correctly, is that those are the ones we ship, so we want a more compact binary. I don't think that was doing anything useful. The compression is just gzip, so when we ship a .tar.xz, having the debug info compressed inside the scylla binary probably reduces the overall compression a bit. When building a rpm the situation in amusing. As part of the rpm build process the debug info is decompressed and extracted to an external file. Given that most of the link time goes to compressing debug info, it is probably a good idea to just skip that. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191123022825.102837-1-espindola@scylladb.com>	2019-11-24 11:35:29 +02:00
Rafael Ávila de Espíndola	786b1ec364	types: Move json code to its own file Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191120181213.111758-7-espindola@scylladb.com>	2019-11-21 12:08:49 +02:00
Konstantin Osipov	002ff51053	lua: make sure the latest master builds on Debian/Ubuntu Use pkg-config to search for Lua dependencies rather than hard-code include and link paths. Avoid using boost internals, not present in earlier versions of boost. Reviewed-by: Rafael Avila de Espindola <espindola@scylladb.com> Message-Id: <20191120170005.49649-1-kostja@scylladb.com>	2019-11-21 07:57:12 +02:00
Pavel Solodovnikov	d910899d61	configure.py: support multi-threaded linking via `gold` Use `-Wl,--threads` flag to enable multi-threaded linking when using `ld.gold` linker. Additional compilation test is required because it depends on whether or not the `gold` linker has been compiled with `--enable-threads` option. This patch introduces a substantial improvement to the link times of `scylla` binary in release and debug modes (around 30 percent). Local setup reports the following numbers with release build for linking only build/release/scylla: Single-threaded mode: Elapsed (wall clock) time (h:mm:ss or m:ss): 1:09.30 Multi-threaded mode: Elapsed (wall clock) time (h:mm:ss or m:ss): 0:51.57 Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20191120163922.21462-1-pa.solodovnikov@scylladb.com>	2019-11-20 19:28:00 +02:00
Peng Jian	f2801feb66	Redis API: graft redis module to Scylla In this document, the detailed design and implementation of Redis API in Scylla is provided. v2: build: work around ragel 7 generated code bug (suggested by Avi) Ragel 7 incorrectly emits some unused variables that don't compile. As a workaround, sed them away. Signed-off-by: Peng Jian <pengjian.uestc@gmail.com> Signed-off-by: Amos Kong <amos@scylladb.com>	2019-11-20 04:55:58 +08:00
Avi Kivity	1fe062aed4	Merge "Add basic UDF support" from Rafael " This patch series adds only UDF support, UDA will be in the next patch series. With this all CQL types are mapped to Lua. Right now we setup a new lua state and copy the values for each argument and return. This will be optimized once profiled. We require --experimental to enable UDF in case there is some change to the table format. " * 'espindola/udf-only-v4' of https://github.com/espindola/scylla: (65 commits) Lua: Document the conversions between Lua and CQL Lua: Implement decimal subtraction Lua: Implement decimal addition Lua: Implement support for returning decimal Lua: Implement decimal to string conversion Lua: Implement decimal to floating point conversion Lua: Implement support for decimal arguments Lua: Implement support for returning varint Lua: Implement support for returning duration Lua: Implement support for duration arguments Lua: Implement support for returning inet Lua: Implement support for inet arguments Lua: Implement support for returning time Lua: Implement support for time arguments Lua: Implement support for returning timeuuid Lua: Implement support for returning uuid Lua: Implement support for uuid and timeuuid arguments Lua: Implement support for returning date Lua: Implement support for date arguments Lua: Implement support for returning timestamp ...	2019-11-17 16:38:19 +02:00
Konstantin Osipov	48f3ca0fcb	test.py: use the configured build modes from ninja mode_list Add mode_list rule to ninja build and use it by default when searching for tests in test.py. Now it is no longer necessary to explicitly specify the test mode when invoking test.py. (cherry picked from commit a211ff30c7f2de12166d8f6f10d259207b462d4b)	2019-11-17 13:42:10 +01:00
Rafael Ávila de Espíndola	bc3bba1064	Lua: Add lua.cc and lua.hh skeleton files Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	7015e219ca	Lua: Link with liblua Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	fc72a64c67	Add schema propagation and storage for UDF With this it is possible to create user defined functions and aggregates and they are saved to disk and the schema change is propagated. It is just not possible to call them yet. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:41:08 -08:00
Rafael Ávila de Espíndola	e7e3dab4aa	Convert UDF parsing code to c++ For now this just constructs the corresponding c++ classes. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-11-07 08:19:52 -08:00
Konstantin Osipov	e555dc502e	lwt: implement basic lightweight transactions support Support single-statement conditional updates and as well as batches. This patch almost fully rewrites column_condition.cc, implementing is_satisfied_by(). Most of the remaining complications in column_condition implementation come from the need to properly handle frozen and multi-cell collection in predicates - up until now it was not possible to compare entire collection values between each other. This is further complicated since multi-cell lists and sets are returned as maps. We can no longer assume that the columns fetched by prefetch operation are non-frozen collections. IF EXISTS/IF NOT EXISTS condition fetches all columns, besides, a column may be needed to check other condition. When fetching the old row for LWT or to apply updates on list/columns, we now calculate precisely the list of columns to fetch. The primary key columns are also included in CAS batch result set, and are thus also prefetched (the user needs them to figure out which statements failed to apply). The patch is cross-checked for compatibility with cassandra-3.11.4-1545-g86812fa502 but does deviate from the origin in handling of conditions on static row cells. This is addressed in future series.	2019-10-27 23:42:49 +03:00
Gleb Natapov	b3e01a45d7	lwt: storage_proxy: implement paxos protocol This patch adds all functionality needed for Paxos protocol. The implementation does not strictly adhere to Paxos paper since the original paper allows setting a value only once, while for LWT we need to be able to make another Paxos round after "learn" phase completes, which requires things like repair to be introduced.	2019-10-27 23:21:51 +03:00
Gleb Natapov	8d6201a23b	lwt: Add RPC verbs needed for paxos implementation Paxos protocol has three stages: prepare, accept, learn. This patch adds rpc verb for each of those stages. To be term compatible with Cassandra the patch calls those stages: prepare, propose, commit.	2019-10-27 23:21:51 +03:00
Gleb Natapov	d1774693bf	lwt: Define state needed by paxos and persist it Paxos protocol relies on replicas having a state that persists over crashes/restarts. This patch defines such state and stores it in the database itself in the paxos table to make it persistent. The stored state is: in_progress_ballot - promised ballot proposal - accepted value proposal_ballot - the ballot of the accepted value most_recent_commit - most recently learned value most_recent_commit_at - the ballot of the most recently learned value	2019-10-27 23:21:51 +03:00
Gleb Natapov	15b935b95d	lwt: add data structures needed for paxos implementation This patch add two data structures that will be used by paxos. First one is "proposal" which contains a ballot and a mutation representing a value paxos protocol is trying to set. Second one is "prepare_response" which is a value returned by paxos prepare stage. It contains currently accepted value (if any) and most recently learned value (again if any). The later is used to "repair" replicas that missed previous "learn" message.	2019-10-27 23:21:51 +03:00
Avi Kivity	27ef73f4f1	Merge "Report file I/O in CQL tracing when reading from sstables." from Kamil " Introduce the traced_file class which wraps a file, adding CQL trace messages before and after every operation that returns a future. Use this file to trace reads from SSTable data and index files. Fixes #4908. " * 'traced_file' of https://github.com/kbr-/scylla: sstables: report sstable index file I/O in CQL tracing sstables: report sstable data file I/O in CQL tracing tracing: add traced_file class	2019-10-26 22:53:37 +03:00
Kamil Braun	a8c9d1206a	tracing: add traced_file class This is a thin wrapper over the `seastar::file` class which adds CQL trace messages before and after I/O operations.	2019-10-25 14:10:24 +02:00
Kamil Braun	474742ac5d	tests: move some UDT tests from cql_query_test.cc to new file.	2019-10-25 12:04:44 +02:00
Kamil Braun	c0d3e6c773	atomic_cell: move collection_mutation(_view) to a new file. The classes 'collection_mutation' and 'collection_mutation_view' were moved to a separate header, collection_mutation.hh. Implementations of functions that operate on these classes, including some methods of collection_type_impl, were moved to a separate compilation unit, collection_mutation.cc. This makes it easier to modify these structures in future commits in order to generalize them for non-frozen User Defined Types. Some additional documentation has been written for collection_mutation.	2019-10-25 10:19:45 +02:00
Piotr Jastrzebski	81a34168a3	create_table_statement: handle 'with cdc =' Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-10-17 11:28:14 +02:00
Piotr Jastrzebski	a9e43f4e86	test: add test_with_cdc_parameter At the moment, this test only checks that table creation and alteration sets cdc_options property on a table correctly. Future patches will extend this test to cover more CDC aspects. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-10-17 10:55:31 +02:00
Piotr Sarna	589a22d078	alternator: add computing the auth signature A function for computing the auth signature from user requests is added, along with helper functions. The implementation is based on gnutls's HMAC. Refs #5046	2019-10-10 13:51:00 +02:00
Tomasz Grabiec	8bedcd6696	tests: Make managed_vector_test a seastar test LSA will depend on seastar reactor being present.	2019-09-23 12:51:24 +02:00
Piotr Sarna	7064b3a2bf	alternator: add rapidjson helper functions Migrating from libjsoncpp to rapidjson proved to be beneficial for parsing performance. As a first step, a set of helper functions is provided to ease the migration process.	2019-09-11 18:01:04 +03:00
Piotr Sarna	8cb078f757	alternator: add initial filtering implementation Filtering is currently only implemented for the equality operator on non-key attributes. Next steps (TODO) involve: 1. Implementing filtering for key restrictions 2. Implementing non-key attribute filtering for operators other than EQ. It, in turn, may involve introducing 'map value restrictions' notion to Scylla, since now it only allows equality restrictions on map values (alternator attributes are currently kept in a CQL map). 3. Implementing FilterExpression in addition to deprecated QueryFilter	2019-09-11 15:08:50 +03:00
Nadav Har'El	829bafd181	alternator: add expression parsers The DynamoDB protocol is based on JSON, and most DynamoDB requests describe the operation and its parameters via JSON objects such as maps and lists. However, in some types of requests an "expression" is passed as a single string, and we need to parse this string. These cases include: 1. Attribute paths, such as "a[3].b.c", are used in projection expressions as well as inside other expressions described below. 2. Condition expressions, such as "(NOT (a=b OR c=d)) AND e=f", used in conditional updates, filters, and other places. 3. Update expressions, such as "SET #a.b = :x, c = :y DELETE d" This patch introduces the framework to parse these expressions, and an implementation of parsing update expressions. These update expressions will be used in the UpdateItem operation in the next patch. All these expression syntaxes are very simple: Most of them could be parsed as regular expressions, or at most a simple hand-written lexical analyzer and recursive-descent parser. Nevertheless, we decided to specify these parsers in the same ANTLR3 language already used in the Scylla project for parsing CQL, hopefully making these parsers easier to reason about, and easier to change if needed - and reducing the amount of boiler- plate code. The parsing of update expressions is most complete except that in SET actions, only the "path = value" form is supported and not yet forms forms such as "path1 = path2" (which does read-before-write) or "path1 = path1 + value" or "path = function(...)". Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 15:06:12 +03:00
Piotr Sarna	b3fd4b5660	alternator: add simple attribute serialization routines Attributes used to be written into the database in raw JSON format, which is far from optimal. This patch introduces more robust serializationi routines for simple alternator types: S, B, BOOL, N. Serialization uses the first byte to encode attribute type and follows with serializing data in binary form. More complex types (sets, lists, etc.) are currently still serialized in raw JSON and will be optimized in follow-up patches. Message-Id: <10955606455bbe9165affb8ac8fba4d9e7c3705f.1559646761.git.sarna@scylladb.com>	2019-09-11 15:01:07 +03:00
Nadav Har'El	52810d1103	configure.py: move alternator source files to separate list For some unknown reason we put the list of alternator source files in configure.py inside the "api" list. Let's move it into a separate list. We could have just put it in the scylla_core list, but that would cause frequent and annoying patch conflicts when people add alternator source files and Scylla core source files concurrently. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:52:39 +03:00
Nadav Har'El	57b46a92d7	alternator: add base64 encoding and decoding functions The DynamoDB API uses base64 encoding to encode binary blobs as JSON strings. So we need functions to do these conversions. This code was "inspired" by https://github.com/ReneNyffenegger/cpp-base64 but doesn't actually copy code from it. I didn't write any specific unit tests for this code, but it will be exercised and tested in a following patch which tests Alternator's use of these functions. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:46:13 +03:00
Nadav Har'El	eb81b31132	alternator: add statistics his patch adds a statistics framework to Alternator: Executor has (for each shard) a _stats object which contains counters for various events, and also is in charge of making these counters visible via Scylla's regular metrics API (http://localhost:9180/metrics). This patch includes a counter for each of DynamoDB's operation types, and we increase the ones we support when handled. We also added counters for total operations and unsupported operations (operation types we don't yet handle). In the future we can easily add many more counters: Define the counter in stats.hh, export it in stats.cc, and increment it in where relevant in executor.cc (or server.cc). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:36:26 +03:00
Piotr Sarna	2ec78164bc	alternator: add minimal HTTP interface The interface works on port 8000 by default and provides the most basic alternator operations - it's an incomplete set without validation, meant to allow testing as early as possible.	2019-09-11 12:34:18 +03:00
Botond Dénes	7adc764b6e	messaging_service: add canonical_support to schema pull and push verbs The verbs are: * DEFINITIONS_UPDATE (push) * MIGRATION_REQUEST (pull) Support was added in a backward-compatible way. The push verb, sends both the old frozen mutation parameter, and the new optional canonical mutation parameter. It is expected that new nodes will use the latter, while old nodes will fall-back to the former. The pull verb has a new optional `options` parameter, which for now contains a single flag: `remote_supports_canonical_mutation_retval`. This flag, if set, means that the remote node supports the new canonical mutation return value, thus the old frozen mutations return value can be left empty.	2019-09-04 10:32:44 +03:00
Avi Kivity	8fb59915bb	Merge "Minor cleanup patches for sstables" from Asias * 'cleanup_sstables' of https://github.com/asias/scylla: sstables: Move leveled_compaction_strategy implementation to source file sstables: Include dht/i_partitioner.hh for dht::partition_range	2019-09-03 14:47:44 +03:00
Botond Dénes	969aa22d51	configure.py: promote unused result warning to error Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190827111428.6829-2-bdenes@scylladb.com>	2019-08-28 09:46:17 +03:00
Asias He	2f24fd9106	sstables: Move leveled_compaction_strategy implementation to source file It is better than putting everything in header.	2019-08-26 16:49:48 +08:00
Avi Kivity	4ef7429c4a	build: build seastar in build directory Currently, seastar is built in seastar/build/{mode}. This means we have two build directories: build/{mode} and seastar/build/{mode}. This patch changes that to have only a single build directory (build/{mode}). It does that by calling Seastar's cmake directly instead of through Seastar's ./configure.py. However, to support dpdk, if that is enabled it calls cmake through Seastar's ./cooking.sh (similar to what Seastar's ./configure.py does). All ./configure.py flags are translated to cmake variables, in the same way that Seastar does. Contains fix from Rafael to pass the flags for the correct mode.	2019-08-21 13:10:17 +02:00
Avi Kivity	e548bdb2e8	thrift, transport: switch to new seastar accept() API (#4814 ) Seastar switched accept() to return a single struct instead of a variadic future, adjust the code to the new API to avoid deprecation warnings.	2019-08-07 15:23:26 +02:00
Avi Kivity	689fc72bab	Update seastar submodule * seastar d199d27681...a1cf07858b (1): > Merge 'Do not return a variadic future form server_socket::accept()' from Avi Seastar configure.py now has --api-level=1, to keep us one the old variadic future server_socket::accept() API.	2019-08-06 18:37:27 +03:00
Kamil Braun	3a0308f76f	Configure: rename seastar_pool to submodule_pool, add more submodules to the pool Signed-off-by: Kamil Braun <kbraun@scylladb.com>	2019-08-05 14:55:56 +02:00
Kamil Braun	f14e6e73bb	Add ZStandard compression This adds the option to compress sstables using the Zstandard algorithm (https://facebook.github.io/zstd/). To use, pass 'sstable_compression': 'org.apache.cassandra.io.compress.ZstdCompressor' to the 'compression' argument when creating a table. You can also specify a 'compression_level'. See Zstd documentation for the available compression levels. Resolves #2613. Signed-off-by: Kamil Braun <kbraun@scylladb.com>	2019-08-05 14:55:53 +02:00
Vlad Zolotarov	9df53b8bca	configure.py: ignore 'thrift -version' exit code (At least) on Ubuntu 19 'thrift -version' prints the expected string but its exit status is non-zero: $ thrift -version Thrift version 0.9.1 $ echo $? 1 We don't really care about the exit status but rather about the printed version string. If there is going to be some problem with the command, e.g. it's missing, the printed string is not going to be as expected anyway - let's verify that explicitly by checking the format of the returned string in that case. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20190722211729.24225-1-vladz@scylladb.com>	2019-07-31 11:44:57 +03:00
Tomasz Grabiec	14700c2ac4	Merge "Fix the system.size_estimates table" from Kamil Fixes a segfault when querying for an empty keyspace. Also, fixes an infinite loop on smp > 1. Queries to system.size_estimates table which are not single-partition queries caused Scylla to go into an infinite loop inside multishard_combining_reader::fill_buffer. This happened because multishard_combinind_reader assumes that shards return rows belonging to separate partitions, which was not the case for size_estimates_mutation_reader. Fixes #4689.	2019-07-15 22:09:30 +02:00
Nadav Har'El	9cc9facbea	configure.py: atomically overwrite build.ninja configure.py currently takes some time to write build.ninja. If the user interrupts (e.g., control-C) configure.py, it can leave behind a partial or even empty build.ninja file. This is most frustrating when the user didn't explicitly run "configure.py", but rather just ran "ninja" and ninja decided to run configure.py, and after interrupting it the user cannot run "ninja" again because build.ninja is gone. Another result of losing build.ninja is that the user now needs to remember which parameters to run "configure.py", because the old ones stored in build.ninja were lost. The solution in this patch is simple: We write the new build.ninja contents into a temporary file, not directly into build.ninja. Then, only when the entire file has been succesfully written, do we rename the temporary file to its intended name - build.ninja. Fixes #4706 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Reviewed-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190715122129.16033-1-nyh@scylladb.com>	2019-07-15 15:34:48 +03:00
Kamil Braun	a1665b74a9	Refactor size_estimates_virtual_reader Move the implementation of size_estimates_mutation_reader to a separate compilation unit to speed up compilation times and increase readability. Refactor tests to use seastar::thread.	2019-07-12 17:53:00 +02:00
Avi Kivity	fb23cd1ff6	Introduce updatable_value The updateable_value and updateable_value_source classes allow broadcasting configuration changes across the application. The updateable_value_source class represents a value that can be updated, and updateable_value tracks its source and reflects changes. A typical use replaces "uint64_t config_item" with "updateable_value<uint64_t> config_item", and from now on changes to the source will be reflected in config_item. For more complicated uses, which must run some callback when configuration changes, you can also call config_item.observe(callback) to be actively notified of changes.	2019-06-28 16:43:25 +03:00
Botond Dénes	df29600eec	Add timestamp_based_splitting_writer This writer implements the core logic of time-window based data segregation. It splits the fragment stream provided by a reader, such that each atom (cell) in the stream will be written into a consumer based on the time-window its timestamp belongs to. The end result is that each consumer will only see fragments, whoose atoms all have timestamps belonging to the same time-window. When a mutation fragment has atoms belonging to different time-windows, it is split into as many fragments as needed so each has only atoms that belong to the same time-window.	2019-06-26 15:45:59 +03:00
Botond Dénes	2693f1838a	Introduce mutation_writer namespace Currently there is a single mutation_writer: `multishard_writer`, however in the next path we are going to add another one. This is the right moment to move these into a common namespace (and folder), we have way too much stuff scattered already in the top-level namespace (and folder). Also rename `tests/multishard_writer_test.cc` to `tests/mutation_writer_test.cc`, this test-suite will be the home of all the different mutation writer's unit test cases.	2019-06-26 15:45:59 +03:00
Botond Dénes	d00cb4916c	tests: introduce random_schema random_schema is a utility class that provides methods for generating random schemas as well as generating data (mutations) for them. The aim is to make using random schemas in tests as simple and convenient as is using `simple_schema`. For this reason the interface of `random_schema` follows closely that of `simple_schema` to the extent that it makes sense. An important difference is that `random_schema` relies on `data_model` to actually build mutations. So all its mutation-related operations work with `data_model::mutation_descrition` instead of actual `mutation` objects. Once the user arrived at the desired mutation description they can generate an actual mutation via `data_model::mutation_description::build()`. In addition to the `random_schema` class, the `random_schema.hh` header exposes the generic utility classes for generating types and values that it internally uses. random_schema is fully deterministic. Using the same seed and the same set of operations is guaranteed to result in generating the same schema and data.	2019-06-25 12:01:33 +03:00
Alexys Jacob	98bc9edf6f	thrift/: support version 0.11+ after THRIFT-2221 Thrift 0.11 changed to generate c++ code with std::shared_ptr instead of boost::shared_ptr. - https://issues.apache.org/jira/browse/THRIFT-2221 This was forcing scylla to stick with older versions of thrift. Fixes issue #3097. thrift: add type aliases to build with old and new versions. update to using namespace =	2019-06-23 16:03:06 +03:00

1 2 3 4 5 ...

1040 Commits