scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 15:03:06 +00:00

Author	SHA1	Message	Date
Juliusz Stasiewicz	7fdc8563bf	system_keyspace: Added infrastructure for table `system.clients' I used the following as a reference: https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/db/virtual/ClientsTable.java At this moment there is only info about IP, clients outgoing port, client 'type' (i.e. CQL/thrift/alternator), shard ID and username. Column `request_count' is NOT present and CK consists of (`port', `client_type'), contrary to what C's has: (`port'). Code that notifies `system.clients` about new connections goes to top-level files `connection_notifier.`. Currently only CQL clients are observed, but enum `client_type` can be used in future to notify about connections with other protocols.	2019-12-17 11:31:28 +01:00
Benny Halevy	9ec98324ed	messaging_service: unregister_handler: return rpc unregister_handler future Now that seastar returns it. Fixes https://github.com/scylladb/scylla/issues/5228 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20191212143214.99328-1-bhalevy@scylladb.com>	2019-12-12 16:38:36 +02:00
Konstantin Osipov	bc482ee666	test.py: remove an unused option Message-Id: <20191204142622.89920-2-kostja@scylladb.com>	2019-12-12 15:53:35 +02:00
Avi Kivity	64cade15cc	Merge "bouncing lwt request to an owning shard" from Gleb " LWT is much more efficient if a request is processed on a shard that owns a token for the request. This is because otherwise the processing will bounce to an owning shard multiple times. The patch proposes a way to move request to correct shard before running lwt. It works by returning an error from lwt code if a shard is incorrect one specifying the shard the request should be moved to. The error is processed by the transport code that jumps to a correct shard and re-process incoming message there. " * 'gleb/bounce_lwt_request' of github.com:scylladb/seastar-dev: lwt: take raw lock for entire cas duration lwt: drop invoke_on in paxos_state prepare and accept lwt: Process lwt request on a owning shard storage_service: move start_native_transport into a thread transport: change make_result to takes a reference to cql result instead of shared_ptr	2019-12-12 15:50:22 +02:00
Nadav Har'El	9f62a3538c	alternator: fix BEGINS_WITH operator for blobs The implementation of Expected's BEGINS_WITH operator on blobs was incorrect, naively comparing the base64-encoded strings, which doesn't work. This patches fixes the code to compare the decoded strings. The reason why the BEGINS_WITH test missed this bug was that we forgot to check the blob case and only tested the string case; So this patch also adds the missing test - which reproduces this bug, and verifies its fix. Fixes #5457 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20191211115526.29862-1-nyh@scylladb.com>	2019-12-12 14:02:56 +01:00
Dejan Mircevski	27b8b6fe9d	cql3: Fix needs_filtering() for clustering columns The LIKE operator requires filtering, so needs_filtering() must check is_LIKE(). This already happens for partition columns, but it was overlooked for clustering columns in the initial implementation of LIKE. Fixes #5400. Tests: unit(dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-12-12 01:19:13 +02:00
Benny Halevy	d1bcb39e7f	hinted handoff: log message after removing hints directory (#5372 ) To be used by dtest as an indicator that endpoint's hints were drained and hints directory is removed. Refs #5354 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2019-12-12 01:16:19 +02:00
Gleb Natapov	64cfb9b1f6	lwt: take raw lock for entire cas duration It will prevent parallel update by the same coordinator and should reduce contention.	2019-12-11 14:41:31 +02:00
Gleb Natapov	898d2330a2	lwt: drop invoke_on in paxos_state prepare and accept Since lwt requests are now running on an owning shard there is no longer a need to invoke cross shard call.	2019-12-11 14:41:31 +02:00
Gleb Natapov	964c532c4f	lwt: Process lwt request on a owning shard LWT is much more efficient if a request is processed on a shard that owns a token for the request. This is because otherwise the processing will bounce to an owning shard multiple times. The patch proposes a way to move request to correct shard before running lwt. It works by returning an error from lwt code if a shard is incorrect one specifying the shard the request should be moved to. The error is processed by transport code that jumps to a correct shard and re-process incoming message there.	2019-12-11 14:41:31 +02:00
Gleb Natapov	54be057af3	storage_service: move start_native_transport into a thread The code runs only once and it is simple if it runs in a seastar thread.	2019-12-11 14:41:31 +02:00
Gleb Natapov	007ba3e38e	transport: change make_result to takes a reference to cql result instead of shared_ptr	2019-12-11 14:41:31 +02:00
Nadav Har'El	9e5c6995a3	alternator-test: add tests for ReturnValues parameter This patch adds comprehensive tests for the ReturnValue parameter of the write operations (PutItem, UpdateItem, DeleteItem), which can return pre-write or post-write values of the modified item. The tests are in a new test file, alternator-test/test_returnvalues.py. This feature is not yet implemented in Alternator, so all the new tests xfail on Alternator (and all pass on AWS). Refs #5053 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20191127163735.19499-1-nyh@scylladb.com>	2019-12-11 13:26:39 +01:00
Nadav Har'El	ab69bfc111	alternator-test: add xfailing tests for ScanIndexForward This patch adds tests for Query's "ScanIndexForward" parameter, which can be used to return items in reversed sort order. We test that a Limit works and returns the given number of last items in the sort order, and also that such reverse queries can be resumed, i.e., paging works in the reverse order. These tests pass against AWS DynamoDB, but fail against Alternator (which doesn't support ScanIndexForward yet), so it is marked xfail. Refs #5153. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20191127114657.14953-1-nyh@scylladb.com>	2019-12-11 13:26:39 +01:00
Pekka Enberg	6bc18ba713	storage_proxy: Remove reference to MBean interface The JMX interface is implemented by the scylla-jmx project, not scylla. Therefore, let's remove this historical reference to MBeans from storage_proxy. Message-Id: <20191211121652.22461-1-penberg@scylladb.com>	2019-12-11 14:24:28 +02:00
Avi Kivity	63474a3380	Merge "Add `experimental_features` option" from Dejan " Add --experimental-features -- a vector of features to unlock. Make corresponding changes in the YAML parser. Fixes #5338 " * 'vecexper' of https://github.com/dekimir/scylla: config: Add `experimental_features` option utils: Add enum_option	2019-12-11 14:23:08 +02:00
Avi Kivity	56b9bdc90f	Update seastar submodule * seastar e440e831c8...00da4c8760 (7): > Merge "reactor: fix iocb pool underflow due to unaccounted aio fsync" from Avi Fixes #5443. > install-dependencies.sh: fix arch dependencies > Merge " rpc: fix use-after-free during rpc teardown vs. rpc server message handling" from Benny > Merge "testing: improve the observability of abandoned failed futures" from Botond > rework the fair_queue tester > directory_test: Update to use run instead of run_deprecated > log: support fmt 6.0 branch with chrono.h for log	2019-12-11 14:17:49 +02:00
Benny Halevy	105c8ef5a9	messaging_service: wait on unregister_handler Prepare for returning future<> from seastar rpc unregister_handler. Refs https://github.com/scylladb/scylla/issues/5228 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20191208153924.1953-1-bhalevy@scylladb.com>	2019-12-11 14:17:41 +02:00
Nadav Har'El	06c3802a1a	storage_proxy: avoid overflow in view-backlog delay calculation In the calculate_delay() code for view-backlog flow control, we calculate a delay and cap it at a "budget" - the remaining timeout. This timeout is measured in milliseconds, but the capping calculation converted it into microseconds, which overflowed if the timeout is very large. This causes some tests which enable the UB sanitizer to fail. We fix this problem by comparing the delay to the budget in millisecond resolution, not in microsecond resolution. Then, if the calculated delay is short enough, we return it using its full microsecond resolution. Fixes #5412 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20191205131130.16793-1-nyh@scylladb.com>	2019-12-11 14:10:54 +02:00
Nadav Har'El	2824d8f6aa	Merge: alternator: Fix EQ operator for sets Merged pull request https://github.com/scylladb/scylla/pull/5453 from Piotr Sarna: Checking the EQ relation for alternator attributes is usually performed simply by comparing underlying JSON objects, but sets (SS, BS, NS types) need a special routine, as we need to make sure that sets stored in a different order underneath are still equal, e.g: [1, 3, 2] == [1, 2, 3] Fixes #5021	2019-12-11 13:20:25 +02:00
Piotr Sarna	421db1dc9d	alternator-test: remove XFAIL from set EQ test With this series merged, test_update_expected_1_eq_set from test_expected.py suite starts passing.	2019-12-11 12:07:39 +01:00
Piotr Sarna	a8e45683cb	alternator: add EQ comparison for sets Checking the EQ relation for alternator attributes is usually performed simply by comparing underlying JSON objects, but sets (SS, BS, NS types) need a special routine, as we need to make sure that sets stored in a different order underneath are still equal, e.g: [1, 3, 2] == [1, 2, 3] Fixes #5021	2019-12-11 12:07:39 +01:00
Piotr Sarna	fb37394995	schema_tables: notify table deletions before creations If a set of mutations contains both an entry that deletes a table and an entry that adds a table with the same name, it's expected to be a replacement operation (delete old + create new), rather than a useless "try to create a table even though it exists already and then immediately delete the original one" operation. As such, notifications about the deletions should be performed before notifications about the creations. The place that originally suffered from this wrong order is view building - which in this case created an incorrect duplicated entry in the view building bookkeeping, and then immediately deleted it, resulting in having old, deprecated entries with stale UUIDS lying in the build queue and never proceeding, because the underlying table is long gone. The issue is fixed by ensuring the order of notifications: - drops are announced first, view drops are announced before table drops; - creations follow, table creations are announced before views; - finally, changes to tables and views are announced; Fixes #4382 Tests: unit(dev), mv_populating_from_existing_data_during_node_stop_test	2019-12-11 12:48:29 +02:00
Benny Halevy	d544df6c3c	dist/ami/build_ami.sh: support incremental build of rpms (#5191 ) Iterate over an array holding all rpm names to see if any of them is missing from `dist/ami/files`. If they are missing, look them up in build/redhat/RPMS/x86_64 so that if reloc/build_rpm.sh was run manually before dist/ami/build_ami.sh we can just collect the built rpms from its output dir. If we're still missing any rpms, then run reloc/build_rpm.sh and copy the required rpms from build/redhat/RPMS/x86_64. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Reviewed-by: Glauber Costa <glauber@scylladb.com>	2019-12-11 12:48:29 +02:00
Amnon Heiman	f43285f39a	api: replace swagger definition to use long instead of int (#5380 ) In swagger 1.2 int is defined as int32. We originally used int following the jmx definition, in practice internally we use uint and int64 in many places. While the API format the type correctly, an external system that uses swagger-based code generator can face a type issue problem. This patch replace all use of int in a return type with long that is defined as int64. Changing the return type, have no impact on the system, but it does help external systems that use code generator from swagger. Fixes #5347 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2019-12-11 12:48:29 +02:00
Nadav Har'El	2abac32f2e	Merged: alternator: Implement CONTAINS and NOT_CONTAINS in Expected Merged pull request https://github.com/scylladb/scylla/pull/5447 by Dejan Mircevski. Adds the last missing operators in the "Expected" parameter and re-enable their tests. Fixes #5034.	2019-12-11 12:48:29 +02:00
Cem Sancak	86b8036502	Fix DPDK mode in prepare script Fixes #5455.	2019-12-11 12:48:29 +02:00
Calle Wilund	35089da983	conf/config: Add better descriptive text on server/client encryption Provide some explanation on prio strings + direction to gnutls manual. Document client auth option. Remove confusing/misleading statement on "custom options" Message-Id: <20191210123714.12278-1-calle@scylladb.com>	2019-12-11 12:48:28 +02:00
Dejan Mircevski	32af150f1d	alternator: Implement NOT_CONTAINS operator in Expected Enable existing NOT_CONTAINS test, add NOT_CONTAINS to the list of recognized operators, implement check_NOT_CONTAINS, and hook it up to verify_expected_one(). Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-12-10 15:31:47 -05:00
Dejan Mircevski	bd2bd3c7c8	alternator: Implement CONTAINS operator in Expected Enable existing CONTAINS test, implement check_CONTAINS, and hook it up to verify_expected_one(). Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-12-10 15:31:47 -05:00
Dejan Mircevski	5a56fd384c	config: Add `experimental_features` option When the user wants to turn on only some experimental features, they can use this new option. The existing `experimental` option is preserved for backwards compatibility. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-12-10 11:47:03 -05:00
Piotr Sarna	9504bbf5a4	alternator: move unwrap_set to serialization header The utility function for unwrapping a set is going to be useful across source files, so it's moved to serialization.hh/serialization.cc.	2019-12-10 15:08:47 +01:00
Piotr Sarna	4660e58088	alternator: move rjson value comparison to rjson.hh The comparison struct is going to be useful across source files, so it's moved into rjson header, where it conceptually belongs anyway.	2019-12-10 15:08:47 +01:00
Botond Dénes	db0e2d8f90	scylla-gdb.py: document and add safety net to seastar::thread related commands Almost all commands provided by `scylla-gdb.py` are safe to use. The worst that could happen if they fail is that you won't get the desired information. There is one notable exception: `scylla thread`. If anything goes wrong while this command is executed - gdb crashes, a bug in the command, etc. - there is a good change the process under examination will crash. Sometimes this is fine, but other times e.g. when live debugging a production node, this is unacceptable. To avoid any accidents add documentation to all commands working with `seastar::thread`. And since most people don't read documentation, especially when debugging under pressure, add a safety net to the `scylla thread` command. When run, this command will now warn of the dangers and will ask for explicit acknowledgment of the risk of crash, by means of passing an `--iamsure` flag. When this flag is missing, it will refuse to run. I am sure this will be very annoying but I am also sure that the avoided crashes are worth it. As part of making `scylla thread` safe, its argument parsing code is migrated to `argparse`. This changes the usage but this should be fine because it is well documented. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20191129092838.390878-1-bdenes@scylladb.com>	2019-12-10 11:51:57 +02:00
Eliran Sinvani	765db5d14f	build_ami: Trim ami description attribute to the allowed size The ami description attribute is only allowed to be 255 characters long. When build_ami.sh generates an ami, it generates an ami description which is a concatenation of all of the componnents version strings. It can happen that the description string is too long which eventually causes the ami build to fail. This patch trims the description string to 255 characters. It is ok since the individual versions of the components are also saved in tags attached to the image. Tests: 1. Reproduced with a long description and validated that it doesn't fail after the fix. Fixes #5435 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <20191209141143.28893-1-eliransin@scylladb.com>	2019-12-10 11:51:57 +02:00
Fabiano Lucchese	4333b37f9e	scylla_setup: Support for enforcing optimal Linux clocksource setting (#5379 ) A Linux machine typically has multiple clocksources with distinct performances. Setting a high-performant clocksource might result in better performance for ScyllaDB, so this should be considered whenever starting it up. This patch introduces the possibility of enforcing optimized Linux clocksource to Scylla's setup/start-up processes. It does so by adding an interactive question about enforcing clocksource setting to scylla_setup, which modifies the parameter "CLOCKSOURCE" in scylla_server configuration file. This parameter is read by perftune.py which, if set to "yes", proceeds to (non persistently) setting the clocksource. On x86, TSC clocksource is used. Fixes #4474	2019-12-10 11:51:57 +02:00
Pavel Emelyanov	3a21419fdb	features: Remove _FEATURE suffix from hinted_handoff feature name All the other features are named w/o one. The internal const-s are all different, but I'm fixing it separately. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191209154310.21649-1-xemul@scylladb.com>	2019-12-10 11:51:57 +02:00
Dejan Mircevski	a26bd9b847	utils: Add enum_option This allows us to accept command-line options with a predefined set of valid arguments. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-12-09 09:45:59 -05:00
Rafael Ávila de Espíndola	761b19cee5	build: Split the build and host linker flags A general build system knows about 3 machines: * build: where the building is running * host: where the built software will run * target: the machine the software will produce code for The target machine is only relevant for compilers, so we can ignore it. Until now we could ignore the build and host distinction too. This patch adds the first difference: don't use host ld_flags when linking build tools (gen_crc_combine_table). The reason for this change is to make it possible to build with -Wl,--dynamic-linker pointing to a path that will exist on the host machine, but may not exist on the build machine. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191207030408.987508-1-espindola@scylladb.com>	2019-12-09 15:54:57 +02:00
fastio	8f326b28f4	Redis: Combine all the source files redis/commands/* into redis/commands.{hh,cc} Fixes: #5394 Signed-off-by: Peng Jian <pengjian.uestc@gmail.com>	2019-12-08 13:54:33 +02:00
Avi Kivity	9c63cd8da5	sysctl: reduce kernel tendency to swap anonymous pages relative to page cache (#5417 ) The vm.swappiness sysctl controls the kernel's prefernce for swapping anonymous memory vs page cache. Since Scylla uses very large amounts of anonymous memory, and tiny amounts of page cache, the correct setting is to prefer swapping page cache. If the kernel swaps anonymous memory the reactor will stall until the page fault is satisfied. On the other hand, page cache pages usually belong to other applications, usually backup processes that read Scylla files. This setting has been used in production in Scylla Cloud for a while with good results. Users can opt out by not installing the scylla-kernel-conf package (same as with the other kernel tunables).	2019-12-08 13:04:25 +02:00
Avi Kivity	0e319e0359	Update seastar submodule * seastar 166061da3...e440e831c (8): > Fail tests on ubsan errors > future: make a couple of asserts more strict > future: Move make_ready out of line > config: Do not allow zero rates Fixes #5360 > future: add new state to avoid temporaries in get_available_state(). > future: avoid temporary future_state on get_available_state(). > future: inline future::abandoned > noncopyable_function: Avoid uninitialized warning on empty types	2019-12-06 18:33:23 +02:00
Piotr Sarna	0718ff5133	Merge 'min/max on collections returns human-readable result' from Juliusz Previously, scylla used min/max(blob)->blob overload for collections, tuples and UDTs; effectively making the results being printed as blobs. This PR adds "dynamically"-typed min()/max() functions for compound types. These types can be complicated, like map<int,set<tuple<..., and created in runtime, so functions for them are created on-demand, similarly to tojson(). The comparison remains unchanged - underneath this is still byte-by-byte weak lex ordering. Fixes #5139 * jul-stas/5139-minmax-bad-printing-collections: cql_query_tests: Added tests for min/max/count on collections cql3: min()/max() for collections/tuples/UDTs do not cast to blobs	2019-12-06 16:40:17 +01:00
Juliusz Stasiewicz	75955beb0b	cql_query_tests: Added tests for min/max/count on collections This tests new min/max function for collections and tuples. CFs in test suite were named according to types being tested, e.g. `cf_map<int,text>' what is not a valid CF name. Therefore, these names required "escaping" of invalid characters, here: simply replacing with '_'.	2019-12-06 12:15:49 +01:00
Juliusz Stasiewicz	9efad36fb8	cql3: min()/max() for collections/tuples/UDTs do not cast to blobs Before: cqlsh> insert into ks.list_types (id, val) values (1, [3,4,5]); cqlsh> select max(val) from ks.list_types; system.max(val) ------------------------------------------------------------ 0x00000003000000040000000300000004000000040000000400000005 After: cqlsh> select max(val) from ks.list_types; system.max(val) -------------------- [3, 4, 5] This is accomplished similarly to `tojson()`/`fromjson()`: functions are generated on demand from within `cql3::functions::get()`. Because collections can have a variety of types, including UDTs and tuples, it would be impossible to statically define max(T t)->T for every T. Until now, max(blob)->blob overload was used. Because `impl_max/min_function_for` is templated with the input/output type, which can be defined in runtime, we need type-erased ("dynamic") versions of these functors. They work identically, i.e. they compare byte representations of lhs and rhs with `bytes::operator<`. Resolves #5139	2019-12-06 12:14:51 +01:00
Avi Kivity	a18a921308	docs: maintainer.md: use command line to merge multi-commit pull requests If you merge a pull request that contains multiple patches via the github interface, it will document itself as the committer. Work around this brain damage by using the command line.	2019-12-06 10:59:46 +01:00
Botond Dénes	7b37a700e1	configure.py: make tests explicitely depend on libseastar_testing.a So that changes to libseastar_testing.a make all test target out of date. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20191205142436.560823-1-bdenes@scylladb.com>	2019-12-05 19:30:34 +02:00
Piotr Sarna	3a46b1bb2b	Merge "handle hints on separate connection and scheduling group" from Piotr Introduce a new verb dedicated for receiving and sending hints: HINT_MUTATION. It is handled on the streaming connection, which is separate from the one used for handling mutations sent by coordinator during a write. The intent of using a separate connection is to increase fairness while handling hints and user requests - this way, a situation can be avoided in which one type of requests saturate the connection, negatively impacting the other one. Information about new RPC support is propagated through new gossip feature HINTED_HANDOFF_SEPARATE_CONNECTION. Fixes #4974. Tests: unit(release)	2019-12-05 17:25:26 +01:00
Calle Wilund	c11874d851	gms::inet_address: Use special ostream formatting to match Java To make gms::inet_address::to_string() similar in output to origin. The sole purpose being quick and easy fix of API/JMX ipv6 formatting of endpoints etc, where strings are used as lexical comparisons instead of textual representation. A better, but more work, solution is to fix the scylla-jmx bridge to do explicit parse + re-format of addresses, but there are many such callpoints. An even better solution would be to fix nodetool to not make this mistake of doing lexical comparisons, but then we risk breaking merge compatibility. But could be an option for a separate nodeprobe impl. Message-Id: <20191204135319.1142-1-calle@scylladb.com>	2019-12-05 17:01:26 +02:00
Gleb Natapov	4893bc9139	tracing: split adding prepared query parameters from stopping of a trace Currently query_options objects is passed to a trace stopping function which makes it mandatory to make them alive until the end of the query. The reason for that is to add prepared statement parameters to the trace. All other query options that we want to put in the trace are copied into trace_state::params_values, so lets copy prepared statement parameters there too. Trace enabled case will become a little bit more expensive but on the other hand we can drop a continuation that holds query_options object alive from a fast path. It is safe to drop the call to stop_foreground_prepared() here since The tracing will be stopped in process_request_one(). Message-Id: <20191205102026.GJ9084@scylladb.com>	2019-12-05 17:00:47 +02:00

1 2 3 4 5 ...

20413 Commits