scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 12:36:56 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	5434e412e4	api: Keep and use reference on token_metadata	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	1a3f78a57d	database: Use own token_metadata Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	fecea1de7e	proxy: Use own token_metadata Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	2f3490dc8d	gossiper: Use own token_metadata Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	c5997b573c	tokens: Switch into standalone sharded instance Way too many places in code needs storage_service just for token_metadata. These references increase the amount of get(_local)?_storage_service() calls and create loops in components dependencies. Keep the token_metadata separately from storage_service and pass instances' references where needed (for now -- only into the storage_service itself). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Pavel Emelyanov	b4e66ddf1d	batchlog: Use in-config ring-delay This kills the first (out of two) global reference on storage_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-10 20:54:32 +03:00
Avi Kivity	bed61b96a2	Merge "Move features from storage- into feature-service" from Pavel " There's a lot of code around that needs storage service purely to get the specific feature value (cluster_supports_<something> calls). This creates several circular dependencies, e.g. storage_service <-> migration_manager one and database <-> storage_servuce. Also features sit on storage_service, but register themselfs on the feature_service and the former subscribes on them back which also looks strange. I propose to keep all the features on feature_service, this keeps the latter intependent from other components, makes it possible to break one of the mentioned circle dependencyand heavily relax the other. Also the set helps us fighting the globals and, after it, the feature_service can be safely stopped at the very last moment. Tests: unit(dev), manual debug build start-stop " * 'br-features-to-service-5' of https://github.com/xemul/scylla: gossiper: Avoid string merge-split for nothing features: Stop on shutdown storage_service: Remove helpers storage_service: Prepare to switch from on-board feature helpers cql3: Check feature in .validate database: Use feature service storage_proxy: Use feature service migration_manager: Use feature service start: Pass needed feature as argument into migrate_truncation_records features: Unfriend storage_service features: Simplify feature registration features: Introduce known_feature_set features: Move disabled features set from storage_service features: Move schema_features helper features: Move all features from storage_service to feature_service storage_service: Use feature_config from _feature_service features: Add feature_config storage_service: Kill set_disabled_features gms: Move features stuff into own .cc file migration_manager: Move some fns into class	2020-02-09 19:22:07 +02:00
Pavel Emelyanov	e2ec5eecf6	view_update: Do not need storage_proxy The view_update_generator acceps (and keeps) database and storage_proxy, the latter is only needed to initialize the view_updating_consumer which, in turn, only needs it to get database from (to find column family). This can be relaxed by providing the database from _generator to _consumer directly, without using the storage_proxy in between. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200207112427.18419-1-xemul@scylladb.com>	2020-02-07 13:30:01 +02:00
Avi Kivity	e719ea1bba	Merge "Fix assert on initialization error" (in large_data_handler) from Rafael " This series fixes an assertion when initialization fails after creating a database. I don't know of a case where that currently happens, but it is easy to cause that when writing a patch and the produced assert is just confusing. " * 'espindola/dont-assert-on-init-error' of https://github.com/espindola/scylla: db: Replace large_data_handler::_stopped with _running db: Move nop_large_data_handler constructor out-of-line db: Move large_data_handler::stop out-of-line	2020-02-05 18:49:11 +02:00
Juliusz Stasiewicz	5127568cc4	cdc: cdc per-table options put into schema extensions With this patch, client tools (in particular cqlsh) get the access to cdc options and will be able to print them with `DESC TABLE`. Fixes #5589	2020-02-05 13:44:39 +01:00
Rafael Ávila de Espíndola	5d4671526c	db: Replace large_data_handler::_stopped with _running This is not just a direct flip to a variable with the negated Boolean value. When created, a large_data_handler is not considered to be running, the user has to call start() before it can be used. The advantaged of doing this is that if initialization fails and a database is destructed before the large_data_handler is started, the assert database::stop() { assert(!_large_data_handler->running()); is not triggered. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-02-04 21:15:44 -08:00
Nadav Har'El	3de09042bb	CDC topology change support Merged pull request https://github.com/scylladb/scylla/pull/5485 by Kamil Braun: This series introduces the notion of CDC generations: sets of CDC streams used by the cluster to choose partition keys for CDC log writes. Each CDC generation begins operating at a specific time point, called the generation's timestamp (cdc_streams_timestamp in the code). It continues being used by all nodes in the cluster to generate log writes until superseded by a new generation. Generations are chosen so that CDC log writes are colocated with their corresponding base table writes, i.e. their partition keys (which are CDC stream identifiers picked from the generation operating at time of making the write) fall into the same vnode and shard as the corresponding base table write partition keys. Currently this is probabilistic and not 100% of log writes will be colocated - this will change in future commits, after per-table partitioners are implemented. CDC generations are a global property of the cluster -- they don't depend on any particular table's configuration. Therefore the old "CDC stream description tables", which were specific to each CDC-enabled table, were removed and replaced by a new, global description table inside the system_distributed keyspace. A new generation is introduced and supersedes the previous one whenever we insert new tokens into the token ring, which breaks the colocation property of the previous generation. The new generation is chosen to account for the new tokens and restore colocation. This happens when a new node joins the cluster. The joining node is responsible for creating and informing other nodes about the new CDC generation. It does that by serializing it and inserting into an internal distributed table ("CDC topology description table"). If it fails the insert, it fails the joining process. It then announces the generation to other nodes through gossip using the generation's timestamp, which is the partition key of the inserted distributed table entry. Nodes that learn about the new generation through gossip attempt to retrieve it from the distributed table. This might fail - for example, if the node is partitioned away from all replicas that hold this generation's table entry. In that case the node might stop accepting writes, since it knows that it should send log entries to a new generation of streams, but it doesn't know what the generation is. The node will keep trying to retrieve the data in the background until it succeeds or sees that it is no longer necessary (e.g., because yet another generation superseded this one). So we give up some availability to achieve safety. However, this solution is not completely safe (might break consistency properties): if a node learns about a new generation too late (if gossip doesn't reach this node in time), the node might send writes to the wrong (old) generation. In the future we will introduce a transaction-based approach where we will always make sure that all nodes receive the new generation before any of them starts using it (and if it's impossible e.g. due to a network partition, we will fail the bootstrap attempt). In practice, if the admin makes sure that the cluster works correctly before bootstrapping a new node, and a network partition doesn't start in the few seconds window where a new generation is announced, everything will work as it should. After the learning node retrieves the generation, it inserts it into an in-memory data structure called "CDC metadata". This structure is then used when performing writes to the CDC log -- given the timestamp of the written mutation, the data structure will return the CDC generation operating at this time point. CDC metadata might reject the query for two reasons: if the timestamp belongs to an earlier generation, which most probably doesn't have the colocation property anymore, or if it is picked too far away into the future, where we don't know if the current generation won't be superseded by a different one (so we don't yet know the set of streams that this log write should be sent to). If the client uses server-generated timestamps, the query will never be rejected. Clients can also use client-generated timestamps, but they must make sure that their clocks are not too desynchronized with the database -- otherwise some or all of their writes to CDC-enabled tables will be rejected. In the case of rolling upgrade, where we restart nodes that were previously running without CDC, we act a bit differently - there is no naturally selected joining node which must propose a new generation. We have to select such a node using other means. For this we use a bully approach: every node compares its host id with host ids of other nodes and if it finds that it has the greatest host id, it becomes responsible for creating the first generation. This change also fixes the way of choosing values of the "time" column of CDC log writes: the timeuuid is chosen in a way which preserves ordering of corresponding base table mutations (the timestamp of this timeuuid is equal to the base table mutation timestamp). Warning: if you were running a previous CDC version (without topology change support), make sure to disable CDC on all tables before performing the upgrade. This will drop the log data -- backup it if needed. TODO in future patchset: expire CDC generations. Currently, each inserted CDC generation will stay in the distributed tables forever (until manually removed by the administrator). When a generation is superseded, it should become "expired", and 24 hours after expiration, it should be removed. The distributed tables (cdc_topology_description and cdc_description) both have an "expired" column which can be used for this purpose. Unit tests: dev, debug, release dtests (dev): https://jenkins.scylladb.com/job/scylla-master/job/byo/job/byo_build_tests_dtest/907/	2020-02-04 10:20:29 +02:00
Pavel Emelyanov	ca55c6c15f	features: Stop on shutdown The service in question doesn't depend on anything, so it's started first and stopped last. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	abe588888d	database: Use feature service Keep local feature_service reference on database. This relaxes the circular storage_service <-> database reference, but not removes it completely. This needs some args tossing in apply_to_builder, but it's rather straightforward, so comes in the same patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	12c1378be0	storage_proxy: Use feature service Keep reference on local feature service from storage_proxy and use it in places that have (local) storage_proxy at hands. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	4f5b70dcb1	migration_manager: Use feature service This unties migration_manager from storage_service thus breaking the circular dependency between these two. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	74fd3466b5	start: Pass needed feature as argument into migrate_truncation_records As a nice side-effect this stops using global storage service instance by this function. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	052259f8ef	features: Add feature_config Some features take db::config to find out whether to be enabled or disabled. This creates unwanted dependency between database and features, so split the features configuration explicitly. Also this will make the "this is for testing env only" logic cleaner and simpler to understand. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-03 15:16:23 +03:00
Pavel Emelyanov	4839ca8491	storage_service: Unregister from gossiper notifications ... at all This unregistration doesn't happen currently, but doesn't seem to cause any problems in general, as on stop gossiper is stopped and nothing from it hits the store_service. However (!) if an exception pops up between the storage_service is subscribed on gossiper and the drain_on_shutdown defer action is set up then we _may_ get into the following situation: - main's stuff gets unrolled back - gossiper is not stopped (drain_on_shutdown defer is not set up) - migration manager is stopped (with deferred action in main) - a nitification comes from gossiper -> storage_service::on_change might want to pull schema with the help of local migration manager -> assert(local_is_initialized) strikes Fix this by registering storage_service to gossiper a bit earlier (both are already initialized y that time) and setting up unregister defer right afterwards. Test: unit(dev), manual start-stop Bug: #5628 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200130190343.25656-1-xemul@scylladb.com>	2020-01-31 14:02:18 +01:00
Eliran Sinvani	971711a546	storage proxy: migrate to per scheduling group statistics This commit builds on top of the introduced per scheduling group statistics template and employs it for achieving a per scheduling group statistics in storage_proxy. Some of the statistics also had meaning as a global - per shard one. Those are the ones for determining if to throttle the write request. This was handled by creating a global stats struct that will hold those stats and by changing the stat update to also include the global one. One point that complicated it is an already existing aggregation over the per shard stats that now became a per scheduling group per shard stats, converting the aggregation to a two-dimensional aggregation. One thing this commit doesn't handle is validating that an individual statistic didn't "cross a scheduling group boundary", such validation is possible but it can easily be added in the future. There is a subtlety to doing so since if the operation did cross to other scheduling group two connected statistics can lose balance for example written bytes and completed write transactions. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2020-01-30 15:01:44 +01:00
Kamil Braun	bd42b10df1	cdc: rename cdc/cdc.{hh,cc} to cdc/log.{hh,cc} To increase modularity, making it easier to find what is where and maintain. The 'log' module (cdc/log.{hh,cc}) is responsible for updating CDC log tables when base table writes are performed. The 'generation' module (cdc/generation.{hh,cc}) handles stream generation changes in response to topology change events. cdc/metadata.{hh,cc} contains a helper class which holds the currently used generation of streams. It is used by both aforementioned modules: 'log' queries it, while 'generation' updates it.	2020-01-30 11:10:39 +01:00
Nadav Har'El	ce0c9c1044	merge: add tagging to alternator Merged patch series from Piotr Sarna: This series adds the following to alternator: - TagResource request - UntagResource request - ListTagsOfResource request - Honoring "Tags" parameter in CreateTable It also provides more tests for above features and extended docs. Tagging is backed by a schema extension, which is in turn backed by entries in system_schema.tables.extensions map. Tags are considered part of the schema, and in particular they are updated via an equivalent of: ALTER TABLE table WITH scylla_tags = {'key1':'v1', 'key2':'v2'} Each tag change is therefore a schema change, which also means that editing tags for the same table on different nodes may be subject to races, until the schema agreement issues are resolved in Scylla. Fixes #5066 Tests: alternator-test(local, remote) Piotr Sarna (6): alternator,main: add tags schema extension alternator: add creating values from string views alternator: implement tagging alternator: allow tagging on table creation docs: add entries for alternator tags and arn alternator-test: make test tables case sensitive alternator-test/test_tag.py \| 63 ++++++++++- alternator-test/util.py \| 2 +- alternator/executor.cc \| 191 ++++++++++++++++++++++++++++++++-- alternator/executor.hh \| 3 + alternator/rjson.cc \| 4 + alternator/rjson.hh \| 1 + alternator/server.cc \| 3 + alternator/tags_extension.hh \| 52 +++++++++ docs/alternator/alternator.md \| 14 ++- main.cc \| 5 + 10 files changed, 325 insertions(+), 13 deletions(-) create mode 100644 alternator/tags_extension.hh	2020-01-29 18:11:47 +02:00
Piotr Sarna	16688efad7	alternator,main: add tags schema extension A schema extension is introduced for alternator - tags. This schema extension can be used to store arbitrary tags for a table, in the form of a map<text, text>. Updating tags for a table is equivalent to the following CQL query: ALTER TABLE table WITH scylla_tags = {'key1':'v1', 'key2':'v2'} The extension, as all other extensions, is backed by the entry in the system_schema.tables table.	2020-01-29 10:20:05 +01:00
Eliran Sinvani	57f90e34ea	alternator: run alternator processing loop in the statement scheduling group In Scylla all query processing activity should run under the "statement" scheduling group. The scheduling group is important for maintaining the balance between background and foreground tasks in Scylla. Testing: In order to test the correctness of the patch. First, the following assert was inserted before any call to one of the executor functions in the http route: assert(current_scheduling_group().name() == "statement" Then all alternator tests ran and passed. The second stage was to change the name so the assert will fail: assert(current_scheduling_group().name() == "no-statement" And ran the tests again - validating that Scylla coredumps. The asserts were then removed. Fixes #5008 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <20200127154341.10020-1-eliransin@scylladb.com>	2020-01-29 00:11:17 +02:00
Avi Kivity	e09ed81c23	Merge "Fix two corner cases in snapshots API" from Pavel " There seem to be two problems with handling snapshot API -- one on start and the other one on stop. Here's the set that addresses both. The fix moved snapshot API registration later in time that required Amnon's ACK. Now we have it :) so -- the rebase and resend. Tests: unit(dev), start-stop " * 'br-snapshot-bugs-2' of https://github.com/xemul/scylla: snapshot: Pass requests through gate api: Register snapshot API later api: Unwrap wrap_ks_cf	2020-01-29 00:11:17 +02:00
Pavel Emelyanov	976463f620	snapshot: Pass requests through gate When the scylla process is stopped no code waits for current snapshot operations to finish. Also, the API server is not stopped either, so new snapshot requests can creep into. In seastar there's a useful abstraction to address both. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-27 17:42:04 +03:00
Pavel Emelyanov	fd6b5efe75	api: Register snapshot API later In storage_service's snapshot code there are checks for _operation_mode being _not_ JOINING to proceed. The intention is apparently to allow for snapshots only after the cluster join. However, here's how the start-up code looks like - _operation_mode = STARTING in storage_service::constructor - snapshot API registered in api::set_server_storage_service - _operation_mode = JOINING in storage_service::join_token_ring So in between steps 2 and 3 snapshots can be taken. Although there's a quick and simple fix for that (check for the _operation_mode to be not STARTING either) I think it's better to register the snapshot API later instead. This will help greatly to de-bload the storage_service, in particular -- to incapsulate the _operation_mode properly. Note, though the check for _operation_mode is made only for taking snapshot, I move all snapshot ops registration to the later phase. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-27 17:42:04 +03:00
Piotr Sarna	9fa88e26a9	Merge 'Alternator - LWT and ConditionExpression' from Nadav This is a fourth iteration of the patch series adding LWT usage (instead of the old naive - and wrong - read before write) to Alternator, as well as full support for the ConditionExpression syntax for conditional updates. Changes in v4: * Rebased to most recent master * Replaced 3 booleans which had 2^3 = 8 theoretical combinations, by just 4 options in enum write_isolation: FORBID_RMW, LWT_ALWAYS, LWT_RMW_ONLY, UNSAFE_RMW The four options are described in details comments. * Fix reversed assertion in FORBID_RMW case. * Two new metrics: write_using_lwt and shard_bounce_for_lwt. * Fail boot if alternator is enabled, but LWT isn't. * Add information about enabling LWT in docs/alternator/alternator.md * nyh/v4-lwt: alternator: add support for ConditionExpression alternator: reimplement read-modify-write operations using LWT alternator: make "executor" a peering_sharded_service	2020-01-26 12:02:32 +02:00
Nadav Har'El	370b963ce5	alternator: reimplement read-modify-write operations using LWT In this patch, we re-implement the three read-modify-write operations - PutItem, UpdateItem, DeleteItem. All three operations may need to read the item before writing it to support conditional updates (the "Expected" parameter) and UpdateItem may also need the previous item's value for its update expression (e.g., a user may ask to "set a=a+1" or "set a=b"). Before this patch, the implementation of RMW operations simply did a read, and then a write - without any attempt to protect concurrent operations. In this patch, Scylla's LWT mechanism (storage_proxy::cas()) is used instead, to ensure that concurrent update operations are correctly isolated even if they are conditional. This means that Alternator now requires the experimental LWT feature to be enabled (and refuses to boot if it isn't). The version presented here is configured to always use LWT for every write, regardless of whether it has a condition or not. So it will will significantly slow down write-only workloads like YCSB. But the code in this patch actually includes three other modes, which can be chosen by setting an enum constant in the code. In the future we will want to let the user configure this mode, globally, per table or per attribute. Note that read requests are NOT modified, and work exactly as they did before: i.e., strongly-consistent reads are done using a normal CL=LOCAL_QUORUM read - not via LWT. I believe this is good enough given Dynamo's guarantees, and critical for our read performance. Also note that patch doesn't yet fix the BatchWriteItem operation. Although BatchWriteItem does not support any RMW operations - just pure writes - we may still need to do those pure writes using LWT. This should be fixed in a follow-up patch. Unfortunately, this patch involves a large amount of code movement and reorganization, because: 1. The cas operation requires each operation to be made into an object, with a separate apply() function, forcing a lot of code to move. 2. Moreover, we need to do this for three different operations (PutItem, UpdateItem, DeleteItem) so to avoid massive code duplication, I had to move some common code. 3. The cas operation also forced us to change some of the utility functions' APIs. The end result is that this patch focuses more on a compact and understandable end result than it does on an easy to understand patch, so reviewers - sorry about that. All alternator-test/ tests pass with this patch (and also with all of the different optional modes enabled). However, other than that, I did not yet do any real isolation tests (are concurrent operations really isolated correctly? or is LWT just faking it? :-) ), performance tests or stress tests - and I'll definitely need to do those as well. Fixes #5054 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-01-23 13:57:28 +02:00
Rafael Ávila de Espíndola	7390485e20	repair: add row_level::stop() Now unregister_ is called from stop(). This reduces the noise in a followup patch. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-01-22 08:16:03 -08:00
Tomasz Grabiec	36d90e637e	Merge "Relax migration manager dependencies" from Pavel Emalyanov The set make dependencies between mm and other services cleaner, in particular, after the set: - the query processor no longer needs migration manager (which doesn't need query processor either) - the database no longer needs migration manager, thus the mutual dependency between these two is dropped, only migration manager -> database is left - the migration manager -> storage_service dependency is relaxed, one more patchset will be needed to remove it, thus dropping one more mutual dependency between them, only the storage_service -> migration manager will be left - the migration manager is stopped on drain, but several more services need it on stop, thus causing use after free problems, in particular there's a caught bug when view builder crashes when unregistering from notifier list on stop. Fixed. Tests: unit(dev) Fixes: #5404	2020-01-16 12:12:25 +01:00
Piotr Dulikowski	c383652061	gossip: allow for aborting on sleep This commit makes most sleeps in gossip.cc abortable. It is now possible to quickly shut down a node during startup, most notably during the phase while it waits for gossip to settle.	2020-01-16 12:05:50 +02:00
Pavel Emelyanov	5cf365d7e7	database: Explicitly pass migration_manager through init_non_system_keyspace This is the last place where database code needs the migration_manager instance to be alive, so now the mutual dependency between these two is gone, only the migration_manager needs the database, but not the vice-versa. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:29:21 +03:00
Pavel Emelyanov	d9edcb3f15	query_processor: Use migration_notifier This patch breaks one (probably harmless but still) dependency loop. The query_processor -> migration_manager -> storage_proxy -> tracing -> query_processor. The first link is not not needed, as the query_processor needs the migration_manager purely to (ub)subscribe on notifications. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	28f1250b8b	view_builder: Use migration notifier The migration manager itself is still needed on start to wait for schema agreement, but there's no longer the need for the life-time reference on it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	7cfab1de77	database: Switch on mnotifier from migration_manager Do not call for local migration manager instance to send notifications, call for the local migration notifier, it will always be alive. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f45b23f088	storage_service: Keep migration_notifier The storage service will need this guy to initialize sub-services with. Also it registers itself with notifiers. That said, it's convenient to have the migration notifier on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f240d5760c	migration_manager: Split notifier from main class The _listeners list on migration_manager class and the corresponding notify_xxx helpers have nothing to do with the its instances, they are just transport for notification delivery. At the same time some services need the migration manager to be alive at their stop time to unregister from it, while the manager itself may need them for its needs. The proposal is to move the migration notifier into a complete separate sharded "service". This service doesn't need anything, so it's started first and stopped last. While it's not effectively a "migration" notifier, we inherited the name from Cassandra and renaming it will "scramble neurons in the old-timers' brains but will make it easier for newcomers" as Avi says. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:19 +03:00
Pavel Emelyanov	1992755c72	storage_service: Kill initialization helper from init.cc The helper just makes further patching more complex, so drop it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:27:27 +03:00
Tomasz Grabiec	c8a5a27bd9	Merge "storage_service: Move load_broadcaster away" from Pavel E. The storage_service struct is a collection of diverse things, most of them requiring only on start and on stop and/or runing on shard 0 (but is nonetheless sharded). As a part of clearing this structure and generated by it inter- -componenes dependencies, here's the sanitation of load_broadcaster.	2020-01-14 19:26:06 +01:00
Piotr Sarna	36ec43a262	Merge "add table with connected cql clients" from Juliusz This change introduces system.clients table, which provides information about CQL clients connected. PK is the client's IP address, CK consists of outgoing port number and client_type (which will be extended in future to thrift/alternator/redis). Table supplies also shard_id and username. Other columns, like connection_stage, driver_name, driver_version..., are currently empty but exist for C* compatibility and future use. This is an ordinary table (i.e. non-virtual) and it's updated upon accepting connections. This is also why C*'s column request_count was not introduced. In case of abrupt DB stop, the table should not persist, so it's being truncated on startup. Resolves #4820	2020-01-14 10:01:07 +02:00
Juliusz Stasiewicz	27dfda0b9e	main/transport: using the infrastructure of system.clients Resolves #4820. Execution path in main.cc now cleans up system.clients table if it exists (this is done on startup). Also, server.cc now calls functions that notify about cql clients connecting/disconnecting.	2020-01-13 14:07:04 +01:00
Pavel Emelyanov	148da64a7e	storage_servce: Move load_broadcaster away This simplifies the storage_service API and fixes the complain about shared_ptr usage instead of unique_ptr. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-13 13:55:09 +03:00
Pavel Emelyanov	b6e1e6df64	misc_services: Introduce load_meter There's a lonely get_load_map() call on storage_service that needs only load broadcaster, always runs on shard 0 and that's it. Next patch will move this whole stuff into its own helper no-shard container and this is preparation for this. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-13 13:53:08 +03:00
Rafael Ávila de Espíndola	b80852c447	main: Explicitly allow scylla core dumps I have not looked into the security reason for disabling it when a program has file capabilities. Fixes #5560 [avi: remove extraneous semicolon] Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200106231836.99052-1-espindola@scylladb.com>	2020-01-07 11:15:59 +02:00
Pavel Emelyanov	f2b20e7083	cache_hitrate_calculator: Do not reinvent the peering_sharded_service The class in question wants to run its own instances on different shards, for this sake it keeps reference on sharded self to call invoke_on() on. There's a handy peering_sharded_service<> in seastar for the same, using it makes the code nicer and shorter. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191226112401.23960-1-xemul@scylladb.com>	2020-01-03 15:48:19 +02:00
Rafael Ávila de Espíndola	91c7f5bf44	Print build-id on startup Fixes #5426 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191218031556.120089-1-espindola@scylladb.com>	2019-12-19 15:43:04 +02:00
Nadav Har'El	8157f530f5	merge: CDC: handle schema changes Merged pull request https://github.com/scylladb/scylla/pull/5366 from Calle Wilund: Moves schema creation/alter/drop awareness to use new "before" callbacks from migration manager, and adds/modifies log and streams table as part of the base table modification. Makes schema changes semi-atomic per node. While this does not deal with updates coming in before a schema change has propagated cluster, it now falls into the same pit as when this happens without CDC. Added side effect is also that now schemas are transparent across all subsystems, not just cql. Patches: cdc_test: Add small test for altering base schema (add column) cdc: Handle schema changes via migration manager callbacks migration_manager: Invoke "before" callbacks for table operations migration_listener: Add empty base class and "before" callbacks for tables cql_test_env: Include cdc service in cql tests cdc: Add sharded service that does nothing. cdc: Move "options" to separate header to avoid to much header inclusion cdc: Remove some code from header	2019-12-17 23:04:36 +02:00
Pavel Emelyanov	71a528d404	directories: Move the whole stuff into own .cc file In order not to pollute the root dir place the code in utils/ directory, "utils" namespace. While doing this -- move the touch_and_lock from the class declaration. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-12 19:52:01 +03:00
Pavel Emelyanov	f2b3c17e66	directories: Move all the dirs code into .init method The seastar::async usage is tempoarary, added for bisect-safety, soon it will go away. For this reason the indentation in the .init method is not "canonical", but is prepared for one-patch drop of the seastar::async. The hinted_handoff_enabled arg is there, as it's not just a parameter on config, it had been parsed in main.cc. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-12 17:33:11 +03:00

... 6 7 8 9 10 ...

776 Commits