scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 11:00:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	bfa4abaf6b	tracing: make sure keyspace and table names are available to static constructors Static constructors (specifically for the `system_keyspaces` global variable) need their dependencies to be already constructed when their own construction begins. Because tracing uses seastar::sstring, which is not constexpr, we must change it to std::string_view (which is). Change the type and perform the required adjustments. The definition is moved to the header file for simplicity.	2022-01-10 15:24:57 +02:00
Avi Kivity	3a0f2091d7	tracing: replace seastar::sprint() with fmt::format() sprint() is obsolete.	2021-10-27 17:02:00 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	daeddda7cc	treewide: remove inclusions of storage_proxy.hh from headers storage_proxy.hh is huge and includes many headers itself, so remove its inclusions from headers and re-add smaller headers where needed (and storage_proxy.hh itself in source files that need it). Ref #1.	2021-04-20 21:23:00 +03:00
Pavel Emelyanov	26c115f379	cql3: Change execute()'s 1st arg to query_processor Currently the statement's execute() method accepts storage proxy as the first argument. This is enough for all of them but schema altering ones, because the latter need to call migration manager's announce. To provide the migration manager to those who need it it's needed to have some higher-level service that the proxy. The query processor seems to be good candidate for it. Said that -- all the .execute()s now accept the querty processor instead of the proxy and get the proxy itself from the query processor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-15 19:00:33 +03:00
Piotr Sarna	c5214eb096	treewide: remove timeout config from query options Timeout config is now stored in each connection, so there's no point in tracking it inside each query as well. This patch removes timeout_config from query_options and follows by removing now unnecessary parameters of many functions and constructors.	2021-02-25 17:20:27 +01:00
Konstantin Osipov	b4f875f08e	uuid: reduce code dependency on UUID_gen.hh Do not include UUID_gen.hh in trace_state.hh and lists.hh to reduce header level dependency on it. Message-Id: <20210127173114.725761-2-kostja@scylladb.com>	2021-01-27 20:08:29 +02:00
Pavel Emelyanov	e7f74449a6	tracing: Keep qp anchor on backend The query processor is required in table_helper's used by tracing. Now everything is ready to push the query processor reference from main down to the table helpers. Because of the current initialization sequence it's only possible to have the started query processor at the .start_tracing() time. Earlier, when the sharded<tracing> is started the query processor is not yet started, so tracing keeps a pointer on local query processor. When tracing is stopped, the pointer is null-ed. This is safe (but an assert is put when dereferencing it), because on stop trace writes' gate is closed and the query processor is only used in them. Also there's still a chance that tracing remains started in case of start abort, but this is on-par with the current code -- sharded query processor is not stopped, so the memory is not freed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:45:19 +03:00
Pavel Emelyanov	87f1223965	tracing: Push query processor through init methods The goal is to make tracing keyspace helper reference query processor, so this patch adds the needed arguments through the initialization stack. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:45:12 +03:00
Pavel Emelyanov	b18522a7ab	table_helper: Require local query processor in calls Keeping the query processor reference on the table_helper in raii manner seems waistful, the only user of it -- the trace_keyspace_helper -- has a bunch of helpers on board, each would then keep its own copy for no gain. At the same time the trace_keyspace_helper already gets the query processor for its needs, so it can share one with table_helper-s. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:44:20 +03:00
Wojciech Mitros	e79ad38425	tracing: add username to the session table In order to improve observability, add a username field to the the system_traces.sessions table. The system table should be change while upgrading by running the fix_system_distributed_tables.py script. Until the table is updated, the old behaviour is preserved. Fixes #6737.	2020-10-01 04:46:40 +02:00
Juliusz Stasiewicz	0afa738a8f	tracing: Fix error on slow batches `trace_keyspace_helper::make_slow_query_mutation_data` expected a "query" key in its parameters, which does not appear in case of e.g. batches of prepared statements. This is example of failing `record.parameters`: ``` ...{"query[0]" : "INSERT INTO ks.tbl (pk, i) values (?, ?);"}, {"query[1]" : "INSERT INTO ks.tbl (pk, i) values (?, ?);"}... ``` In such case Scylla recorded no trace and said: ``` ERROR 2020-09-28 10:09:36,696 [shard 3] trace_keyspace_helper - No "query" parameter set for a session requesting a slow_query_log record ``` Fix here is to leave query empty if not found. The users can still retrieve the query contents from existing info. Fixes #5843 Closes #7293	2020-09-29 13:24:39 +02:00
Pavel Emelyanov	8618a02815	migration_manager: Remove db/schema_tables.hh inclustion into header The schema_tables.hh -> migration_manager.hh couple seems to work as one of "single header for everyhing" creating big blot for many seemingly unrelated .hh's. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-17 17:54:43 +03:00
Rafael Ávila de Espíndola	c5795e8199	everywhere: Replace engine().cpu_id() with this_shard_id() This is a bit simpler and might allow removing a few includes of reactor.hh. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200326194656.74041-1-espindola@scylladb.com>	2020-03-27 11:40:03 +03:00
Pavel Emelyanov	b11cf6e950	cql3/query_processor.hh: Debloat from other headers This gives ~30% less (251 jobs -> 181 jobs) recompile when touching it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200212225828.3374-1-xemul@scylladb.com>	2020-02-16 11:22:30 +02:00
Avi Kivity	dcab666d52	cql3: query_processor: reduce #includes query_processor is a central class, so reducing its includes can reduce dependencies treewite. This patch removes includes for parsed_statement, cf_statement, and untyped_result_set and fixes up the rest of the tree to include what it lacks as a result of these removals.	2020-02-09 12:24:24 +02:00
Gleb Natapov	f78b2c5588	transport: remove remaining craft related to cql's server load balancing Commit `7e3805ed3d` removed the load balancing code from cql server, but it did not remove most of the craft that load balancing introduced. The most of the complexity (and probably the main reason the code never worked properly) is around service::client_state class which is copied before been passed to the request processor (because in the past the processing could have happened on another shard) and then merged back into the "master copy" because a request processing may have changed it. This commit remove all this copying. The client_request is passed as a reference all the way to the lowest layer that needs it and it copy construction is removed to make sure nobody copies it by mistake. tests: dev, default c-s load of 3 node cluster Message-Id: <20190906083050.GA21796@scylladb.com>	2019-09-07 18:17:53 +03:00
Botond Dénes	fddd9a88dd	treewide: silence discarded future warnings for legit discards This patch silences those future discard warnings where it is clear that discarding the future was actually the intent of the original author, and they did the necessary precautions (handling errors). The patch also adds some trivial error handling (logging the error) in some places, which were lacking this, but otherwise look ok. No functional changes.	2019-08-26 18:54:44 +03:00
Avi Kivity	3a44fa9988	cql3, treewide: introduce empty cql3::cql_config class and propagate it We need a way to configure the cql interpreter and runtime. So far we relied on accessing the configuration class via various backdoors, but that causes its own problems around initialization order and testability. To avoid that, this patch adds an empty cql_config class and propagates it from main.cc (and from tests) to the cql interpreter via the query_options class, which is already passed everywhere. Later patches will fill it with contents.	2019-08-21 19:35:59 +02:00
Gleb Natapov	6a4207f202	Pass service permit to storage_proxy Current cql transport code acquire a permit before processing a query and release it when the query gets a reply, but some quires leave work behind. If the work is allowed to accumulate without any limit a server may eventually run out of memory. To prevent that the permit system should account for the background work as well. The patch is a first step in this direction. It passes a permit down to storage proxy where it will be later hold by background work.	2019-08-12 10:20:43 +03:00
Piotr Jastrzebski	134b59a425	table_helper: take insert function arguments by value Previous version wasn't working correctly with r-values. Fixes #4438 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <5017b04901c47bd826b2e411e603ce01e42a83a5.1555424512.git.piotr@scylladb.com>	2019-04-16 17:34:35 +03:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	cfedf4ab0f	table_helper: de-template setup_keyspace() This setup function has no reason to be a template and is easily converted. We can then later de-inline it to reduce dependencies.	2019-01-05 16:23:10 +02:00
Avi Kivity	6641353854	tracing: remove static class_registry Static class_registries hinder librarification by requiring linking with all object files (instead of a library from which objects are linked on demand) and reduce readability by hiding dependencies and by their horrible syntax. Hide them behind a non-static, non-template tracing backend registry. Message-Id: <20181229121000.7885-1-avi@scylladb.com>	2018-12-31 13:24:54 +00:00
Vlad Zolotarov	6db90a2e63	tracing: store a query response size Add a new "response_size" column to system_traces.sessions and store a size of an uncompressed response for a traced query. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-08-03 12:29:36 -04:00
Vlad Zolotarov	05020921bb	tracing: store request size Add a new column "request_size" to system_traces.sessions and store the uncompressed request frame data size. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-08-03 12:29:36 -04:00
Vlad Zolotarov	9723988926	cql3::statements::batch_statement: introduce a single_statement class This is a helper class needed to control the handling process of a single statement in the current batch. In particular it has the boolean defining if the authorization is needed for this statement. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2018-05-22 20:15:03 -04:00
Avi Kivity	7b5db486a0	query_options: augment with timeout_config Add a timeout_config member to query_options. This lets the query processor know what timeouts the user of this query want to apply.	2018-04-30 13:19:53 +03:00
Avi Kivity	f7b102238a	cql3: change cql_statement methods to accept a local storage_proxy The storage_proxy represents the entire cluster, so there's never a need to access it on a remote shard; the local shard instance will contact remote shard or remote nodes as needed. Simplify the API by passing storage_proxy references instead of seastar::sharded<storage_proxy> references. query_processor and other callers are adjusted to call seastar::sharded::local() first. Message-Id: <20180415142656.25370-2-avi@scylladb.com>	2018-04-16 10:18:28 +02:00
Jesse Haber-Kucharsky	1dd181bd7b	tracing/trace_keyspace_helper: Use internal `client_state`	2017-11-15 23:19:18 -05:00
Piotr Jastrzebski	80f08921c4	Make table_helper independent from trace_keyspace_helper table_helper is a generic helper than can easily be used in other places. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <11e46dbc1c90d0273a41c8144e6f6013e21efcdb.1499077818.git.piotr@scylladb.com>	2017-07-03 15:55:00 +03:00
Avi Kivity	5bc13e4454	Revert "Make table_helper independent from trace_keyspace_helper" This reverts commit `db5bf363d0`. Causes errors of the sort Exiting on unhandled exception: exceptions::invalid_request_exception (Keyspace 'system_traces' does not exist)	2017-07-02 11:30:51 +03:00
Piotr Jastrzebski	db5bf363d0	Make table_helper independent from trace_keyspace_helper table_helper is a generic helper than can easily be used in other places. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <3e360a963d4a53de6d758ba8bada78fc572f001a.1498745600.git.piotr@scylladb.com>	2017-06-29 17:20:07 +03:00
Avi Kivity	c4faa1e202	Merge "tracing: tracing spans and time series helper table" from Vlad " - Introduce a parent span IP and span ID paradigm. - Introduce time series tables to simplify traces processing. - Add the "How to get traces?" chapter to the tracing.md. " * 'tracing-span-ids-and-time-series-helpers-v4' of github.com:cloudius-systems/seastar-dev: docs: tracing.md: add a "how to get traces" chapter tracing::trace_keyspace_helper: introduce a time series helper tables tracing: cleanup: use nullptr instead of trace_state_ptr() tracing: introduce a span ID and parent span ID	2017-05-28 12:01:35 +03:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Vlad Zolotarov	f993e85b5f	tracing::trace_keyspace_helper: introduce a time series helper tables Introduce two time series helper tables that will simplify the querying of traces. One for querying regular traces: CREATE TABLE system_traces.sessions_time_idx ( minute timestamp, started_at timestamp, session_id uuid, PRIMARY KEY (minute, started_at, session_id)) and one for querying slow query records: CREATE TABLE system_traces.node_slow_log_time_idx ( minute timestamp, started_at timestamp, session_id uuid, start_time timeuuid, node_ip inet, shard int, PRIMARY KEY (minute, started_at, session_id)) With these tables one may get the relevant traces like in an example below: SELECT * from system_traces.sessions_time_idx where minutes in ('2016-09-07 16:56:00-0700') and started_at > '2016-09-07 16:56:30-0700' Fixes #2243 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-25 21:52:28 -04:00
Vlad Zolotarov	b0f660331a	tracing: introduce a span ID and parent span ID This patch makes the tracing framework follow the general idea of Google's Dapper paper: traces generated in a context of the same query are forming a single-rooted acyclic tree where in a ScyllaDB case vertexes are spans running on each involved replica Node and edges are RPCs sent from one Node to another. - Each vertex in the tree above has an ID - "span ID". - In order to be able to build the tree from the sessions traces we need to know the parent "span ID" - the ID of a span that sent an RPC that created the current span. - Each span of a tracing session is given a 64-bit random span ID. - The root span has a span_id::illegal_id value. This patch adds: - The described above parent span ID and a span ID to the one_session_records object. - The current span ID is passed in the trace_info struct to the remote replica. - Add parent_id and span_id columns to system_traces.events table for the parent ID and span ID. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-25 21:52:23 -04:00
Vlad Zolotarov	f8956ba01a	tracing: use prepared statment for updating tables In addition to actually moving to using the prepared statements the changes also include: - Kill the cache_xxx() methods - the schema is going to be checked during the prepared statement creation and during its execution. - Move the caching of table ID and the prepared statement to the get_schema_ptr_or_create(). - Rename: get_schema_ptr_or_create() -> cache_table_info(). After these changes we are less strict in our demands to system_traces tables schemas, e.g. if some column's type is not exactly as we expect but rather only "compatible" in the CQL sense we will tolerate this and will continue to write into that table. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 17:12:42 -04:00
Vlad Zolotarov	98864c6c30	tracing::trace_keyspace_helper: introduce a table_helper class This class contains a general table info and implements standard operations on this table: - Creation. - Info caching. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 17:04:48 -04:00
Tomasz Grabiec	d6425e7646	db: Create default auth and tracing keyspaces using lowest timestamp If the node is bootstrapped with auto_boostrap disabled, it will not wait for schema sync before creating global keyspaces for auth and tracing. When such schema changes are then reconciled with schema on other nodes, they may overwrite changes made by the user before the node was started, because they will have higher timestamp. To prevent that, let's use minimum timestamp so that default schema always looses with manual modifications. This is what Cassandra does. Fixes #2129.	2017-03-07 19:19:15 +01:00
Avi Kivity	c227e3e706	Merge "move a few files in the ScyllaDB project to use the new metrics registration API" from Vlad * 'rearrange-scylla-collectd-stats-registration-v3' of github.com:cloudius-systems/seastar-dev: thrift::server: move collectd counters registration to the metrics registration layer gms::gossiper: move collectd counters registration to the metrics registration layer utils::logalloc: move collectd counters registration to metrics registration layer streaming::stream_manager: move a collectd counters registration to the metrics registration layer db::commitlog::commitlog: move collectd counters registration to the metrics registration layer sstables::compaction_manager: move collectd metrics registration to the metrics registration layer db::batchlog_manager: move collectd registration to the metrics registration layer transport::server: move collectd metrics registration to the metrics registration layer cql3::query_processor: move collectd metrics registration to the metrics registration layer database: move collectd registrations to metrics registration layer tracing::trace_keyspace_helper: move collectd metrics registration to a metric registration layer tracing::trace_keyspace_helper: fix alignment tracing::tracing: move collectd metrics registration to metrics registration layer	2017-01-12 17:13:08 +02:00
Vlad Zolotarov	ca0a0f1458	tracing::trace_keyspace_helper: use generate_legacy_id() for CF IDs generation Explicitly generate tables' IDs of tables from the system_traces KS using generate_legacy_id() in order to ensure all Nodes create these tables with the same IDs. This is going to prevent hitting issue #420. Fixes #1976 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <1484153725-31030-1-git-send-email-vladz@scylladb.com>	2017-01-12 11:36:35 +02:00
Vlad Zolotarov	af29c3506b	tracing::trace_keyspace_helper: move collectd metrics registration to a metric registration layer Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-01-10 16:24:54 -05:00
Vlad Zolotarov	0df37c04f6	tracing::trace_keyspace_helper: fix alignment Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-01-10 16:24:54 -05:00
Piotr Jastrzebski	4bbe05dd47	mutation_partition: take schema in find_row and clustered_row This will allow intrusive set implementation that does not store schema. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-01-05 11:26:03 +01:00
Avi Kivity	1d9ee358f1	Revert "Merge "Reduce the size of mutation_partition" from Piotr" This reverts commit `aa392810ff`, reversing changes made to a24ff47c637e6a5fd158099b8a65f1191fc2d023; it uses boost::intrusive::detail directly, which it must not, and doesn't compile on all boost versions as a consequence.	2016-12-25 16:07:48 +02:00
Piotr Jastrzebski	2af6ff68d9	mutation_partition: take schema in find_row and clustered_row This will allow intrusive set implementation that does not store schema. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-12-23 11:29:07 +01:00
Nadav Har'El	fe1ba753ce	Avoid semaphore's default initial value The fact that Seastar's semaphore has a default initializer of 1 if not explicitly initialized is confusing and unexpected and recently lead to two bugs. So ScyllaDB should not rely on this default behavior, and specify the initial value of each semaphore explicitly. In several cases in the ScyllaDB code, the explict initialization was missing, and this patch adds it. In one case (rate_limiter) I even think the default of 1 was a bit strange, and 0 makes more sense. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <1474530745-23951-1-git-send-email-nyh@scylladb.com>	2016-09-24 19:25:02 +03:00
Vlad Zolotarov	a491ac0f18	tracing: introduce a log_slow_query logic The main idea is to log queries that take "too long" to complete. The "too long" is above the given threshold. To achieve the above this patch does the following: - Introduce two new properties to the tracing::trace_state: - "Full tracing": when the tracing of this query was explicitly requested. In this state we will record all possible traces related to this query: both on the coordinator and on any replica involved. - "Log slow query": when slow query logging is enabled. If slow query logging is enabled and a session's "duration" is above the specified threshold we will create a record in the "slow queries log" and write all trace records created on the coordinator and on a replica if a replica's session lasts longer than that threshold. (We will propagate the Coordinator's slow query logging threshold to replicas in the context of a specific tracing/logging session). The properties above are independent, namely they may be enabled and/or disabled independently and any combination of them is legal (naturally, creating a tracing session when both states above are disabled makes no sense). - Instrument the tracing::tracing service to allow the following: - Enable/disable slow query logging. - Set/get the slow query duration threshold (in microseconds). - Set/get the slow query log record TTL value (in seconds). - Instrument the trace_keyspace_helper to write a slow query log entry when requested. - The slow query logging is disabled by default and the threshold is set to half a second. - The TTL of a slow log record is set to 86400 seconds by default. - It makes sense to use the same "slow query logging threshold" and a "slow query record TTL" both on a coordinator and on a replica Nodes in a context of the same tracing session: - Pass both TTL and a threshold to the replica in a trace_info. This patch also implements the new slow query logging specific logic: - Don't write the pending tracing records before the end of a tracing session until "duration" reaches the logging threshold. - Don't build the parameters<sstring, sstring> map unless we know we will write it to I/O. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-08-28 18:28:44 +03:00

1 2

71 Commits