scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Botond Dénes	c872a963b6	test: move reader_concurrency_semaphore related tests into separate file The mutation_reader_test is already one of our largest test files. Move the reader concurrency semaphore related tests to a new file, making them easier to find making the mutation reader test a little bit smaller too.	2021-05-06 08:59:47 +03:00
Tomasz Grabiec	121eb32679	Merge 'test: perf: report instructions retired per operations' from Avi Kivity Instructions retired per op is a much more stable than time per op (inverse throughput) since it isn't much affected by changes in CPU frequencey or other load on the test system (it's still somewhat affected since a slower system will run more reactor polls per op). It's also less indicative of real performance, since it's possible for fewer inststructions to execute in more time than more instructions, but that isn't an issue for comparative tests). This allows incremental changes to the code base to be compared with more confidence. Current results are around 55k instructions per read, and 52k for writes. Closes #8563 * github.com:scylladb/scylla: test: perf: tidy up executor_stats snapshot computation test: perf: report instructions retired per operations test: perf: add RAII wrapper around Linux perf_event_open() test: perf: make executor_stats_snapshot() a member function of executor	2021-05-05 00:54:08 +02:00
Avi Kivity	6ffd813b7b	Merge 'hints: delay repair until hints are replayed' from Piotr Dulikowski Both hinted handoff and repair are meant to improve the consistency of the cluster's data. HH does this by storing records of failed replica writes and replaying them later, while repair goes through all data on all participaring replicas and makes sure the same data is stored on all nodes. The former is generally cheaper and sometimes (but not always) can bring back full consistency on its own; repair, while being more costly, is a sure way to bring back current data to full consistency. When hinted handoff and repair are running at the same time, some of the work can be unnecessarily duplicated. For example, if a row is repaired first, then hints towards it become unnecessary. However, repair needs to do less work if data already has good consistency, so if hints finish first, then the repair will be shorter. This PR introduces a possibility to wait for hints to be replayed before continuing with user-issued repair. The coordinator of the repair operation asks all nodes participating in the repair operation (including itself) to mark a point at the end of all hint queues pointing towards other nodes participating in repair. Then, it waits until hint replay in all those queues reaches marked point, or configured timeout is reached. This operation is currently opt-in and can be turned on by setting the `wait_for_hint_replay_before_repair_in_ms` config option to a positive value. Fixes #8102 Tests: - unit(dev) - some manual tests: - shutting down repair coordinator during hints replay, - shutting down node participating in repair during hints replay, Closes #8452 * github.com:scylladb/scylla: repair: introduce abort_source for repair abort repair: introduce abort_source for shutdown storage_proxy: add abort_source to wait_for_hints_to_be_replayed storage_proxy: stop waiting for hints replay when node goes down hints: dismiss segment waiters when hint queue can't send repair: plug in waiting for hints to be sent before repair repair: add get_hosts_participating_in_repair storage_proxy: coordinate waiting for hints to be sent config: add wait_for_hint_replay_before_repair option storage_proxy: implement verbs for hint sync points messaging_service: add verbs for hint sync points storage_proxy: add functions for syncing with hints queue db/hints: make it possible to wait until current hints are sent db/hints: add a metric for counting processed files db/hints: allow to forcefully update segment list on flush	2021-05-03 18:47:27 +03:00
Avi Kivity	0bc98caf3e	test: perf: add RAII wrapper around Linux perf_event_open() Make it easy to embed in other classes. A helper function is provided for the instructions retired counter.	2021-04-28 18:41:02 +03:00
Piotr Dulikowski	82c419870a	messaging_service: add verbs for hint sync points Adds two verbs: HINT_SYNC_POINT_CREATE and HINT_SYNC_POINT_CHECK. Those will make it possible to create a sync point and regularly poll to check its existence.	2021-04-27 15:06:39 +02:00
Botond Dénes	f7f5fca5a8	Add very basic coverage report generation support This patch introduces the most basic bare infrastructure to generate coverage report as well as a guide on how to manually generate them. Although this barely qualifies as "support", it already allows one to generate a coverage report with the help of this guide. One immediate limitation of this patch is that it only supports clang, which is not a terrible problem, given that its our main compiler currently. Future patches will build on this to incrementally improve and automate this: * Provide script to automatically merge profraw files and generate html report from it. * Integrate into test.py, adding a flag which causes it to generate a coverage report after a run. * Support GCC too, but at least auto-detect whether clang is used or not. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210423140100.659452-1-bdenes@scylladb.com>	2021-04-25 15:59:20 +03:00
Pavel Emelyanov	e7dc059917	migration_manager: Merge migration_task in The migration_task is the class with the single static method that's called from a single place in migration manager and this method calls migration manager back right at once. There's no much sense in keeping this abstraction, merge it into the migration manager. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-04-23 17:13:24 +03:00
Piotr Sarna	dfd1ea6b92	test: rename alternator_base64_test to alternator_unit_test With the more generic name, I would no longer feel bad adding non-base64 test cases to it.	2021-04-21 14:26:40 +02:00
Nadav Har'El	c29f55e801	Merge 'Unify CQL and Redis server code' from Pekka Enberg The Redis server started as a copy of the CQL server, but did not receive all the fixes of the CQL server over time. For example, commit `1a8630e` ("transport: silence "broken pipe" and "connection reset by peer" errors") was only done on the CQL server. To remedy the situation, this pull request unifies code between the CQL and Redis servers by introducing a "generic_server" component, and switching CQL and Redis to use it. Test: dtest(dev) Closes #8388 * github.com:scylladb/scylla: generic_server: Rename "maybe_idle" to "maybe_stop" generic_server: API documentation for connection and server classes transport, redis: Use generic server::listen() transport/server: Remove "redis_server" prefix from logging transport/server: Remove "cql_server" prefix from logging generic_server: Remove unneeded static_pointer_cast<> transport, redis: Use generic server::do_accepts() transport, redis: Use generic server::process() redis: Move Redis specific code to handle_error() transport: Move CQL specific error handling to handle_error() transport, redis: Move connection tracking to generic_server::server class transport, redis: Move _stopped and _connections_list to generic_server::server class transport, redis: Move total_connections to generic_server::server class transport, redis: Use generic server::maybe_idle() transport, redis: Move list_base_hook<> inheritance to generic_server::connection transport, redis: Use generic connection::shutdown()	2021-04-20 12:20:25 +03:00
Tomasz Grabiec	320f6bf220	Merge 'test: perf: perf_simple_query: collect allocation and task statistics' from Avi Kivity Calculate and display the number of memory allocations and tasks executed per operation. Sample results (--smp 1): 180022.46 tps (90 allocs/op, 20 tasks/op) 178963.44 tps (90 allocs/op, 20 tasks/op) 178702.41 tps (90 allocs/op, 20 tasks/op) 177679.74 tps (90 allocs/op, 20 tasks/op) 179539.36 tps (90 allocs/op, 20 tasks/op) median 178963.44 tps (90 allocs/op, 20 tasks/op) median absolute deviation: 575.92 maximum: 180022.46 minimum: 177679.74 This allows less noisy tracking of how some changes impact performance. Closes #8425 * github.com:scylladb/scylla: test: perf: perf_simple_query: collect allocation and task statistics perf: deinline some functions in perf.hh	2021-04-14 13:16:00 +02:00
Michael Livshin	4ccb1b3a2f	build: add nix-shell support Support native building & unit testing in the Nix ecosystem under nix-shell. Actual dist packaging for Nixpkgs/NixOS is not there (yet?), because: * Does not exactly seem like a huge priority. * I don't even have a firm idea of how much work it would entail (it certainly does not need the ld.so trickery, so there's that. But at least some work would be needed, seeing how ScyllaDB needs to integrate with its environment and NixOS is a little unorthodox). Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20210413110508.5901-4-michael.livshin@scylladb.com>	2021-04-14 13:15:59 +02:00
Michael Livshin	d87e751182	build: add a structural way to distro-extend configure.py For now just for additional cflags, ldflags & cmake arguments. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20210413110508.5901-3-michael.livshin@scylladb.com>	2021-04-14 13:15:59 +02:00
Michael Livshin	5cb4005e84	build: extend configure.py's subprocess environment properly The `env` parameter to `subprocess.Popen()` and friends, when it is not `None`, is not an addition to the subprocess environment but the _whole_ subprocess environment. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20210413110508.5901-2-michael.livshin@scylladb.com>	2021-04-14 13:15:59 +02:00
Pekka Enberg	19507bb7ea	transport, redis: Use generic connection::shutdown() This patch moves the duplicated connection::shutdown() method to to a new generic_server::connection base class that is now inherited by cql_server and redis_server.	2021-04-13 13:56:44 +03:00
Avi Kivity	e3db889057	Merge 'Introduce service levels' from Piotr Sarna This series introduces service level syntax borrowed from https://docs.scylladb.com/using-scylla/workload-prioritization/ , but without workload prioritization itself - just for the sake of using identical syntax to provide different parameters later. The new parameters may include: * per-service-level timeouts * oltp/olap declaration, which may change the way Scylla treats long requests - e.g. time them out (the oltp way) or keep them sustained with empty pages (the olap way) Refs #7617 Closes #7867 * github.com:scylladb/scylla: transport: initialize query state with service level controller main: add initializing service level data accessor service: make enable_shared_from_this inheritance public cql3: add SERVICE LEVEL syntax (without an underscore) unit test: Add unit test for per user sla syntax cql: Add support for service level cql queries auth: Add service_level resource for supporting in authorization of cql service_level cql: Support accessing service_level_controller from query state instantiate and initialize the service_level_controller qos: Add a standard implementation for service level data accessor qos: add waiting for the updater future service/qos: adding service level controller service_levels: Add documentation for distributed tables service/qos: adding service level table to the distributed keyspace service/qos: add common definitions auth: add support for role attributes	2021-04-12 17:34:43 +03:00
Eliran Sinvani	2701481cbc	cql: Add support for service level cql queries This patch adds support for new service level cql queries. The queries implemented are: CREATE SERVICE_LEVEL [IF NOT EXISTS] <service_level_name> ALTER SERVICE_LEVEL <service_level_name> WITH param = <something> DROP SERVICE_LEVEL [IF EXISTS] <service_level_name> ATTACH SERVICE_LEVEL <service_level_name> TO <role_name> DETACH SERVICE_LEVEL FROM <role_name> LIST SERVICE_LEVEL <service_level_name> LIST ALL SERVICE_LEVELS LIST ATTACHED SERVICE_LEVEL OF <role_name> LIST ALL ATTACHED SERVICE_LEVELS	2021-04-12 16:30:01 +02:00
Eliran Sinvani	8493e19840	qos: Add a standard implementation for service level data accessor service_level_controller defines an interface for accessing the service level distributed data, this patch implements a standard implementation of the interface that delegates to the system distributed keyspace. Message-Id: <25e68302f6f4d4fe5fcb66ea19159ad68506ba64.1609175314.git.sarna@scylladb.com>	2021-04-12 16:01:04 +02:00
Eliran Sinvani	a54ea4667b	service/qos: adding service level controller adding the service level controller implementation. The implementation follows the design in: https://docs.google.com/document/d/1RrSTZ3ZX86-YDt2POwAVwFeKN9uX8frEvATJda5n1FU/edit?usp=sharing Some interfaces were added for registration with system componnents. The method of registration is chosen over a constructor parameter, due to the componnets being initialized prior to the service level controller being created. Message-Id: <e9c4e7d5b411062b6a553f5c6861e7875cd71d2c.1609171761.git.sarna@scylladb.com>	2021-04-12 16:01:04 +02:00
Eliran Sinvani	4fea0762c2	service/qos: add common definitions Adding common definitions that will be used by the performance isolation classes. Mainly defines the common ground for configuring a service level through the service level options structure. Message-Id: <12476f4a8e21af3a4c7a892683940698f3beacce.1609160860.git.sarna@scylladb.com>	2021-04-12 15:58:09 +02:00
Avi Kivity	bad4924868	Merge 'Add a ninja help build target' from Pekka Enberg This pull request adds a "ninja help" build target in hopes of making the different build targets more discoverable to developers. Closes #8454 * github.com:scylladb/scylla: building.md: Document "ninja help" target configure.py: "ninja help" target building.md: Document "ninja <mode>-dist" target configure.py: Add <mode>-dist target as alias for dist-<mode>	2021-04-12 16:30:37 +03:00
Pekka Enberg	698710598a	configure.py: "ninja help" target This adds a "help" build target, which prints out important build targets. The printing is done in a separate shell script, becaue "ninja" insists on print out the "command" before executing it, which makes the help text unreadable.	2021-04-12 10:35:02 +03:00
Pekka Enberg	e959c90af8	configure.py: Add <mode>-dist target as alias for dist-<mode> The build and test build targets put "mode" as prefix, so let's unify the dist target too in preparation for "ninja help".	2021-04-12 10:29:54 +03:00
Michael Livshin	09f221203f	build: tolerate ./build being a symbolic link Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20210411122951.14196-1-michael.livshin@scylladb.com>	2021-04-12 10:08:56 +03:00
Avi Kivity	3a90df39c5	perf: deinline some functions in perf.hh Those functions were defined in a header, but not marked inline. This made including the header from two source files impossible, as the linker would complain about duplicate symbols. Rather than making them inline, put them in a new source file perf.cc as they don't need to be inline.	2021-04-07 17:51:58 +03:00
Avi Kivity	b2f0a9d05c	caching_options.hh: move code to .cc caching_options is by no means performance sensitive, but it is included in many places (via schema.hh), and it turn it pulls in other includes. Reduce include load by moving deinlining it. Ref #1. Closes #8408	2021-04-05 13:05:43 +03:00
Michał Chojnowski	4715268e30	utils: managed_bytes: add operator<< and to_hex for managed_bytes We will need them to replace bytes with managed_bytes in some places in an upcoming patch. The change to configure.py is necessary because opearator<< links to to_hex in bytes.cc.	2021-04-01 10:39:42 +02:00
Michał Chojnowski	b6740a01ac	configure: remove unused link dependencies from UUID_test	2021-04-01 10:39:42 +02:00
Pavel Solodovnikov	7c229998e8	raft: unit-tests for `raft_address_map` Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-03-26 20:22:44 +03:00
Piotr Sarna	06131e21a3	configure.py: add customizing clang inline threshold Until clang figures things out with the now infamous `-llvm -inline-threshold X` parameter, let's allow customizing it to make the compilation of release builds less tiresome. For instance, scylla's row_level.o object file currently does not compile for me until I decrease the inline threshold to a low value (e.g. 50). Message-Id: <54113db9438e3c3371410996f49b7fbe9a1b7257.1616422536.git.sarna@scylladb.com>	2021-03-24 12:09:26 +02:00
Takuya ASADA	35a14ab22b	configure.py: drop compat-python3 targets Since we switched scylla-python3 build directory to tools/python3/build on Jenkins, we nolonger need compat-python3 targets, drop them. Related scylladb/scylla-pkg#1554 Closes #8328	2021-03-21 18:04:27 +02:00
Avi Kivity	a78f43b071	Merge 'tracing: fast slow query tracing' from Ivan Prisyazhnyy The set of patches introduces a new tracing mode - `fast slow query tracing`. In this mode, Scylla tracks only tracing sessions and omits all tracing events if the tracing context does not have a `full_tracing` state set. Fixes #2572 Motivation --- We want to run production systems with that option always enabled so we could always catch slow queries without an overhead. The next step is we are gonna optimize further the costs of having tracing enabled to minimize session context handling overhead to allow it to be as transparent for the end-user as possible. Fast tracing mode --- To read the status do $ curl -v http://localhost:10000/storage_service/slow_query To enable fast slow-query tracing $ curl -v --request POST http://localhost:10000/storage_service/slow_query\?fast=true\&enable=true Potential optimizations --- - remove tracing::begin(lazy_eval) - replace tracing::begin(string) for enum to remove copying and memory allocations - merge parameters allocations - group parameters check for trace context - delay formatting - reuse prepared statement shared_ptr instead of both copying it and copying its query Performance --- 100% cache hits --- 1 Core: ``` $ SCYLLA_HOME=/home/sitano.public/Projects/scylla build/release/scylla --smp 1 --cpuset 7 --log-to-syslog 0 --log-to-stdout 1 --default-log-level info --network-stack posix --workdir /home/sitano.public/Projects/scylla --developer-mode 1 --listen-address 0.0.0.0 --api-address 0.0.0.0 --rpc-address 0.0.0.0 --broadcast-rpc-address 172.18.0.1 --broadcast-address 127.0.0.1 ./cassandra-stress write n=100000 no-warmup -pop seq=1..100000 -node 127.0.0.1 -log level=verbose -rate threads=1 -mode native cql3 curl --request POST http://localhost:10000/storage_service/slow_query\?fast\=false\&enable\=false for i in $(seq 5); do taskset -c 2,3,4,5 ./cassandra-stress read duration=5m -pop seq=1..100000 -node 127.0.0.1 -log level=verbose -rate threads=4 throttle=30000/s -mode native cql3 done curl --request POST http://localhost:10000/storage_service/slow_query\?fast\=true\&enable\=true for i in $(seq 5); do taskset -c 2,3,4,5 ./cassandra-stress read duration=5m -pop seq=1..100000 -node 127.0.0.1 -log level=verbose -rate threads=4 throttle=30000/s -mode native cql3 done curl --request POST http://localhost:10000/storage_service/slow_query\?fast\=false\&enable\=true for i in $(seq 5); do taskset -c 2,3,4,5 ./cassandra-stress read duration=5m -pop seq=1..100000 -node 127.0.0.1 -log level=verbose -rate threads=4 throttle=30000/s -mode native cql3 done ``` \| qps \| \| \| -- \| -- \| -- \| -- \| -- \| baseline \| fast, slow \| nofast, slow \| %[1-fastslow/baseline] \| 29,018 \| 26,468 \| 23,591 \| 8.79% \| 28,909 \| 26,274 \| 23,584 \| 9.11% \| 28,900 \| 26,547 \| 23,598 \| 8.14% \| 28,921 \| 26,669 \| 23,596 \| 7.79% \| 28,821 \| 26,385 \| 23,601 \| 8.45% stdev \| 70.24030182 \| 150.9678774 \| 6.670832032 \| avg \| 28,914 \| 26,469 \| 23,594 \| stderr \| 0.24% \| 0.57% \| 0.03% \| %[avg/baseline] \| \| 8.46% \| 18.40% \| 8.46% performance degradation in `fast slow query mode` for pure in-memory workload with minimum traces. 18.40% performance degradation in `original slow query mode` for pure in-memory workload with minimum traces. 0% cache hits --- 1GB memory, 1 Core: $ SCYLLA_HOME=/home/sitano.public/Projects/scylla build/release/scylla --memory 1G --smp 1 --cpuset 7 --log-to-syslog 0 --log-to-stdout 1 --default-log-level info --network-stack posix --workdir /home/sitano.public/Projects/scylla --developer-mode 1 --listen-address 0.0.0.0 --api-address 0.0.0.0 --rpc-address 0.0.0.0 --broadcast-rpc-address 172.18.0.1 --broadcast-address 127.0.0.1 2.4GB, 10000000 keys data: $ ./cassandra-stress write n=10000000 no-warmup -pop seq=1..10000000 -node 127.0.0.1 -log level=verbose -rate threads=4 -mode native cql3 $ curl --request POST http://localhost:10000/storage_service/slow_query\?fast\=true\&enable\=true CASSANDRA_STRESS prepared statements with BYPASS CACHE $ taskset -c 2,3,4,5 ./cassandra-stress read duration=5m -pop seq=1..10000000 -node 127.0.0.1 -log level=verbose -rate threads=4 throttle=30000/s -mode native cql3 20000 reads IOPS, 100MB/s from disk \| qps \| \| \| -- \| -- \| -- \| -- \| -- \| baseline reads \| fast, slow reads \| %[1-fastslow/baseline] \| \| 9,575 \| 9,054 \| 5.44% \| \| 9,614 \| 9,065 \| 5.71% \| \| 9,610 \| 9,066 \| 5.66% \| \| 9,611 \| 9,062 \| 5.71% \| \| 9,614 \| 9,073 \| 5.63% \| stdev \| 16.75410397 \| 6.892024376 \| avg \| 9,605 \| 9,064 \| stderr \| 0.17% \| 0.08% \| %[avg/baseline] \| \| 5.63% \| 5.63% performance degradation in `fast slow query mode` for pure on-disk workload with minimum traces. Closes #8314 * github.com:scylladb/scylla: tracing: fast mode unit test tracing: rest api for lightweight slow query tracing tracing: omit tracing session events and subsessions in fast mode	2021-03-21 12:15:17 +02:00
Ivan Prisyazhnyy	f00391af8b	tracing: fast mode unit test Signed-off-by: Ivan Prisyazhnyy <ivan@scylladb.com>	2021-03-18 15:05:09 +02:00
Avi Kivity	f038d1555c	Merge 'Add more context to configure.py' from Piotr Sarna This series makes configure.py output slightly more helpful in case of incorrect parameters passed to the compiler/linker. Closes #8267 * github.com:scylladb/scylla: configure: print more context if the linking attempt failed configure: provide more context on failed ./configure.py run configure: add verbose option to try_compile_and_link	2021-03-18 11:24:18 +01:00
Benny Halevy	7862cad669	sstable_set: partitioned_sstable_set: clone: do clone all sstables The existing implementation wrongfully shares _all sstables rather than cloning it. This caused a use-after-free in `repair_meta::do_estimate_partitions_on_local_shard` when traversing a shared sstable_set, during which `table::make_reader_excluding_sstables` erased an entry. The erase should have happened on a cloned copy of the sstable_list, not on a shared copy. The regression was introduced in `c3b8757fa1`. Added a unit test that reproduces the share-on-copy issue for partitioned_stable_set (sstables::sstable_set). Fixes #8274 Test: unit(release, debug) DTest: materialized_views_test.py:TestMaterializedViews.simple_repair_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Reviewed-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210317145552.701559-1-bhalevy@scylladb.com>	2021-03-18 11:15:59 +02:00
Piotr Sarna	2201c9b146	configure: print more context if the linking attempt failed Previously, when a linking attempt failed, configure.py immediately printed that neither lld nor gold was found, which might be misleading if the linkers are installed, but the compilation failed anyway. The printed information is now more specific, and combined with the previous commit, it will also provide more information why the compilation attempt failed.	2021-03-16 07:39:05 +01:00
Piotr Sarna	f86b879933	configure: provide more context on failed ./configure.py run If the configuration step failed, it used to only inform that it must be due to the wrong GCC version, which can be misleading. For instance, trying to compile on clang with incorrect flags also resulted in an "wrong GCC version" message. Now, the message is more generic, but it also prints the stderr output from the miscompilation, which may help pinpoint the problem: $ ./configure.py --mode release --cflags='-fhello -fcolor-diagnostics -mllvm -opt-bisect-limit=10000' --compiler=clang++ --c-compiler=clang Note: neither lld nor gold found; using default system linker Compilation failed: clang++ -x c++ -o build/tmp/tmp1177gojf /home/sarna/repo/scylla/build/tmp/tmp_u3voys6 -fhello -fcolor-diagnostics -mllvm -opt-bisect-limit=10000 [] // clang pretends to be gcc (defined __GNUC__), so we // must check it first \#ifdef __clang__ \#if __clang_major__ < 10 #error "MAJOR" \#endif \#elif defined(__GNUC__) \#if __GNUC__ < 10 #error "MAJOR" \#elif __GNUC__ == 10 #if __GNUC_MINOR__ < 1 #error "MINOR" #elif __GNUC_MINOR__ == 1 #if __GNUC_PATCHLEVEL__ < 1 #error "PATCHLEVEL" #endif #endif \#endif \#else \#error "Unrecognized compiler" \#endif int main() { return 0; } clang-11: error: unknown argument: '-fhello' distcc[4085341] ERROR: compile (null) on localhost failed Wrong compiler version or incorrect flags. Scylla needs GCC >= 10.1.1 with coroutines (-fcoroutines) or clang >= 10.0.0 to compile.	2021-03-16 07:39:03 +01:00
Piotr Sarna	6389246d6e	configure: add verbose option to try_compile_and_link Which will be useful later for providing more context why a ./configure.py run failed.	2021-03-16 07:35:16 +01:00
Alejo Sanchez	6139ad6337	raft: tests: move boost tests to tests/raft Move raft boost tests to test/raft directory. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2021-03-15 06:16:58 -04:00
Avi Kivity	486f6bf29c	Merge "sstables: move format specific reader code to kl/, mx/" from Botond " Currently the sstable reader code is scattered across several source files as following (paths are relative to sstables/): * partition.cc - generic reader code; * row.hh - format specific code related to building mutation fragments from cells; * mp_row_consumer.hh - format specific code related to parsing the raw byte stream; This is a strange organization scheme given that the generic sstable reader is a template and as such it doesn't itself depend on the other headers where the consumer and context implementations live. Yet these are all included in partition.cc just so the reader factory function can instantiate the sstable reader template with the format specific objects. This patchset reorganizes this code such that the generic sstable reader is exposed in a header. Furthermore, format specific code is moved to the kl/ and mx/ directories respectively. Each directory has a reader.hh with a single factory function which creates the reader, all the format specific code is hidden from sight. The added benefit is that now reader code specific to a format is centralized in the format specific folder, just like the writer code. This patchset only moves code around, no logical changes are made. Tests: unit(dev) " * 'sstable-reader-separation/v1' of https://github.com/denesb/scylla: sstables: get rid of mp_row_consumer.{hh,cc} sstables: get rid of row.hh sstables/mp_row_consumer.hh: remove unused struct new_mutation sstables: move mx specific context and consumer to mx/reader.cc sstables: move kl specific context and consumer to kl/reader.cc sstables: mv partition.cc sstable_mutation_reader.hh	2021-03-11 16:57:54 +02:00
Botond Dénes	361ba473c7	sstables: get rid of mp_row_consumer.{hh,cc} Move stuff contained therein to `sstable_mutation_reader.{hh,cc}` which will serve as the collection point of utility stuff needed by all reader implementations.	2021-03-11 12:17:13 +02:00
Botond Dénes	cecc7f8064	sstables: move mx specific context and consumer to mx/reader.cc Move all the mx format specific context and consumer code to mx/reader.cc and add a factory function `mx::make_reader()` which takes over the job of instantiating the `sstable_mutation_reader` with the mx specific context and consumer.	2021-03-11 12:17:13 +02:00
Botond Dénes	4e3ae9d913	sstables: move kl specific context and consumer to kl/reader.cc Move all the kl format specific context and consumer code to kl/reader* and add a factory function `kl::make_reader()` which takes over the job of instantiating the `sstable_mutation_reader` with the kl specific context and consumer. Code which is used by test is moved to kl/reader_impl.hh, while code that can be hidden us moved to kl/reader.cc. Users who just want to create a reader only have to include kl/reader.hh.	2021-03-11 12:17:13 +02:00
Botond Dénes	0ec040921d	sstables: mv partition.cc sstable_mutation_reader.hh The sstable reader currently knows the definition of all the different consumers and contexts. But it doesn't really need to, as it is a template. Exploit this and prepare for a organization scheme where the consumers and contexts live hidden in a cc file which includes and instantiates the sstable reader template. As a first step expose `sstable_mutation_reader` in a header.	2021-03-11 12:17:13 +02:00
Dejan Mircevski	2525759027	test: Add unit tests for get_clustering_bounds ... as guardrails for the upcoming rewrite. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2021-03-10 21:17:26 -05:00
Tomasz Grabiec	3cb01f218f	Merge "raft: add unit tests for log, tracker, votes and fix found bugs" from Kostja Test log consistency after apply_snapshot() is called. Ensure log::last_term() log::last_conf_index() and log::size() work as expected. Misc cleanups. * scylla-dev.git/raft-confchange-test-v4: raft: fix spelling raft: add a unit test for voting raft: do not account for the same vote twice raft: remove fsm::set_configuration() raft: consistently use configuration from the log raft: add ostream serialization for enum vote_result raft: advance commit index right after leaving joint configuration raft: add tracker test raft: tidy up follower_progress API raft: update raft::log::apply_snapshot() assert raft: add a unit test for raft::log raft: rename log::non_snapshoted_length() to log::in_memory_size() raft: inline raft::log::truncate_tail() raft: ignore AppendEntries RPC with a very old term raft: remove log::start_idx() raft: return a correct last term on an empty log raft: do not use raft::log::start_idx() outside raft::log() raft: rename progress.hh to tracker.hh raft: extend single_node_is_quiet test	2021-03-03 16:29:40 +01:00
Tomasz Grabiec	0dc57db248	Revert "Merge "raft: add unit tests for log, tracker, votes and fix found bugs" from Kostja" This reverts commit `f94f70cda8`, reversing changes made to `5206a97915`. Not the latest version of the series was merged. Rvert prior to merging the latest one.	2021-03-03 16:29:02 +01:00
Avi Kivity	5f4bf18387	Revert "Merge 'sstables: add versioning to the sstable_set ' from Wojciech Mitros" This reverts commit `31909515b3`, reversing changes made to `ef97adc72a`. It shows many serious regressions in dtest. Fixes #8197.	2021-03-02 13:21:22 +02:00
Avi Kivity	31909515b3	Merge 'sstables: add versioning to the sstable_set ' from Wojciech Mitros Currently, the sstable_set in a table is copied before every change to allow accessing the unchanged version by existing sstable readers. This patch changes the sstable_set to a structure that keeps all its versions that are referenced somewhere and provides a way of getting a reference to an immutable version of the set. Each sstable in the set is associated with the versions it is alive in, and is removed when all such versions don't have references anymore. To avoid copying, the object holding all sstables in the set version is changed to a new structure, sstable_list, which was previously an alias for std::unordered_set<shared_sstable>, and which implements most of the methods of an unordered_set, but its iterator uses the actual set with all sstables from all referenced versions and iterates over those sstables that belong to the captured version. The methods that modify the sets contents give strong exception guarantee by trying to insert new sstables to its containers, and erasing them in the case of an caught exception. To release shared_sstables as soon as possible (i.e. when all references to versions that contain them die), each time a version is removed, all sstables that were referenced exclusively by this version are erased. We are able to find these sstables efficiently by storing, for each version, all sstables that were added and erased in it, and, when a version is removed, merging it with the next one. When a version that adds an sstable gets merged with a version that removes it, this sstable is erased. Fixes #2622 Signed-off-by: Wojciech Mitros wojciech.mitros@scylladb.com Closes #8111 * github.com:scylladb/scylla: sstables: add test for checking the latency of updating the sstable_set in a table sstables: move column_family_test class from test/boost to test/lib sstables: use fast copying of the sstable_set instead of rebuilding it sstables: replace the sstable_set with a versioned structure sstables: remove potential ub sstables: make sstable_set constructor less error-prone	2021-03-01 14:16:36 +02:00
Avi Kivity	d980f550d1	Merge 'row_cache: Make fill_buffer() preemptable when cursor leads with dummy rows' from Tomasz Grabiec fill_buffer() will keep scanning until _lower_bound_changed is true, even if preemption is signaled, so that the reader makes forward progress. Before the patch, we did not update _lower_bound on touching a dummy entry. The read will not respect preemption until we hit a non-dummy row. If there is a lot of dummy rows, that can cause reactor stalls. Fix that by updating _lower_bound on dummy entries as well. Refs #8153. Tested with perf_row_cache_reads: ``` $ build/release/test/perf/perf_row_cache_reads -c1 -m200M Rows in cache: 0 Populating with dummy rows Rows in cache: 373929 Scanning read: 183.658966 [ms], preemption: {count: 848, 99%: 0.545791 [ms], max: 0.519343 [ms]}, cache: 99/100 [MB] read: 120.951515 [ms], preemption: {count: 257, 99%: 0.545791 [ms], max: 0.518795 [ms]}, cache: 99/100 [MB] ``` Notice that max preemption latency is low in the second "read:" line. Closes #8167 * github.com:scylladb/scylla: row_cache: Make fill_buffer() preemptable when cursor leads with dummy rows tests: perf: Introduce perf_row_cache_reads row_cache: Add metric for dummy row hits	2021-02-28 21:00:20 +02:00
Tomasz Grabiec	52e411df36	tests: perf: Introduce perf_row_cache_reads Tests performance of various read patterns from the row cache. Example: $ build/release/test/perf/perf_row_cache_reads_g -c1 -m200M Filling memtable Rows in cache: 0 Populating with dummy rows Rows in cache: 373929 Scanning read: 156.288986 [ms], preemption: {count: 702, 99%: 0.545791 [ms], max: 0.537537 [ms]}, cache: 99/100 [MB] read: 106.480766 [ms], preemption: {count: 6, 99%: 0.006866 [ms], max: 106.496168 [ms]}, cache: 99/100 [MB]	2021-02-26 01:20:38 +01:00

1 2 3 4 5 ...

1382 Commits