scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Calle Wilund	d614143f5e	Commitlog/database: Fixup series "Commit log flush request on disk overflow" Also at seastar-dev: calle/commitlog_flush_v3 (And, yes, this time I _did_ update the remote!) Refs #262 Commit of original series was done on stale version (v2) due to authors inability to multitask and update git repos. v3: * Removed future<> return value from callbacks. I.e. flush callback is now only fully syncronous over actual call	2015-09-07 21:29:19 +03:00
Calle Wilund	fdb921afb2	Commitlog: Add flushing of segment CF:s on disk overflow * Do not throw away commitlog segments on disk size overflow. Issue a flush request (i.e. calculate RP we want to free unto, and for all dirty CF:s, do a request). "Abstracted" as registerable callback. I.e. DB:s responsibility to actually do something with it.	2015-09-07 13:21:43 +02:00
Calle Wilund	841dd32a8a	Commitlog: divide max on-disk-size by num cpus To try to keep the resulting limit as configured	2015-09-07 13:13:46 +02:00
Calle Wilund	d95101664d	Commitlog: Don't throw exceptions on unrecognized files in CL dir	2015-09-01 14:23:03 +02:00
Calle Wilund	1814f89730	Commitlog: Add some more metrics + accessors for json API Fixes #99 Adding missing commitlog metrics to the rest API. v2: Mis-send (clumsy fingers) v3: Use map_reduce0 + subroutine for nicer code v4: rebased on current master v5: rebased yet again. Since the _second_ file in this previous patch set was commited, and is dependent on this very change below to even compile, some expediency might be warranted.	2015-09-01 10:15:33 +03:00
Calle Wilund	9ba84e458a	Commitlog: Handle partial writes in segment::cycle * Fixes #247 * Re-introduce test_allocation_failure, but allow for the "failure" to not happen. I.e. if run with low memory settings, the test will check that allocation failure is graceful. With lots of memory it will check partial write.	2015-08-31 20:02:05 +03:00
Calle Wilund	d3a01072af	CommitLogReplayer: Java -> C++ Initial implementation	2015-08-31 14:29:50 +02:00
Calle Wilund	bbf82e80d0	Commitlog: Allow skipping X bytes in commit log reader Also refactor reader into named methods for debugging sanity.	2015-08-31 14:29:49 +02:00
Calle Wilund	da9ea641e5	Commitlog: Handle full paths in descriptor file name parse.	2015-08-31 14:29:48 +02:00
Calle Wilund	02d2bef1f2	Commitlog: Expose convinience method "list_existing_segments"	2015-08-31 14:29:48 +02:00
Calle Wilund	19052b3c09	Commitlog: Expose list_existing_descriptors	2015-08-31 14:29:48 +02:00
Calle Wilund	e068ffb5a5	Commitlog: Make file reader provide replay_position for entries	2015-08-31 14:29:47 +02:00
Calle Wilund	41b1ad8600	Commitlog: Make descriptor type visible/usable from outside	2015-08-31 14:29:47 +02:00
Calle Wilund	ea38b223bd	Commitlog: change the ID generation scheme * Make it more like origin, i.e. based on wall clock time of app start * Encode shard ID in the, RP segement ID, to ensure RP:s and segement names are unique per shard	2015-08-31 14:29:46 +02:00
Calle Wilund	0fcf7e3e91	Commitlog: Make "position" type 32-bit to align replay_position with Origin * Note: removed commitlog_test:test_allocation_failure because with segments limited to 4GB -> mutation limited to 2GB, actually forcing a fail is not guaranteed or even likely.	2015-08-31 14:29:44 +02:00
Calle Wilund	3f1a91b89c	Commitlog: do not eagerly create first segment on init Deferring makes it easier to separate old segments from new, which in turn helps replay logic.	2015-08-31 13:11:44 +02:00
Avi Kivity	5f62f7a288	Revert "Merge "Commit log replay" from Calle" Due to test breakage. This reverts commit `43a4491043`, reversing changes made to `5dcf1ab71a`.	2015-08-27 12:39:08 +03:00
Avi Kivity	43a4491043	Merge "Commit log replay" from Calle "Initial implementation/transposition of commit log replay. * Changes replay position to be shard aware * Commit log segment ID:s now follow basically the same scheme as origin; max(previous ID, wall clock time in ms) + shard info (for us) * SStables now use the DB definition of replay_position. * Stores and propagates (compaction) flush replay positions in sstables * If CL segments are left over from a previous run, they, and existing sstables are inspected for high water mark, and then replayed from those marks to amend mutations potentially lost in a crash * Note that CPU count change is "handled" in so much that shard matching is per _previous_ runs shards, not current. Known limitations: * Mutations deserialized from old CL segments are _not_ fully validated against existing schemas. * System::truncated_at (not currently used) does not handle sharding afaik, so watermark ID:s coming from there are dubious. * Mutations that fail to apply (invalid, broken) are not placed in blob files like origin. Partly because I am lazy, but also partly because our serial format differs, and we currently have no tools to do anything useful with it * No replay filtering (Origin allows a system property to designate a filter file, detailing which keyspace/cf:s to replay). Partly because we have no system properties. There is no unit test for the commit log replayer (yet). Because I could not really come up with a good one given the test infrastructure that exists (tricky to kill stuff just "right"). The functionality is verified by manual testing, i.e. running scylla, building up data (cassandra-stress), kill -9 + restart. This of course does not really fully validate whether the resulting DB is 100% valid compared to the one at k-9, but at least it verified that replay took place, and mutations where applied. (Note that origin also lacks validity testing)"	2015-08-27 10:53:36 +03:00
Calle Wilund	2a1c7d2587	CommitLogReplayer: Java -> C++ Initial implementation	2015-08-25 09:41:56 +02:00
Calle Wilund	86a97fea4c	Commitlog: Allow skipping X bytes in commit log reader Also refactor reader into named methods for debugging sanity.	2015-08-25 09:41:55 +02:00
Calle Wilund	37cfc09e91	Commitlog: Handle full paths in descriptor file name parse.	2015-08-25 09:41:55 +02:00
Calle Wilund	4364d72ca3	Commitlog: Expose convinience method "list_existing_segments"	2015-08-25 09:41:54 +02:00
Calle Wilund	a3a02968ab	Commitlog: Expose list_existing_descriptors	2015-08-25 09:41:54 +02:00
Calle Wilund	fcb87471b9	Commitlog: Make file reader provide replay_position for entries	2015-08-25 09:40:53 +02:00
Calle Wilund	db6370ad87	Commitlog: Make descriptor type visible/usable from outside	2015-08-25 09:40:53 +02:00
Calle Wilund	4f24b9795e	Commitlog: change the ID generation scheme * Make it more like origin, i.e. based on wall clock time of app start * Encode shard ID in the, RP segement ID, to ensure RP:s and segement names are unique per shard	2015-08-25 09:40:52 +02:00
Pekka Enberg	544c7936d8	db/commitlog: Kill Java code Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-24 11:51:49 +03:00
Calle Wilund	ac74dd6159	Commitlog: Make "position" type 32-bit to align replay_position with Origin	2015-08-24 10:05:44 +02:00
Calle Wilund	d50986ef31	Commitlog: do not eagerly create first segment on init Deferring makes it easier to separate old segments from new, which in turn helps replay logic.	2015-08-24 10:05:44 +02:00
Calle Wilund	8f0f4e7945	Commitlog: do more extensive dir entry probes to determine type Since directory_entry "type" might not be set. Ensuring that code does not remain future free or easy to read. Fixes #157.	2015-08-17 16:56:31 +03:00
Calle Wilund	2db7791c6a	Commitlog: Attempt to reduce allocation size for segment if alloc fails	2015-08-12 16:20:12 +02:00
Calle Wilund	4fe98d3acf	Commitlog: Throw bad_alloc on memalign fail (avoid sigsegv later)	2015-08-12 16:20:11 +02:00
Calle Wilund	7191a130bb	Commitlog: recycle buffers to reduce fragmentation.	2015-08-12 16:20:11 +02:00
Calle Wilund	6ac6d644be	Commitlog: add logging Note: pretty lame logging, but modeled after origin. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-08-10 18:42:41 +03:00
Avi Kivity	a8ff8ea442	commitlog: switch to faster crc32 implementation	2015-08-09 00:05:36 +03:00
Avi Kivity	dce642f472	commitlog: fix use-after-free of file during close Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-29 20:36:03 +03:00
Tomasz Grabiec	e416e800c8	commitlog: Fix use-after-move "f" was passed to make_file_input_stream() after it was moved-from.	2015-07-22 19:21:57 +03:00
Nadav Har'El	4edf7fe206	clean up uses of lw_shared_ptr<file> recently, "file" started to use a shared_ptr internally, and is already copy-able and reference counted, and there is no reason to use lw_shared_ptr<file>. This patch cleans up a few remaining places where lw_shared_ptr<file> was used. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-22 11:51:40 +03:00
Avi Kivity	4a95f1589c	Merge seastar upstream Adjust make_file_*_stream() callers for updated seastar API.	2015-07-20 17:02:46 +03:00
Avi Kivity	e343295667	commitlog: don't pass a temporary string to std::regex_match The match results will point nowhere, and libstdc++ 5 rightly rejects it.	2015-06-08 09:23:18 +03:00
Calle Wilund	0729580f84	Separate replay_position into its own header Less include bloat... Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-06-01 14:17:43 +03:00
Avi Kivity	a83d0b258c	commitlog: fix build error in default_size	2015-05-26 20:47:39 +03:00
Avi Kivity	51b0e2a1e9	commitlog: don't specify the default_size in terms of alignment Use kiB units instead.	2015-05-26 18:50:21 +03:00
Calle Wilund	e77b23c34f	commitlog: Bump up buffer size Still not "properly" profiled what the best size is, but testing shows this helps a bit at least. Signed-off-by: Calle Wilund <calle@cloudius-systems.com> Reviewed-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-26 18:49:17 +03:00
Asias He	898233ddcf	Remove redundant const in static constexpr const From http://en.cppreference.com/w/cpp/language/constexpr: A constexpr specifier used in an object declaration implies const.	2015-05-25 13:09:23 +03:00
Amnon Heiman	b95dabba38	Expose the segment names in commit log This adds a method to return a vector with full-path to the active segment names. It will be used by the API. Signed-off-by: Amnon Heiman <amnon@cloudius-systems.com>	2015-05-19 15:27:52 +03:00
Avi Kivity	548646d4ba	Merge branch 'master' of github.com:cloudius-systems/seastar into db Should fix use-after-free when a frozen_mutation is applied to the local shard. Includes two adjustments to urchin collectd usage from Calle: - Updated thrift collectd registration to use proper move semantics - Commitlog: Fix collectd registration to use move semantics + test Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-12 14:47:07 +03:00
Calle Wilund	183e0b52e6	commitlog: Add collectd counters - # segments - # allocting segments - # unused segments - # allocations - # cycles (disk writes) - # flush - # total bytes allocated - # total bytes disk slack (due to dma blocks) Counters are per-commitlog (shard). Can be extended to be per-segment also, but would be transient and probably not much more useful. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-11 16:06:51 +03:00
Calle Wilund	46e0676a7d	commitlog: Add reader stream/subscription Generic read-all-stream from a commit log segmen file. Provides a byte view for each data entry, doing CRC checks and padding skips. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:45:13 +03:00
Calle Wilund	7f685abca0	commitlog: added file header space twice Checked wrong var == 0 when creating second mem buffer in segment Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:19:56 +03:00

1 2

63 Commits