scylladb

Author	SHA1	Message	Date
Avi Kivity	5f62f7a288	Revert "Merge "Commit log replay" from Calle" Due to test breakage. This reverts commit `43a4491043`, reversing changes made to `5dcf1ab71a`.	2015-08-27 12:39:08 +03:00
Avi Kivity	43a4491043	Merge "Commit log replay" from Calle "Initial implementation/transposition of commit log replay. * Changes replay position to be shard aware * Commit log segment ID:s now follow basically the same scheme as origin; max(previous ID, wall clock time in ms) + shard info (for us) * SStables now use the DB definition of replay_position. * Stores and propagates (compaction) flush replay positions in sstables * If CL segments are left over from a previous run, they, and existing sstables are inspected for high water mark, and then replayed from those marks to amend mutations potentially lost in a crash * Note that CPU count change is "handled" in so much that shard matching is per _previous_ runs shards, not current. Known limitations: * Mutations deserialized from old CL segments are _not_ fully validated against existing schemas. * System::truncated_at (not currently used) does not handle sharding afaik, so watermark ID:s coming from there are dubious. * Mutations that fail to apply (invalid, broken) are not placed in blob files like origin. Partly because I am lazy, but also partly because our serial format differs, and we currently have no tools to do anything useful with it * No replay filtering (Origin allows a system property to designate a filter file, detailing which keyspace/cf:s to replay). Partly because we have no system properties. There is no unit test for the commit log replayer (yet). Because I could not really come up with a good one given the test infrastructure that exists (tricky to kill stuff just "right"). The functionality is verified by manual testing, i.e. running scylla, building up data (cassandra-stress), kill -9 + restart. This of course does not really fully validate whether the resulting DB is 100% valid compared to the one at k-9, but at least it verified that replay took place, and mutations where applied. (Note that origin also lacks validity testing)"	2015-08-27 10:53:36 +03:00
Calle Wilund	2a1c7d2587	CommitLogReplayer: Java -> C++ Initial implementation	2015-08-25 09:41:56 +02:00
Calle Wilund	86a97fea4c	Commitlog: Allow skipping X bytes in commit log reader Also refactor reader into named methods for debugging sanity.	2015-08-25 09:41:55 +02:00
Calle Wilund	37cfc09e91	Commitlog: Handle full paths in descriptor file name parse.	2015-08-25 09:41:55 +02:00
Calle Wilund	4364d72ca3	Commitlog: Expose convinience method "list_existing_segments"	2015-08-25 09:41:54 +02:00
Calle Wilund	a3a02968ab	Commitlog: Expose list_existing_descriptors	2015-08-25 09:41:54 +02:00
Calle Wilund	fcb87471b9	Commitlog: Make file reader provide replay_position for entries	2015-08-25 09:40:53 +02:00
Calle Wilund	db6370ad87	Commitlog: Make descriptor type visible/usable from outside	2015-08-25 09:40:53 +02:00
Calle Wilund	4f24b9795e	Commitlog: change the ID generation scheme * Make it more like origin, i.e. based on wall clock time of app start * Encode shard ID in the, RP segement ID, to ensure RP:s and segement names are unique per shard	2015-08-25 09:40:52 +02:00
Pekka Enberg	544c7936d8	db/commitlog: Kill Java code Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-24 11:51:49 +03:00
Calle Wilund	ac74dd6159	Commitlog: Make "position" type 32-bit to align replay_position with Origin	2015-08-24 10:05:44 +02:00
Calle Wilund	d50986ef31	Commitlog: do not eagerly create first segment on init Deferring makes it easier to separate old segments from new, which in turn helps replay logic.	2015-08-24 10:05:44 +02:00
Calle Wilund	8f0f4e7945	Commitlog: do more extensive dir entry probes to determine type Since directory_entry "type" might not be set. Ensuring that code does not remain future free or easy to read. Fixes #157.	2015-08-17 16:56:31 +03:00
Calle Wilund	2db7791c6a	Commitlog: Attempt to reduce allocation size for segment if alloc fails	2015-08-12 16:20:12 +02:00
Calle Wilund	4fe98d3acf	Commitlog: Throw bad_alloc on memalign fail (avoid sigsegv later)	2015-08-12 16:20:11 +02:00
Calle Wilund	7191a130bb	Commitlog: recycle buffers to reduce fragmentation.	2015-08-12 16:20:11 +02:00
Calle Wilund	6ac6d644be	Commitlog: add logging Note: pretty lame logging, but modeled after origin. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-08-10 18:42:41 +03:00
Avi Kivity	a8ff8ea442	commitlog: switch to faster crc32 implementation	2015-08-09 00:05:36 +03:00
Avi Kivity	dce642f472	commitlog: fix use-after-free of file during close Reviewed-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-29 20:36:03 +03:00
Tomasz Grabiec	e416e800c8	commitlog: Fix use-after-move "f" was passed to make_file_input_stream() after it was moved-from.	2015-07-22 19:21:57 +03:00
Nadav Har'El	4edf7fe206	clean up uses of lw_shared_ptr<file> recently, "file" started to use a shared_ptr internally, and is already copy-able and reference counted, and there is no reason to use lw_shared_ptr<file>. This patch cleans up a few remaining places where lw_shared_ptr<file> was used. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-07-22 11:51:40 +03:00
Avi Kivity	4a95f1589c	Merge seastar upstream Adjust make_file_*_stream() callers for updated seastar API.	2015-07-20 17:02:46 +03:00
Avi Kivity	e343295667	commitlog: don't pass a temporary string to std::regex_match The match results will point nowhere, and libstdc++ 5 rightly rejects it.	2015-06-08 09:23:18 +03:00
Calle Wilund	0729580f84	Separate replay_position into its own header Less include bloat... Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-06-01 14:17:43 +03:00
Avi Kivity	a83d0b258c	commitlog: fix build error in default_size	2015-05-26 20:47:39 +03:00
Avi Kivity	51b0e2a1e9	commitlog: don't specify the default_size in terms of alignment Use kiB units instead.	2015-05-26 18:50:21 +03:00
Calle Wilund	e77b23c34f	commitlog: Bump up buffer size Still not "properly" profiled what the best size is, but testing shows this helps a bit at least. Signed-off-by: Calle Wilund <calle@cloudius-systems.com> Reviewed-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-26 18:49:17 +03:00
Asias He	898233ddcf	Remove redundant const in static constexpr const From http://en.cppreference.com/w/cpp/language/constexpr: A constexpr specifier used in an object declaration implies const.	2015-05-25 13:09:23 +03:00
Amnon Heiman	b95dabba38	Expose the segment names in commit log This adds a method to return a vector with full-path to the active segment names. It will be used by the API. Signed-off-by: Amnon Heiman <amnon@cloudius-systems.com>	2015-05-19 15:27:52 +03:00
Avi Kivity	548646d4ba	Merge branch 'master' of github.com:cloudius-systems/seastar into db Should fix use-after-free when a frozen_mutation is applied to the local shard. Includes two adjustments to urchin collectd usage from Calle: - Updated thrift collectd registration to use proper move semantics - Commitlog: Fix collectd registration to use move semantics + test Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-12 14:47:07 +03:00
Calle Wilund	183e0b52e6	commitlog: Add collectd counters - # segments - # allocting segments - # unused segments - # allocations - # cycles (disk writes) - # flush - # total bytes allocated - # total bytes disk slack (due to dma blocks) Counters are per-commitlog (shard). Can be extended to be per-segment also, but would be transient and probably not much more useful. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-11 16:06:51 +03:00
Calle Wilund	46e0676a7d	commitlog: Add reader stream/subscription Generic read-all-stream from a commit log segmen file. Provides a byte view for each data entry, doing CRC checks and padding skips. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:45:13 +03:00
Calle Wilund	7f685abca0	commitlog: added file header space twice Checked wrong var == 0 when creating second mem buffer in segment Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:19:56 +03:00
Calle Wilund	897083e213	commitlog: Sync check time delta was inverted Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:19:49 +03:00
Calle Wilund	d2d3547115	commitlog: Initial data chunk offset should be 16 As in "after file header". Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-06 16:19:43 +03:00
Pekka Enberg	bf1734c480	db/commitlog: Use 'pragma once' in commitlog.hh Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-06 09:47:42 +03:00
Pekka Enberg	a32ae69b2b	db/commitlog: Minor formatting fixes Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-06 09:47:21 +03:00
Pekka Enberg	9920d58b70	db/commitlog: Use C++ type aliases Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-05-06 09:43:04 +03:00
Calle Wilund	e1c0ddea93	commitlog: make sure we don't race creating segments Use a semaphore as gatekeeper to make sure we finish creating a segment before starting a new one. Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-05-05 09:59:27 +03:00
Calle Wilund	2f4e7a00f6	Use db/config object in main, database etc * Uses config object to augument/impl options parsing * Database now holds config obj * Commitlog can now be inited with global config obj.	2015-04-29 18:01:17 +02:00
Calle Wilund	aeb83f2874	Add commitlog to db + use it in storage_proxy/handler * A commitlog is created in "work" dirs when initing the db from a datadir. However, since we have neither disk data storage, nor replay capability yet (and no real db config), the settings are basically to just write in-memory serialization, write them to disk and then discard them. So in fact, pointless. But at least using the log... * Moved the actual "apply" of mutation into database. If a commitlog is active, add an entry to it before applying mutation.	2015-04-29 10:10:21 +02:00
Calle Wilund	07f1208926	Make commitlog respect max disk size setting.	2015-04-29 10:10:21 +02:00
Calle Wilund	9979ee8d45	Modify commit log to use dataoutput Both internal usage and external interface.	2015-04-01 10:08:00 +02:00
Calle Wilund	80b244cc4b	commitlog: fix using wrong segment Commit log segment "finish_and_get_new" should not call "new_segment" explicitly, since more than one invocation might get there on the same flush condition (segment full). Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-03-09 17:54:47 +02:00
Calle Wilund	054f9ed082	Initial commit log support. Implements a cassandra-file-compatible segmented log of "mutations". Handles "batch" and "periodic" mode like stock version. Also includes "dirty" management for the interaction log/memtable. Supports: * add * sync/flush * clear dirty bits (thus discarding segments) Many more estoric stock functions not yet implemented. Missing: Storage management. Does not deal with total size on disk of segments yet. Nor does it have any provisions for dealing with active buffer bloat should async writes stall. [avi: adjust for future<>::rescue() removal] Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-03-05 11:06:09 +02:00
Calle Wilund	663cb6c4b1	db: Import db/commmitlog/* Signed-off-by: Calle Wilund <calle@cloudius-systems.com>	2015-03-05 11:02:46 +02:00

47 Commits