zrepl

mirror of https://github.com/zrepl/zrepl.git synced 2024-11-22 08:23:50 +01:00

Author	SHA1	Message	Date
Christian Schwarz	7303d91abf	WIP state-machine based replication	2018-08-11 12:19:10 +02:00
Christian Schwarz	c1f3076eb3	WIP2 logging done somewhat	2018-08-10 17:06:00 +02:00
Christian Schwarz	74445a0017	fixup	2018-08-08 13:12:50 +02:00
Christian Schwarz	1a8d2c5ebe	replication: context support and propert closing of stale readers	2018-07-08 23:31:46 +02:00
Christian Schwarz	8cca0a8547	Initial working version Summary: * Logging is still bad * test output in a lot of placed * FIXMEs every where Test Plan: None, just review Differential Revision: https://phabricator.cschwarz.com/D2	2018-06-24 10:44:00 +02:00
Christian Schwarz	5f2c14adab	zfs: use custom datatype to pass ZFS properties in ZFSSet refs #55	2018-01-05 18:42:10 +01:00
Christian Schwarz	be7176bee7	Puller: fix wrong filesystem log field usage was introduced in `9465b593`	2017-12-29 21:25:42 +01:00
Christian Schwarz	7d89d1fb00	job pull: refactor + use Task API refs #10	2017-12-27 18:34:24 +01:00
Christian Schwarz	b69089a527	Puller: refactor + use Task API * drop rx byte count functionality * will be re-added to Task as necessary refs #10	2017-12-27 14:39:47 +01:00
Christian Schwarz	59e34942d1	Puller: make main interface public refs #10	2017-12-27 14:39:46 +01:00
Christian Schwarz	58ee796394	adopt Task API: infect datastructures refs #10	2017-12-27 14:39:46 +01:00
Christian Schwarz	583a63a68f	refactor: encapsulate pulling in a struct refs #10	2017-12-24 15:23:28 +01:00
Christian Schwarz	0cbee78b40	fix unreachable code & missing stringer-generated code	2017-09-30 16:31:55 +02:00
Christian Schwarz	c31ec8c646	convert more code to structured logging	2017-09-23 17:52:29 +02:00
Christian Schwarz	9465b593f9	cmd: configurable logrus formatters We lost the nice context-stack [jobname][taskname][...] at the beginning of each log line when switching to logrus. Define some field names that define these contexts. Write a human-friendly formatter that presents these field names like the solution we had before logrus. Write some other formatters for logfmt and json output along the way. Limit ourselves to stdout logging for now.	2017-09-23 11:24:36 +02:00
Christian Schwarz	bfcba7b281	cmd: logging using logrus	2017-09-22 17:01:54 +02:00
Christian Schwarz	73c9033583	WIP: Switch to new config format. Don't use jobrun for daemon, just call JobDo() once, the job must organize stuff itself. Sacrifice all the oneshot commands, they will be reintroduced as client-calls to the daemon.	2017-09-10 17:53:54 +02:00
Christian Schwarz	7e442ea0ea	cmd: remove legacy NoMatchError	2017-09-02 12:40:22 +02:00
Christian Schwarz	582ae83da3	cmd: remove RunCmd	2017-09-01 19:29:19 +02:00
Christian Schwarz	3070d156a3	jobrun: rename to jobmetadata	2017-09-01 19:29:19 +02:00
Christian Schwarz	6ab05ee1fa	reimplement io.ReadWriteCloser based RPC mechanism The existing ByteStreamRPC requires writing RPC stub + server code for each RPC endpoint. Does not scale well. Goal: adding a new RPC call should - not require writing an RPC stub / handler - not require modifications to the RPC lib The wire format is inspired by HTTP2, the API by net/rpc. Frames are used for framing messages, i.e. a message is made of multiple frames which are glued together using a frame-bridging reader / writer. This roughly corresponds to HTTP2 streams, although we're happy with just one stream at any time and the resulting non-need for flow control, etc. Frames are typed using a header. The two most important types are 'Header' and 'Data'. The RPC protocol is built on top of this: - Client sends a header => multiple frames of type 'header' - Client sends request body => mulitiple frames of type 'data' - Server reads a header => multiple frames of type 'header' - Server reads request body => mulitiple frames of type 'data' - Server sends response header => ... - Server sends response body => ... An RPC header is serialized JSON and always the same structure. The body is of the type specified in the header. The RPC server and client use some semi-fancy reflection tequniques to automatically infer the data type of the request/response body based on the method signature of the server handler; or the client parameters, respectively. This boils down to a special-case for io.Reader, which are just dumped into a series of data frames as efficiently as possible. All other types are (de)serialized using encoding/json. The RPC layer and Frame Layer log some arbitrary messages that proved useful during debugging. By default, they log to a non-logger, which should not have a big impact on performance. pprof analysis shows the implementation spends its CPU time 60% waiting for syscalls 30% in memmove 10% ... On a Intel(R) Core(TM) i7-6600U CPU @ 2.60GHz CPU, Linux 4.12, the implementation achieved ~3.6GiB/s. Future optimization may include spice(2) / vmspice(2) on Linux, although this doesn't fit so well with the heavy use of io.Reader / io.Writer throughout the codebase. The existing hackaround for local calls was re-implemented to fit the new interface of PRCServer and RPCClient. The 'R'PC method invocation is a bit slower because reflection is involved inbetween, but otherwise performance should be no different. The RPC code currently does not support multipart requests and thus does not support the equivalent of a POST. Thus, the switch to the new rpc code had the following fallout: - Move request objects + constants from rpc package to main app code - Sacrifice the hacky 'push = pull me' way of doing push -> need to further extend RPC to support multipart requests or something to implement this properly with additional interfaces -> should be done after replication is abstracted better than separate algorithms for doPull() and doPush()	2017-09-01 19:24:53 +02:00
Christian Schwarz	676ac41677	fix leaking channel when closing connection	2017-08-09 21:03:05 +02:00
Christian Schwarz	4e45b4090b	pull log output: optimize to be readable by humans	2017-08-06 18:28:05 +02:00
Christian Schwarz	cba083cadf	Make zfs.DatasetPath json.Marshaler and json.Unmarshaler Had to resort to using pointers to zfs.DatasetPath everywhere... Should find a better solution for that.	2017-08-06 16:22:15 +02:00
Christian Schwarz	2ce07c9342	rework filters & mappings config defines a single datastructure that can act both as a Map and as a Filter (DatasetMapFilter) Cleanup wildcard syntax along the way (also changes semantics).	2017-08-06 16:21:54 +02:00
Christian Schwarz	4732fdd4cc	Implement placeholder filesystems. Note the docs on the placeholder user property introduced with this commit. The solution is not really satisfying but couldn't think of a better one OTOMH	2017-08-06 16:21:54 +02:00
Christian Schwarz	8eb4a2ba44	Rudimentary progress reporting on send / recv side.	2017-08-06 16:21:54 +02:00
Christian Schwarz	4b373fbd95	zfs & replication: explicit conflict types for FilesystemDiff + handling in repl	2017-07-08 13:13:16 +02:00
Christian Schwarz	e951beaef5	Simplify CLI by requiring explicit job names. Job names are derived from job type + user-defined name in config file CLI now has subcommands corresponding 1:1 to the config file sections: push,pull,autosnap,prune A subcommand always expects a job name, thus executes exactly one job. Dict-style syntax also used for PullACL and Sink sections. jobrun package is currently only used for autosnap, all others need to be invoked repeatedly via external tool. Plan is to re-integrate jobrun in an explicit daemon-mode (subcommand).	2017-07-08 11:13:50 +02:00
Christian Schwarz	b44a005bbb	Switch to using https://github.com/spf13/cobra for CLI. Use opportunity to structure project by subcommands.	2017-07-06 13:36:55 +02:00
Christian Schwarz	c22190e981	zfs: extract filesystem version code to separate file & add filtering support	2017-07-01 23:19:31 +02:00
Christian Schwarz	3c7f782dac	rpc: remove FilesystemRequest.Direction (unused)	2017-05-20 17:43:49 +02:00
Christian Schwarz	40fe7e643d	cmd: Move replication logic to separate file.	2017-05-20 17:29:37 +02:00

33 Commits