zrepl

mirror of https://github.com/zrepl/zrepl.git synced 2024-11-25 18:04:58 +01:00

Author	SHA1	Message	Date
Christian Schwarz	bfcba7b281	cmd: logging using logrus	2017-09-22 17:01:54 +02:00
Christian Schwarz	3eaba92025	cmd: introduce control socket & subcommand Move pprof debugging there.	2017-09-18 00:16:28 +02:00
Christian Schwarz	4ac7e78e2b	cmd: config: was using wrong reference to config	2017-09-17 17:45:02 +02:00
Christian Schwarz	6a05e101cf	WIP daemon: Implement * pruning on source side * local job * test subcommand for doing a dry-run of a prune policy * use a non-blocking callback from autosnap to trigger the depending jobs -> avoids races, looks saner in the debug log	2017-09-16 21:13:19 +02:00
Christian Schwarz	c6ca1efaae	cmd: fix typo	2017-09-15 19:34:38 +02:00
Christian Schwarz	1deaa459c8	config: unify job debugging options	2017-09-11 15:45:10 +02:00
Christian Schwarz	ce25c01c7e	implement stdinserver command + corresponding server How it works: `zrepl stdinserver CLIENT_IDENTITY` * connects to the socket in $global.serve.stdinserver.sockdir/CLIENT_IDENTITY * sends its stdin / stdout file descriptors to the `zrepl daemon` process (see cmsg(3)) * does nothing more This enables a setup where `zrepl daemon` is not directly exposed to the internet but instead all traffic is tunnelled through SSH. The server with the source job has an authorized_keys file entry for the public key used by the corresponding pull job command="/mnt/zrepl stdinserver CLIENT_IDENTITY" ssh-ed25519 AAAAC3NzaC1E... zrepl@pullingserver	2017-09-11 13:48:07 +02:00
Christian Schwarz	f3689563b5	config: restructure in 'jobs' and 'global' section	2017-09-11 13:43:18 +02:00
Christian Schwarz	73c9033583	WIP: Switch to new config format. Don't use jobrun for daemon, just call JobDo() once, the job must organize stuff itself. Sacrifice all the oneshot commands, they will be reintroduced as client-calls to the daemon.	2017-09-10 17:53:54 +02:00
Christian Schwarz	0df47b0b0a	move config.go to config_old.go	2017-09-09 21:57:20 +02:00
Christian Schwarz	70258fbada	cmd: add 'test' subcommand configbreak	2017-09-02 12:30:03 +02:00
Christian Schwarz	e048386cd5	cmd: add repeat config option to Prune	2017-09-02 11:08:24 +02:00
Christian Schwarz	f8979d6e83	jobrun/cmd: implement jobrun.Job for config objects	2017-09-02 11:08:24 +02:00
Christian Schwarz	6ab05ee1fa	reimplement io.ReadWriteCloser based RPC mechanism The existing ByteStreamRPC requires writing RPC stub + server code for each RPC endpoint. Does not scale well. Goal: adding a new RPC call should - not require writing an RPC stub / handler - not require modifications to the RPC lib The wire format is inspired by HTTP2, the API by net/rpc. Frames are used for framing messages, i.e. a message is made of multiple frames which are glued together using a frame-bridging reader / writer. This roughly corresponds to HTTP2 streams, although we're happy with just one stream at any time and the resulting non-need for flow control, etc. Frames are typed using a header. The two most important types are 'Header' and 'Data'. The RPC protocol is built on top of this: - Client sends a header => multiple frames of type 'header' - Client sends request body => mulitiple frames of type 'data' - Server reads a header => multiple frames of type 'header' - Server reads request body => mulitiple frames of type 'data' - Server sends response header => ... - Server sends response body => ... An RPC header is serialized JSON and always the same structure. The body is of the type specified in the header. The RPC server and client use some semi-fancy reflection tequniques to automatically infer the data type of the request/response body based on the method signature of the server handler; or the client parameters, respectively. This boils down to a special-case for io.Reader, which are just dumped into a series of data frames as efficiently as possible. All other types are (de)serialized using encoding/json. The RPC layer and Frame Layer log some arbitrary messages that proved useful during debugging. By default, they log to a non-logger, which should not have a big impact on performance. pprof analysis shows the implementation spends its CPU time 60% waiting for syscalls 30% in memmove 10% ... On a Intel(R) Core(TM) i7-6600U CPU @ 2.60GHz CPU, Linux 4.12, the implementation achieved ~3.6GiB/s. Future optimization may include spice(2) / vmspice(2) on Linux, although this doesn't fit so well with the heavy use of io.Reader / io.Writer throughout the codebase. The existing hackaround for local calls was re-implemented to fit the new interface of PRCServer and RPCClient. The 'R'PC method invocation is a bit slower because reflection is involved inbetween, but otherwise performance should be no different. The RPC code currently does not support multipart requests and thus does not support the equivalent of a POST. Thus, the switch to the new rpc code had the following fallout: - Move request objects + constants from rpc package to main app code - Sacrifice the hacky 'push = pull me' way of doing push -> need to further extend RPC to support multipart requests or something to implement this properly with additional interfaces -> should be done after replication is abstracted better than separate algorithms for doPull() and doPush()	2017-09-01 19:24:53 +02:00
Christian Schwarz	2ce07c9342	rework filters & mappings config defines a single datastructure that can act both as a Map and as a Filter (DatasetMapFilter) Cleanup wildcard syntax along the way (also changes semantics).	2017-08-06 16:21:54 +02:00
Christian Schwarz	d1999fc17c	Remove months as a possible time interval unit as it is too volatile. Thanks to @erdgeist for pointing that out. refs #2	2017-07-09 00:38:16 +02:00
Dirk Engling	5afbedbd87	Shrink the 'monthly' interval from 32 weeks to 32 days	2017-07-09 00:11:02 +02:00
Christian Schwarz	2c13fbe6ec	config: rename 'pools' section to 'remotes'	2017-07-08 12:08:34 +02:00
Christian Schwarz	e951beaef5	Simplify CLI by requiring explicit job names. Job names are derived from job type + user-defined name in config file CLI now has subcommands corresponding 1:1 to the config file sections: push,pull,autosnap,prune A subcommand always expects a job name, thus executes exactly one job. Dict-style syntax also used for PullACL and Sink sections. jobrun package is currently only used for autosnap, all others need to be invoked repeatedly via external tool. Plan is to re-integrate jobrun in an explicit daemon-mode (subcommand).	2017-07-08 11:13:50 +02:00
Christian Schwarz	b44a005bbb	Switch to using https://github.com/spf13/cobra for CLI. Use opportunity to structure project by subcommands.	2017-07-06 13:36:55 +02:00
Christian Schwarz	655b3ab55f	implement automatic snapshotting feature	2017-07-02 00:02:33 +02:00
Christian Schwarz	8c8a6ee905	implement snapshot pruning feature	2017-07-02 00:02:33 +02:00
Christian Schwarz	af2aa9dfe1	cmd/jobrun: repeat strategies as part of jobrun	2017-07-01 23:19:25 +02:00
Christian Schwarz	35dcfc234e	Implement push support. Pushing is achieved by inverting the roles on the established connection, i.e. the client tells the server what data it should pull from the client (PullMeRequest). Role inversion is achieved by moving the server loop to the serverLoop function of ByteStreamRPC, which can be called from both the Listen() function (server-side) and the PullMeRequest() client-side function. A donwside of this PullMe approach is that the replication policies become part of the rpc, because the puller must follow the policy.	2017-05-20 18:17:08 +02:00
Christian Schwarz	40fe7e643d	cmd: Move replication logic to separate file.	2017-05-20 17:29:37 +02:00
Christian Schwarz	6f84bf665d	cmd: support logging reads & writes from sshbytestream to a file.	2017-05-13 15:34:28 +02:00
Christian Schwarz	fa97d3d98a	config: parse InitialReplPolicy with default to most_recent	2017-05-07 12:00:34 +02:00
Christian Schwarz	301be177ea	config: fix broken parsing of direct mapping Would only parse wildcard ('\|') DirectMapping but no specific direct mappings.	2017-05-01 20:08:20 +02:00
Christian Schwarz	6da2deb96e	config: fix mapping parser	2017-04-30 23:47:11 +02:00
Christian Schwarz	3e0c758d7f	config: PushACLs, sinks are also just ClientMappings & LOCAL_TRANSPORT_IDENTITY	2017-04-30 23:47:11 +02:00
Christian Schwarz	2e6dc26993	sshbytestream: IdentityFile and custom SSHCommand.	2017-04-30 23:46:59 +02:00
Christian Schwarz	2d57e15936	Change transport format in zrepl config & parse it.	2017-04-29 20:10:09 +02:00
Christian Schwarz	9edc2005ea	config; pointer to pools for pull and push jobs	2017-04-29 19:07:47 +02:00
Christian Schwarz	526255a9ef	Implement jobrun package, abstraction for cron-like goroutines. Unlike cron, there is no overtaking though.	2017-04-29 18:29:15 +02:00
Christian Schwarz	d9ecfc8eb4	Gofmt megacommit.	2017-04-26 20:29:54 +02:00
Christian Schwarz	00231ecb73	Implement config parser.	2017-04-26 19:57:40 +02:00

36 Commits