mir/criu - criu - Mike's Git repositories

mir/criu

mirror of https://github.com/checkpoint-restore/criu synced 2025-08-30 05:48:05 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	b354a09cd7	rst: Brush up shared resources collection Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-12 20:11:33 +04:00
Pavel Emelyanov	ccce9fed2a	tty: Brush up ctl tty preparation Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-12 20:09:05 +04:00
Cyrill Gorcunov	20d6762d93	tty: Add restoration of controlling terminal v4 The idea behind is pretty simple -- once we find that there is a controlling terminal present we do call ioctl on appropriate /dev/pts/N. This is done in a bit unusuall manner. When we find that there is a controling terminal present we do create an additional FdinfoEntry for it with object id taken from existing master peer. The file engine stack this new FdinfoEntry on fd_info_head head list. Thus we will have at least two entries on this list. One for real Fdinfo associated with master peer and one for our new generated Fdfinfo entry, it depends on pid which one become a file master. Finally we do use post_open_fd hook in our tty code which allows us to open controlling terminal and yield proper ioctl on it. v2: - restore control terminals via service fd, still need to speedup service fd retrieval. v3: - use prepare_ctl_tty() helper to generate control terminal fdinfo entry v4: - use post_open_fd Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-12 20:00:58 +04:00
Cyrill Gorcunov	89a7a45d37	tty: Add checkpoint/restore for unix terminals v6 Usually the PTYs represent a pair of links -- master peer and slave peer. Master peer must be opened before slave. Internally, when kernel creates master peer it also generates a slave interface in a form of /dev/pts/N, where N is that named pty "index". Master/slave connection unambiguously identified by this index. Still, one master can carry multiple slaves -- for example a user opens one master via /dev/ptmx and appropriate /dev/pts/N in sequence. The result will be the following master `- slave 1 `- slave 2 both slave will have same master index but different file descriptors. Still inside the kernel pty parameters are same for both slaves. Thus only one slave parameters should be restored, there is no need to carry all parameters for every slave peer we've found. Not yet addressed problems: - At moment of restore the master peer might be already closed for any reason so to resolve such problem we need to open a fake master peer with proper index and hook a slave on it, then we close master peer. - Need to figure out how to deal with ttys which have some data in buffers not yet flushed, at moment this data will be simply lost during c/r - Need to restore control terminals - Need to fetch tty flags such as exclusive/packet-mode, this can't be done without kernel patching [ avagin@: - ideas on contol terminals restore - overall code redesign and simplification ] v4: - drop redundant pid from dump_chrdev - make sure optional fown is passed on regular ptys - add a comments about zeroifying termios - get rid of redundant empty line in files.c v5 (by avagin@): - complete rework of tty image format, now we have two files -- tty.img and tty-info.img. The idea behind to reduce data being stored. v6 (by xemul@): - packet mode should be set to true in image, until properly fetched from the kernel - verify image data on retrieval Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> CC: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-12 20:00:54 +04:00
Andrey Vagin	b11eeea381	restore: auto-unlink for ghost files (v2) A ghost file is used for restoring descriptors of an unlinked file. It is created, opened and deleted. Currently ghost files are collected in root task and then removed by crtools when everybody is restored. This scheme doesn't work, ghost_file_list is not shared, plus tasks may live in different mount namespace. It was broken by the following commit: bd4e5d2f restore: prepare shared objects after initializing namespaces We can't just move clear_ghost_files(), because we need to wait, until all processes have not opened a ghost file. We can add one more global barrier or move clear_ghost_files() in a restore code bellow an existent barrier. Here is a better sollution, a gost file is deleted by the last user. v2: Use the type atomic_t and fix a commit message. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-11 17:59:59 +04:00
Andrey Vagin	f6d373cc8c	restore: prevent killing of nonpositive PIDs I don't like surprises. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-07 18:52:59 +04:00
Andrey Vagin	0ae2bad0c6	mm: mark a vma as stack, if a value of sp is in it /proc/PID/maps can contains not up to date information about a stack vma. A kernel marks a VMA as stack, if thread_struct->usersp is in it, but usersp is updated, when a process calls a syscall. This problem is occured, when we try to dump/restore a process in a loop. When a restorer resumes a process, a restorer vma will be marked as stack. A thread stack should not be marked as stack, because its vma is mapped w/o MAP_GROWSDOWN. Signed-off-by: Andrey Vagin <avagin@openvz.org> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-07 18:21:04 +04:00
Cyrill Gorcunov	45375d5721	restore: Rename a task item being restored to `current' An analogue to current macro the kernel has. The name 'me' is somehow confusing. No func. changes. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-05 19:52:55 +04:00
Cyrill Gorcunov	05466cc38a	restorer: Pass current log level to the arguments Will need it to honor current log level in restorer. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-03 14:44:09 +04:00
Andrey Vagin	9ec01ff307	log: don't create a log file in a current directory We can set a directory for log and image files. crtools sets it as a current directory and then creates all files in it. It works before we don't decide to change a mount name space. I suggest to open a log dir and create files for help openat. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-09-02 01:02:30 +04:00
Andrey Vagin	d34b9004a7	restore: use a currect stack for new processes (v3) Why do we need a new stack? We already have one and it can be used. We need to step a bit for executing a glibc clone() v2: Don't lose a page from a child's stack v3: Remove the defined constant STACK_SIZE Signed-off-by: Andrey Vagin <avagin@openvz.org> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-28 23:19:28 +04:00
Pavel Emelyanov	3ae36e700f	restore: Don't mess with last_pid when restoring pidns init When we fork a pidns init there's no need in specifying its pid, as it will be autogenerated to 1. Clean the code not to mess with the last_pid sysctl at all in that case, rather than just omitting the write into it. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-14 14:09:20 +04:00
Andrey Vagin	aabb56bd66	crtools: write a pid of a root task in a specified file When we restore a pid namespace the root task will get some unknown pid in the original (i.e. -- the ns crtools a launched from) one. To find this pid out one can use this option -- it will make the pid obtained by the new init to be written into a pid file. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-14 12:54:00 +04:00
Pavel Emelyanov	5c9cc71fea	log: Replace perror-s with pr_perror-s over code Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-11 21:57:42 +04:00
Pavel Emelyanov	9efd12f2c7	code: Remove trailing whitespaces over .c and .h files Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-11 21:34:35 +04:00
Cyrill Gorcunov	57032aff5e	restorer: Do restore futex robust lists Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-10 20:29:01 +04:00
Pavel Emelyanov	1a62282d48	net: Push the host end of a veth to original netns The call will then have to handle this end (put into a bridge). Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-10 19:14:36 +04:00
Pavel Emelyanov	7f1c9af0f8	vma: State that vma->fd is -1 constant in the image This field was lost while switching to protobuf -- the vma images were used by parasite as plain array and it was easier to reseve this space in the image. Now it's too late to change this, so make it be -1 always. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-10 10:17:50 +04:00
Pavel Emelyanov	fc7071d05e	net: Packet sockets basic support Support only basic packet socket functionality -- create and bind. This should be enough to start testing dhclient inside container. Other stuff (filter, mmaps, fanouts, etc.) will come later. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-09 16:17:41 +04:00
Pavel Emelyanov	b1b0a39a58	pb: Rewrite object reading to use pb-descs The pb_read thing is no longer a macros. This will allow to factor out objects collecting on restore. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-07 19:22:00 +04:00
Andrey Vagin	8bff4c7fca	restore: consolidate restoring of a root task in one blob Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-06 18:37:13 +04:00
Andrey Vagin	703a322cc0	restore: mount_proc return a result instead of exit In addition it fixes error handling. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-06 18:36:59 +04:00
Andrey Vagin	4c88cafe43	restore: fix clean up in PIDNS When processes are restored in PIDNS, the controll process (crtools) don't know a real pid of processes, but it knows a pid of init. crtools can kill init and all other processes will be killed too. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-06 18:31:39 +04:00
Andrey Vagin	bd4e5d2f9d	restore: prepare shared objects after initializing namespaces On this stage crtools unlink old socket files, create ghost files and etc, so we should be in a correct namespace. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-02 16:08:06 +04:00
Andrey Vagin	3cb5969b25	pstee: fix memory corrupation The pstree_item for helpers is allocated without rst_info. Before this patch prepare_fd_pid was executed for such items and touched rst_info. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-02 15:54:54 +04:00
Pavel Emelyanov	da409cc641	signalfd: Dumping and restoring Only the fact of the fd presence, its flags and fown and the sigmask. The sigpending state is tightly coupled with the task's sigpending state which is not yet supported. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-02 12:26:35 +04:00
Andrey Vagin	2e9ddccdb9	restore: rework logic about temporary proc We need proc for restoring processes. The existent /proc may be not suitable. E.g. If processes are in pidns. crtools mounts procfs in a temporary directory, but it should be umounted at the end. Before this patch crtools did that, but it doesn't work if processes in a mount namespace. Actually this logic can be simplified and this patch does that. * create a tmp dir * mount procfs * open this directory and save a file descriptor. * detach procfs * remove the tmp dir * access to proc via openat, statat and so on. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-08-01 15:01:13 +04:00
Cyrill Gorcunov	58b0ef655f	restore: Add test for optional PB fields in core_entry Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-07-20 14:30:20 +04:00
Cyrill Gorcunov	9d918c5964	protobuf: Convert core_entry to PB format v5 This requires some exlanations - Since we use protobuf data in restorer code we need to carry a copy of appropriate PB entities in resident memory. For this sake task_restore_core_args and thread_restore_args were significantly reworked. In short -- the caller code fills PB structures into task arguments space. v3: - Combine everything arch related to thread_info field, and make it optional - Drop "version" field from message, we check version in another specific message - Don't forget to call core_entry__free_unpacked where needed - We continue dumping FPU state, still it's not yet restored v4: - Don't carry task_core_entry and task_kobs_ids_entry for threads, and yield error if present in image. v5: - Allocate core_entry depending on type of task being dumped Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-20 14:06:42 +04:00
Pavel Emelyanov	64967eef21	crtools: Kill the ability to work on individual process We haven't tested it for several monthes and there's no evidence it is required at all. For dumping a single task -t option works just fine. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-19 17:55:34 +04:00
Pavel Emelyanov	9f2168a4f0	images: Introduce the top-level file -- inventory Currently we store the images version in the core file. This is bad, since core file describes a single process (or thread) and says nothing about the images set as a whole (let alone the fact that it's being parsed too late). Thus introduce the inventory image file which describes the image set the way we need (want). For now the only entry in it is the images version. In the future it can be extended. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-19 17:37:25 +04:00
Cyrill Gorcunov	4806e1395f	protobuf: Convert vma_entry to PB format v3 v2: - Use regular uint types in message proto - Use PB engine for "show" v3: - drop usage of temp. variable in prepare_shmem_pid Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-19 12:43:36 +04:00
Pavel Emelyanov	ffd40996ea	pb: Switch creds to protobuf format Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-19 12:35:25 +04:00
Cyrill Gorcunov	808b8f2f06	protobuf: Convert mm_entry to PB format Because the MmEntry has a "repeated" field, we copy aux vector explicitly and reserve space for it in task args. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-19 07:25:05 +04:00
Cyrill Gorcunov	a7691bcbe2	protobuf: Convert itimer_entry to PB format Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-18 16:27:01 +04:00
Cyrill Gorcunov	6b9d3affc9	protobuf: Convert sa_entry to PB format Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-18 16:25:06 +04:00
Pavel Emelyanov	786012e891	mnt: Fix mountinfo collecting issues 1. Mountinfo should be collected after we have forked into new namespace (strictly speaking this is so) 2. When restoring a mnt ns we can reuse the collected mntinfos rather than reading them again. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-15 08:43:37 +04:00
Andrey Vagin	6fb3759c5f	restore: restore pgid in two phases As described in the previous patch, process group leaders are restored in the first phase, then all other processes restores pgid. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-02 16:53:55 +04:00
Andrey Vagin	5c45786417	restore: wait while restroring pgid (v2) Pgid leader should become such before any other task tries to enter this group (with setpgid). Thus we introduce a yet another global sync point -- before it all pgid leaders call setpgid after it all the others do it. v2: wait while helpers restored pgid Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-02 16:53:54 +04:00
Cyrill Gorcunov	ea1ce8e472	fifo: Add checkpoint restore for fifos v4 Checkpoint and restore of fifo is similar to pipes c/r except the pipe end-points are named file. Because the fifo has a name we use regular files facility for fifo path c/r. Still there is a trick used to "open" fifo: the opening procedure migh sleep if a fifo's peer is not yet opened, so before doing a real open we yield a fake open procedure (with O_RDWR flag) which prevents us from sleeping even if peer is not yet ready. Also we need writable fifo end to restore data queued. v2: - add open/priv members to reg_file_info - make open_fifo_fd to use open_fe_fd - comment on pipe_id - make sure the fifo data is not restored twice v3: - drop useless fixme comment and add sane one v4: - Use restore_data flag to escape data restore duplication - Use S_ISREG for file contents copying Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-07-01 17:15:48 +04:00
Andrey Vagin	49c1d43645	pstree: move all code about pstree in a separate file Signed-off-by: Andrey Vagin <avagin@openvz.org> Looks-cool-to: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-27 21:07:30 +04:00
Pavel Emelyanov	a201f59816	rst: Create custom proc every time we restore a namespace We need own proc for restoring mount namespace, since the proc should be umounted and mounted back diring namespace restore and I don't want to introduce a special kludge for this. One more notice -- the temp proc is mounted _after_ namespaces recreation for the same reason (it will be umounted by prepare_mnt_ns). Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-27 20:57:40 +04:00
Cyrill Gorcunov	cb74aa3e48	files: Drop redundant declarations from files.h These are declared in files-reg.h, so get rid of them and add files-reg.h inclusion where needed. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 17:02:23 +04:00
Andrey Vagin	a4685cd30c	restore: check own sid instead of parent's one Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:15:23 +04:00
Andrey Vagin	1264a7a9c6	restore: fail restore if pgid or sid are not restored Don't fail if a root non-init task has another sid, because it's inherited from parent and can't be restored and it's expected behaviour, when a subtree is dumped. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:15:03 +04:00
Andrey Vagin	8c4017d933	restore: print message about sid only if it's restored Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:14:14 +04:00
Andrey Vagin	9b8a206729	restore: restore sid of task which isn't leaders and isn't a child of init (v4) It's sign, that a parent has been changed sid after forking a child. We should know a sid with which a process was born, because in a processes chain, more then one process might change SID. v2: fix names of variables v3: prevent rewriting of born_sid v4: Abort the restorer with error message if a born_sid can't be determing. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:13:58 +04:00
Andrey Vagin	2c412fa6ac	restore: restore sids of tasks, which have been reparented to init (v3) * Create helpers for processes which have been reparented to init. * Insert helpers in a process tree. * Helpers will exit after constructing a process tree. v2: fix variables names and check errors v3: add comments in code Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:13:26 +04:00
Andrey Vagin	eb9a6f2015	restore: add interface for creating helper tasks (v3) They will be used for restoring sid. For example, if a session group leader is absent, a helper process is created with this id and it will die after restoring all other tasks. Before this patch restore failed if anyone exited. Now we should skip helpers, which exited successfully. It's a bit tricky. All children are collected in sigchld_handler, but we have a point, where we want to wait all helpers. For that waitpit is used and ECHLD is ignored, because it signs that a helper exited and has been waited in sigchld_handler. v2: check that me isn't NULL in the sig handler v3: move code about waiting helpers in a separate function Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:12:26 +04:00
Andrey Vagin	acacc6049e	restore: calculate a maximum value of PID-s It will be used for allocating PIDs for helper tasks Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-06-22 12:11:41 +04:00

1 2 3 4 5 ...

306 Commits