mir/criu - criu - Mike's Git repositories

mir/criu

mirror of https://github.com/checkpoint-restore/criu synced 2025-08-30 13:58:34 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	f212e6fe37	dump: Reformat vma dumping a little Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-21 11:33:54 +04:00
Andrey Vagin	e869c16df5	mm: rework of dumping shared memory vma_entry contains shmid and all shared memory are dumped in own files. The most interesting thing is restore. A maping is restored by process with the smallest pid. The mamping is created before executing restorer. We map a full mapping and restore it's conten, then we open a file from /proc/pid/map_files and store a descriptor in vma_info. The mapping is unmaped. Now we can map any region of this mapping in the restorer. We use this trick, because a target process may have this mapping in some places and the restorer has not function to open proc files. v2: fix error hangling xemul: Fixed static-s and args for cr_dump_shmem Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-21 11:03:55 +04:00
Andrey Vagin	37a6c1fc88	mm: move shmid to vma_entry (v2) It will be used to restore shared mappings v2: clean up Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-21 10:56:31 +04:00
Pavel Emelyanov	2b9fd6ce17	dump: Formatting of regs dump fixup Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-17 01:02:55 +04:00
Stanislav Kinsbursky	8b7497a5ef	dump: dump pstree outside tasks loop No need in checking the pid in every task loop cycle, just dump it at the beginning and that's it. Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-16 15:40:40 +04:00
Cyrill Gorcunov	45b76a9a9a	dump: Don't shadow the ret code with dump_namespaces cr_dump_tasks does assign ret = -1 by default but dump_namespaces does shadow this variable with own error code so that if any subsequent calls will fail we will not notice the error. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Andrew Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-15 20:19:46 +04:00
Kinsbursky Stanislav	fe1cf26085	dump: add const qualifiers where possible Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-06 17:00:42 +04:00
Kinsbursky Stanislav	c1999ec58e	dump: use fd_params->type for cwd and exe magic This is a cleanup patch. Use file entry type variable for special files instead of file entry addr variable. Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-06 16:59:28 +04:00
Pavel Emelyanov	a544b6842c	sockinfo call indentation fix Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-04 17:07:00 +04:00
Pavel Emelyanov	7b7f09e4f0	sanitize vma types parsing Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-04 17:06:36 +04:00
Pavel Emelyanov	fc225709b0	proc: Make parse_maps return the amount of them found Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 21:31:35 +04:00
Kinsbursky Stanislav	a26d524011	dump: rename st_buf to fd_stat Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 11:25:46 +04:00
Kinsbursky Stanislav	698fe036d0	dump: hide check for fifo Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 11:25:46 +04:00
Kinsbursky Stanislav	ba6ac0104e	dump: remove else Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 11:25:46 +04:00
Kinsbursky Stanislav	81060211c5	dump: hide fd type into fd_params structure Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 11:25:46 +04:00
Kinsbursky Stanislav	42963b9450	dump: hide fd open in dump routine Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-02 11:25:46 +04:00
Cyrill Gorcunov	7aa8e4b6e2	log: log-engine slight redesign The messages are filtered by their type LOG_MSG - plain messages, they escape any (!) log level filtration and go to stdout LOG_ERROR - error messages LOG_WARN - warning messages LOG_INFO - informative messages LOG_DEBUG - debug messages By default the LOG_WARN log level is used, thus LOG_INFO and LOG_DEBUG messages will not appear in output stream. pr_panic helper was replaced with pr_err, pr_warning shorthanded to pr_warn and old printk if rather pr_msg now. Because we share messages between "show" and "dump" actions, before the "show" action proceed we need to tune up log level and set it to LOG_INFO. Also note that printing of VMA and siginfo now became LOG_INFO messages, it was not that correct to print them regardless the log level. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-03-02 01:05:43 +04:00
Pavel Emelyanov	1e98f53cca	dump: Stop scanning at once if list is empty Empty list on scan error means, that the very top task ran away from us. There's no point in trying to catch one. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:21 +04:00
Pavel Emelyanov	ae9f1bfdc4	dump: Restart seize in case reparent occurred This can happen while dumping a pid-namespace (we can't do it now), thus put this check not to forget one in the future. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:21 +04:00
Pavel Emelyanov	a1fe2c58a9	dump: Try to seize task tree several times on error Tasks can fork or die while we try to seize them. It's much more simpler and much more reliably to unseize what was seized and walk the tree again in case of some tree check failed. Yes, this makes it impossible to suspend a big tree which constantly forks :( but I'd prefer fixing more urgent issues before. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:21 +04:00
Pavel Emelyanov	9e0b308af0	dump/restore: Rework final-state switch Remove CR_TASK_XXX states, use the TASK_XXX ones (for image). This is required to unseize tasks properly in the next patches. Plus, make sure that pstree_list and the seized set coincide (i.e. handle error in collect_task). Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Pavel Emelyanov	0afad031d5	dump: Check for process/threads tree not to change after seizeing When we've seized all the tasks and threads found in /proc check for the /proc contents be the same. Do it one-by-one as we descend the tree. This is OK, since tasks cannot create kids for anyone but themselves or their parents (reparent will be handled later). Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Pavel Emelyanov	30a2b6f314	dump: Don't pass pid to children parser routine It's in item already. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Pavel Emelyanov	199e8d8248	dump: Check for pids reuse at suspend While we try to seize task it can die and give its pid to somebody else. This can break pstree consistency. Check for parent being valid after task is seized. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Pavel Emelyanov	f8a18edd44	dump: Remove SHOULD_BE_DEAD task state Move proc checks for Z-state into seize_task(). Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Pavel Emelyanov	3f256945f2	dump: Sanitize threads dumping Merge two calls into one helper. For future. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 19:31:20 +04:00
Kinsbursky Stanislav	64bf2b7d4b	dump: fix check for service dentries Without the patch all files starting with "." will be ommited. Use of strcmp() - explicit check for "." and ".." looks clearer. Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-03-01 13:01:08 +04:00
Cyrill Gorcunov	29a7a81d25	dump: Clenaup recently added file-ids code - drop double file-ids.h inclusion - rename fd_id_entry to plain entry, it's used just in a couple of lines only, no need for complex name Reported-by: Kir Kolyshkin <kir@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-29 19:22:35 +04:00
Kinsbursky Stanislav	c19012326d	dump: socket queues support This patch was designed to be generic and thus usable for all kinds of sockets. Not sure, thah this goal has been reached, but at least I tried. Key ideas: 1) On-stack structure for collecting sockets queues and then passing them to parasite code. 2) Singly linked list is used for collecting structures, representing sockets of any kind (!) with queues. Based on xemul@ patches. Signed-off-by: Stanislav Kinsbursky <skinsbursky@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-29 17:42:30 +04:00
Cyrill Gorcunov	2acc741a3a	files: Use sys_kcmp to find file descriptor duplicates v4 We switch generic-object-id concept with sys_kcmp approach, which implies changes of image format a bit (and since it's early time for project overall, we're allowed to). In short -- previously every file descriptor had an ID generated by a kernel and exported via procfs. If the appropriate file descriptors were the same objects in kernel memory -- the IDs did match up to bit. It allows us to figure out which files were actually the identical ones and should be restored in a special way. Once sys_kcmp system call was merged into the kernel, we've got a new opprotunity -- to use this syscall instead. The syscall basically compares kernel objects and returns ordered results suitable for objects sorting in a userspace. For us it means -- we treat every file descriptor as a combination of 'genid' and 'subid'. While 'genid' serves for fast comparison between fds, the 'subid' is kind of a second key, which guarantees uniqueness of genid+subid tuple over all file descritors found in a process (or group of processes). To be able to find and dump file descriptors in a single pass we collect every fd into a global rbtree, where (!) each node might become a root for a subtree as well. The main tree carries only non-equal genid. If we find genid which is already in tree, we need to make sure that it's either indeed a duplicate or not. For this we use sys_kcmp syscall and if we find that file descriptors are different -- we simply put new fd into a subtree. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-02-28 19:13:47 +04:00
Andrey Vagin	62ba357e4d	dump: use prctl to dump clear_tid_address Signed-off-by: Andrey Vagin <avagin@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-22 15:06:51 +04:00
Pavel Emelyanov	097bc0b967	dump: Collect mem+regs+sigmask atomically The ptrace seize doesn't prevent signals from delivery. That said, we should block the signals in the target task before dumping anything which is signals-related, i.e. memory and registers. But once we've blocked signals, we should dump registers before unblocking them, since any postponed signal will screw things up. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-20 12:33:43 +04:00
Kir Kolyshkin	447388d79b	open_proc() and friends: hide pid_dir This patch tries to introduce lazy and hidden pid_dir support, meaning one don't have to worry about pid_dir but the optimization is still there. The patch relies on the fact that we work with many /proc/pid files for one pid, then for another pid and so on, i.e. not in a random manner. The idea is when we call open_proc() with a new pid for the first time, the appropriate /proc/PID directory is opened and its fd is stored. Next call to open_proc() with the same PID only need to check that the PID is not changed. In case PID is changed, we close the old one and open/store a new one. Now the code using open_proc() and friends: - does not need to carry proc_pid around, pid is enough - does not need to call open_pid_proc() The only thing that can't be done in that "lazy" mode is closing the last PID fd, thus close_pid_proc(). Signed-off-by: Kir Kolyshkin <kir@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-17 16:46:25 +04:00
Kir Kolyshkin	5661d806cb	Move error reporting to inside open_proc and friends ...and make it correctly print the file name we were unable to open. Also, error from fdopen[dir]() is now reported with file name as well. Note that open_proc() and friends need to be macros in order for pr_perror() to show actual file name and line number where error occured. Historical note: the original version of this patch was way more radical, changing openat() to open() and thus removing pid_dir (replacing with pid when needed) and open_proc_dir(), changing openat() to open(). The word from Pavel is he wants to keep the openat/pid_dir optimization because it saves two dentry lookups in kernel code for each open(). Because of this optimization (and desire to print correct file name in case of error) we have to carry both pid and pid_dir everywhere. Signed-off-by: Kir Kolyshkin <kir@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-17 16:46:25 +04:00
Kir Kolyshkin	3116e3e930	dump_one_task(): do not leak opened pid_dir fd Make sure we close it Signed-off-by: Kir Kolyshkin <kir@openvz.org> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-17 10:46:37 +04:00
Cyrill Gorcunov	cf8b39d4aa	util: Drop jerr macros Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-16 22:05:40 +04:00
Cyrill Gorcunov	dcb1cbfb82	Rework parasite code - make control block to keep all information needed to run injected syscall and parasite blobs - add ptrace_swap_area helper - handle both parasite engine calls and injected syscalls by single __parasite_execute function - drop jerr() usage - bring back handling of inflight signals from original program inside parasite code Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-16 22:05:28 +04:00
Pavel Emelyanov	354ab03a67	parasite: Enlighten parasite blob injection I don't like that we poke the parasite into remote space with 4k calls to ptrace. Now we have the /proc/pid/map_files/ dir which helps us sharing a mapping with some other process. Use this -- map the remote area for parasite locally and put the parasite blob into it with simple memcpy. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-13 01:06:06 +04:00
Pavel Emelyanov	c56574b411	dump: Obtain task brk via misc dump command Right now we do syscall_seized for this, but we have the misc dumping command and the core is (after patch #3) dump after parasite, so we can get brk from the misc dump, thus avoiding one more switch to parasite. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-12 11:45:29 +04:00
Pavel Emelyanov	583a7fe8c3	dump: Move core dumping behind parasite code run This is required for patch #4 Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-12 11:45:29 +04:00
Pavel Emelyanov	227f177194	cr: Split dumped pages locations This actually does two things: 1. The parasite code writes to pages _or_ to pages_shared file himself based on a hint given from the main program. This avoids shared pages copying in finalize_core. 2. The private pages are moved out of the core file into a separate one. This avoids private pages copying in finalize_core. The goal of this patch is a) to avoid pages copying at all (we still have one on restore, but fixing this requires Andrey's work on shared memory dumping) and b) make big blobs with pages be stored in separate files (I have plans on its format rework and unification). Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-12 11:45:29 +04:00
Cyrill Gorcunov	36dc98ad2a	dump: Fix test for syscall return Only the low 32 bits of orig_ax are meaningful for obtaining syscall number so we need to test if sign extended bits are greater than 0. Reported-by: Kir Kolyshkin <kir@openvz.org> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-02-10 18:57:13 +04:00
Kinsbursky Stanislav	a0ec1002b2	crtools: cleanup fdset initalization v2: wrappers names become less obfuscating This patch: 1) Updates function cr_fdset_open() to be suitable for handling fdset creation for dump and show stages. 2) Replaces cr_fdset_open() by new wrapper function cr_fdset_dump(). 3) Replaces prep_cr_fdset_for_restore() by new wrapper function cr_fdset_show(). Signed-off-by: Stanislav Kinsbursky <skinsbursky@parallels.com> Acked-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-08 21:43:12 +04:00
Cyrill Gorcunov	0998335d94	dump: Make sure error code is returned on pr_panic In case of critical error is happened during checkpoint procedure, the program should exit immediately. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-02-08 16:31:33 +04:00
Cyrill Gorcunov	60f180aa77	Dump all special files in one place Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-02-07 20:08:06 +04:00
Cyrill Gorcunov	76a249282e	restore: Add checkpoint/restore for /proc/pid/exe symlink This patch adds ability to checkpoint/restore /proc/pid/exe symlink, so if a process we've just checkpointed has been say /path/to/exe, then at restore time we bring this path back. There some restiction from kernel side: if existing /proc/pid/exe already mapped more than once, the kernel will refuse to change the symlink, so we need to restore it lately when mmaps of crtools itself already unmapped (ie via late call in restorer.c). Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Pavel Emelyanov <xemul@parallels.com>	2012-02-07 20:08:01 +04:00
Andrey Vagin	4d962b27c0	crtools: dump and restore clear_tid_address pthread_join works with this patch Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-03 17:28:04 +04:00
Cyrill Gorcunov	e61605169f	ctrools: Rewrite task/threads stopping engine is back This commit brings the former "Rewrite task/threads stopping engine" commit back. Handling it separately is too complex so better try to handle it in-place. Note some tests might fault, it's expected. --- Stopping tasks with STOP and proceeding with SEIZE is actually excessive -- the SEIZE if enough. Moreover, just killing a task with STOP is also racy, since task should be given some time to come to sleep before its proc can be parsed. Rewrite all this code to SEIZE task and all its threads from the very beginning. With this we can distinguish stopped task state and migrate it properly (not supported now, need to implement). This thing however has one BIG problem -- after we SEIZE-d a task we should seize it's threads, but we should do it in a loop -- reading /proc/pid/task and seizing them again and again, until the contents of this dir stops changing (not done now). Besides, after we seized a task and all its threads we cannot scan it's children list once -- task can get reparented to init and any task's child can call clone with CLONE_PARENT flag thus repopulating the children list of the already seized task (not done also) This patch is ugly, yes, but splitting it doesn't help to review it much, sorry :( Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-01 19:49:28 +04:00
Cyrill Gorcunov	63b88720a3	Revert "ctrools: Rewrite task/threads stopping engine" This reverts commit `6da51eee3f`. It breaks transition/file_read test case	2012-02-01 19:27:28 +04:00
Pavel Emelyanov	6da51eee3f	ctrools: Rewrite task/threads stopping engine Stopping tasks with STOP and proceeding with SEIZE is actually excessive -- the SEIZE if enough. Moreover, just killing a task with STOP is also racy, since task should be given some time to come to sleep before its proc can be parsed. Rewrite all this code to SEIZE task and all its threads from the very beginning. With this we can distinguish stopped task state and migrate it properly (not supported now, need to implement). This thing however has one BIG problem -- after we SEIZE-d a task we should seize it's threads, but we should do it in a loop -- reading /proc/pid/task and seizing them again and again, until the contents of this dir stops changing (not done now). Besides, after we seized a task and all its threads we cannot scan it's children list once -- task can get reparented to init and any task's child can call clone with CLONE_PARENT flag thus repopulating the children list of the already seized task (not done also) This patch is ugly, yes, but splitting it doesn't help to review it much, sorry :( Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org>	2012-02-01 17:29:13 +04:00

1 2 3 4 5 ...

262 Commits