mir/criu - criu - Mike's Git repositories

mir/criu

mirror of https://github.com/checkpoint-restore/criu synced 2025-08-31 06:15:24 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	b8556e8084	usernsd: The way to restore priviledged stuff in userns We have collected a good set of calls that cannot be done inside user namespaces, but we need to [1]. Some of them has already being addressed, like prctl mm bits restore, but some are not. I'm pretty sceptical about the ability to relax the security checks on quite a lot of them (e.g. open-by-handle is indeed a very dangerous operation if allowed to unpriviledged user), so we need some way to call those things even in user namespaces. The good news about it its that all the calls I've found operate on file descriptors this way or another. So if we had a process, that lived outside of user namespace, we could ask one to do the high priority operation we need and exchange the affected file descriptor via unix socket. So the usernsd is the one doing exactly this. It starts before we create the user namespace and accepts requests via unix socket. Clients (the processes we restore) send him the functions they want to call, the descriptor they want to operate on and the arguments blob. Optionally, they can request some file descriptor back after the call. In non usernamespace case the daemon is not started and the calls are done right in the requestor's process environment. In the next patch there's an example of how to use this daemon to do the priviledged SO_SNDBUFFORCE/_RCVBUFFORCE sockopt on a socket. [1] http://criu.org/UserNamespace Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Andrew Vagin <avagin@openvz.org>	2015-02-13 16:11:38 +04:00
Ruslan Kuprieiev	df301b7eb7	security: create separate security.h header Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2015-02-10 16:53:54 +03:00
Saied Kazemi	296129295a	Allow the veth-pair option to specify a bridge When restoring a pair of veth devices that had one end inside a namespace or container and the other end outside, CRIU creates a new veth pair, puts one end in the namespace/container, and names the other end from what's specified in the --veth-pair IN=OUT command line option. This patch allows for appending a bridge name to the OUT string in the form of OUT@<BRIDGE-NAME> in order for CRIU to move the outside veth to the named bridge. For example, --veth-pair eth0=veth1@br0 tells CRIU to name the peer of eth0 veth1 and move it to bridge br0. This is a simple and handy extension of the --veth-pair option that obviates the need for an action script although one can still do the same (and possibly more) if they prefer to use action scripts. Signed-off-by: Saied Kazemi <saied@google.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2015-01-12 14:54:18 +03:00
Pavel Emelyanov	a1b1959dd1	shmem: Turn shmem-info into shared objects from shremap ones We have a nasty issue with it. Current code allocates these entries in shremap area one by one. We do NOT allocate any OTHER entries in this region, but if we will this array will be spoiled. Fortunately we no longer need shmem-infos as plain array, neither we need one in restorer. So just turn this into plain shared objects and collect them in a list. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2015-01-12 14:47:24 +03:00
Cyrill Gorcunov	fd07bc7791	cpu: Add 'ins' mode to --cpu-cap option In this mode we test if target cpu has all features present in image file but do not require bit to bit match: target cpu may be a new one with more features present. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-12-26 18:15:46 +03:00
Pavel Emelyanov	2694a74a00	aio: Restore AIO contexts Restoring AIO is quite simple. Once all VMAs are put in their places we can call io_setup() to let kernel create the context back and then move the ring into proper place. Another thing we should "restore" is the context ID. But the thing is, upon ring creation kernel repots the ring start address as this ID. And there's a patch in the -next tree that changes the ID when we remap the ring. That said after AIO context creation and ring remap we need to check that the new ID is seen by the kernel. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-12-26 18:13:40 +03:00
Ruslan Kuprieiev	b30940eee2	cr_errno: move cr_err helpers into cr_errno.h Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-12-22 13:50:45 +03:00
Ruslan Kuprieiev	e76749b790	cr-restore: set cr_error to EEXIST if such pid already exists, v3 This is a very common error when using criu. The problem here is that we need to somehow transfer cr_errno from one process to another. I suggest using pipe to give one end to children and read cr_errno on other after restore is finished. v2, Pavel suggested putting errno into shared task_entries. v3. and he also suggested using cmpxchg Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-12-19 18:59:17 +03:00
Saied Kazemi	0412152fc5	Add inherit fd support There are cases where a process's file descriptor cannot be restored from the checkpoint images. For example, a pipe file descriptor with one end in the checkpointed process and the other end in a separate process (that was not part of the checkpointed process tree) cannot be restored because after checkpoint the pipe will be broken. There are also cases where the user wants to use a new file during restore instead of the original file at checkpoint time. For example, the user wants to change the log file of a process from /path/to/oldlog to /path/to/newlog. In these cases, criu's caller should set up a new file descriptor to be inherited by the restored process and specify the file descriptor with the --inherit-fd command line option. The argument of --inherit-fd has the format fd[%d]:%s, where %d tells criu which of its own file descriptors to use for restoring the file identified by %s. As a debugging aid, if the argument has the format debug[%d]:%s, it tells criu to write out the string after colon to the file descriptor %d. This can be used, for example, as an easy way to leave a "restore marker" in the output stream of the process. It's important to note that inherit fd support breaks applications that depend on the state of the file descriptor being inherited. So, consider inherit fd only for specific use cases that you know for sure won't break the application. For examples please visit http://criu.org/Category:HOWTO. v2: Added a check in send_fd_to_self() to avoid closing an inherit fd. Also, as an extra measure of caution, added checks in the inherit fd look up functions to make sure that the inherit fd hasn't been reused. The patch also includes minor cosmetic changes. Signed-off-by: Saied Kazemi <saied@google.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-12-10 12:48:30 +03:00
Pavel Emelyanov	19a76494a9	kerndat: Collect all global variables on one struct Not to spoil the global namespace and unify the kerndat data names. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-11-11 20:14:53 +04:00
Andrey Vagin	fa266bb0cb	userns: restore per-namespace mappings of user and group IDs (v4) In this patch we fill /proc/PID/uid_map and /proc/PID/gid_map for the root task. v2: initialize groups in a new namespace. Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com> v3: add a helper to initialize creds in a new userns v4: initialize userns creds in prepare_namespaces() Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-11-07 17:16:40 +04:00
Cyrill Gorcunov	7688fbc1cf	restore: Defer new net-namespace creation until forked Some kernel modules such as pktgen runs kthred upon new-net creation taking last_pid we were requested. Lets workaround this problem using clone + unshare bundle. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Andrew Vagin <avagin@parallels.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-11-05 15:35:35 +04:00
Andrey Vagin	434879c043	restore: clear breakpoints only for alive tasks We don't set breakpoints for zombies. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-10-14 10:44:50 +04:00
Tycho Andersen	66d1a60ed4	restore: don't race when closing cg yard TASK_HELPERs are created with CLONE_FILES, so if we always close the cg yard here, it will close it for the other helpers and cause problems. Instead, we close it much later, in code only called by alive tasks, to ensure that there is no conflict. Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-10-08 19:08:12 +04:00
Cyrill Gorcunov	87273ccdb8	cpuinfo: x86 -- Add dump and validation of cpuinfo image, v2 On Wed, Oct 01, 2014 at 04:57:40PM +0400, Pavel Emelyanov wrote: > On 10/01/2014 01:07 AM, Cyrill Gorcunov wrote: > > On Tue, Sep 30, 2014 at 09:18:53PM +0400, Cyrill Gorcunov wrote: > >> If a user requested criu to dump cpuinfo image then we > >> write one on dump and verify on restore. At the moment > >> we require all cpu feature bits to match the destination > >> cpu in a sake of simplicity, but in future we need deps > >> engine which would filer out bits and test if cpu we're > >> restoring on is more capable than one we were dumping at > >> allowing to proceed restore procedure. > >> > >> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> > > > > Updated to new img format Something like attached? >From 59272a9514311e6736cddee08d5f88aa95d49189 Mon Sep 17 00:00:00 2001 From: Cyrill Gorcunov <gorcunov@openvz.org> Date: Thu, 25 Sep 2014 16:04:10 +0400 Subject: [PATCH] cpuinfo: x86 -- Add dump and validation of cpuinfo image If a user requested criu to dump cpuinfo image then we write one on dump and verify on restore. At the moment we require all cpu feature bits to match the destination cpu in a sake of simplicity, but in future we need deps engine which would filer out bits and test if cpu we're restoring on is more capable than one we were dumping at allowing to proceed restore procedure. Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-10-03 13:26:57 +04:00
Pavel Emelyanov	c443b03e10	rst: Rework the rst_info referencing Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-10-01 13:34:38 +04:00
Pavel Emelyanov	295090c1ea	img: Introduce the struct cr_img We want to have buffered images to speed up dump and, slightly, restore. Right now we use plan file descriptors to write and read images to/from. Making them buffered cannot be gracefully done on plain fds, so introduce a new class. This will also help if (when?) we will want to do more complex changes with images, e.g. store them all in one file or send them directly to the network. For now the cr_img just contains one int _fd variable. This patch chages the prototype of open_image() to return struct cr_img , pb_(read\|write) to accept one and fixes the compilation of the rest of the code :) Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>	2014-09-30 21:48:13 +04:00
Pavel Emelyanov	35be2ee262	img: Don't return fd, return -1 instead The same -- int-fd will soon go away, so return the explicit int -1 instead of it. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>	2014-09-30 21:48:11 +04:00
Pavel Emelyanov	42821edccf	img: Use errno when checking optional images open fail There will be no int-fd soon, so one more preparation to this fact. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>	2014-09-30 21:48:11 +04:00
Pavel Emelyanov	9d9ac53cd5	rst: Don't use write_img_buf for setting last_pid sysctl The write_img_buf will be used only for images writing, while in this place we just have a raw file descriptor. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>	2014-09-30 21:48:04 +04:00
Pavel Emelyanov	ab50f6ac18	ptrace: Factor out pie stopping code Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Andrey Vagin <avagin@parallels.com>	2014-09-23 20:36:10 +04:00
Andrey Vagin	48fcc7994d	ptrace: flush breakpoints Unfortunately the kernel doesn't flush hw breakpoints on detaching ptrace. If a breakpoint is triggered without ptrace, it will be killed by SIGTRAP. Reported-by: Mr Jenkins Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-22 18:03:03 +04:00
Andrew Vagin	13fc78b907	ptrace: say to parasite_stop_on_syscall where is we now On restore parasite_stop_on_syscall() can be called after PTRACE_SYSCALL and after a breakpoint. parasite_stop_on_syscall() must be called only after PTRACE_SYSCALL, so all tests where is one process stuck. Reported-by: Mr Jenkins Signed-off-by: Andrew Vagin <avagin@openvz.org> Acked-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-22 12:49:45 +04:00
Andrey Vagin	248fc31531	restore: use breakpoints instead of tracing syscalls Currently CRIU traces syscalls to catch a moment, when sigreturn() is called. Now we trace recv(cmd), close(logfd), close(cmdfd), sigreturn(). We can reduce a number of steps by using hw breakpoints. A breakpoint is set before sigreturn, so we will need to trace only it. Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-19 17:57:18 +04:00
Tycho Andersen	f020bef776	remap: add a dead pid /proc remap If a file like /proc/20/mountinfo is open, but 20 is a zombie (or doesn't exist any more), we can't read this file at all, so a link remap won't work. Instead, we add a new remap, called the dead process remap, which forks a TASK_HELPER as that dead pid so that the restore task can open the new /proc/20/mountinfo instead. This commit also adds a new stage CR_STATE_RESTORE_SHARED. Since new TASK_HELPERS are added when loading the shared resource images, we need to wait to start forking tasks until after these resources are loaded. v2: fix a mutex bug Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-19 17:42:48 +04:00
Tycho Andersen	c09ba04c48	restore: TASK_HELPERs live until RESTORE stage ends In order to use TASK_HELPERS to open files from dead processes, they should persist until criu is done restoring the filesystem, which happens in the RESTORE stage. To do this, we need to pass each helper's PIDs to the restorer blob, so that it can wait() on them when the restore stage is done. This commit is in preparation for the remap_dead_pid commits. v2: wait() on helpers after restore stage is over v3: add CR_STATE_RESTORE_FS stage v4: CR_STATE_RESTORE_FS waits for nr_tasks + nr_helpers, not nr_threads v5: ditch CR_STATE_RESTORE_FS in favor of passing helpers to restorer blob Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-19 17:42:46 +04:00
Pavel Emelyanov	cc4492e1c6	rst: Don't allocate page for child stack (v2) When clone-ing kids we can set their stack on current, as it will anyway be COW-ed later. One thing to note -- we do need to reserve some space on the stack for glibc's arguments and retcode allocation. 128 bytes should be enough for 16 pointers while clone has 5 arguments. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-18 20:27:06 +04:00
Ruslan Kuprieiev	2dcafd1419	restore: return -1 if fail In cr_dump_tasks() we expect restore_root_task to return < 0 if error ocures. Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-18 20:20:11 +04:00
Pavel Emelyanov	4eec4c6ea1	rst: Don't allocate PATH_MAX for /proc/self realink Pid is 10 chars maximum. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-16 23:16:18 +04:00
Pavel Emelyanov	53957fadc3	restore: Introduce the --restore-sibling option We have a slight mess with how criu restores root task. Right now we have the following options. 1) CLI a) Usually task calling criu `- criu `- root restored task b) when --restore-detached AND root has pdeath_sig task calling criu `- criu `- root restored task 2) Library/SWRK task using lib/swrk `- criu `- root restored task 3) Standalone service a) Usually service `- service sub task `- root restored task b) when root has pdeath_sig criu service `- criu sub task `- root restored task It would be better is CRIU always restored the root task as sibling, but we have 3 constraints: First, the case 1.a is kept for zdtm to run tests in pid namespaces on 3.11, which in turn doesn't allow CLONE_PARENT \| CLONE_NEWPID. Second, CLI w/o --restore-detach waits for the restored task to die and this behavior can be "expected" already. Third, in case of standalone service tasks shouldn't become service's children. And I have one "plan". The p.haul project while live migrating tasks on destination node starts a service, which uses library/swrk mode. In this case the restored processes become p.haul service's kids which is also not great. That said, here's the option called --restore-child that pairs the --restore-detach like this: * detached AND child: task `- criu restore (exits at the end) `- root task The root task will become task's child. This will be default to library/swrk. This is what LXC needs. * detach AND !child task `- criu restore (exits at the end) `- root task The root task will get re-parented to init. This will be compatible with 1.3. This will be default to standalone service and to my wish with the p.haul case. * !detach AND child task `- criu restore (waits for root task to die) `- root task This should be deprecated, so that criu restore doesn't mess with task <-> root task signalling. * !detach AND !child task `- criu restore (waits for root task to die) `- root task This is how plain criu restore works now. Signed-off-by: Pavel Emelyanov <xemul@parallels.com> Acked-by: Tycho Andersen <tycho.andersen@canonical.com> Acked-by: Andrew Vagin <avagin@openvz.org>	2014-09-10 18:30:30 +04:00
Tycho Andersen	1ff2500b9e	restore: use root_as_sibling only after defining it root_as_sibling was used in criu_signals_setup(), but was only defined later (when forking the root task for the first time). This meant that the SA_NOCLDSTOP was never masked off, which meant SIGCHLD was never delivered after ptracing the root task. Thus, when the a child of the root task died (e.g. from cr_system), the root task sat in PTRACE_STOP, and the restore task never PTRACE_CONT'd, resulting in a deadlock. Instead, we only unmask SA_NOCLDSTOP right before we PTRACE_SEIZE, after the value is defined. v2: re-work the condition for CLONE_PARENT v3: move unmasking of SA_NOCLDSTOP to restore_root_task v4: keep all the comments in the original code Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-10 14:53:31 +04:00
Pavel Emelyanov	17d44de9af	scripts: Use numeric script names Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-05 13:48:26 +04:00
Pavel Emelyanov	069bdd9674	scripts: Move scripts code into separate sources Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-05 13:48:21 +04:00
Cyrill Gorcunov	3146f58317	plugin: Rework plugins API, v2 Here we define new api to be used in plugins. - Plugin should provide a descriptor with help of CR_PLUGIN_REGISTER macro, or in case if plugin require no init/exit functions -- with CR_PLUGIN_REGISTER_DUMMY. - Plugin should define a plugin hook with help of CR_PLUGIN_REGISTER_HOOK macro. - Now init/exit functions of plugins takes @stage argument which tells plugin which stage of criu it's been called on dump/restore. For exit it also takes @ret which allows plugin to know if something went wrong and it needs to cleanup own resources. The idea behind is to not limit plugins authors with names of functions they might need to use for particular hook. Such new API deprecates olds plugins structure but to keep backward compatibility we will provide a tiny layer of additional code to support old plugins for at least a couple of release cycles. For example a trivial plugin might look like \| #include <sys/types.h> \| #include <sys/stat.h> \| #include <fcntl.h> \| #include <libgen.h> \| #include <errno.h> \| \| #include <sys/socket.h> \| #include <linux/un.h> \| \| #include <stdio.h> \| #include <stdlib.h> \| #include <string.h> \| #include <unistd.h> \| \| #include "criu-plugin.h" \| #include "criu-log.h" \| \| static int dump_ext_file(int fd, int id) \| { \| pr_info("dump_ext_file: fd %d id %d\n", fd, id); \| return 0; \| } \| \| CR_PLUGIN_REGISTER_DUMMY("trivial") \| CR_PLUGIN_REGISTER_HOOK(CR_PLUGIN_HOOK__DUMP_EXT_FILE, dump_ext_file) Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Andrew Vagin <avagin@parallels.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-09-03 20:48:36 +04:00
Tycho Andersen	4b4ec8ff61	restore: die properly if restore_one_task fails This is really just the last bit of c32046c9; if restore_one_task() fails, we need to do the same futex wakeup we do everywhere else in this function. v2: use err instead of err_fini_mnt after mount has been finalized normally Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Acked-by: Acked-by: Andrew Vagin <avagin@parallels.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-29 19:34:59 +04:00
Tycho Andersen	dd375cebc9	restore: don't restore cg props if task restore fails Once the task restore has failed, we can just abort, no need to restore the cg props. Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Acked-by: Andrew Vagin <avagin@parallels.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-28 18:54:22 +04:00
Tycho Andersen	c32046c9a4	restore: die if init fails in --restore-detached mode When in --restore-detached (i.e. root_as_sibling) mode, we ptrace(PTRACE_SEIZE) the root task to receive its SIGCHLD in case one of its child tasks dies. However, we don't receive a SIGCHLD if the root task itself dies, so we must explicitly abort. Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Acked-by: Andrew Vagin <avagin@parallels.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-28 18:53:35 +04:00
Andrew Vagin	28b0e16d73	cgroup: call fin_cgroup() on error paths fini_cgroup umounts a cgyard directory, which is mounted in prepare_cgroup(). Reported-by: Mr Jenkins Signed-off-by: Andrew Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-26 12:51:42 +04:00
Andrey Vagin	8f17b34abb	criu: Drop redundant newline from pr_perror Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-22 19:22:39 +04:00
Pavel Emelyanov	ddd837d9e9	rst: Fix core pointer passed into reading thread core image Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-19 13:19:06 +04:00
Ruslan Kuprieiev	60ef59c7ff	restore: use signals_s and signals_p to prepare signals In order to save backward compatibility, criu will try to open signal.img, if no signals_ are found. Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-19 13:09:45 +04:00
Ruslan Kuprieiev	235a41fcf9	restore: open cores for each thread early and store them at current->core We need to open cores for each thread early, because we'll need them to prepare signals later. Signed-off-by: Ruslan Kuprieiev <kupruser@gmail.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-19 13:09:44 +04:00
Pavel Emelyanov	f781ba0466	rst: Rework task_entries to use rst_mem engine The task_entries is a small structure used to coordinate the processes restore stages. Currentl we allocate one page for it and handle one separately. No need in this complexity, actually. The rst_mem engine is already capable to controll this small object. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-19 13:00:10 +04:00
Pavel Emelyanov	5f9acc8dc9	shmem: Explicitly initialize rst_shmems This is a position in the RM_SHREMAP memory. Since shmems are currently the only user of it, this is validly equals zero, but it will change soon. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-19 13:00:07 +04:00
Cyrill Gorcunov	994ae676b4	restore: Set CLONE_PARENT iif pdeath_sig is present, v4 It's been discovered that on 3.11 we might fail on restore if pass @CLONE_PARENT flag into clone() call due to kernel limitations. Because we're treating 3.11 as a base working kernel lets do a trick instead - setup this flag iif pdeath_sig is present - if CLONE_NEWPID is passed warn a user about potential consequences. - because we need to carry the condition in attach_to_tasks call, introduce @root_as_sibling variable for this. CC: Tycho Andersen <tycho.andersen@canonical.com> CC: Pavel Emelyanov <xemul@parallels.com> CC: Andrey Vagin <avagin@openvz.org> Signed-off-by: Cyrill Gorcunov <gorcunov@openvz.org> Acked-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-15 13:26:36 +04:00
Andrey Vagin	0b33bac3bc	criu: allow the root task to handle SIGCHLD If criu process attaches to the root task (it happens for opts.swrk_restore and opts.restore_detach) with ptrace, then any signal delivered to the root would be also delivered to criu. The latter woult treat the former to die due to this delivery and would abort the restore. Fix it by checking that criu (current == NULL) gets ptrace notification (si_code == CLD_TRAPPED) about signal delivered (si_status = SIGCHLD, no other signals are allowed by the restoring tasks). This patch fixes the following error of static/zombie00: Execute zdtm/live/static/zombie00 ./zombie00 --pidfile=zombie00.pid --outfile=zombie00.out Dump 2207 Restore Test: zdtm/live/static/zombie00, Result: FAIL ==================================== ERROR ==================================== Restore log: /root/git/orig/criu/test/dump/static/zombie00/2207/1/restore.log (00.026826) Error (cr-restore.c:1085): 2207 killed by signal 17 (00.026985) Error (cr-restore.c:1706): Restoring FAILED. ================================= ERROR OVER ================================= Reported-by: Mr Jenkins Cc: Pavel Emelyanov <xemul@parallels.com> Cc: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-14 17:09:53 +04:00
Tycho Andersen	e301b1d56c	restore: --restore-detached implies CLONE_PARENT We need to use CLONE_PARENT to prevent processes from immediately dying due to pdeath_sig when they are restored in detached mode. [ xemul: One more place which requires check for restore-detach is in sigactions preparation ] Signed-off-by: Tycho Andersen <tycho.andersen@canonical.com> Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-14 12:25:07 +04:00
Pavel Emelyanov	15b39a1dd5	pstree: Use task_alive() instead of switch()-es Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-12 14:41:10 +04:00
Pavel Emelyanov	548625132d	pstree: Introduce task_alive() helper Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-12 14:41:00 +04:00
Pavel Emelyanov	7960379f71	flock: Merge all file lock entries into single image file They are now in per-pid images, but every entry contains a pid to which it "belongs". This belonging is fake -- it's just a pid of a task who placed the lock, while locks really belong to files. We even have a bug when task that locked a file exited and "delegated" the lock to its child. This images merge reduces the amount of image files criu generates and may simplify the fix of mentioned above issue. Signed-off-by: Pavel Emelyanov <xemul@parallels.com>	2014-08-12 14:38:49 +04:00

1 2 3 4 5 ...

712 Commits