mir/criu - criu - Mike's Git repositories

mir/criu

mirror of https://github.com/checkpoint-restore/criu synced 2025-08-31 06:15:24 +00:00

Author	SHA1	Message	Date
Pavel Tikhomirov	62088c721f	criu: put statement continuation on the same line as the closing bracket We should follow Linux Kernel Codding Style: ... the closing brace is empty on a line of its own, except in the cases where it is followed by a continuation of the same statement, ie ... an else in an if-statement ... https://www.kernel.org/doc/html/v4.10/process/coding-style.html#placing-braces-and-spaces Automaticly fixing with: :!git grep --files-with-matches "^\selse[^{]{" \| xargs :argadd <files> :argdo :%s/}\s\n\s\(else[^{]*{\)/} \1/g \| update Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com>	2020-04-25 00:43:23 -07:00
Alexander Mikhalitsyn	d1fa1734ee	autofs: fix integer overflow in mount options parsing In real life cases pipe_ino param could be larger that INT_MAX, but in autofs_parse() function we using atoi function, that uses 4 byte integers. It's a bug. Example of mount info from real case: (00.508286) type autofs source /etc/auto.misc mnt_id 2824 s_dev 0x4b9 / @ ./misc flags 0x300000 options fd=5,pipe_ino=3480845226,pgrp=95929,timeout=300, minproto=5,maxproto=5,indirect 3480845226 > 2147483647 (32-bit wide signed int max value) => we have a problem It causes a error: (03.195915) Error (criu/pipes.c:529): The packetized mode for pipes is not supported yet Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com>	2020-04-25 00:43:23 -07:00
Nicolas Viennot	6b9faabf39	mem: avoid re-opening CR_FD_PAGES when not needed This commit introduces an optimization when rsti(t)->vma_io is empty. This optimization allows streaming a non-seekable image as CR_FD_PAGES is not reopened. Signed-off-by: Nicolas Viennot <Nicolas.Viennot@twosigma.com>	2020-04-25 00:43:23 -07:00
Nicolas Viennot	4d34f84bb6	img: rellocate a PATH_MAX buffer from the bss section to the stack Reducing our memory footprint by 4K. Improved-by: Andrei Vagin <avagin@gmail.com> Signed-off-by: Nicolas Viennot <Nicolas.Viennot@twosigma.com>	2020-04-25 00:43:23 -07:00
Nicolas Viennot	bb0b4219ef	img: fix image_name() when image is empty When an image is opened but errored with a ENOENT error, the image is still valid. Later on, do_pb_read_one() can fail and will invoke image_name(). The image fd is EMPTY_IMG_FD (-404). read_fd_link fails. Signed-off-by: Nicolas Viennot <Nicolas.Viennot@twosigma.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	067a20c815	zdtm: fail if test with the crfail tag passes Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	698f3a4dbd	zdtm: limit the line length for ps by 160 symbols By default, this limit is 80 symbols and this isn't enough: 4730 pts/0 S+ 0:00 \_ ./zdtm_ct zdtm.py 7535 4731 pts/0 S+ 0:00 \| \_ python zdtm.py 7536 4839 pts/0 S+ 0:00 \| \_ python zdtm.p 7537 4861 pts/0 S+ 0:00 \| \_ make --no 7538 4882 pts/0 S+ 0:00 \| \_ ./mnt 7539 4883 ? Ss 0:00 \| \_ . Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	eab1a30748	timens: restore processes in a new timens to restore clocks After restoring processes, we have to be sure that monotonic and boottime clocks will not go backward. For this, we can restore processes in a new time namespace and set proper offsets for the clocks. In this patch, criu dumps clocks values event when processes are running in this host time namespace and on restore, criu creates a new time namespace, sets dumped clock values and restores processes. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	73438d34bb	test: check that C/R of nested time namespaces fails Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	0d8c0562f9	zdtm_ct: run each test in a new time namespace Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	f1655fd540	zdtm: add a new test to check c/r of time namespaces This test checks that monotonic and boottime don't jump after C/R. In ns and uns flavors, the test is started in a separate time namespace with big offsets, so if criu will restore a time namespace incorrectly the test will detect the big delta of clocks values before and after C/R. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	3fd0fa4bdc	zdtm: add support for time namespaces For ns and uns flavors, tests run in separate time namespaces. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	ddba4af608	namespace: fail if ns/time_for_children isn't equal to ns/time This case isn't supported right now. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Andrei Vagin	4127ef4ab7	criu: Add support for time namespaces The time namespace allows for per-namespace offsets to the system monotonic and boot-time clocks. C/R of time namespaces are very straightforward. On dump, criu enters a target time namespace and dumps currents clocks values, then on restore, criu creates a new namespace and restores clocks values. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-04-25 00:43:23 -07:00
Pavel Tikhomirov	0e9b42acf9	MAINTAINERS: Add Pavel (myself) to maintainers Hope I have enough experience in the project to be nominated. I want to help with review and will try to do my best in it. Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com>	2020-04-06 23:19:57 -07:00
Pavel Tikhomirov	e3fb52e375	remove header include statements duplicates Revert "util: introduce the mount_detached_fs helper" This reverts commit `5dbc24b206`. Revert "criu: Make use strlcpy() to copy into allocated strings" This reverts commit `bc49927bbc`. Fixes for https://github.com/checkpoint-restore/criu/pull/1003 Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com>	2020-03-30 19:43:32 -07:00
Pavel Emelyanov	fcb23dbfcf	Merge pull request #1003 from avagin/v3.14-part2 Prepare v3.14 (part 2)	2020-03-30 13:50:56 +03:00
Andrei Vagin	8c36865c84	memfd: split the struct memfd_inode The struct memfd_inode has a union for dump and restore parts. The only common parts are the list_head node, and the inode id. Suggested-by: Nicolas Viennot <Nicolas.Viennot@twosigma.com> Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	e3a5d09752	memfd: save all memfd inodes in one image Per-object image is acceptable if we expect to have 1-3 objects per-container. If we expect to have more objects, it is better to save them all into one image. There are a number of reasons for this: * We need fewer system calls to read all objects from one image. * It is faster to save or move one image. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Byeonggon Lee	967797a867	Add build directory to gitignore After running make install, build directory is generated but not ignored in gitignore. So this commit add build directory to gitignore. Signed-off-by: Byeonggon Lee <gonny952@gmail.com>	2020-03-27 19:36:20 +03:00
Pavel Tikhomirov	cc362b432e	namespaces: fix error handling in dump_user_ns Fix n_xid_map leaks on error path and remove useless exit_code. Fixes: `6e1726f8` ("userns: set uid and gid before entering into userns") Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	1ad8657ddb	config/nftables: include string.h for strlen Fixes: `9433b7b9db` ("make: use cflags/ldflags for config.h detection mechanism") Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	5f28b692a0	test/fifo_loop: change sizes of all fifo-s to fit a test buffer This test doesn't expect that the write operation will block. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	1ad209b9c2	test/pipe03: check that pipe size is restored Create two pipes with and without queued data. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	2b376168ef	pipe: restore pipe size even if a pipe is empty Without this patch, pipe size is restored only if a pipe has queued data. Reported-by: Mr Jenkins Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Valeriy Vdovin	fa705e418b	zdtm: Use safe helper function to initialize unix socket sockaddr structure The helper function removes code duplication from tests that want to initialize unix socket address to an absolute file path, derived from current working directory of the test + relative filename of a resulting socket. Because the former code used cwd = get_current_dir_name() as part of absolute filename generation, the resulting filepath could later cause failure of bind systcall due to unchecked permissions and introduce confusing permission errors. Signed-off-by: Valeriy Vdovin <valeriy.vdovin@virtuozzo.com>	2020-03-27 19:36:20 +03:00
Valeriy Vdovin	691b4a4e7e	zdtm: Implemented get_current_dir_name wrapper that checks for 'x' permissions Any filesystem syscall, that needs to navigate to inode by it's absolute path performs successive lookup operations for each part of the path. Lookup operation includes access rights check. Usually but not always zdtm tests processes fall under 'other' access category. Also, usually directories don't have 'x' bit set for other. In case when bit 'x' is not set and user-ID and group-ID of a process relate it to 'other', test's will not succeed in performing these syscalls which are most of filesystem api, that has const char *path as part of it arguments (open, openat, mkdir, bind, etc). The observable behavior of that is that zdtm tests fail at file creation ops on one system and pass on the other. The above is not immediately clear to the developer by just looking at failed test's logs. Investigation of that is also not quick for a developer due to the complex structure of zdtm runtime where nested clones with NAMESPACE flags take place alongside with bind-mounts. As an additional note: 'get_current_dir_name' is documented as returning EACCESS in case when some part of the path lacks read/list permissions. But in fact it's not always so. Practice shows, that test processes can get false success on this operation only to fail on later call to something like mkdir/mknod/bind with a given path in arguments. 'get_cwd_check_perm' is a wrapper around 'get_current_dir_name'. It also checks for permissions on the given filepath and logs the error. This directs the developer towards the right investigation path or even eliminates the need for investigation completely. Signed-off-by: Valeriy Vdovin <valeriy.vdovin@virtuozzo.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	c40c09cbbf	test/zdtmp: add a test to C/R shared memory file descriptors Any shared memory region can be openned via /proc/self/map_files. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	10b1d46f67	mem/vma: set VMA_FILE_{PRIVATE,SHARED} if a vma file is borrowed Here is a fast path when two consequent vma-s share the same file. But one of these vma-s can map a file with MAP_SHARED, but another one can map it with MAP_PRIVATE and we need to take this into account.	2020-03-27 19:36:20 +03:00
Andrei Vagin	fb65ab2b1a	mem: dump shared memory file descriptors Any shared memroy mapping can be opened via /proc/self/maps_files/. Such file descriptors look like memfd file descriptors, so they can be dumped by the same way. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Nicolas Viennot	f42ae70c75	make: use cflags/ldflags for config.h detection mechanism The config.h detection scripts should use the provided CFLAGS/LDFLAGS as it tries to link libnl, libnet, and others. Signed-off-by: Nicolas Viennot <Nicolas.Viennot@twosigma.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	d0d6f1ad10	mailmap: update my email Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Mike Rapoport	c3ad4942d4	travis: add ppc64-cross test on amd64 Signed-off-by: Mike Rapoport <rppt@linux.ibm.com>	2020-03-27 19:36:20 +03:00
Alexander Mikhalitsyn	b9c8e957d8	crit-recode: skip (not try to parse) nftables raw image We should ignore (not parse) images that has non-crtool format, that images has no magic number (RAW_IMAGE_MAGIC equals 0). nftables images has format compatible with `nft -f /proc/self/fd/0` input format. Reported-by: Mr Jenkins Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com>	2020-03-27 19:36:20 +03:00
Dmitry Safonov	1f74f8d770	travis: Use debian/buster as base for cross build tests Jessie is called 'oldoldstable', migrate to Buster. Suggested-by: Adrian Reber <areber@redhat.com> Signed-off-by: Dmitry Safonov <dima@arista.com>	2020-03-27 19:36:20 +03:00
Dmitry Safonov	18ac1540c4	travis: Add aarch64-cross test on amd64 Fixes: #924 Signed-off-by: Dmitry Safonov <dima@arista.com>	2020-03-27 19:36:20 +03:00
Dmitry Safonov	327554ee64	compel: Remove compel.h The file only includes other headers (which may be not needed). If we aim for one-include-for-compel, we could instead paste all subheaders into "compel.h". Rather, I think it's worth to migrate to more fine-grained compel headers than follow the strategy 'one header to rule them all'. Further, the header creates problems for cross-compilation: it's included in files, those are used by host-compel. Which rightfully confuses compiler/linker as host's definitions for fpu regs/other platform details get drained into host's compel. Signed-off-by: Dmitry Safonov <dima@arista.com>	2020-03-27 19:36:20 +03:00
Dmitry Safonov	62ad2f6095	criu: Remove compel.h includes The plan is to remove "compel.h". That file only includes other headers (which may be not needed). If we aim for one-include-for-compel, we could instead paste all subheaders into "compel.h". Rather, I think it's worth to migrate to more fine-grained compel headers than follow the strategy 'one header to rule them all'. Further, the header creates problems for cross-compilation: it's included in files, those are used by host-compel. Which rightfully confuses compiler/linker as host's definitions for fpu regs/other platform details get drained into host's compel. As a first step - stop including "compel.h" in criu. Signed-off-by: Dmitry Safonov <dima@arista.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	065ff6f415	zdtm/fifo_loop: don't try to write more than pipe size ... otherwise write() can block. Reported-by: Mr Jenkins Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Pavel Tikhomirov	73e0ed3b8a	zdtm: add a test on open symlink migration Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com> Co-Developed-by: Vitaly Ostrosablin <vostrosablin@virtuozzo.com> Signed-off-by: Vitaly Ostrosablin <vostrosablin@virtuozzo.com> Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com>	2020-03-27 19:36:20 +03:00
Alexander Mikhalitsyn	1936608ce4	files: allow dumping opened symlinks To really open symlink file and not the regular file below it, one needs to do open with O_PATH\|O_NOFOLLOW flags. Looks like systemd started to open /etc/localtime symlink this way sometimes, and before that nobody actually used this and thus we never supported this in CRIU. Error (criu/files-ext.c:96): Can't dump file 11 of that type [120777] (unknown /etc/localtime) Looks like it is quiet easy to support, as c/r of symlink file is almost the same as c/r of regular one. We need to only make fstatat not following links in check_path_remap. Also we need to take into account support of ghost symlinks. Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com> Co-developed-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com> Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com>	2020-03-27 19:36:20 +03:00
Pavel Tikhomirov	8b9c1f4c5b	zdtm: add a test for files opened with O_PATH On these test without the patch ("fown: Don't fail on dumping files opened wit O_PATH") we trigger these errors: Error (criu/pie/parasite.c:340): fcntl(4, F_GETOWN_EX) -> -9 Error (criu/files.c:403): Can't get owner signum on 18: Bad file descriptor Error (criu/files-reg.c:1887): Can't restore file pos: Bad file descriptor Signed-off-by: Pavel Tikhomirov <ptikhomirov@virtuozzo.com> Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com>	2020-03-27 19:36:20 +03:00
Cyrill Gorcunov	f167d1f4e9	fown: Don't fail on dumping files opened with O_PATH O_PATH opened files are special: they have empty file operations in kernel space, so there not that much we can do with them, even setting position is not allowed. Same applies to a signal number for owner settings. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com> Co-developed-by: Alexander Mikhalitsyn <alexander@mihalicyn.com> Signed-off-by: Alexander Mikhalitsyn (Virtuozzo) <alexander@mihalicyn.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	58fd63042c	zdtm/inhfd: force python to read new data from a file python 2.7 doesn't call the read system call if it's read file to the end once. The next seek allows to workaround this problem. inhfd/memfd.py hangs due to this issue. Reported-by: Mr Jenkins Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	fce196d88d	memfd: don't corrupt a state of the dumped fd Right now, criu uses a dumped fd to dump content of a memfd "file". Here are two reasons why we should not do this: * a state of a dumped fd doesn't have to be changed, but now criu calls lseek on it. This can be workarounded by using pread. * a dumped descriptor can be write-only. Reported-by: Mr Jenkins Cc: Nicolas Viennot <Nicolas.Viennot@twosigma.com> Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Andrei Vagin	ffe0896ed0	fs: use __open_proc instead of open("/proc/...", ... ) Processes can run in a mount namespace without /proc. Reported-by: Mr Jenkins Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-03-27 19:36:20 +03:00
Adrian Reber	4129d3262a	cgroup2: add minimal cgroup2 support The runc test cases are (sometimes) mounting a cgroup inside of the container. For these tests to succeed, let CRIU know that cgroup2 exists and how to restore such a mount. This does not fix any specific cgroup2 settings, it just enables CRIU to mount cgroup2 in the restored container. Signed-off-by: Adrian Reber <areber@redhat.com>	2020-03-27 19:36:20 +03:00
Adrian Reber	10416bcbcb	seize: support cgroup v2 freezer This adds support to checkpoint processes using the cgroup v2 freezer. Signed-off-by: Adrian Reber <areber@redhat.com>	2020-03-27 19:36:20 +03:00
Adrian Reber	9f902e0c6b	seize: factor out opening and writing the freezer state More preparations for cgroupv2 freezer. Factor our the freezer state opening and writing to have one location where to handle v1 and v2 differences. Signed-off-by: Adrian Reber <areber@redhat.com>	2020-03-27 19:36:20 +03:00
Adrian Reber	563c5e5e76	seize: prepare for cgroupv2 freezer The cgroupv2 freezer does not return the same strings as v1. Instead of THAWED and FROZEN v2 returns 0 and 1 (strings). This prepares the seize code to use 0 and 1 everywhere and THAWED and FROZEN only for v1 specific code paths. Signed-off-by: Adrian Reber <areber@redhat.com>	2020-03-27 19:36:20 +03:00

1 2 3 4 5 ...

10170 Commits