mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-27 20:49:04 +00:00

Author	SHA1	Message	Date
Evan Hunt	5ea26ee1f1	modify reference counting within netmgr - isc__nmhandle_get() now attaches to the sock in the nmhandle object. the caller is responsible for dereferencing the original socket pointer when necessary. - tcpdns listener sockets attach sock->outer to the outer tcp listener socket. tcpdns connected sockets attach sock->outerhandle to the handle for the tcp connected socket. - only listener sockets need to be attached/detached directly. connected sockets should only be accessed and reference-counted via their associated handles.	2020-06-19 09:39:50 +02:00
Evan Hunt	9e740cad21	make isc_nmsocket_{attach,detach}{} functions private there is no need for a caller to reference-count socket objects. they need tto be able tto close listener sockets (i.e., those returned by isc_nm_listen{udp,tcp,tcpdns}), and an isc_nmsocket_close() function has been added for that. other sockets are only accessed via handles.	2020-06-19 09:39:50 +02:00
Ondřej Surý	1013c0930e	Add missing acquire memory barrier in isc_nmhandle_unref The ThreadSanitizer uses system synchronization primitives to check for data race. The netmgr handle->references was missing acquire memory barrier before resetting and reusing the memory occupied by isc_nmhandle_t.	2020-06-11 13:01:26 +02:00
Witold Kręcicki	85d8e4bf76	Fix a race in TCP accepting. There's a possibility of a race in TCP accepting code: T1 accepts a connection C1 T2 accepts a connection C2 T1 tries to accept a connection C3, but we hit a quota, isc_quota_cb_init() sets quota_accept_cb for the socket, we return from accept_connection T2 drops C2, but we race in quota_release with accepting C3 so we don't see quota->waiting is > 0, we don't launch the callback T1 accepts a connection C4, we are able to get the quota we clear the quota_accept_cb from sock->quotacb T1 drops C1, tries to call the callback which is zeroed, sigsegv.	2020-06-10 11:37:27 -07:00
Witold Kręcicki	801f7af6e9	isc_uv_import must pass UV__IPC_SOCKET_XFER_TCP_CONNECTION, not SERVER. As a leftover from old TCP accept code isc_uv_import passed TCP_SERVER flag when importing a socket on Windows. Since now we're importing/exporting accepted connections it needs to pass TCP_CONNECTION flag.	2020-06-03 20:08:54 +00:00
Ondřej Surý	4ec357da0a	Don't check the result of setting SO_INCOMING_CPU The SO_INCOMING_CPU is available since Linux 3.19 for getting the value, but only since Linux 4.4 for setting the value (see below for a full description). BIND 9 should not fail when setting the option on the socket fails, as this is only an optimization and not hard requirement to run BIND 9. SO_INCOMING_CPU (gettable since Linux 3.19, settable since Linux 4.4) Sets or gets the CPU affinity of a socket. Expects an integer flag. int cpu = 1; setsockopt(fd, SOL_SOCKET, SO_INCOMING_CPU, &cpu, sizeof(cpu)); Because all of the packets for a single stream (i.e., all packets for the same 4-tuple) arrive on the single RX queue that is associated with a particular CPU, the typical use case is to employ one listening process per RX queue, with the incoming flow being handled by a listener on the same CPU that is handling the RX queue. This provides optimal NUMA behavior and keeps CPU caches hot.	2020-06-03 12:44:44 +02:00
Witold Kręcicki	7ef756f639	Clear sock->magic to 0 when destroying a netmgr socket	2020-05-29 19:18:58 +00:00
Witold Kręcicki	a8807d9a7b	Add missing isc_mutex_destroy and isc_conditional_destroy calls. While harmless on Linux, missing isc_{mutex,conditional}_destroy causes a memory leak on *BSD. Missing calls were added.	2020-05-29 19:18:58 +00:00
Evan Hunt	68a1c9d679	change 'expr == true' to 'expr' in conditionals	2020-05-25 16:09:57 -07:00
Witold Kręcicki	60629e5b0b	Redesigned TCP accepting: one listen/accept loop, passing the connected socket. Instead of using bind() and passing the listening socket to the children threads using uv_export/uv_import use one thread that does the accepting, and then passes the connected socket using uv_export/uv_import to a random worker. The previous solution had thundering herd problems (all workers waking up on one connection and trying to accept()), this one avoids this and is simpler. The tcp clients quota is simplified with isc_quota_attach_cb - a callback is issued when the quota is available.	2020-05-13 08:45:27 +02:00
Witold Kręcicki	fa02f6438b	Don't set UDP recv/send buffer sizes - use system defaults (unless explicitly defined)	2020-05-01 17:04:00 +02:00
Ondřej Surý	09ba47b067	Use SO_REUSEPORT only on Linux, use SO_REUSEPORT_LB on FreeBSD The SO_REUSEPORT socket option on Linux means something else on BSD based systems. On FreeBSD there's 1:1 option SO_REUSEPORT_LB, so we can use that.	2020-05-01 15:20:55 +02:00
Witold Kręcicki	83049ceabf	Don't free udp recv buffer if UV_UDP_MMSG_CHUNK is set	2020-04-30 17:30:37 +02:00
Ondřej Surý	d5356a40ff	Use UV_UDP_RECVMMSG to enable mmsg support in libuv if available	2020-04-30 17:30:37 +02:00
Ondřej Surý	978c7b2e89	Complete rewrite the BIND 9 build system The rewrite of BIND 9 build system is a large work and cannot be reasonable split into separate merge requests. Addition of the automake has a positive effect on the readability and maintainability of the build system as it is more declarative, it allows conditional and we are able to drop all of the custom make code that BIND 9 developed over the years to overcome the deficiencies of autoconf + custom Makefile.in files. This squashed commit contains following changes: - conversion (or rather fresh rewrite) of all Makefile.in files to Makefile.am by using automake - the libtool is now properly integrated with automake (the way we used it was rather hackish as the only official way how to use libtool is via automake - the dynamic module loading was rewritten from a custom patchwork to libtool's libltdl (which includes the patchwork to support module loading on different systems internally) - conversion of the unit test executor from kyua to automake parallel driver - conversion of the system test executor from custom make/shell to automake parallel driver - The GSSAPI has been refactored, the custom SPNEGO on the basis that all major KRB5/GSSAPI (mit-krb5, heimdal and Windows) implementations support SPNEGO mechanism. - The various defunct tests from bin/tests have been removed: bin/tests/optional and bin/tests/pkcs11 - The text files generated from the MD files have been removed, the MarkDown has been designed to be readable by both humans and computers - The xsl header is now generated by a simple sed command instead of perl helper - The <irs/platform.h> header has been removed - cleanups of configure.ac script to make it more simpler, addition of multiple macros (there's still work to be done though) - the tarball can now be prepared with `make dist` - the system tests are partially able to run in oot build Here's a list of unfinished work that needs to be completed in subsequent merge requests: - `make distcheck` doesn't yet work (because of system tests oot run is not yet finished) - documentation is not yet built, there's a different merge request with docbook to sphinx-build rst conversion that needs to be rebased and adapted on top of the automake - msvc build is non functional yet and we need to decide whether we will just cross-compile bind9 using mingw-w64 or fix the msvc build - contributed dlz modules are not included neither in the autoconf nor automake	2020-04-21 14:19:48 +02:00
Ondřej Surý	4df5a5832c	Remove files generated by autotools	2020-04-21 14:19:30 +02:00
Ondřej Surý	26842ac25c	Remove the extra decstats on STATID_ACTIVE for children sockets	2020-04-03 19:41:46 +02:00
Witold Kręcicki	01c4c3301e	Deactivate the handle before sending the async close callback. We could have a race between handle closing and processing async callback. Deactivate the handle before issuing the callback - we have the socket referenced anyway so it's not a problem.	2020-03-30 10:26:05 +02:00
Witold Kręcicki	5fedd21e16	netmgr refactoring: use generic functions when operating on sockets. tcpdns used transport-specific functions to operate on the outer socket. Use generic ones instead, and select the proper call in netmgr.c. Make the missing functions (e.g. isc_nm_read) generic and add type-specific calls (isc__nm_tcp_read). This is the preparation for netmgr TLS layer.	2020-03-24 20:31:43 +00:00
Ondřej Surý	3178974f0c	Use the new sorting rules to regroup #include headers	2020-03-09 16:19:22 +01:00
Witold Kręcicki	4b9962d4a3	Only use tcpdns timer if it's initialized.	2020-03-05 23:13:39 +01:00
Witold Kręcicki	ae1499ca19	Fix TCPDNS socket closing issues	2020-03-05 18:02:27 +00:00
Witold Kręcicki	fc9792eae8	Limit TCP connection quota logging to 1/s	2020-03-05 18:02:27 +00:00
Witold Kręcicki	fc9e2276ca	Proper accounting of active TCP connections	2020-03-05 18:02:27 +00:00
Evan Hunt	0b76d8a490	comments	2020-02-28 08:46:16 +01:00
Witold Kręcicki	4791263def	Increase inactivehandles and inactivereqs size for better reuse.	2020-02-28 08:46:16 +01:00
Witold Kręcicki	517e6eccdf	use SO_INCOMING_CPU for UDP sockets	2020-02-28 08:46:16 +01:00
Witold Kręcicki	a658f7976c	We don't need to fill udp local address every time since we are bound to it.	2020-02-28 08:46:16 +01:00
Witold Kręcicki	eb874608c1	Use the original threadid when sending a UDP packet to decrease probability of context switching	2020-02-28 08:46:16 +01:00
Evan Hunt	ba0313e649	fix spelling errors reported by Fossies.	2020-02-21 15:05:08 +11:00
Witold Kręcicki	093af1a609	Use libuv-provided uv_{export,import} if available. We were using our own versions of isc_uv_{export,import} functions for multithreaded TCP listeners. Upcoming libuv version will contain proper uv_{export,import} functions - use them if they're available.	2020-02-18 12:17:55 +01:00
Witold Kręcicki	a0d36d7601	Make nm->recvbuf larger and heap allocated, to allow uv_recvmmsg usage. Upcoming version of libuv will suport uv_recvmmsg and uv_sendmmsg. To use uv_recvmmsg we need to provide a larger buffer and be able to properly free it.	2020-02-18 12:17:55 +01:00
Ondřej Surý	5777c44ad0	Reformat using the new rules	2020-02-14 09:31:05 +01:00
Evan Hunt	e851ed0bb5	apply the modified style	2020-02-13 15:05:06 -08:00
Ondřej Surý	056e133c4c	Use clang-tidy to add curly braces around one-line statements The command used to reformat the files in this commit was: ./util/run-clang-tidy \ -clang-tidy-binary clang-tidy-11 -clang-apply-replacements-binary clang-apply-replacements-11 \ -checks=-,readability-braces-around-statements \ -j 9 \ -fix \ -format \ -style=file \ -quiet clang-format -i --style=format $(git ls-files '.c' '.h') uncrustify -c .uncrustify.cfg --replace --no-backup $(git ls-files '.c' '.h') clang-format -i --style=format $(git ls-files '.c' '*.h')	2020-02-13 22:07:21 +01:00
Ondřej Surý	f50b1e0685	Use clang-format to reformat the source files	2020-02-12 15:04:17 +01:00
Witold Kręcicki	a133239698	Don't limit the size of uvreq/nmhandle pool artificially. There was a hard limit set on number of uvreq and nmhandles that can be allocated by a pool, but we don't handle a situation where we can't get an uvreq. Don't limit the number at all, let the OS deal with it.	2020-02-11 12:10:57 +00:00
Ondřej Surý	bc1d4c9cb4	Clear the pointer to destroyed object early using the semantic patch Also disable the semantic patch as the code needs tweaks here and there because some destroy functions might not destroy the object and return early if the object is still in use.	2020-02-09 18:00:17 -08:00
Ondřej Surý	41fe9b7a14	Formatting issues found by local coccinelle run	2020-02-08 03:12:09 -08:00
Mark Andrews	0be2dc9f22	break was on wrong line. 959 break; CID 1457872 (#1 of 1): Structurally dead code (UNREACHABLE) unreachable: This code cannot be reached: isc__nm_incstats(sock->mgr,.... 960 isc__nm_incstats(sock->mgr, sock->statsindex[STATID_ACTIVE]); 961 default:	2020-02-05 18:37:17 +11:00
Witold Kręcicki	fd8788eb94	Fix possible race in socket destruction. When two threads unreferenced handles coming from one socket while the socket was being destructed we could get a use-after-free: Having handle H1 coming from socket S1, H2 coming from socket S2, S0 being a parent socket to S1 and S2: Thread A Thread B Unref handle H1 Unref handle H2 Remove H1 from S1 active handles Remove H2 from S2 active handles nmsocket_maybe_destroy(S1) nmsocket_maybe_destroy(S2) nmsocket_maybe_destroy(S0) nmsocket_maybe_destroy(S0) LOCK(S0->lock) Go through all children, figure out that we have no more active handles: sum of S0->children[i]->ah == 0 UNLOCK(S0->lock) destroy(S0) LOCK(S0->lock) - but S0 is already gone	2020-01-20 22:28:36 +01:00
Witold Kręcicki	42f0e25a4c	calling isc__nm_udp_send() on a non-udp socket is not 'unexpected', it's a critical failure	2020-01-20 22:28:36 +01:00
Witold Kręcicki	8d6dc8613a	clean up some handle/client reference counting errors in error cases. We weren't consistent about who should unreference the handle in case of network error. Make it consistent so that it's always the client code responsibility to unreference the handle - either in the callback or right away if send function failed and the callback will never be called.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	f75a9e32be	netmgr: fix a non-thread-safe access to libuv structures In tcp and udp stoplistening code we accessed libuv structures from a different thread, which caused a shutdown crash when named was under load. Also added additional DbC checks making sure we're in a proper thread when accessing uv_ functions.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	16908ec3d9	netmgr: don't send to an inactive (closing) udp socket We had a race in which n UDP socket could have been already closing by libuv but we still sent data to it. Mark socket as not-active when stopping listening and verify that socket is not active when trying to send data to it.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	eda4300bbb	netmgr: have a single source of truth for tcpdns callback We pass interface as an opaque argument to tcpdns listening socket. If we stop listening on an interface but still have in-flight connections the opaque 'interface' is not properly reference counted, and we might hit a dead memory. We put just a single source of truth in a listening socket and make the child sockets use that instead of copying the value from listening socket. We clean the callback when we stop listening.	2020-01-15 17:22:13 +01:00
Witold Kręcicki	0d637b5985	netmgr: we can't uv_close(sock->timer) when in sock->timer close callback	2020-01-15 14:56:40 +01:00
Witold Kręcicki	525c583145	netmgr: - isc__netievent_storage_t was to small to contain isc__netievent__socket_streaminfo_t on Windows - handle isc_uv_export and isc_uv_import errors properly - rewrite isc_uv_export and isc_uv_import on Windows	2020-01-15 14:08:44 +01:00
Ondřej Surý	3000f14eba	Use isc_refcount_increment0() when reusing handle or socket; remove extra DbC checks	2020-01-14 13:12:13 +01:00
Witold Krecicki	6ee1461cc3	netmgr: handle errors properly in accept_connection. If a connection was closed early (right after accept()) an assertion that assumed that the connection was still alive could be triggered in accept_connection. Handle those errors properly and not with assertions, free all the resources afterwards.	2020-01-14 11:03:06 +01:00

... 4 5 6 7 8

392 Commits