mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-24 11:08:45 +00:00

Author	SHA1	Message	Date
Ondřej Surý	6cfadf9db0	Fix a race between isc__nm_async_shutdown() and new sends/reads There was a data race where a new event could be scheduled after isc__nm_async_shutdown() had cleaned up all the dangling UDP/TCP sockets from the loop.	2020-10-30 11:11:54 +01:00
Ondřej Surý	5fcd52209a	Refactor udp_recv_cb() - more logical code flow. - propagate errors back to the caller. - add a 'reading' flag and call the callback from failed_read_cb() only when it the socket was actively reading.	2020-10-30 11:11:54 +01:00
Ondřej Surý	cdccac4993	Fix netmgr read/connect timeout issues - don't bother closing sockets that are already closing. - UDP read timeout timer was not stopped after reading. - improve handling of TCP connection failures.	2020-10-30 11:11:54 +01:00
Ondřej Surý	7a6056bc8f	Add isc__nm_udp_shutdown() function This function will be called during isc_nm_closedown() to ensure that all UDP sockets are closed and detached.	2020-10-30 11:11:54 +01:00
Evan Hunt	5dcdc00b93	add netmgr functions to support outgoing DNS queries - isc_nm_tcpdnsconnect() sets up up an outgoing TCP DNS connection. - isc_nm_tcpconnect(), _udpconnect() and _tcpdnsconnect() now take a timeout argument to ensure connections time out and are correctly cleaned up on failure. - isc_nm_read() now supports UDP; it reads a single datagram and then stops until the next time it's called. - isc_nm_cancelread() now runs asynchronously to prevent assertion failure if reading is interrupted by a non-network thread (e.g. a timeout). - isc_nm_cancelread() can now apply to UDP sockets. - added shim code to support UDP connection in versions of libuv prior to 1.27, when uv_udp_connect() was added all these functions will be used to support outgoing queries in dig, xfrin, dispatch, etc.	2020-10-30 11:11:54 +01:00
Ondřej Surý	8797e5efd5	Fix the data race when read-writing sock->active by using cmpxchg	2020-10-22 11:46:58 -07:00
Ondřej Surý	f7c82e406e	Fix the isc_nm_closedown() to actually close the pending connections 1. The isc__nm_tcp_send() and isc__nm_tcp_read() was not checking whether the socket was still alive and scheduling reads/sends on closed socket. 2. The isc_nm_read(), isc_nm_send() and isc_nm_resumeread() have been changed to always return the error conditions via the callbacks, so they always succeed. This applies to all protocols (UDP, TCP and TCPDNS).	2020-10-22 11:37:16 -07:00
Ondřej Surý	afca2e3b21	Fix the way udp_send_direct() is used There were two problems how udp_send_direct() was used: 1. The udp_send_direct() can return ISC_R_CANCELED (or translated error from uv_udp_send()), but the isc__nm_async_udpsend() wasn't checking the error code and not releasing the uvreq in case of an error. 2. In isc__nm_udp_send(), when the UDP send is already in the right netthread, it uses udp_send_direct() to send the UDP packet right away. When that happened the uvreq was not freed, and the error code was returned to the caller. We need to return ISC_R_SUCCESS and rather use the callback to report an error in such case.	2020-10-22 11:37:16 -07:00
Ondřej Surý	e8b56acb49	Clone the csock in accept_connection(), not in callback If we clone the csock (children socket) in TCP accept_connection() instead of passing the ssock (server socket) to the call back and cloning it there we unbreak the assumption that every socket is handled inside it's own worker thread and therefore we can get rid of (at least) callback locking.	2020-10-08 07:24:31 +02:00
Ondřej Surý	b9a42446e8	Enable DF (don't fragment) flag on listening UDP sockets This commits uses the isc__nm_socket_dontfrag() helper function to enable setting DF bit on the outgoing UDP packets.	2020-10-05 16:21:21 +02:00
Ondřej Surý	fd975a551d	Split reusing the addr/port and load-balancing socket options The SO_REUSEADDR, SO_REUSEPORT and SO_REUSEPORT_LB has different meaning on different platform. In this commit, we split the function to set the reuse of address/port and setting the load-balancing into separate functions. The libuv library already have multiplatform support for setting SO_REUSEADDR and SO_REUSEPORT that allows binding to the same address and port, but unfortunately, when used after the load-balancing socket options have been already set, it overrides the previous setting, so we need our own helper function to enable the SO_REUSEADDR/SO_REUSEPORT first and then enable the load-balancing socket option.	2020-10-05 15:18:28 +02:00
Ondřej Surý	9dc01a636b	Refactor isc__nm_socket_freebind() to take fd and sa_family as args The isc__nm_socket_freebind() has been refactored to match other isc__nm_socket_...() helper functions and take uv_os_fd_t and sa_family_t as function arguments.	2020-10-05 15:18:24 +02:00
Ondřej Surý	5daaca7146	Add SO_REUSEPORT and SO_INCOMING_CPU helper functions The setting of SO_REUSE**** and SO_INCOMING_CPU have been moved into a separate helper functions.	2020-10-05 14:54:24 +02:00
Evan Hunt	dcee985b7f	update all copyright headers to eliminate the typo	2020-09-14 16:20:40 -07:00
Ondřej Surý	89c534d3b9	properly lock the setting/unsetting of callbacks in isc_nmsocket_t changes to socket callback functions were not thread safe.	2020-09-11 12:17:57 -07:00
Evan Hunt	57b4dde974	change from isc_nmhandle_ref/unref to isc_nmhandle attach/detach Attaching and detaching handle pointers will make it easier to determine where and why reference counting errors have occurred. A handle needs to be referenced more than once when multiple asynchronous operations are in flight, so callers must now maintain multiple handle pointers for each pending operation. For example, ns_client objects now contain: - reqhandle: held while waiting for a request callback (query, notify, update) - sendhandle: held while waiting for a send callback - fetchhandle: held while waiting for a recursive fetch to complete - updatehandle: held while waiting for an update-forwarding task to complete control channel connection objects now contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback - cmdhandle: held while an rndc command is running httpd connections contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback	2020-09-11 12:17:57 -07:00
Witold Kręcicki	7eb4564895	assorted small netmgr-related changes - rename isc_nmsocket_t->tcphandle to statichandle - cancelread functions now take handles instead of sockets - add a 'client' flag in socket objects, currently unused, to indicate whether it is to be used as a client or server socket	2020-09-11 10:24:36 -07:00
Evan Hunt	38264b6a4d	Use different allocators for UDP and TCP Each worker has a receive buffer with space for 20 DNS messages of up to 2^16 bytes each, and the allocator function passed to uv_read_start() or uv_udp_recv_start() will reserve a portion of it for use by sockets. UDP can use recvmmsg() and so it needs that entire space, but TCP reads one message at a time. This commit introduces separate allocator functions for TCP and UDP setting different buffer size limits, so that libuv will provide the correct buffer sizes to each of them.	2020-08-05 12:57:23 +02:00
Witold Kręcicki	a0f7d28967	netmgr: retry binding with IP_FREEBIND when EADDRNOTAVAIL is returned. When a new IPv6 interface/address appears it's first in a tentative state - in which we cannot bind to it, yet it's already being reported by the route socket. Because of that BIND9 is unable to listen on any newly detected IPv6 addresses. Fix it by setting IP_FREEBIND option (or equivalent option on other OSes) and then retrying bind() call.	2020-07-31 12:44:22 +02:00
Witold Kręcicki	1cf65cd882	Fix a shutdown race in netmgr udp We need to mark the socket as inactive early (and synchronously) in the stoplistening process; otherwise we might destroy the callback argument before we actually stop listening, and call the callback on bad memory.	2020-06-26 00:19:42 -07:00
Evan Hunt	75c985c07f	change the signature of recv callbacks to include a result code this will allow recv event handlers to distinguish between cases in which the region is NULL because of error, shutdown, or cancelation.	2020-06-19 12:33:26 -07:00
Evan Hunt	5ea26ee1f1	modify reference counting within netmgr - isc__nmhandle_get() now attaches to the sock in the nmhandle object. the caller is responsible for dereferencing the original socket pointer when necessary. - tcpdns listener sockets attach sock->outer to the outer tcp listener socket. tcpdns connected sockets attach sock->outerhandle to the handle for the tcp connected socket. - only listener sockets need to be attached/detached directly. connected sockets should only be accessed and reference-counted via their associated handles.	2020-06-19 09:39:50 +02:00
Evan Hunt	9e740cad21	make isc_nmsocket_{attach,detach}{} functions private there is no need for a caller to reference-count socket objects. they need tto be able tto close listener sockets (i.e., those returned by isc_nm_listen{udp,tcp,tcpdns}), and an isc_nmsocket_close() function has been added for that. other sockets are only accessed via handles.	2020-06-19 09:39:50 +02:00
Ondřej Surý	4ec357da0a	Don't check the result of setting SO_INCOMING_CPU The SO_INCOMING_CPU is available since Linux 3.19 for getting the value, but only since Linux 4.4 for setting the value (see below for a full description). BIND 9 should not fail when setting the option on the socket fails, as this is only an optimization and not hard requirement to run BIND 9. SO_INCOMING_CPU (gettable since Linux 3.19, settable since Linux 4.4) Sets or gets the CPU affinity of a socket. Expects an integer flag. int cpu = 1; setsockopt(fd, SOL_SOCKET, SO_INCOMING_CPU, &cpu, sizeof(cpu)); Because all of the packets for a single stream (i.e., all packets for the same 4-tuple) arrive on the single RX queue that is associated with a particular CPU, the typical use case is to employ one listening process per RX queue, with the incoming flow being handled by a listener on the same CPU that is handling the RX queue. This provides optimal NUMA behavior and keeps CPU caches hot.	2020-06-03 12:44:44 +02:00
Witold Kręcicki	fa02f6438b	Don't set UDP recv/send buffer sizes - use system defaults (unless explicitly defined)	2020-05-01 17:04:00 +02:00
Ondřej Surý	09ba47b067	Use SO_REUSEPORT only on Linux, use SO_REUSEPORT_LB on FreeBSD The SO_REUSEPORT socket option on Linux means something else on BSD based systems. On FreeBSD there's 1:1 option SO_REUSEPORT_LB, so we can use that.	2020-05-01 15:20:55 +02:00
Witold Kręcicki	83049ceabf	Don't free udp recv buffer if UV_UDP_MMSG_CHUNK is set	2020-04-30 17:30:37 +02:00
Ondřej Surý	d5356a40ff	Use UV_UDP_RECVMMSG to enable mmsg support in libuv if available	2020-04-30 17:30:37 +02:00
Witold Kręcicki	5fedd21e16	netmgr refactoring: use generic functions when operating on sockets. tcpdns used transport-specific functions to operate on the outer socket. Use generic ones instead, and select the proper call in netmgr.c. Make the missing functions (e.g. isc_nm_read) generic and add type-specific calls (isc__nm_tcp_read). This is the preparation for netmgr TLS layer.	2020-03-24 20:31:43 +00:00
Evan Hunt	0b76d8a490	comments	2020-02-28 08:46:16 +01:00
Witold Kręcicki	517e6eccdf	use SO_INCOMING_CPU for UDP sockets	2020-02-28 08:46:16 +01:00
Witold Kręcicki	a658f7976c	We don't need to fill udp local address every time since we are bound to it.	2020-02-28 08:46:16 +01:00
Witold Kręcicki	eb874608c1	Use the original threadid when sending a UDP packet to decrease probability of context switching	2020-02-28 08:46:16 +01:00
Ondřej Surý	5777c44ad0	Reformat using the new rules	2020-02-14 09:31:05 +01:00
Evan Hunt	e851ed0bb5	apply the modified style	2020-02-13 15:05:06 -08:00
Ondřej Surý	056e133c4c	Use clang-tidy to add curly braces around one-line statements The command used to reformat the files in this commit was: ./util/run-clang-tidy \ -clang-tidy-binary clang-tidy-11 -clang-apply-replacements-binary clang-apply-replacements-11 \ -checks=-,readability-braces-around-statements \ -j 9 \ -fix \ -format \ -style=file \ -quiet clang-format -i --style=format $(git ls-files '.c' '.h') uncrustify -c .uncrustify.cfg --replace --no-backup $(git ls-files '.c' '.h') clang-format -i --style=format $(git ls-files '.c' '*.h')	2020-02-13 22:07:21 +01:00
Ondřej Surý	f50b1e0685	Use clang-format to reformat the source files	2020-02-12 15:04:17 +01:00
Witold Kręcicki	42f0e25a4c	calling isc__nm_udp_send() on a non-udp socket is not 'unexpected', it's a critical failure	2020-01-20 22:28:36 +01:00
Witold Kręcicki	8d6dc8613a	clean up some handle/client reference counting errors in error cases. We weren't consistent about who should unreference the handle in case of network error. Make it consistent so that it's always the client code responsibility to unreference the handle - either in the callback or right away if send function failed and the callback will never be called.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	f75a9e32be	netmgr: fix a non-thread-safe access to libuv structures In tcp and udp stoplistening code we accessed libuv structures from a different thread, which caused a shutdown crash when named was under load. Also added additional DbC checks making sure we're in a proper thread when accessing uv_ functions.	2020-01-20 22:28:36 +01:00
Witold Kręcicki	16908ec3d9	netmgr: don't send to an inactive (closing) udp socket We had a race in which n UDP socket could have been already closing by libuv but we still sent data to it. Mark socket as not-active when stopping listening and verify that socket is not active when trying to send data to it.	2020-01-20 22:28:36 +01:00
Evan Hunt	90a1dabe74	count statistics in netmgr UDP code - also restored a test in the statistics test which was changed when the netmgr was introduced because active sockets were not being counted.	2020-01-13 14:09:37 -08:00
Evan Hunt	80a5c9f5c8	associate socket stats counters with netmgr socket objects - the socket stat counters have been moved from socket.h to stats.h. - isc_nm_t now attaches to the same stats counter group as isc_socketmgr_t, so that both managers can increment the same set of statistics - isc__nmsocket_init() now takes an interface as a paramter so that the address family can be determined when initializing the socket. - based on the address family and socket type, a group of statistics counters will be associated with the socket - for example, UDP4Active with IPv4 UDP sockets and TCP6Active with IPv6 TCP sockets. note that no counters are currently associated with TCPDNS sockets; those stats will be handled by the underlying TCP socket. - the counters are not actually used by netmgr sockets yet; counter increment and decrement calls will be added in a later commit.	2020-01-13 14:05:02 -08:00
Evan Hunt	e38004457c	netmgr fixes: - use UV_{TC,UD}P_IPV6ONLY for IPv6 sockets, keeping the pre-netmgr behaviour. - add a new listening_error bool flag which is set if the child listener fails to start listening. This fixes a bug where named would hang if, e.g., we failed to bind to a TCP socket.	2020-01-13 10:54:17 -08:00
Evan Hunt	31b3980ef0	shorten some names reduce line breaks and general unwieldiness by changing some function, type, and parameter names.	2019-12-09 21:44:04 +01:00
Evan Hunt	b05194160b	style, comments	2019-12-09 11:15:27 -08:00
Witold Kręcicki	5a65ec0aff	Add uv_handle_{get,set}_data functions that's absent in pre-1.19 libuv to make code clearer. This might be removed when we stop supporting older libuv versions.	2019-12-09 11:15:27 -08:00
Witold Kręcicki	bc5aae1579	netmgr: make tcp listening multithreaded. When listening for TCP connections we create a socket, bind it and then pass it over IPC to all threads - which then listen on in and accept connections. This sounds broken, but it's the official way of dealing with multithreaded TCP listeners in libuv, and works on all platforms supported by libuv.	2019-12-09 11:15:27 -08:00
Evan Hunt	73cafd9d57	clean up comments	2019-11-17 18:59:40 -08:00
Witold Kręcicki	70397f9d92	netmgr: libuv-based network manager This is a replacement for the existing isc_socket and isc_socketmgr implementation. It uses libuv for asynchronous network communication; "networker" objects will be distributed across worker threads reading incoming packets and sending them for processing. UDP listener sockets automatically create an array of "child" sockets so each worker can listen separately. TCP sockets are shared amongst worker threads. A TCPDNS socket is a wrapper around a TCP socket, which handles the the two-byte length field at the beginning of DNS messages over TCP. (Other wrapper socket types can be implemented in the future to handle DNS over TLS, DNS over HTTPS, etc.)	2019-11-07 11:55:37 -08:00

1 2

100 Commits