mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-29 05:28:00 +00:00

Author	SHA1	Message	Date
Ondřej Surý	e603983ec9	Stop providing branch prediction information The __builtin_expect() can be used to provide the compiler with branch prediction information. The Gcc manual says[1] on the subject: In general, you should prefer to use actual profile feedback for this (-fprofile-arcs), as programmers are notoriously bad at predicting how their programs actually perform. Stop using __builtin_expect() and ISC_LIKELY() and ISC_UNLIKELY() macros to provide the branch prediction information as the performance testing shows that named performs better when the __builtin_expect() is not being used. 1. https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fexpect	2021-10-14 10:33:24 +02:00
Artem Boldariev	abecfdc298	DoT: do not attempt to call read callback if it is not avaialble This commit fixes a crash in DoT code when it was attempting to call a read callback on the later stages of the connection when it is not available. It also fixes [GL #2884] (back-trace provided in the bug report is exactly the same as was seen when fixing this problem).	2021-10-05 11:26:14 +03:00
Artem Boldariev	25b2c6ad96	Require "dot" ALPN token for zone transfer requests over DoT (XoT) This commit makes BIND verify that zone transfers are allowed to be done over the underlying connection. Currently, it makes sense only for DoT, but the code is deliberately made to be protocol-agnostic.	2021-10-05 11:23:47 +03:00
Artem Boldariev	eba3278e52	Add isc_nm_xfr_allowed() function The intention of having this function is to have a predicate to check if a zone transfer could be performed over the given handle. In most cases we can assume that we can do zone transfers over any stream transport except DoH, but this assumption will not work for zone transfers over DoT (XoT), as the RFC9103 requires ALPN to happen, which might not be the case for all deployments of DoT.	2021-10-05 11:23:47 +03:00
Artem Boldariev	56b3f5d832	Low level code to support ALPN in DoT This commit adds low-level code necessary to support ALPN in DoT as XoT requires "dot" ALPN token to be negotiated on a connection for zone transfers.	2021-10-05 11:23:47 +03:00
Evan Hunt	08ce69a0ea	Rewrite dns_resolver and dns_request to use netmgr timeouts - The `timeout_action` parameter to dns_dispatch_addresponse() been replaced with a netmgr callback that is called when a dispatch read times out. this callback may optionally reset the read timer and resume reading. - Added a function to convert isc_interval to milliseconds; this is used to translate fctx->interval into a value that can be passed to dns_dispatch_addresponse() as the timeout. - Note that netmgr timeouts are accurate to the millisecond, so code to check whether a timeout has been reached cannot rely on microsecond accuracy. - If serve-stale is configured, then a timeout received by the resolver may trigger it to return stale data, and then resume waiting for the read timeout. this is no longer based on a separate stale timer. - The code for canceling requests in request.c has been altered so that it can run asynchronously. - TCP timeout events apply to the dispatch, which may be shared by multiple queries. since in the event of a timeout we have no query ID to use to identify the resp we wanted, we now just send the timeout to the oldest query that was pending. - There was some additional refactoring in the resolver: combining fctx_join() and fctx_try_events() into one function to reduce code duplication, and using fixednames in fetchctx and fetchevent. - Incidental fix: new_adbaddrinfo() can't return NULL anymore, so the code can be simplified.	2021-10-02 11:39:56 -07:00
Ondřej Surý	9ee60e7a17	netmgr fixes needed for dispatch - The read timer must always be stopped when reading stops. - Read callbacks can now call isc_nm_read() again in TCP, TCPDNS and TLSDNS; previously this caused an assertion. - The wrong failure code could be sent after a UDP recv failure because the if statements were in the wrong order. the check for a NULL address needs to be after the check for an error code, otherwise the result will always be set to ISC_R_EOF. - When aborting a read or connect because the netmgr is shutting down, use ISC_R_SHUTTINGDOWN. (ISC_R_CANCELED is now reserved for when the read has been canceled by the caller.) - A new function isc_nmhandle_timer_running() has been added enabling a callback to check whether the timer has been reset after processing a timeout. - Incidental netmgr fix: always use isc__nm_closing() instead of referencing sock->mgr->closing directly - Corrected a few comments that used outdated function names.	2021-10-02 11:39:56 -07:00
Evan Hunt	d9e1ad9e37	Remove reference count REQUIRE in isc_nm_read() Previously isc_nm_read() required references on the handle to be at least 2, under the assumption that it would only ever be called from a connect or accept callback. however, it can also be called from a read callback, in which case the reference count might be only 1.	2021-10-02 11:39:56 -07:00
Mark Andrews	8fc9bb8e8e	Address use before NULL check warning of ievent->sock Reorder REQUIRE checks to ensure ievent->sock is checked earlier	2021-09-28 11:57:47 +10:00
Mark Andrews	7079829b84	Address use before NULL check warning of uvreq move dereference of uvreq until the after NULL check.	2021-09-28 11:57:47 +10:00
Ondřej Surý	8248da3b83	Preserve the contents of socket buffer on realloc On TCPDNS/TLSDNS read callback, the socket buffer could be reallocated if the received contents would be larger than the buffer. The existing code would not preserve the contents of the existing buffer which lead to the loss of the already received data. This commit changes the isc_mem_put()+isc_mem_get() with isc_mem_reget() to preserve the existing contents of the socket buffer.	2021-09-23 22:36:01 +02:00
Ondřej Surý	8edbd0929f	Use isc_mem_reget() to handle the internal active handle cache The netmgr, has an internal cache for freed active handles. This cache was allocated using isc_mem_allocate()/isc_mem_free() API because it was simpler to reallocate the cache when we needed to grow it. The new isc_mem_reget() function could be used here reducing the need to use isc_mem_allocate() API which is tad bit slower than isc_mem_get() API.	2021-09-23 22:17:15 +02:00
Artem Boldariev	530133c10f	Unify DoH URI making throughout the codebase This commit adds new function isc_nm_http_makeuri() which is supposed to unify DoH URI construction throughout the codebase. It handles IPv6 addresses, hostnames, and IPv6 addresses given as hostnames properly, and replaces similar ad-hoc code in the codebase.	2021-08-30 10:21:58 +03:00
Evan Hunt	fc6f751fbe	replace per-protocol keepalive functions with a common one this commit removes isc__nm_tcpdns_keepalive() and isc__nm_tlsdns_keepalive(); keepalive for these protocols and for TCP will now be set directly from isc_nmhandle_keepalive(). protocols that have an underlying TCP socket (i.e., TLS stream and HTTP), now have protocol-specific routines, called by isc_nmhandle_keeaplive(), to set the keepalive value on the underlying socket.	2021-08-27 10:02:10 -07:00
Evan Hunt	7867b8b57d	enable keepalive when the keepalive EDNS option is seen previously, receiving a keepalive option had no effect on how long named would keep the connection open; there was a place to configure the keepalive timeout but it was never used. this commit corrects that. this also fixes an error in isc__nm_{tcp,tls}dns_keepalive() in which the sense of a REQUIRE test was reversed; previously this error had not been noticed because the functions were not being used.	2021-08-27 09:56:51 -07:00
Evan Hunt	19e24e22f5	cleanup netmgr-int.h - fix some duplicated and out-of-order prototypes declared in netmgr-int.h - rename isc_nm_tcpdns_keepalive to isc__nm_tcpdns_keepalive as it's for internal use	2021-08-27 09:56:51 -07:00
Artem Boldariev	8a655320c8	Fix a crash (in dig) when closing HTTP socket with unused session This commit fixes a crash (caused by an assert) when closing an HTTP/2 socket with unused HTTP/2 session.	2021-08-27 12:14:48 +03:00
Artem Boldariev	32cd4367a3	Make no assumptions regarding HTTP headers processing order This commit changes the DoH code in such a way that it makes no assumptions regarding which headers are expected to be processed first. In particular, the code expected the :method: pseudo-header to be processed early, which might not be true.	2021-08-25 10:32:56 +03:00
Ondřej Surý	87d5c8ab7c	Disable the Path MTU Discover on UDP Sockets Instead of disabling the fragmentation on the UDP sockets, we now disable the Path MTU Discovery by setting IP(V6)_MTU_DISCOVER socket option to IP_PMTUDISC_OMIT on Linux and disabling IP(V6)_DONTFRAG socket option on FreeBSD. This option sets DF=0 in the IP header and also ignores the Path MTU Discovery. As additional mitigation on Linux, we recommend setting net.ipv4.ip_no_pmtu_disc to Mode 3: Mode 3 is a hardend pmtu discover mode. The kernel will only accept fragmentation-needed errors if the underlying protocol can verify them besides a plain socket lookup. Current protocols for which pmtu events will be honored are TCP, SCTP and DCCP as they verify e.g. the sequence number or the association. This mode should not be enabled globally but is only intended to secure e.g. name servers in namespaces where TCP path mtu must still work but path MTU information of other protocols should be discarded. If enabled globally this mode could break other protocols.	2021-08-19 07:12:33 +02:00
Artem Boldariev	e639957b58	Optimise TLS stream for small write size (>= 512 bytes) This commit changes TLS stream behaviour in such a way, that it is now optimised for small writes. In the case there is a need to write less or equal to 512 bytes, we could avoid calling the memory allocator at the expense of possibly slight increase in memory usage. In case of larger writes, the behviour remains unchanged.	2021-08-12 14:28:17 +03:00
Artem Boldariev	e301e1e3b8	Avoid memory copying during send in TLS stream At least at this point doing memory copying is not required. Probably it was a workaround for some problem in the earlier days of DoH, at this point it appears to be a waste of CPU cycles.	2021-08-12 14:28:17 +03:00
Artem Boldariev	bd69c7c57c	Simplify buffering code logic in http_send_outgoing() This commit significantly simplifies the code in http_send_outgoing() as it was unnecessary complicated, because it was dealing with multiple statically and dynamically allocated buffers, making it extremely hard to follow, as well as making it to do unnecessary memory copying in some situations. This commit fixes these issues, while retaining the high level buffering logic.	2021-08-12 14:28:17 +03:00
Artem Boldariev	a32faa20b4	DoH: replace a custom buffer code for POST data with isc_buffer_t This commit replaces the custom buffer code in client-side DoH code intended to keep track of POST data, with isc_buffer_t.	2021-08-12 14:28:17 +03:00
Artem Boldariev	5b52a7e37e	When terminating a client session, mark it as closing When an HTTP/2 client terminates a session it means that it is about to close the underlying connection. However, we were not doing that. As a result, with the latest changes to the test suite, which made it to limit amount of requests per a transport connection, the tests using quota would hang for quite a while. This commit fixes that.	2021-08-12 14:28:17 +03:00
Artem Boldariev	a05728beb0	Do not call http_do_bio() in isc__nm_http_request() The function should not be called here because it is, in general, supposed to be called at the end of the transport level callbacks to perform I/O, and thus, calling it here is clearly a mistake because it breaks other code expectations. As a result of the call to http_do_bio() from within isc__nm_http_request() the unit tests were running slower than expected in some situations. In this particular situation http_do_bio() is going to be called at the end of the transport_connect_cb() (initially), or http_readcb(), sending all of the scheduled requests at once. This change affects only the test suite because it is the only place in the codebase where isc__nm_http_request() is used in order to ensure that the server is able to handle multiple HTTP/2 streams at once.	2021-08-12 14:28:16 +03:00
Artem Boldariev	849d38b57b	Fix a crash by attach to the transport socket as early as possible This commit fixes a crash in DoH caused by transport handle to be detached too early when sending outgoing data. We need to attach to the session->handle earlier because as an indirect result of the nghttp2_session_mem_send() the session might get closed and the handle detached. However, there is still might be some outgoing data to handle. Besides, even when the underlying socket was closed via the handle, we still should try to attempt to send outgoing data via isc_nm_send() to let it call write callback, passed to the http_send_outgoing().	2021-08-12 14:28:16 +03:00
Artem Boldariev	e0704f2e5d	Use isc_buffer_t to keep track of outgoing response This commit gets rid of custom code taking care of response buffering by replacing the custom code with isc_buffer_t. Also, it gets rid of an unnecessary memory copying when sending a response.	2021-08-12 14:28:16 +03:00
Artem Boldariev	6fe4ab39b9	Use isc_buffer_t to keep track of incoming POST data This commit replaces the ad-hoc 64K buffer for incoming POST data with isc_buffer_t backed by dynamically allocated buffer sized accordingly to the value in the "Content-Length" header.	2021-08-12 14:28:16 +03:00
Artem Boldariev	0ca790d9bf	DoH: isc__buffer_usedregion->isc_buffer_usedregion in client_send() This commit replaces wrong usage of isc__buffer_usedregion() instead of implied isc_buffer_usedregion().	2021-08-12 14:28:16 +03:00
Artem Boldariev	2733cca3ac	Replace ad-hoc DNS message buffer in client code with isc_buffer_t The commit replaces an ad-hoc incoming DNS-message buffer in the client-side DoH code with isc_buffer_t. The commit also fixes a timing issue in the unit tests revealed by the change.	2021-08-12 14:28:16 +03:00
Artem Boldariev	c819caa3a1	Replace the HTTP/2 session's ad-hoc buffer with isc_buffer_t This commit replaces a static ad-hoc HTTP/2 session's temporary buffer with a realloc-able isc_buffer_t object, which is being allocated on as needed basis, lowering the memory consumption somewhat. The buffer is needed in very rare cases, so allocating it prematurely is not wise. Also, it fixes a bug in http_readcb() where the ad-hoc buffer appeared to be improperly used, leading to a situation when the processed data from the receiving regions can be processed twice, while unprocessed data will never be processed.	2021-08-12 14:28:16 +03:00
Artem Boldariev	170cc41d5c	Get rid of some HTTP/2 related types when NGHTTP2 is not available This commit removes definitions of some DoH-related types when libnghttp2 is not available.	2021-08-04 10:32:27 +03:00
Artem Boldariev	f388b71378	Get rid of RW locks in the DoH code This commit gets rid of RW locks in a hot path of the DoH code. In the original design, it was implied that we add new endpoints after the HTTP listener was created. Such a design implies some locking. We do not need such flexibility, though. Instead, we could build a set of endpoints before the HTTP listener gets created. Such a design does not need RW locks at all.	2021-08-04 10:32:25 +03:00
Artem Boldariev	590e8e0b86	Make max number of HTTP/2 streams configurable This commit makes number of concurrent HTTP/2 streams per connection configurable as a mean to fight DDoS attacks. As soon as the limit is reached, BIND terminates the whole session. The commit adds a global configuration option (http-streams-per-connection) which can be overridden in an http <name> {...} statement like follows: http local-http-server { ... streams-per-connection 100; ... }; For now the default value is 100, which should be enough (e.g. NGINX uses 128, but it is a full-featured WEB-server). When using lower numbers (e.g. ~70), it is possible to hit the limit with e.g. flamethrower.	2021-07-16 11:50:22 +03:00
Artem Boldariev	954240467d	Verify HTTP paths both in incoming requests and in config file This commit adds the code (and some tests) which allows verifying validity of HTTP paths both in incoming HTTP requests and in BIND's configuration file.	2021-07-16 10:28:08 +03:00
Artem Boldariev	64cd7e8a7f	Fix crash in DoH on empty query string in GET requests An unhandled code path left GET query string data uninitialised (equal to NULL) and led to a crash during the requests' base64 data decoding. This commit fixes that.	2021-07-13 16:53:51 +03:00
Ondřej Surý	a9e6a7ae57	Disable setting the thread affinity It was discovered that setting the thread affinity on both the netmgr and netthread threads lead to inconsistent recursive performance because sometimes the netmgr and netthread threads would compete over single resource and sometimes not. Removing setting the affinity causes a slight dip in the authoritative performance around 5% (the measured range was from 3.8% to 7.8%), but the recursive performance is now consistently good.	2021-07-13 14:48:29 +02:00
Ondřej Surý	29a285a67d	Revert the allocate/free -> get/put change from jemalloc change In the jemalloc merge request, we missed the fact that ah_frees and ah_handles are reallocated which is not compatible with using isc_mem_get() for allocation and isc_mem_put() for deallocation. This commit reverts that part and restores use of isc_mem_allocate() and isc_mem_free().	2021-07-09 18:19:57 +02:00
Ondřej Surý	f487c6948b	Replace locked mempools with memory contexts Current mempools are kind of hybrid structures - they serve two purposes: 1. mempool with a lock is basically static sized allocator with pre-allocated free items 2. mempool without a lock is a doubly-linked list of preallocated items The first kind of usage could be easily replaced with jemalloc small sized arena objects and thread-local caches. The second usage not-so-much and we need to keep this (in libdns:message.c) for performance reasons.	2021-07-09 15:58:02 +02:00
Ondřej Surý	5ab05d1696	Replace isc_mem_allocate() usage with isc_mem_get() in netmgr.c The isc_mem_allocate() comes with additional cost because of the memory tracking. In this commit, we replace the usage with isc_mem_get() because we track the allocated sizes anyway, so it's possible to also replace isc_mem_free() with isc_mem_put().	2021-07-09 15:58:02 +02:00
Artem Boldariev	c6d0e3d3a7	Return HTTP status code for small/malformed requests This commit makes BIND return HTTP status codes for malformed or too small requests. DNS request processing code would ignore such requests. Such an approach works well for other DNS transport but does not make much sense for HTTP, not allowing it to complete the request/response sequence. Suppose execution has reached the point where DNS message handling code has been called. In that case, it means that the HTTP request has been successfully processed, and, thus, we are expected to respond to it either with a message containing some DNS payload or at least to return an error status code. This commit ensures that BIND behaves this way.	2021-07-09 16:37:08 +03:00
Artem Boldariev	fedff2cd6c	Return "Bad Request" (400) in a case of Base64 decoding error This error code fits better than the more generic "Internal Server Error" (500) which implies that the problem is on the server. Also, do not end the whole HTTP/2 session on a bad request.	2021-07-09 16:26:46 +03:00
Artem Boldariev	1792740075	Ignore an "Accept" HTTP header value We were too strict regarding the value and presence of "Accept" HTTP header, slightly breaking compatibility with the specification. According to RFC8484 client SHOULD add "Accept" header to the requests but MUST be able to handle "application/dns-message" media type regardless of the value of the header. That basically suggests we ignore its value. Besides, verifying the value of the "Accept" header is a bit tricky because it could contain multiple media types, thus requiring proper parsing. That is doable but does not provide us with any benefits. Among other things, not verifying the value also fixes compatibility with clients, which could advertise multiple media types as supported, which we should accept. For example, it is possible for a perfectly valid request to contain "application/dns-message", "application/", and "/*" in the "Accept" header value. Still, we would treat such a request as invalid.	2021-07-09 16:26:46 +03:00
Artem Boldariev	7b6945fb60	Fix BIND hanging when browsers end HTTP/2 streams prematurely The commit fixes BIND hanging when browsers end HTTP/2 streams prematurely (for example, by sending RST_STREAM). It ensures that isc__nmsocket_prep_destroy() will be called for an HTTP/2 stream, allowing it to be properly disposed. The problem was impossible to reproduce using dig or DoH benchmarking software (e.g. flamethrower) because these do not tend to end HTTP/2 streams prematurely.	2021-07-09 15:42:44 +03:00
Artem Boldariev	094fcc10e7	Move the code which calls server read callback into a separate func This commit moves the code which calls server read callback into a separate function to avoid code repetition.	2021-07-09 15:42:44 +03:00
Ondřej Surý	2bb454182b	Make the DNS over HTTPS support optional This commit adds two new autoconf options `--enable-doh` (enabled by default) and `--with-libnghttp2` (mandatory when DoH is enabled). When DoH support is disabled the library is not linked-in and support for http(s) protocol is disabled in the netmgr, named and dig.	2021-07-07 09:50:53 +02:00
Ondřej Surý	b941411072	Disable IP fragmentation on the UDP sockets In DNS Flag Day 2020, we started setting the DF (Don't Fragment socket option on the UDP sockets. It turned out, that this code was incomplete leading to dropping the outgoing UDP packets. This has been now remedied, so it is possible to disable the fragmentation on the UDP sockets again as the sending error is now handled by sending back an empty response with TC (truncated) bit set. This reverts commit 66eefac78c92b64b6689a1655cc677a2b1d13496.	2021-06-23 17:41:34 +02:00
Evan Hunt	a3ba95116e	Handle UDP send errors when sending DNS message larger than MTU When the fragmentation is disabled on UDP sockets, the uv_udp_send() call can fail with UV_EMSGSIZE for messages larger than path MTU. Previously, this error would end with just discarding the response. In this commit, a proper handling of such case is added and on such error, a new DNS response with truncated bit set is generated and sent to the client. This change allows us to disable the fragmentation on the UDP sockets again.	2021-06-23 17:41:34 +02:00
Ondřej Surý	ec86759401	Replace netmgr per-protocol sequential function with a common one Previously, each protocol (TCPDNS, TLSDNS) has specified own function to disable pipelining on the connection. An oversight would lead to assertion failure when opcode is not query over non-TCPDNS protocol because the isc_nm_tcpdns_sequential() function would be called over non-TCPDNS socket. This commit removes the per-protocol functions and refactors the code to have and use common isc_nm_sequential() function that would either disable the pipelining on the socket or would handle the request in per specific manner. Currently it ignores the call for HTTP sockets and causes assertion failure for protocols where it doesn't make sense to call the function at all.	2021-06-22 17:21:44 +03:00
Artem Boldariev	dc356bb196	Fix ASAN error in DoH (passing NULL to memmove()) The warning was produced by an ASAN build: runtime error: null pointer passed as argument 2, which is declared to never be null This commit fixes it by checking if nghttp2_session_mem_send() has actually returned anything.	2021-06-16 17:46:10 +03:00

1 2 3 4 5 ...

310 Commits