mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-28 21:17:54 +00:00

Author	SHA1	Message	Date
Evan Hunt	2edefbad4a	remove the 'name_coff' parameter in dns_name_towire() this parameter was added as a (minor) optimization for cases where dns_name_towire() is run repeatedly with the same compression context, as when rendering all of the rdatas in an rdataset. it is currently only used in one place. we now simplify the interface by removing the extra parameter. the compression offset value is now part of the compression context, and can be activated when needed by calling dns_compress_setmultiuse(). multiuse mode is automatically deactivated by any subsequent call to dns_compress_permitted().	2025-02-25 12:53:25 -08:00
Ondřej Surý	04c2c2cbc8	Simplify dns_name_init() Remove the now-unused offsets parameter from dns_name_init().	2025-02-25 12:17:34 +01:00
Ondřej Surý	2f8e0edf3b	Split and simplify the use of EDE list implementation Instead of mixing the dns_resolver and dns_validator units directly with the EDE code, split-out the dns_ede functionality into own separate compilation unit and hide the implementation details behind abstraction. Additionally, the EDE codes are directly copied into the ns_client buffers by passing the EDE context to dns_resolver_createfetch(). This makes the dns_ede implementation simpler to use, although sligtly more complicated on the inside. Co-authored-by: Colin Vidal <colin@isc.org> Co-authored-by: Ondřej Surý <ondrej@isc.org>	2025-01-30 11:52:53 +01:00
Colin Vidal	39c2fc4670	fix byte order in EDE logging When an EDE code is added to a message, the code is converted early in a big-endian order so it can be memcpy-ed directly in the EDE buffer that will go on the wire. This previous change forget to update debug logs which still assume the EDE code was in host byte order. Add a separate variable to differentiate both and avoid ambiguities	2025-01-27 11:49:44 +01:00
Evan Hunt	10accd6260	clean up uses of ISC_R_NOMEMORY the isc_mem allocation functions can no longer fail; as a result, ISC_R_NOMEMORY is now rarely used: only when an external library such as libjson-c or libfstrm could return NULL. (even in these cases, arguably we should assert rather than returning ISC_R_NOMEMORY.) code and comments that mentioned ISC_R_NOMEMORY have been cleaned up, and the following functions have been changed to type void, since (in most cases) the only value they could return was ISC_R_SUCCESS: - dns_dns64_create() - dns_dyndb_create() - dns_ipkeylist_resize() - dns_kasp_create() - dns_kasp_key_create() - dns_keystore_create() - dns_order_create() - dns_order_add() - dns_peerlist_new() - dns_tkeyctx_create() - dns_view_create() - dns_zone_setorigin() - dns_zone_setfile() - dns_zone_setstream() - dns_zone_getdbtype() - dns_zone_setjournal() - dns_zone_setkeydirectory() - isc_lex_openstream() - isc_portset_create() - isc_symtab_create() (the exception is dns_view_create(), which could have returned other error codes in the event of a crypto library failure when calling isc_file_sanitize(), but that should be a RUNTIME_CHECK anyway.)	2025-01-23 15:54:57 -08:00
Colin Vidal	4096f27130	add support for multiple EDE Extended DNS error mechanism (EDE) enables to have several EDE raised during a DNS resolution (typically, a DNSSEC query will do multiple fetches which each of them can have an error). Add support to up to 3 EDE errors in an DNS response. If duplicates occur (two EDEs with the same code, the extra text is not compared), only the first one will be part of the DNS answer. Because the maximum number of EDE is statically fixed, `ns_client_t` object own a static vector of `DNS_DE_MAX_ERRORS` (instead of a linked list, for instance). The array can be fully filled (all slots point to an allocated `dns_ednsopt_t` object) or partially filled (or empty). In such case, the first NULL slot means there is no more EDE objects.	2025-01-22 21:07:44 +01:00
Ondřej Surý	2cb5a6210f	Improve the badcache cleaning by adding LRU and using RCU Instead of cleaning the dns_badcache opportunistically, add per-loop LRU, so each thread-loop can clean the expired entries. This also allows removal of the atomic operations as the badcache entries are now immutable, instead of updating the badcache entry in place, the old entry is now deleted from the hashtable and the LRU list, and the new entry is inserted in the LRU.	2024-11-27 17:44:53 +01:00
Ondřej Surý	1a19ce39db	Remove redundant semicolons after the closing braces of functions	2024-11-19 12:27:22 +01:00
Ondřej Surý	0258850f20	Remove redundant parentheses from the return statement	2024-11-19 12:27:22 +01:00
Evan Hunt	c6698322c6	suppress report-channel for zones above the agent-domain RFC 9567 section 8.1 specifies that the agent domain cannot be a subdomain of the domain it is reporting on. therefore, in addition to making it illegal to configure that at the zone level, we also need to disable send-report-channel for any zone for which the global send-report-channel value is a subdomain. we also now warn if send-report-channel is configured globally to a zone that we host, but that zone doesn't have log-report-channel set.	2024-10-23 21:29:32 +00:00
Mark Andrews	c676fd2566	Allow send-report-channel to be set at the zone level If send-report-channel is set at the zone level, it will be stored in the zone object and used instead of the view-level agent-domain when constructing the EDNS Report-Channel option.	2024-10-23 21:29:32 +00:00
Mark Andrews	ac1c60d87e	Add send-report-channel option This commit adds support for the EDNS Report-Channel option, which is returned in authoritative responses when EDNS is in use. "send-report-channel" sets the Agent-Domain value that will be included in EDNS Report-Channel options. This is configurable at the options/view level; the value is a DNS name. Setting the Agent-Domain to the root zone (".") disables the option. When this value has been set, incoming queries matchng the form _er.<qtype>.<qname>.<extended-error-code>._er.<agent-domain>/TXT will be logged to the dns-reporting-agent channel at INFO level. (Note: error reporting queries will only be accepted if sent via TCP or with a good server cookie. If neither is present, named returns BADCOOKIE to complete the DNS COOKIE handshake, or TC=1 to switch the client to TCP.)	2024-10-23 21:29:32 +00:00
Evan Hunt	5ea1f6390d	corrected code style errors - add missing brackets around one-line statements - add paretheses around return values	2024-10-18 19:31:27 +00:00
Evan Hunt	8104ffda0e	report client transport in 'rndc recursing' when dumping the list of recursing clients, indicate whether a given query was sent over UDP, TCP, TLS, or HTTP.	2024-10-14 12:59:52 -07:00
Ondřej Surý	091d738c72	Convert all categories and modules into static lists Remove the complicated mechanism that could be (in theory) used by external libraries to register new categories and modules with statically defined lists in <isc/log.h>. This is similar to what we have done for <isc/result.h> result codes. All the libraries are now internal to BIND 9, so we don't need to provide a mechanism to register extra categories and modules.	2024-08-20 12:50:39 +00:00
Ondřej Surý	8506102216	Remove logging context (isc_log_t) from the public namespace Now that the logging uses single global context, remove the isc_log_t from the public namespace.	2024-08-20 12:50:39 +00:00
Ondřej Surý	bf9fd2a6ff	Reset the TCP connection on a failed send When sending fails, the ns__client_request() would not reset the connection and continue as nothing is happening. This comes from the model that we don't care about failed UDP sends because datagrams are unreliable anyway, but it greatly affects TCP connections with keep-alive. The worst case scenario is as follows: 1. the 3-way TCP handshake gets completed 2. the libuv calls the "uv_connection_cb" callback 3. the TCP connection gets queue because of the tcp-clients quota 4. the TCP client sends as many DNS messages as the buffers allow 5. the TCP connection gets dropped by the client due to the timeout 6. the TCP connection gets accepted by the server 7. the data already sent by the client gets read 8. all sending fails immediately because the TCP connection is dead 9. we consume all the data in the buffer in a very tight loop As it doesn't make sense to trying to process more data on the TCP connection when the sending is failing, drop the connection immediately on the first sending error.	2024-07-03 09:07:20 +02:00
Ondřej Surý	1c0564d715	Remove ns_query_init() cannot fail, remove the error paths As ns_query_init() cannot fail now, remove the error paths, especially in ns__client_setup() where we now don't have to care what to do with the connection if setting up the client could fail. It couldn't fail even before, but now it's formal.	2024-07-03 09:05:51 +02:00
Aram Sargsyan	54ddd848fe	Avoid running get_matching_view() asynchronously on an error path Also create a new ns_client_async_reset() static function to decrease code duplication.	2024-06-10 17:35:40 +02:00
Aram Sargsyan	ad489c44df	Remove sig0checks-quota-maxwait-ms support Waiting for a quota to appear complicates things and wastes rosources on timer management. Just answer with REFUSE if there is no quota.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	f0cde05e06	Implement asynchronous view matching for SIG(0)-signed queries View matching on an incoming query checks the query's signature, which can be a CPU-heavy task for a SIG(0)-signed message. Implement an asynchronous mode of the view matching function which uses the offloaded signature checking facilities, and use it for the incoming queries.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	c7f79a0353	Add a quota for SIG(0) signature checks In order to protect from a malicious DNS client that sends many queries with a SIG(0)-signed message, add a quota of simultaneously running SIG(0) checks. This protection can only help when named is using more than one worker threads. For example, if named is running with the '-n 4' option, and 'sig0checks-quota 2;' is used, then named will make sure to not use more than 2 workers for the SIG(0) signature checks in parallel, thus leaving the other workers to serve the remaining clients which do not use SIG(0)-signed messages. That limitation is going to change when SIG(0) signature checks are offloaded to "slow" threads in a future commit. The 'sig0checks-quota-exempt' ACL option can be used to exempt certain clients from the quota requirements using their IP or network addresses. The 'sig0checks-quota-maxwait-ms' option is used to define a maximum amount of time for named to wait for a quota to appear. If during that time no new quota becomes available, named will answer to the client with DNS_R_REFUSED.	2024-06-10 17:33:08 +02:00
Ondřej Surý	e28266bfbc	Remove the extra memory context with own arena for sending The changes in this MR prevent the memory used for sending the outgoing TCP requests to spike so much. That strictly remove the extra need for own memory context, and thus since we generally prefer simplicity, remove the extra memory context with own jemalloc arenas just for the outgoing send buffers.	2024-06-10 16:48:54 +02:00
Ondřej Surý	452a2e6348	Replace the tcp_buffers memory pool with static per-loop buffer As a single thread can process only one TCP send at the time, we don't really need a memory pool for the TCP buffers, but it's enough to have a single per-loop (client manager) static buffer that's being used to assemble the DNS message and then it gets copied into own sending buffer. In the future, this should get optimized by exposing the uv_try API from the network manager, and first try to send the message directly and allocate the sending buffer only if we need to send the data asynchronously.	2024-06-10 16:48:53 +02:00
Aram Sargsyan	982eab7de0	ns_client: reuse TCP send buffers Constantly allocating, reallocating and deallocating 64K TCP send buffers by 'ns_client' instances takes too much CPU time. There is an existing mechanism to reuse the ns_clent_t structure associated with the handle using 'isc_nmhandle_getdata/_setdata' (see ns_client_request()), but it doesn't work with TCP, because every time ns_client_request() is called it gets a new handle even for the same TCP connection, see the comments in streamdns_on_complete_dnsmessage(). To solve the problem, we introduce an array of available (unused) TCP buffers stored in ns_clientmgr_t structure so that a 'client' working via TCP can have a chance to reuse one (if there is one) instead of allocating a new one every time.	2024-06-10 16:48:53 +02:00
Aydın Mercan	f30008a71c	Provide an early escape hatch for ns_client_transport_type Because some tests don't have a legtimate handle, provide a temporary return early that should be fixed and removed before squashing. This short circuiting is still correct until DoQ/DoH3 support is introduced.	2024-04-26 16:12:29 +03:00
Aydın Mercan	b5478654a2	Add fallback to ns_client_get_type despite unreachable GCC might fail to compile because it expects a return after UNREACHABLE. It should ideally just work anyway since UNREACHABLE is either a noreturn or UB (__builtin_unreachable / C23 unreachable). Either way, it should be optimized almost always so the fallback is free or basically free anyway when it isn't optimized out.	2024-04-26 16:12:29 +03:00
Aydın Mercan	4a3f7fe1ef	Emit and read correct DoT and DoH dnstap entries Other protocols still pretend to be TCP/UDP. This only causes a difference when using dnstap-read on a file with DoQ or DNSCrypt entries	2024-04-26 16:12:29 +03:00
Artem Boldariev	5ed3a76f9d	BIND: Add 'allow-proxy' and 'allow-proxy-on' options The main intention of PROXY protocol is to pass endpoints information to a back-end server (in our case - BIND). That means that it is a valid way to spoof endpoints information, as the addresses and ports extracted from PROXYv2 headers, from the point of view of BIND, are used instead of the real connection addresses. Of course, an ability to easily spoof endpoints information can be considered a security issue when used uncontrollably. To resolve that, we introduce 'allow-proxy' and 'allow-proxy-on' ACL options. These are the only ACL options in BIND that work with real PROXY connections addresses, allowing a DNS server operator to specify from what clients and on which interfaces he or she is willing to accept PROXY headers. By default, for security reasons we do not allow to accept them.	2023-12-06 15:15:25 +02:00
Ondřej Surý	17da9fed58	Remove AES algorithm for DNS cookies The AES algorithm for DNS cookies was being kept for legacy reasons, and it can be safely removed in the next major release. Remove both the AES usage for DNS cookies and the AES implementation itself.	2023-11-15 10:31:16 +01:00
Aram Sargsyan	b970556f21	Remove unnecessary NULL-checks in ns__client_setup() All these pointers are guaranteed to be non-NULL. Additionally, update a comment to remove obviously outdated information about the function's requirements.	2023-09-28 13:43:18 +00:00
Ondřej Surý	f5af981831	Change dns_message_create() function to accept memory pools Instead of creating new memory pools for each new dns_message, change dns_message_create() method to optionally accept externally created dns_fixedname_t and dns_rdataset_t memory pools. This allows us to preallocate the memory pools in ns_client and dns_resolver units for the lifetime of dns_resolver_t and ns_clientmgr_t.	2023-09-24 18:07:40 +02:00
Ondřej Surý	3340c82b99	Improve isc_refcount with initializer and implicit destroy Add ISC_REFCOUNT_INITIALIZER(x) macro and implicitly call isc_refcount_destroy() in the ISC_REFCOUNT_IMPL() macros to reduce code duplicities.	2023-09-24 10:08:56 +02:00
Artem Boldariev	01cc7edcca	Allocate DNS send buffers using dedicated per-worker memory arenas This commit ensures that memory allocations related to DNS send buffers are routed through dedicated per-worker memory arenas in order to decrease memory usage on high load caused by TCP-based DNS transports. We do that by following jemalloc developers suggestions: https://github.com/jemalloc/jemalloc/issues/2483#issuecomment-1639019699 https://github.com/jemalloc/jemalloc/issues/2483#issuecomment-1698173849	2023-09-05 09:39:41 +02:00
Ondřej Surý	89fcb6f897	Apply the isc_mem_cget semantic patch	2023-08-31 22:08:35 +02:00
Evan Hunt	62d70966f2	remove dns_name_towire2() we don't need two versions of dns_name_towire(), we can just add NULL to the calls that don't need to specify a compression offset.	2023-08-31 10:29:16 -07:00
Ondřej Surý	4dacdde28f	Refactor dns_badcache to use cds_lfht lock-free hashtable The dns_badcache unit had (yet another) own locked hashtable implementation. Replace the hashtable used by dns_badcache with lock-free cds_lfht implementation from liburcu.	2023-07-31 15:51:15 +02:00
Mark Andrews	3969e2c5f7	Return BADCOOKIE on validly formed bad SERVER COOKIES The server was previously tolerant of out-of-date or otherwise bad DNS SERVER COOKIES that where well formed unless require-cookie was set. BADCOOKIE is now return for these conditions.	2023-07-13 01:58:53 +00:00
Mark Andrews	971f49b3ad	Use RCU for view->adb access view->adb may be referenced while the view is shutting down as the zone uses a weak reference to the view and examines view->adb but dns_view_detach call dns_adb_detach to clear view->adb.	2023-06-14 19:21:28 +10:00
Evan Hunt	6105a7d360	convert TSIG keyring storage from RBT to hash table since it is not necessary to find partial matches when looking up names in a TSIG keyring, we can use a hash table instead of an RBT to store them. the tsigkey object now stores the key name as a dns_fixedname rather than allocating memory for it. the `name` parameter to dns_tsigkeyring_add() has been removed; it was unneeded since the tsigkey object already contains a copy of the name. the opportunistic cleanup_ring() function has been removed; it was only slowing down lookups.	2023-06-14 08:14:38 +00:00
Evan Hunt	ffacf0aec6	use algorithm number instead of name to create TSIG keys the prior practice of passing a dns_name containing the expanded name of an algorithm to dns_tsigkey_create() and dns_tsigkey_createfromkey() is unnecessarily cumbersome; we can now pass the algorithm number instead.	2023-06-14 08:14:38 +00:00
Artem Boldariev	d8a5feb556	Use appropriately sized send buffers for DNS messages over TCP This commit changes send buffers allocation strategy for stream based transports. Before that change we would allocate a dynamic buffers sized at 64Kb even when we do not need that much. That could lead to high memory usage on server. Now we resize the send buffer to match the size of the actual data, freeing the memory at the end of the buffer for being reused later.	2023-06-06 13:40:42 +02:00
Mark Andrews	c48c72343d	Silence Coverity USE_AFTER_FREE warning Use current used pointer - 16 instead of a saved pointer as Coverity thinks the memory may be freed between assignment and use of 'cp'. isc_buffer_put{mem,uint{8,16,32}} can theoretically free the memory if there is a dynamic buffer in use but that is not the case here.	2023-05-23 02:13:28 +00:00
Ondřej Surý	1715cad685	Refactor the isc_quota code and fix the quota in TCP accept code In e18541287231b721c9cdb7e492697a2a80fd83fc, the TCP accept quota code became broken in a subtle way - the quota would get initialized on the first accept for the server socket and then deleted from the server socket, so it would never get applied again. Properly fixing this required a bigger refactoring of the isc_quota API code to make it much simpler. The new code decouples the ownership of the quota and acquiring/releasing the quota limit. After (during) the refactoring it became more clear that we need to use the callback from the child side of the accepted connection, and not the server side.	2023-04-12 14:10:37 +02:00
Tony Finch	0d353704fb	Use isc_histo for the message size statistics This should have no functional effects. The message size stats are specified by RSSAC002 so it's best not to mess around with how they appear in the statschannel. But it's worth changing the implementation to use general-purpose histograms, to reduce code size and benefit from sharded counters.	2023-04-03 12:08:05 +01:00
Ondřej Surý	a5f5f68502	Refactor isc_time_now() to return time, and not result The isc_time_now() and isc_time_now_hires() were used inconsistently through the code - either with status check, or without status check, or via TIME_NOW() macro with RUNTIME_CHECK() on failure. Refactor the isc_time_now() and isc_time_now_hires() to always fail when getting current time has failed, and return the isc_time_t value as return value instead of passing the pointer to result in the argument.	2023-03-31 15:02:06 +02:00
Ondřej Surý	46f06c1d6e	Apply the semantic patch to remove isc_stdtime_get() This is a simple replacement using the semantic patch from the previous commit and as added bonus, one removal of previously undetected unused variable in named/server.c.	2023-03-31 13:32:56 +02:00
Evan Hunt	d91097e0c7	change ns__client_request() to ns_client_request() in the future we'll want to call this function from outside named, so change the name to one suitable for external access.	2023-03-28 12:38:28 -07:00
Evan Hunt	197334464e	remove named_os_gethostname() this function was just a front-end for gethostname(). it was needed when we supported windows, which has a different function for looking up the hostname; it's not needed any longer.	2023-02-18 20:23:41 +00:00
Evan Hunt	a52b17d39b	remove isc_task completely as there is no further use of isc_task in BIND, this commit removes it, along with isc_taskmgr, isc_event, and all other related types. functions that accepted taskmgr as a parameter have been cleaned up. as a result of this change, some functions can no longer fail, so they've been changed to type void, and their callers have been updated accordingly. the tasks table has been removed from the statistics channel and the stats version has been updated. dns_dyndbctx has been changed to reference the loopmgr instead of taskmgr, and DNS_DYNDB_VERSION has been udpated as well.	2023-02-16 18:35:32 +01:00

1 2 3 4 5

221 Commits