mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-30 22:15:20 +00:00

Author	SHA1	Message	Date
Aram Sargsyan	ad489c44df	Remove sig0checks-quota-maxwait-ms support Waiting for a quota to appear complicates things and wastes rosources on timer management. Just answer with REFUSE if there is no quota.	2024-06-10 17:33:11 +02:00
Aram Sargsyan	f0cde05e06	Implement asynchronous view matching for SIG(0)-signed queries View matching on an incoming query checks the query's signature, which can be a CPU-heavy task for a SIG(0)-signed message. Implement an asynchronous mode of the view matching function which uses the offloaded signature checking facilities, and use it for the incoming queries.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	710bf9b938	Implement asynchronous message signature verification Add support for using the offload threadpool to perform message signature verifications. This should allow check SIG(0)-signed messages without affecting the worker threads.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	7f013ad05d	Remove dns_message_rechecksig() This is a tiny helper function which is used only once and can be replaced with two function calls instead. Removing this makes supporting asynchronous signature checking less complicated.	2024-06-10 17:33:10 +02:00
Aram Sargsyan	c7f79a0353	Add a quota for SIG(0) signature checks In order to protect from a malicious DNS client that sends many queries with a SIG(0)-signed message, add a quota of simultaneously running SIG(0) checks. This protection can only help when named is using more than one worker threads. For example, if named is running with the '-n 4' option, and 'sig0checks-quota 2;' is used, then named will make sure to not use more than 2 workers for the SIG(0) signature checks in parallel, thus leaving the other workers to serve the remaining clients which do not use SIG(0)-signed messages. That limitation is going to change when SIG(0) signature checks are offloaded to "slow" threads in a future commit. The 'sig0checks-quota-exempt' ACL option can be used to exempt certain clients from the quota requirements using their IP or network addresses. The 'sig0checks-quota-maxwait-ms' option is used to define a maximum amount of time for named to wait for a quota to appear. If during that time no new quota becomes available, named will answer to the client with DNS_R_REFUSED.	2024-06-10 17:33:08 +02:00
Matthijs Mekking	c1ac8b6ad0	Log rekey failure as error if too many records By default we log a rekey failure on debug level. We should probably change the log level to error. We make an exception for when the zone is not loaded yet, it often happens at startup that a rekey is run before the zone is fully loaded.	2024-06-10 16:55:12 +02:00
Matthijs Mekking	82635e56d8	Log error when update fails The new "too many records" error can make an update fail without the error being logged. This commit fixes that.	2024-06-10 16:55:12 +02:00
Evan Hunt	7dd6b47ace	fix a memory leak that could occur when signing when signatures were not added because of too many types already existing at a node, the diff was not being cleaned up; this led to a memory leak being reported at shutdown.	2024-06-10 16:55:12 +02:00
Ondřej Surý	52b3d86ef0	Add a limit to the number of RR types for single name Previously, the number of RR types for a single owner name was limited only by the maximum number of the types (64k). As the data structure that holds the RR types for the database node is just a linked list, and there are places where we just walk through the whole list (again and again), adding a large number of RR types for a single owner named with would slow down processing of such name (database node). Add a configurable limit to cap the number of the RR types for a single owner. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-types-per-name configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:09 +02:00
Ondřej Surý	32af7299eb	Add a limit to the number of RRs in RRSets Previously, the number of RRs in the RRSets were internally unlimited. As the data structure that holds the RRs is just a linked list, and there are places where we just walk through all of the RRs, adding an RRSet with huge number of RRs inside would slow down processing of said RRSets. Add a configurable limit to cap the number of the RRs in a single RRSet. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-records-per-type configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:07 +02:00
Ondřej Surý	e28266bfbc	Remove the extra memory context with own arena for sending The changes in this MR prevent the memory used for sending the outgoing TCP requests to spike so much. That strictly remove the extra need for own memory context, and thus since we generally prefer simplicity, remove the extra memory context with own jemalloc arenas just for the outgoing send buffers.	2024-06-10 16:48:54 +02:00
Ondřej Surý	4c2ac25a95	Limit the number of DNS message processed from a single TCP read The single TCP read can create as much as 64k divided by the minimum size of the DNS message. This can clog the processing thread and trash the memory allocator because we need to do as much as ~20k allocations in a single UV loop tick. Limit the number of the DNS messages processed in a single UV loop tick to just single DNS message and limit the number of the outstanding DNS messages back to 23. This effectively limits the number of pipelined DNS messages to that number (this is the limit we already had before).	2024-06-10 16:48:54 +02:00
Ondřej Surý	452a2e6348	Replace the tcp_buffers memory pool with static per-loop buffer As a single thread can process only one TCP send at the time, we don't really need a memory pool for the TCP buffers, but it's enough to have a single per-loop (client manager) static buffer that's being used to assemble the DNS message and then it gets copied into own sending buffer. In the future, this should get optimized by exposing the uv_try API from the network manager, and first try to send the message directly and allocate the sending buffer only if we need to send the data asynchronously.	2024-06-10 16:48:53 +02:00
Aram Sargsyan	982eab7de0	ns_client: reuse TCP send buffers Constantly allocating, reallocating and deallocating 64K TCP send buffers by 'ns_client' instances takes too much CPU time. There is an existing mechanism to reuse the ns_clent_t structure associated with the handle using 'isc_nmhandle_getdata/_setdata' (see ns_client_request()), but it doesn't work with TCP, because every time ns_client_request() is called it gets a new handle even for the same TCP connection, see the comments in streamdns_on_complete_dnsmessage(). To solve the problem, we introduce an array of available (unused) TCP buffers stored in ns_clientmgr_t structure so that a 'client' working via TCP can have a chance to reuse one (if there is one) instead of allocating a new one every time.	2024-06-10 16:48:53 +02:00
Ondřej Surý	4e7c4af17f	Throttle reading from TCP if the sends are not getting through When TCP client would not read the DNS message sent to them, the TCP sends inside named would accumulate and cause degradation of the service. Throttle the reading from the TCP socket when we accumulate enough DNS data to be sent. Currently this is limited in a way that a single largest possible DNS message can fit into the buffer.	2024-06-10 16:48:52 +02:00
Artem Boldariev	d80dfbf745	Keep the endpoints set reference within an HTTP/2 socket This commit ensures that an HTTP endpoints set reference is stored in a socket object associated with an HTTP/2 stream instead of referencing the global set stored inside a listener. This helps to prevent an issue like follows: 1. BIND is configured to serve DoH clients; 2. A client is connected and one or more HTTP/2 stream is created. Internal pointers are now pointing to the data on the associated HTTP endpoints set; 3. BIND is reconfigured - the new endpoints set object is created and promoted to all listeners; 4. The old pointers to the HTTP endpoints set data are now invalid. Instead referencing a global object that is updated on re-configurations we now store a local reference which prevents the endpoints set objects to go out of scope prematurely.	2024-06-10 16:40:12 +02:00
Artem Boldariev	c41fb499b9	DoH: avoid potential use after free for HTTP/2 session objects It was reported that HTTP/2 session might get closed or even deleted before all async. processing has been completed. This commit addresses that: now we are avoiding using the object when we do not need it or specifically check if the pointers used are not 'NULL' and by ensuring that there is at least one reference to the session object while we are doing incoming data processing. This commit makes the code more resilient to such issues in the future.	2024-06-10 16:40:10 +02:00
Ondřej Surý	086b63f56d	Use isc_queue to implement wait-free deadnodes queue Replace the ISC_LIST based deadnodes implementation with isc_queue which is wait-free and we don't have to acquire neither the tree nor node lock to append nodes to the queue and the cleaning process can also copy (splice) the list into a local copy without acquiring the list. Currently, there's little benefit to this as we need to hold those locks anyway, but in the future as we move to RCU based implementation, this will be ready. To align the cleaning with our event loop based model, remove the hardcoded count for the node locks and use the number of the event loops instead. This way, each event loop can have its own cleaning as part of the process. Use uniform random numbers to spread the nodes evenly between the buckets (instead of hashing the domain name).	2024-06-05 09:19:56 +02:00
Ondřej Surý	a9b4d42346	Add isc_queue implementation on top of cds_wfcq Add an isc_queue implementation that hides the gory details of cds_wfcq into more neat API. The same caveats as with cds_wfcq. TODO: Add documentation to the API.	2024-06-05 09:19:56 +02:00
Mark Andrews	56c3dcc5d7	Update resquery_senddone handling of ISC_R_TIMEDOUT Treat timed out as an address specific error.	2024-06-04 00:15:48 +10:00
Mark Andrews	4e3dd85b8d	Update resquery_senddone handling of ISC_R_CONNECTIONRESET Treat connection reset as an address specific error.	2024-06-04 00:15:48 +10:00
Mark Andrews	180b1e7939	Handle ISC_R_HOSTDOWN and ISC_R_NETDOWN in resolver.c These error codes should be treated like other unreachable error codes.	2024-06-04 00:15:48 +10:00
Mark Andrews	05472e63e8	Don't do DS checks over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	d026dbe536	Don't forward UPDATE messages over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	5d99625515	Don't send NOTIFY over disabled address families	2024-06-03 18:34:31 +10:00
Mark Andrews	2cd4303249	Report non-effective primaries When named is started with -4 or -6 and the primaries for a zone do not have an IPv4 or IPv6 address respectively issue a log message.	2024-06-03 18:34:31 +10:00
Mark Andrews	ecdde04e63	Zone transfers should honour -4 and -6 options Check if the address family has been disabled when transferring zones.	2024-06-03 18:34:31 +10:00
Mark Andrews	9be1873ef3	Add helper function isc_sockaddr_disabled	2024-06-03 18:34:31 +10:00
Matthijs Mekking	c40e5c8653	Call reset_shutdown if uv_tcp_close_reset failed If uv_tcp_close_reset() returns an error code, this means the reset_shutdown callback has not been issued, so do it now.	2024-06-03 10:14:47 +02:00
Matthijs Mekking	5b94bb2129	Do not runtime check uv_tcp_close_reset When we reset a TCP connection by sending a RST packet, do not bother requiring the result is a success code.	2024-06-03 10:14:47 +02:00
Mark Andrews	87e3b9dbf3	Pass a memory context in to dns_cache_create	2024-05-31 15:40:32 +10:00
Mark Andrews	5e77edd074	Use a new memory context when flushing the cache When the cache's memory context was in over memory state when the cache was flushed it resulted in LRU cleaning removing newly entered data in the new cache straight away until the old cache had been destroyed enough to take it out of over memory state. When flushing the cache create a new memory context for the new db to prevent this.	2024-05-31 15:40:32 +10:00
Ondřej Surý	3310cac2b0	Create the new database for AXFR from the dns_zone API The `axfr_makedb()` didn't set the loop on the newly created database, effectively killing delayed cleaning on such database. Move the database creation into dns_zone API that knows all the gory details of creating new database suitable for the zone.	2024-05-29 08:30:19 +02:00
Aram Sargsyan	4d3c31b928	fixup! Merge branch 'ondrej/light-cleanup-of-rdataslab' into 'main'	2024-05-25 11:47:33 +02:00
Ondřej Surý	3feabc8a22	Cleanup the dns_cache unit Remove duplicate code and use ISC_REFCOUNT_{DECL,IMPL} macros.	2024-05-25 11:47:33 +02:00
Ondřej Surý	03ed19cf71	Refactor the common buffer manipulation in rdataslab.c in macros The rdataslab.c was full of code like this: length = raw[0] * 256 + raw[1]; and count2 = current2++ 256; count2 += *current2++; Refactor code like this into peek_uint16() and get_uint16 macros to prevent code repetition and possible mistakes when copy and pasting the same code over and over. As a side note for an entertainment of a careful reader of the commit messages: The byte manipulation was changed from multiplication and addition to shift with or. The difference in the assembly looks like this: MUL and ADD: movzx eax, BYTE PTR [rdi] movzx edi, BYTE PTR [rdi+1] sal eax, 8 or edi, eax SHIFT and OR: movzx edi, WORD PTR [rdi] rol di, 8 movzx edi, di If the result and/or buffer is then being used after the macro call, there's more differences in favor of the SHIFT+OR solution.	2024-05-24 09:52:45 +02:00
Aydın Mercan	03a59cbb04	reinsert accidentally removed + in db trace It only affects development when using `DNS_DB_TRACE`.	2024-05-17 18:11:23 -07:00
Aydın Mercan	49e62ee186	fix typing mistakes in trace macros The detach function declaration in `ISC__REFCOUNT_TRACE_DECL` had an returned an accidental implicit int. While not allowed since C99, it became an error by default in GCC 14. `ISC_REFCOUNT_TRACE_IMPL` and `ISC_REFCOUNT_STATIC_TRACE_IMPL` expanded into the wrong macros, trying to declare it again with the wrong number of parameters.	2024-05-17 18:11:23 -07:00
Mark Andrews	b7de2c7cb9	Clang-format header file changes	2024-05-17 16:03:21 -07:00
Mark Andrews	6e9ed4983e	add test cases for several FORMERR code paths: - duplicated question - duplicated answer - qtype as an answer - two question types - question names - nsec3 bad owner name - short record - short question - mismatching question class - bad record owner name - mismatched class in record - mismatched KEY class - OPT wrong owner name - invalid RRSIG "covers" type - UPDATE malformed delete type - TSIG wrong class - TSIG not the last record	2024-05-17 13:39:22 +10:00
Evan Hunt	9c882f1e69	replace qpzone node attriutes with atomics there were TSAN error reports because of conflicting uses of node->dirty and node->nsec, which were in the same qword. this could be resolved by separating them, but we could also make them into atomic values and remove some node locking.	2024-05-17 00:33:35 +00:00
Matthijs Mekking	f882101265	Rewrite qp fix_iterator() The fix_iterator() function had a lot of bugs in it and while fixing them, the number of corner cases and the complexity of the function got out of hand. Rewrite the function with the following modifications: The function now requires that the iterator is pointing to a leaf node. This removes the cases we have to deal when the iterator was left on a dead branch. From the leaf node, pop up the iterator stack until we encounter the branch where the offset point is before the point where the search key differs. This will bring us to the right branch, or at the first unmatched node, in which case we pop up to the parent branch. From there it is easier to retrieve the predecessor. Once we are at the right branch, all we have to do is find the right twig (which is either the twig for the character at the position where the search key differs, or the previous twig) and walk down from there to the greatest leaf or, in case there is no good twig, get the previous twig from the successor and get the greatest leaf from there. If there is no previous twig to select in this branch, because every leaf from this branch node is greater than the one we wanted, we need to pop up the stack again and resume at the parent branch. This is achieved by calling prevleaf().	2024-05-16 09:49:41 +00:00
Matthijs Mekking	8b8c16d7a4	Get anyleaf when qp lookup is on a dead end branch Move the fix_iterator out of the loop and only call it when we found a leaf node. This leaf node may be the wrong leaf node, but fix_iterator should correct that. Also, when we don't need to set the iterator, just get any leaf. We only need to have a leaf for the qpkey_compare and the end result does not matter if compare was against an ancestor leaf or any leaf below that point.	2024-05-16 09:49:41 +00:00
Mark Andrews	ec3c624814	Properly build the NSEC/NSEC3 type bit map DNSKEY was incorrectly being added to the NESC/NSEC3 type bit map when it was obscured by the delegation. This lead to zone verification failures.	2024-05-16 10:27:49 +10:00
Mark Andrews	e84615629f	Properly update 'maxtype' 'maxtype' should be checked to see if it should be updated whenever a type is added to the type map.	2024-05-16 10:20:49 +10:00
Ondřej Surý	eb862ce509	Properly attach/detach isc_httpd in case read ends earlier than send An assertion failure would be triggered when sending the TCP data ends after the TCP reading gets closed. Implement proper reference counting for the isc_httpd object.	2024-05-15 12:22:10 +02:00
Evan Hunt	b6815de316	Fix QP chain on partial match When searching for a requested name in dns_qp_lookup(), we may add a leaf node to the QP chain, then subsequently determine that the branch we were on was a dead end. When that happens, the chain can be left holding a pointer to a node that is not an ancestor of the requested name. We correct for this by unwinding any chain links with an offset value greater or equal to that of the node we found.	2024-05-14 12:58:46 -07:00
Matthijs Mekking	91de4f6490	Refactor fix_iterator The code below the if/else construction could only be run if the 'if' code path was taken. Move the code into the 'if' code block so that it is more easier to read.	2024-05-14 12:58:46 -07:00
Aydın Mercan	e037520b92	Keep track of the recursive clients highwater The high-water allows administrators to better tune the recursive clients limit without having to to poll the statistics channel in high rates to get this number.	2024-05-10 12:08:52 +03:00
Aydın Mercan	09e4fb2ffa	Return the old counter value in `isc_stats_increment` Returning the value allows for better high-water tracking without running into edge cases like the following: 0. The counter is at value X 1. Increment the value (X+1) 2. The value is decreased multiple times in another threads (X+1-Y) 3. Get the value (X+1-Y) 4. Update-if-greater misses the X+1 value which should have been the high-water	2024-05-10 12:08:52 +03:00

1 2 3 4 5 ...

15510 Commits