mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-30 14:07:59 +00:00

Author	SHA1	Message	Date
Aram Sargsyan	5367ccb561	Adjust the resolver-query-timeout test Since the read timeout now works, the resolver time outs from the dispatch level instead of from the "hung fetch" timer, and so the EDE value in 'fctx_expired()' is not being set. Remove the expected EDE value from the test.	2025-01-22 13:40:45 +00:00
Aram Sargsyan	87c453850c	Fix rtt calculation bug for TCP in the resolver When TCP is used, 'fctx_query()' adds one second to the rtt (round-trip time) value, but there's a bug when the decision about using TCP is made already after the calculation. Move the block of the code which looks up the peers list to decide whether to use TCP into a place that's before the rtt calculation is performed. This commit doesn't add or remove any code, it just moves the code and adds a comment block.	2025-01-22 13:40:45 +00:00
Aram Sargsyan	e61ba5865f	Use a suitable response in tcp_connected() when initiating a read When 'ISC_R_TIMEDOUT' is received in 'tcp_recv()', it times out the oldest response in the active responses queue, and only after that it checks whether other active responses have also timed out. So when setting a timeout value for a read operation after a successful connection, it makes sense to take the timeout value from the oldest response in the active queue too, because, theoretically, the responses can have different timeout values, e.g. when the TCP dispatch is shared. Currently 'resp' is always NULL. Previously when connect and read timeouts were not separated in dispatch this affected only logging, but now since we are setting a new timeout after a successful connection, we need to choose a suitable response from the active queue.	2025-01-22 13:40:45 +00:00
Ondřej Surý	48471fd50c	fix: usr: Avoid unnecessary locking in the zone/cache database Prevent lock contention among many worker threads referring to the same database node at the same time. This would improve zone and cache database performance for the heavily contended database nodes. Closes #5130 Merge branch '5130-reduce-lock-contention-in-decrement-reference' into 'main' See merge request isc-projects/bind9!9963	2025-01-22 13:27:40 +00:00
JINMEI Tatuya	7f4471594d	Optimize database decref by avoiding locking with refs > 1 Previously, this function always acquires a node write lock if it might need node cleanup in case the reference decrements to 0. In fact, the lock is unnecessary if the reference is larger than 1 and it can be optimized as an "easy" case. This optimization could even be "necessary". In some extreme cases, many worker threads could repeat acquring and releasing the reference on the same node, resulting in severe lock contention for nothing (as the ref wouldn't decrement to 0 in most cases). This change would prevent noticeable performance drop like query timeout for such cases. Co-authored-by: JINMEI Tatuya <jtatuya@infoblox.com> Co-authored-by: Ondřej Surý <ondrej@isc.org>	2025-01-22 14:27:13 +01:00
Ondřej Surý	3fe440f0cf	chg: dev: Shutdown the fetch context after canceling the last fetch Shutdown the fetch context immediately after the last fetch has been canceled from that particular fetch context. Merge branch 'ondrej/shutdown-the-fetch-context-early' into 'main' See merge request isc-projects/bind9!9958	2025-01-22 13:21:23 +00:00
Ondřej Surý	9f945c8b67	Shutdown the fetch context after canceling the last fetch Currently, the fetch context will continue running even when the last fetch (response) has been removed from the context, so named can process and cache the answer. This can lead to a situation where the number of outgoing recursing clients exceeds the the configured number for recursive-clients. Be more stringent about the recursive-clients limit and shutdown the fetch context immediately after the last fetch has been canceled from that particular fetch context.	2025-01-22 14:19:20 +01:00
Ondřej Surý	0673568c17	fix: usr: Apply the memory limit only to ADB database items Resolver under heavy-load could exhaust the memory available for storing the information in the Address Database (ADB) effectively evicting already stored information in the ADB. The memory used to retrieve and provide information from the ADB is now not a subject of the same memory limits that are applied for storing the information in the Address Database. Closes #5127 Merge branch '5127-change-ADB-memory-split' into 'main' See merge request isc-projects/bind9!9954	2025-01-22 13:14:40 +00:00
Ondřej Surý	05faff6d53	Remove memory limit on ADB finds and fetches Address Database (ADB) shares the memory for the short lived ADB objects (finds, fetches, addrinfo) and the long lived ADB objects (names, entries, namehooks). This could lead to a situation where the resolver-heavy load would force evict ADB objects from the database to point where ADB is completely empty, leading to even more resolver-heavy load. Make the short lived ADB objects use the other memory context that we already created for the hashmaps. This makes the ADB overmem condition to not be triggered by the ongoing resolver fetches.	2025-01-22 14:13:35 +01:00
Arаm Sаrgsyаn	3f490fe3fb	chg: dev: Separate the connect and the read TCP timeouts in dispatch The network manager layer has two different timers with their own timeout values for TCP connections: connect timeout and read timeout. Separate the connect and the read TCP timeouts in the dispatch module too. Closes #5009 Merge branch '5009-dispatch-separate-connect-and-read-timeouts' into 'main' See merge request isc-projects/bind9!9698	2025-01-22 12:58:29 +00:00
Aram Sargsyan	612d76b83d	Remove dispatch timeout INT16_MAX limitation In some places there was a limitation of the maximum timeout value of INT16_MAX, which is only about 32 seconds. Refactor the code to remove the limitation.	2025-01-22 11:57:53 +00:00
Aram Sargsyan	64ffbe82c0	Separate the connect and the read timeouts in dispatch The network manager layer has two different timers with their own timeout values for TCP connections: connect timeout and read timeout. Separate the connect and the read TCP timeouts in the dispatch module too.	2025-01-22 11:57:52 +00:00
Aram Sargsyan	114555ea65	dispatch_test: make client timeouts shorter Use shorter timeouts for the client to ensure that the clients time out before the server.	2025-01-22 11:52:24 +00:00
Aram Sargsyan	9ccd1be482	Update the dns_dispatch_add() function's documentation The 'timedout' callback no longer exists. Remove the mentioning of the 'timedout' callback.	2025-01-22 11:52:24 +00:00
Colin Vidal	65c557c536	new: nil: ignore TAGS files Merge branch 'colin/ignoreTAGS' into 'main' See merge request isc-projects/bind9!9956	2025-01-22 11:22:41 +00:00
Colin Vidal	2164ea8abd	ignore TAGS files TAGS file are generated from `make tags` using etags. Other index tags are already ignored (GTAGS, GPATH, etc.). Also ignoring `TAGS`.	2025-01-22 10:42:35 +00:00
Colin Vidal	1732346fcc	rem: dev: remove fields from struct fetchctx struct fetchctx does have several fields which are now unused or confusing, removing those. Merge branch 'colin/remove-fctx-validator' into 'main' See merge request isc-projects/bind9!9945	2025-01-22 10:31:22 +00:00
Colin Vidal	c9529c0acb	remove ISC_LINK(link) property from fetchctx Likely because of historical reasons, struct fetchctx does have a list link property but is never used as a list. Remove this link property.	2025-01-22 09:56:09 +00:00
Colin Vidal	93e6e72eb6	remove validator link form fetchctx struct fetchctx does have a list of pending validators as well as a pointer to the HEAD validator. Remove the validator pointer to avoid confusion, as there is no perticular reasons to have it directly accessible outside of the list.	2025-01-22 09:56:09 +00:00
Andoni Duarte	87b0c1c1a0	chg: doc: Set up version for BIND 9.21.5 Merge branch 'andoni/set-up-version-for-bind-9.21.5' into 'main' See merge request isc-projects/bind9!9968	2025-01-22 08:33:12 +00:00
Andoni Duarte Pintado	bdef1e2176	Update BIND version to 9.21.5-dev	2025-01-21 15:58:51 +01:00
Nicki Křížek	0f626b8cc3	Update BIND version for release v9.21.4	2025-01-20 13:54:00 +01:00
Nicki Křížek	4a0a598cc2	new: doc: Prepare documentation for BIND 9.21.4 Merge branch 'andoni/prepare-documentation-for-bind-9.21.4' into 'v9.21.4-release' See merge request isc-private/bind9!772	2025-01-20 12:52:22 +00:00
Michał Kępień	70187b67ae	Reorder release notes	2025-01-17 22:53:57 +01:00
Michał Kępień	e9003901a7	Add release note for GL #5099	2025-01-17 22:53:57 +01:00
Andoni Duarte Pintado	fa4c45d9e8	Tweak and reword release notes	2025-01-17 22:53:57 +01:00
Andoni Duarte Pintado	84f36eaa83	Fix broken option reference in the ARM	2025-01-17 22:53:57 +01:00
Andoni Duarte Pintado	0937207606	Prepare release notes for BIND 9.21.4	2025-01-16 10:39:11 +01:00
Andoni Duarte Pintado	b6ccbcbcf1	Generate changelog for BIND 9.21.4	2025-01-16 10:20:01 +01:00
Andoni Duarte	bddaff3210	[CVE-2024-12705] sec: usr: DNS-over-HTTP(s) flooding fixes Fix DNS-over-HTTP(S) implementation issues that arise under heavy query load. Optimize resource usage for :iscman:`named` instances that accept queries over DNS-over-HTTP(S). Previously, :iscman:`named` would process all incoming HTTP/2 data at once, which could overwhelm the server, especially when dealing with clients that send requests but don't wait for responses. That has been fixed. Now, :iscman:`named` handles HTTP/2 data in smaller chunks and throttles reading until the remote side reads the response data. It also throttles clients that send too many requests at once. Additionally, :iscman:`named` now carefully processes data sent by some clients, which can be considered "flooding." It logs these clients and drops connections from them. :gl:`#4795` In some cases, :iscman:`named` could leave DNS-over-HTTP(S) connections in the `CLOSE_WAIT` state indefinitely. That also has been fixed. ISC would like to thank JF Billaud for thoroughly investigating the issue and verifying the fix. :gl:`#5083` See https://gitlab.isc.org/isc-projects/bind9/-/issues/4795 Closes https://gitlab.isc.org/isc-projects/bind9/-/issues/5083 Merge branch 'artem-improve-doh-resource-usage' into 'v9.21.4-release' See merge request isc-private/bind9!732	2025-01-15 14:42:44 +00:00
Artem Boldariev	937b5f8349	DoH: reduce excessive bad request logging We started using isc_nm_bad_request() more actively throughout codebase. In the case of HTTP/2 it can lead to a large count of useless "Bad Request" messages in the BIND log, as often we attempt to send such request over effectively finished HTTP/2 sessions. This commit fixes that.	2025-01-15 14:09:17 +00:00
Artem Boldariev	4ae4e255cf	Do not stop timer in isc_nm_read_stop() in manual timer mode A call to isc_nm_read_stop() would always stop reading timer even in manual timer control mode which was added with StreamDNS in mind. That looks like an omission that happened due to how timers are controlled in StreamDNS where we always stop the timer before pausing reading anyway (see streamdns_on_complete_dnsmessage()). That would not work well for HTTP, though, where we might want pause reading without stopping the timer in the case we want to split incoming data into multiple chunks to be processed independently. I suppose that it happened due to NM refactoring in the middle of StreamDNS development (at the time isc_nm_cancelread() and isc_nm_pauseread() were removed), as the StreamDNS code seems to be written as if timers are not stoping during a call to isc_nm_read_stop().	2025-01-15 14:09:17 +00:00
Artem Boldariev	609a41517b	DoH: introduce manual read timer control This commit introduces manual read timer control as used by StreamDNS and its underlying transports. Before that, DoH code would rely on the timer control provided by TCP, which would reset the timer any time some data arrived. Now, the timer is restarted only when a full DNS message is processed in line with other DNS transports. That change is required because we should not stop the timer when reading from the network is paused due to throttling. We need a way to drop timed-out clients, particularly those who refuse to read the data we send.	2025-01-15 14:09:17 +00:00
Artem Boldariev	3425e4b1d0	DoH: floodding clients detection This commit adds logic to make code better protected against clients that send valid HTTP/2 data that is useless from a DNS server perspective. Firstly, it adds logic that protects against clients who send too little useful (=DNS) data. We achieve that by adding a check that eventually detects such clients with a nonfavorable useful to processed data ratio after the initial grace period. The grace period is limited to processing 128 KiB of data, which should be enough for sending the largest possible DNS message in a GET request and then some. This is the main safety belt that would detect even flooding clients that initially behave well in order to fool the checks server. Secondly, in addition to the above, we introduce additional checks to detect outright misbehaving clients earlier: The code will treat clients that open too many streams (50) without sending any data for processing as flooding ones; The clients that managed to send 1.5 KiB of data without opening a single stream or submitting at least some DNS data will be treated as flooding ones. Of course, the behaviour described above is nothing else but heuristical checks, so they can never be perfect. At the same time, they should be reasonable enough not to drop any valid clients, realatively easy to implement, and have negligible computational overhead.	2025-01-15 14:09:17 +00:00
Artem Boldariev	9846f395ad	DoH: process data chunk by chunk instead of all at once Initially, our DNS-over-HTTP(S) implementation would try to process as much incoming data from the network as possible. However, that might be undesirable as we might create too many streams (each effectively backed by a ns_client_t object). That is too forgiving as it might overwhelm the server and trash its memory allocator, causing high CPU and memory usage. Instead of doing that, we resort to processing incoming data using a chunk-by-chunk processing strategy. That is, we split data into small chunks (currently 256 bytes) and process each of them asynchronously. However, we can process more than one chunk at once (up to 4 currently), given that the number of HTTP/2 streams has not increased while processing a chunk. That alone is not enough, though. In addition to the above, we should limit the number of active streams: these streams for which we have received a request and started processing it (the ones for which a read callback was called), as it is perfectly fine to have more opened streams than active ones. In the case we have reached or surpassed the limit of active streams, we stop reading AND processing the data from the remote peer. The number of active streams is effectively decreased only when responses associated with the active streams are sent to the remote peer. Overall, this strategy is very similar to the one used for other stream-based DNS transports like TCP and TLS.	2025-01-15 14:09:17 +00:00
Andoni Duarte	4d054cca7a	[CVE-2024-11187] sec: usr: Limit the additional processing for large RDATA sets When answering queries, don't add data to the additional section if the answer has more than 13 names in the RDATA. This limits the number of lookups into the database(s) during a single client query, reducing query processing load. See isc-projects/bind9#5034 Merge branch '5034-security-limit-additional' into 'v9.21.4-release' See merge request isc-private/bind9!750	2025-01-15 11:56:06 +00:00
Ondřej Surý	a1982cf1bb	Limit the additional processing for large RDATA sets Limit the number of records appended to ADDITIONAL section to the names that have less than 14 records in the RDATA. This limits the number of the lookups into the database(s) during single client query. Also don't append any additional data to ANY queries. The answer to ANY is already big enough.	2025-01-14 09:57:54 +00:00
Ondřej Surý	e51d4d3b88	Isolate using the -T noaa flag only for part of the resolver test Instead of running the whole resolver/ns4 server with -T noaa flag, use it only for the part where it is actually needed. The -T noaa could interfere with other parts of the test because the answers don't have the authoritative-answer bit set, and we could have false positives (or false negatives) in the test because the authoritative server doesn't follow the DNS protocol for all the tests in the resolver system test.	2025-01-14 09:57:54 +00:00
Ondřej Surý	8356179953	Rename the qpzone and qpcache methods that implement DB api All the database implementations share the same names for the methods implementing the database. That has some advantages like knowing what to expect, but it turns out that any time such method shows up in any kind of tracing - be it perf record, backtrace or anything else that uses symbol names, it is very hard to distinguish whether the find() belongs to qpcache, qpzone, builtin or sdlz implementation. Make at least the names for qpzone and qpcache unique.	2025-01-14 09:57:54 +00:00
Nicki Křížek	3a94afa03a	fix: usr: querying an NSEC3-signed zone for an empty record could trigger an assertion A bug in the qpzone database could trigger a crash when querying for a deleted name, or a newly-added empty non-terminal name, in an NSEC3-signed zone. This has been fixed. Closes #5108 Merge branch '5108-nsec3-empty-node' into 'main' See merge request isc-projects/bind9!9928	2025-01-14 08:34:16 +00:00
Evan Hunt	232dac8cd5	detect when closest-encloser name is too long there was a database bug in which dns_db_find() could get a partial match for the query name, but still set foundname to match the full query name. this triggered an assertion when query_addwildcardproof() assumed that foundname would be shorter. the database bug has been fixed, but in case it happens again, we can just copy the name instead of splitting it. we will also log a warning that the closest-encloser name was invalid.	2025-01-09 17:04:08 -08:00
Evan Hunt	71e1c91695	dns_nsec3_addnsec3() can fail when iterating back when adding a new NSEC3 record, dns_nsec3_addnsec3() uses a dbiterator to seek to the newly created node and then find its predecessor. dbiterators in the qpzone use snapshots, so changes to the database are not reflected in an already-existing iterator. consequently, when we add a new node, we have to create a new iterator before we can seek to it.	2025-01-09 17:04:08 -08:00
Evan Hunt	3e367a23f9	add a regression test for a new ENT node this test adds a record with empty non-terminal nodes above it. this has also been observed to trigger the crash in NSEC3 zones. NOTE: the test currently fails, because while there is no crash, the query results are not as expected. when we add a node below an ENT, receive_secure_serial() gets DNS_R_PARTIALMATCH, and the signed zone is never updated. this is not a regression from fixing the crash bug; it's a separate inline-signing bug.	2025-01-09 17:03:51 -08:00
Evan Hunt	7b94c34965	add a regression test for record deletion test that there's no crash when querying for a newly-deleted node. (incidentally also renamed ns3/named.conf.in to ns3/named1.conf.in, because named2.conf.in does exist, and they should match.)	2025-01-09 17:03:51 -08:00
Evan Hunt	ad4bab306c	qpzone find() function could set foundname incorrectly when a requested name is found in the QP trie during a lookup, but its records have been marked as nonexistent by a previous deletion, then it's treated as a partial match, but the foundname could be left pointing to the original qname rather than the parent. this could lead to an assertion failure in query_findclosestnsec3().	2025-01-09 17:03:51 -08:00
Michał Kępień	9636dc1a1e	fix: nil: Fix default IANA root zone mirror configuration Closes #5115 Merge branch '5115-fix-default-iana-root-zone-mirror-configuration' into 'main' See merge request isc-projects/bind9!9934	2025-01-09 11:22:07 +00:00
Michał Kępień	010d2eb436	Fix default IANA root zone mirror configuration Commit `b121f02eac` renamed the top-level "primaries" block in bin/named/config.c to "remote-servers". This configuration block lists the primary servers used for an IANA root zone mirror when no primary servers are explicitly specified for it in the configuration. However, the relevant part of the named_zone_configure() function only looks for a top-level "primaries" block and not for any of its synonyms. As a result, configuring an IANA root zone mirror with just: zone "." { type mirror; }; now results in a cryptic fatal error on startup: loading configuration: not found exiting (due to fatal error) Fix by using the correct top-level block name in named_zone_configure().	2025-01-09 12:16:48 +01:00
Arаm Sаrgsyаn	19a2aab136	fix: usr: Fix response policy zones and catalog zones with an $INCLUDE statement defined Response policy zones (RPZ) and catalog zones were not working correctly if they had an $INCLUDE statement defined. This has been fixed. Closes #5111 Merge branch '5111-includes-disable-rpz-and-catz-fix' into 'main' See merge request isc-projects/bind9!9930	2025-01-08 14:01:36 +00:00
Aram Sargsyan	d75bdabe51	Fix a typo in dns/master.h The ISC_R_SEENINCLUDE definition does not exist, the correct one is DNS_R_SEENINCLUDE.	2025-01-08 14:00:55 +00:00
Aram Sargsyan	3d7a9fba3b	Don't disable RPZ and CATZ for zones with an $INCLUDE statement The code in zone_startload() disables RPZ and CATZ for a zone if dns_master_loadfile() returns anything other than ISC_R_SUCCESS, which makes sense, but it's an error because zone_startload() can also return DNS_R_SEENINCLUDE upon success when the zone had an $INCLUDE statement.	2025-01-08 14:00:55 +00:00

... 10 11 12 13 14 ...

42936 Commits