mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-22 10:10:06 +00:00

Author	SHA1	Message	Date
Mark Andrews	ab0bf49203	Adjust default value of "max-recursion-queries" Since the queries sent towards root and TLD servers are now included in the count (as a result of the fix for CVE-2020-8616), "max-recursion-queries" has a higher chance of being exceeded by non-attack queries. Increase its default value from 75 to 100.	2020-12-01 23:47:23 +11:00
Mark Andrews	0e3b1f5a25	Tighten DNS COOKIE response handling Fallback to TCP when we have already seen a DNS COOKIE response from the given address and don't have one in this UDP response. This could be a server that has turned off DNS COOKIE support, a misconfigured anycast server with partial DNS COOKIE support, or a spoofed response. Falling back to TCP is the correct behaviour in all 3 cases.	2020-11-26 20:48:46 +00:00
Mark Andrews	ed783a8139	remove fctx:id field	2020-11-06 01:54:44 +00:00
Mark Andrews	1f63bb15b3	Restore the dns_message_reset() call before the dns_dispatch_getnext() This was accidentally lost in the process of moving rmessage from fctx to query. Without this dns_message_setclass() will fail.	2020-10-08 10:55:35 +11:00
Ondřej Surý	bb990030d3	Simplify the EDNS buffer size logic for DNS Flag Day 2020 The DNS Flag Day 2020 aims to remove the IP fragmentation problem from the UDP DNS communication. In this commit, we implement the required changes and simplify the logic for picking the EDNS Buffer Size. 1. The defaults for `edns-udp-size`, `max-udp-size` and `nocookie-udp-size` have been changed to `1232` (the value picked by DNS Flag Day 2020). 2. The probing heuristics that would try 512->4096->1432->1232 buffer sizes has been removed and the resolver will always use just the `edns-udp-size` value. 3. Instead of just disabling the PMTUD mechanism on the UDP sockets, we now set IP_DONTFRAG (IPV6_DONTFRAG) flag. That means that the UDP packets won't get ever fragmented. If the ICMP packets are lost the UDP will just timeout and eventually be retried over TCP.	2020-10-05 16:21:21 +02:00
Ondřej Surý	33eefe9f85	The dns_message_create() cannot fail, change the return to void The dns_message_create() function cannot soft fail (as all memory allocations either succeed or cause abort), so we change the function to return void and cleanup the calls.	2020-09-29 08:22:08 +02:00
Diego Fronza	cde6227a68	Properly handling dns_message_t shared references This commit fix the problems that arose when moving the dns_message_t object from fetchctx_t to the query structure. Since the lifetime of query objects are different than that of a fetchctx and the dns_message_t object held by the query may be being used by some external module, e.g. validator, even after the query may have been destroyed, propery handling of the references to the message were added in this commit to avoid accessing an already destroyed object. Specifically, in rctx_done(), a reference to the message is attached at the beginning of the function and detached at the end, since a possible call to fctx_cancelquery() would release the dns_message_t object, and in the next lines of code a call to rctx_nextserver() or rctx_chaseds() would require a valid pointer to the same object. In valcreate() a new reference is attached to the message object, this ensures that if the corresponding query object is destroyed before the validator attempts to access it, no invalid pointer access occurs. In validated() we have to attach a new reference to the message, since we destroy the validator object at the beginning of the function, and we need access to the message in the next lines of the same function. rctx_nextserver() and rctx_chaseds() functions were adapted to receive a new parameter of dns_message_t* type, this was so they could receive a valid reference to a dns_message_t since using the response context respctx_t to access the message through rctx->query->rmessage could lead to an already released reference due to the query being canceled.	2020-09-29 08:22:08 +02:00
Diego Fronza	02f9e125c1	Fix invalid dns message state in resolver's logic The assertion failure REQUIRE(msg->state == DNS_SECTION_ANY), caused by calling dns_message_setclass within function resquery_response() in resolver.c, was happening due to wrong management of dns message_t objects used to process responses to the queries issued by the resolver. Before the fix, a resolver's fetch context (fetchctx_t) would hold a pointer to the message, this same reference would then be used over all the attempts to resolve the query, trying next server, etc... for this to work the message object would have it's state reset between each iteration, marking it as ready for a new processing. The problem arose in a scenario with many different forwarders configured, managing the state of the dns_message_t object was lacking better synchronization, which have led it to a invalid dns_message_t state in resquery_response(). Instead of adding unnecessarily complex code to synchronize the object, the dns_message_t object was moved from fetchctx_t structure to the query structure, where it better belongs to, since each query will produce a response, this way whenever a new query is created an associated dns_messate_t is also created. This commit deals mainly with moving the dns_message_t object from fetchctx_t to the query structure.	2020-09-29 08:22:08 +02:00
Diego Fronza	12d6d13100	Refactored dns_message_t for using attach/detach semantics This commit will be used as a base for the next code updates in order to have a better control of dns_message_t objects' lifetime.	2020-09-29 08:22:08 +02:00
Evan Hunt	dcee985b7f	update all copyright headers to eliminate the typo	2020-09-14 16:20:40 -07:00
Mark Andrews	992a79a14b	Address lock-order-inversion WARNING: ThreadSanitizer: lock-order-inversion (potential deadlock) (pid=12714) Cycle in lock order graph: M100252 (0x7b7c00010a08) => M1171 (0x7b7400000dc8) => M100252 Mutex M1171 acquired here while holding mutex M100252 in thread T1: #0 pthread_mutex_lock <null> (delv+0x4483a6) #1 dns_resolver_createfetch3 /builds/isc-projects/bind9/lib/dns/resolver.c:9585:2 (libdns.so.1110+0x1769fd) #2 dns_resolver_createfetch /builds/isc-projects/bind9/lib/dns/resolver.c:9504:10 (libdns.so.1110+0x174e17) #3 create_fetch /builds/isc-projects/bind9/lib/dns/validator.c:1156:10 (libdns.so.1110+0x1c1e5f) #4 validatezonekey /builds/isc-projects/bind9/lib/dns/validator.c:2124:13 (libdns.so.1110+0x1c3b6d) #5 start_positive_validation /builds/isc-projects/bind9/lib/dns/validator.c:2301:10 (libdns.so.1110+0x1bfde9) #6 validator_start /builds/isc-projects/bind9/lib/dns/validator.c:3647:12 (libdns.so.1110+0x1bef62) #7 dispatch /builds/isc-projects/bind9/lib/isc/task.c:1157:7 (libisc.so.1107+0x507d5) #8 run /builds/isc-projects/bind9/lib/isc/task.c:1331:2 (libisc.so.1107+0x4d729) Mutex M100252 previously acquired by the same thread here: #0 pthread_mutex_lock <null> (delv+0x4483a6) #1 validator_start /builds/isc-projects/bind9/lib/dns/validator.c:3628:2 (libdns.so.1110+0x1bee31) #2 dispatch /builds/isc-projects/bind9/lib/isc/task.c:1157:7 (libisc.so.1107+0x507d5) #3 run /builds/isc-projects/bind9/lib/isc/task.c:1331:2 (libisc.so.1107+0x4d729) Mutex M100252 acquired here while holding mutex M1171 in thread T1: #0 pthread_mutex_lock <null> (delv+0x4483a6) #1 dns_validator_destroy /builds/isc-projects/bind9/lib/dns/validator.c:3912:2 (libdns.so.1110+0x1bf788) #2 validated /builds/isc-projects/bind9/lib/dns/resolver.c:4916:2 (libdns.so.1110+0x18fdfd) #3 dispatch /builds/isc-projects/bind9/lib/isc/task.c:1157:7 (libisc.so.1107+0x507d5) #4 run /builds/isc-projects/bind9/lib/isc/task.c:1331:2 (libisc.so.1107+0x4d729) Mutex M1171 previously acquired by the same thread here: #0 pthread_mutex_lock <null> (delv+0x4483a6) #1 validated /builds/isc-projects/bind9/lib/dns/resolver.c:4907:2 (libdns.so.1110+0x18fc3d) #2 dispatch /builds/isc-projects/bind9/lib/isc/task.c:1157:7 (libisc.so.1107+0x507d5) #3 run /builds/isc-projects/bind9/lib/isc/task.c:1331:2 (libisc.so.1107+0x4d729) Thread T1 'isc-worker0000' (tid=12729, running) created by main thread at: #0 pthread_create <null> (delv+0x42afdb) #1 isc_thread_create /builds/isc-projects/bind9/lib/isc/pthreads/thread.c:60:8 (libisc.so.1107+0x726d8) #2 isc__taskmgr_create /builds/isc-projects/bind9/lib/isc/task.c:1468:7 (libisc.so.1107+0x4d635) #3 isc_taskmgr_createinctx /builds/isc-projects/bind9/lib/isc/task.c:2091:11 (libisc.so.1107+0x4f4ac) #4 main /builds/isc-projects/bind9/bin/delv/delv.c:1639:2 (delv+0x4b7f96) SUMMARY: ThreadSanitizer: lock-order-inversion (potential deadlock) (/builds/isc-projects/bind9/bin/delv/.libs/delv+0x4483a6) in pthread_mutex_lock	2020-09-09 14:08:33 +10:00
Ondřej Surý	0a22024c27	Print diagnostics on dns_name_issubdomain() failure in fctx_create() Log diagnostic message when dns_name_issubdomain() in the fctx_create() when the resolver is qname minimizing and forwarding at the same time.	2020-09-02 18:08:15 +02:00
Diego Fronza	230d79c191	Fix resolution of unusual ip6.arpa names Before this commit, BIND was unable to resolve ip6.arpa names like the one reported in issue #1847 when using query minimization. As reported in the issue, an attempt to resolve a name like 'rec-test-dom-158937817846788.test123.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.2.0.3.4.3.5.4.0.8.2.6.0.1.0.0.2.ip6.arpa' using default settings would fail. The reason was that query minimization algorithm in 'fctx_minimize_qname' would divide any ip6.arpa names in increasing number of labels, 7,11, ... up to 35, thus limiting the destination name (minimized) to a number of 35 labels. In case the last query minimization attempt (with 35 labels) would fail with NXDOMAIN, BIND would attempt the query mininimization again with the exact same QNAME, limited on the 35 labels, and that in turn would fail again. This fix avoids this fail loop by considering the extra labels that may appear in the leftmost part of an ip6.arpa name, those after the IPv6 part.	2020-09-01 15:47:00 -03:00
Evan Hunt	51c9ea98a3	permanently disable QNAME minimization in a fetch when forwarding QNAME minimization is normally disabled when forwarding. if, in the course of processing a fetch, we switch back to normal recursion at some point, we can't safely start minimizing because we may have been left in an inconsistent state.	2020-08-05 15:43:52 +02:00
Mark Andrews	bde5c7632a	Always check the return from isc_refcount_decrement. Created isc_refcount_decrement_expect macro to test conditionally the return value to ensure it is in expected range. Converted unchecked isc_refcount_decrement to use isc_refcount_decrement_expect. Converted INSIST(isc_refcount_decrement()...) to isc_refcount_decrement_expect.	2020-07-31 10:15:44 +10:00
Michał Kępień	953d704bd2	Fix idle timeout for connected TCP sockets When named acting as a resolver connects to an authoritative server over TCP, it sets the idle timeout for that connection to 20 seconds. This fixed timeout was picked back when the default processing timeout for each client query was hardcoded to 30 seconds. Commit 000a8970f840a0c27c5cc404826853c4674362ac made this processing timeout configurable through "resolver-query-timeout" and decreased its default value to 10 seconds, but the idle TCP timeout was not adjusted to reflect that change. As a result, with the current defaults in effect, a single hung TCP connection will consistently cause the resolution process for a given query to time out. Set the idle timeout for connected TCP sockets to half of the client query processing timeout configured for a resolver. This allows named to handle hung TCP connections more robustly and prevents the timeout mismatch issue from resurfacing in the future if the default is ever changed again.	2020-07-30 10:58:39 +02:00
Witold Kręcicki	b4f3fafcff	Fix assertion failure during startup when the server is under load. When we're coming back from recursion fetch_callback does not accept DNS_R_NXDOMAIN as an rcode - query_gotanswer calls query_nxdomain in which an assertion fails on qctx->is_zone. Yet, under some circumstances, qname minimization will return an DNS_R_NXDOMAIN - when root zone mirror is not yet loaded. The fix changes the DNS_R_NXDOMAIN answer to DNS_R_SERVFAIL.	2020-07-01 12:25:36 +02:00
Evan Hunt	364b349ad2	ensure clientstr is null-terminated	2020-06-05 18:56:40 -07:00
Witold Kręcicki	175c4d9055	Fix a data access race in resolver We were passing client address to dns_resolver_createfetch as a pointer and it was saved as a pointer. The client (with its address) could be gone before the fetch is finished, and in a very odd scenario log_formerr would call isc_sockaddr_format() which first checks if the address family is valid (and at this point it still is), then the sockaddr is cleared, and then isc_netaddr_fromsockaddr is called which fails an assertion as the address family is now invalid.	2020-06-05 16:06:42 +02:00
Michał Kępień	fb123df2b2	Improve the "hint" variable comment Replace an existing comment with a more verbose explanation of when the "hint" variable is set in resquery_send() and how its value affects the advertised UDP buffer size in outgoing queries.	2020-05-25 14:34:56 +02:00
Michał Kępień	d27f96cc98	Ensure server-specific "edns-udp-size" is obeyed If "edns-udp-size" is set in a "server" block matching the queried server, it is accounted for in the process of determining the advertised UDP buffer size, but its value may still be overridden before the query is sent. This behavior contradicts the ARM which claims that when set, the server-specific "edns-udp-size" value is used for all EDNS queries sent to a given server. Furthermore, calling dns_peer_getudpsize() with the "udpsize" variable as an argument makes the code hard to follow as that call may either update the value of "udpsize" or leave it untouched. Ensure the code matches the documentation by moving the dns_peer_getudpsize() call below all other blocks of code potentially affecting the advertised UDP buffer size, which is where it was located when server-specific "edns-udp-size" support was first implemented [1]. Improve code readability by calling dns_peer_getudpsize() with a helper variable instead of "udpsize". [1] see commit 1c153afce556ff3c687986fb7c4a0b0a7f5e7cd8	2020-05-25 14:34:56 +02:00
Michał Kępień	d7583e7926	Restore semantic meaning of DNS_FETCHOPT_EDNS512 When the DNS_FETCHOPT_EDNS512 flag was first introduced [1], it enforced advertising a 512-byte UDP buffer size in an outgoing query. Ever since EDNS processing code got updated [2], that flag has still been set upon detection of certain query timeout patterns, but it has no longer been affecting the calculations of the advertised UDP buffer size in outgoing queries. Restore original semantic meaning of DNS_FETCHOPT_EDNS512 by ensuring the advertised UDP buffer size is set to 512 bytes when that flag is set. Update existing comments and add new ones to improve code readability. [1] see commit 08c90261660649ca7d92065f6f13a61ec5a9a86d [2] see commit 8e15d5eb3a000f1341e6bea0ddbc28d6dd2a0591	2020-05-25 14:34:56 +02:00
Michał Kępień	949d9a3ea4	Remove fctx->reason and a misleading log message The following message: success resolving '<name>' (in '<domain>'?) after reducing the advertised EDNS UDP packet size to 512 octets can currently be logged even if the EDNS UDP buffer size advertised in queries sent to a given server had already been set to 512 octets before the fetch context was created (e.g. due to the server responding intermittently). In other words, this log message may be misleading as lowering the advertised EDNS UDP buffer size may not be the actual cause of <name> being successfully resolved. Remove the log message in question to prevent confusion. As this log message is the only existing user of the "reason" field in struct fetchctx, remove that field as well, along with all the code related to it.	2020-05-25 14:34:56 +02:00
Mark Andrews	266faa3399	Count queries to the root and TLD servers as well	2020-05-19 12:30:29 +02:00
Mark Andrews	3ee5ea2fdb	Reduce the number of fetches we make when looking up addresses If there are more that 5 NS record for a zone only perform a maximum of 4 address lookups for all the name servers. This limits the amount of remote lookup performed for server addresses at each level for a given query.	2020-05-19 12:30:29 +02:00
Ondřej Surý	978c7b2e89	Complete rewrite the BIND 9 build system The rewrite of BIND 9 build system is a large work and cannot be reasonable split into separate merge requests. Addition of the automake has a positive effect on the readability and maintainability of the build system as it is more declarative, it allows conditional and we are able to drop all of the custom make code that BIND 9 developed over the years to overcome the deficiencies of autoconf + custom Makefile.in files. This squashed commit contains following changes: - conversion (or rather fresh rewrite) of all Makefile.in files to Makefile.am by using automake - the libtool is now properly integrated with automake (the way we used it was rather hackish as the only official way how to use libtool is via automake - the dynamic module loading was rewritten from a custom patchwork to libtool's libltdl (which includes the patchwork to support module loading on different systems internally) - conversion of the unit test executor from kyua to automake parallel driver - conversion of the system test executor from custom make/shell to automake parallel driver - The GSSAPI has been refactored, the custom SPNEGO on the basis that all major KRB5/GSSAPI (mit-krb5, heimdal and Windows) implementations support SPNEGO mechanism. - The various defunct tests from bin/tests have been removed: bin/tests/optional and bin/tests/pkcs11 - The text files generated from the MD files have been removed, the MarkDown has been designed to be readable by both humans and computers - The xsl header is now generated by a simple sed command instead of perl helper - The <irs/platform.h> header has been removed - cleanups of configure.ac script to make it more simpler, addition of multiple macros (there's still work to be done though) - the tarball can now be prepared with `make dist` - the system tests are partially able to run in oot build Here's a list of unfinished work that needs to be completed in subsequent merge requests: - `make distcheck` doesn't yet work (because of system tests oot run is not yet finished) - documentation is not yet built, there's a different merge request with docbook to sphinx-build rst conversion that needs to be rebased and adapted on top of the automake - msvc build is non functional yet and we need to decide whether we will just cross-compile bind9 using mingw-w64 or fix the msvc build - contributed dlz modules are not included neither in the autoconf nor automake	2020-04-21 14:19:48 +02:00
Diego Fronza	cf7b0de1eb	Fixed rebinding protection bug when using forwarder setups BIND wasn't honoring option "deny-answer-aliases" when configured to forward queries. Before the fix it was possible for nameservers listed in "forwarders" option to return CNAME answers pointing to unrelated domains of the original query, which could be used as a vector for rebinding attacks. The fix ensures that BIND apply filters even if configured as a forwarder instance. (cherry picked from commit af6a4de3d5ad6c1967173facf366e6c86b3ffc28)	2020-04-08 09:37:33 +02:00
Evan Hunt	ba0313e649	fix spelling errors reported by Fossies.	2020-02-21 15:05:08 +11:00
Diego Fronza	45543da802	Fixed disposing of resolver->references in destroy() function	2020-02-14 14:28:31 -03:00
Diego Fronza	e7b36924e2	Fixed potential-lock-inversion This commit simplifies a bit the lock management within dns_resolver_prime() and prime_done() functions by means of turning resolver's attribute "priming" into an atomic_bool and by creating only one dependent object on the lock "primelock", namely the "primefetch" attribute. By having the attribute "priming" as an atomic type, it save us from having to use a lock just to test if priming is on or off for the given resolver context object, within "dns_resolver_prime" function. The "primelock" lock is still necessary, since dns_resolver_prime() function internally calls dns_resolver_createfetch(), and whenever this function succeeds it registers an event in the task manager which could be called by another thread, namely the "prime_done" function, and this function is responsible for disposing the "primefetch" attribute in the resolver object, also for resetting "priming" attribute to false. It is important that the invariant "priming == false AND primefetch == NULL" remains constant, so that any thread calling "dns_resolver_prime" knows for sure that if the "priming" attribute is false, "primefetch" attribute should also be NULL, so a new fetch context could be created to fulfill this purpose, and assigned to "primefetch" attribute under the lock protection. To honor the explanation above, dns_resolver_prime is implemented as follow: 1. Atomically checks the attribute "priming" for the given resolver context. 2. If "priming" is false, assumes that "primefetch" is NULL (this is ensured by the "prime_done" implementation), acquire "primelock" lock and create a new fetch context, update "primefetch" pointer to point to the newly allocated fetch context. 3. If "priming" is true, assumes that the job is already in progress, no locks are acquired, nothing else to do. To keep the previous invariant consistent, "prime_done" is implemented as follow: 1. Acquire "primefetch" lock. 2. Keep a reference to the current "primefetch" object; 3. Reset "primefetch" attribute to NULL. 4. Release "primefetch" lock. 5. Atomically update "priming" attribute to false. 6. Destroy the "primefetch" object by using the temporary reference. This ensures that if "priming" is false, "primefetch" was already reset to NULL. It doesn't make any difference in having the "priming" attribute not protected by a lock, since the visible state of this variable would depend on the calling order of the functions "dns_resolver_prime" and "prime_done". As an example, suppose that instead of using an atomic for the "priming" attribute we employed a lock to protect it. Now suppose that "prime_done" function is called by Thread A, it is then preempted before acquiring the lock, thus not reseting "priming" to false. In parallel to that suppose that a Thread B is scheduled and that it calls "dns_resolver_prime()", it then acquires the lock and check that "priming" is true, thus it will consider that this resolver object is already priming and it won't do any more job. Conversely if the lock order was acquired in the other direction, Thread B would check that "priming" is false (since prime_done acquired the lock first and set "priming" to false) and it would initiate a priming fetch for this resolver. An atomic variable wouldn't change this behavior, since it would behave exactly the same, depending on the function call order, with the exception that it would avoid having to use a lock. There should be no side effects resulting from this change, since the previous implementation employed use of the more general resolver's "lock" mutex, which is used in far more contexts, but in the specifics of the "dns_resolver_prime" and "prime_done" it was only used to protect "primefetch" and "priming" attributes, which are not used in any of the other critical sections protected by the same lock, thus having zero dependency on those variables.	2020-02-14 14:28:31 -03:00
Ondřej Surý	5777c44ad0	Reformat using the new rules	2020-02-14 09:31:05 +01:00
Evan Hunt	e851ed0bb5	apply the modified style	2020-02-13 15:05:06 -08:00
Ondřej Surý	056e133c4c	Use clang-tidy to add curly braces around one-line statements The command used to reformat the files in this commit was: ./util/run-clang-tidy \ -clang-tidy-binary clang-tidy-11 -clang-apply-replacements-binary clang-apply-replacements-11 \ -checks=-,readability-braces-around-statements \ -j 9 \ -fix \ -format \ -style=file \ -quiet clang-format -i --style=format $(git ls-files '.c' '.h') uncrustify -c .uncrustify.cfg --replace --no-backup $(git ls-files '.c' '.h') clang-format -i --style=format $(git ls-files '.c' '*.h')	2020-02-13 22:07:21 +01:00
Ondřej Surý	36c6105e4f	Use coccinelle to add braces to nested single line statement Both clang-tidy and uncrustify chokes on statement like this: for (...) if (...) break; This commit uses a very simple semantic patch (below) to add braces around such statements. Semantic patch used: @@ statement S; expression E; @@ while (...) - if (E) S + { if (E) { S } } @@ statement S; expression E; @@ for (...;...;...) - if (E) S + { if (E) { S } } @@ statement S; expression E; @@ if (...) - if (E) S + { if (E) { S } }	2020-02-13 21:58:55 +01:00
Ondřej Surý	f50b1e0685	Use clang-format to reformat the source files	2020-02-12 15:04:17 +01:00
Ondřej Surý	bc1d4c9cb4	Clear the pointer to destroyed object early using the semantic patch Also disable the semantic patch as the code needs tweaks here and there because some destroy functions might not destroy the object and return early if the object is still in use.	2020-02-09 18:00:17 -08:00
Witold Kręcicki	d708370db4	Fix atomics usage for mutexatomics	2020-02-08 12:34:19 -08:00
Ondřej Surý	a9bd6f6ea6	Fix comparison between type uint16_t and wider type size_t in a loop Found by LGTM.com (see below for description), and while it should not happen as EDNS OPT RDLEN is uint16_t, the fix is easy. A little bit of cleanup is included too. > In a loop condition, comparison of a value of a narrow type with a value > of a wide type may result in unexpected behavior if the wider value is > sufficiently large (or small). This is because the narrower value may > overflow. This can lead to an infinite loop.	2020-02-05 01:41:13 +00:00
Ondřej Surý	7dfc092f06	Use C11 atomics for nfctx, kill unused dns_resolver_nrunning()	2020-01-14 13:12:13 +01:00
Mark Andrews	62abb6aa82	make resolver->zspill atomic to prevent potential deadlock	2019-12-12 08:26:59 +00:00
Mark Andrews	13aaeaa06f	Note bucket lock requirements and move REQUIRE inside locked section.	2019-12-10 22:16:15 +00:00
Mark Andrews	5589748eca	lock access to fctx->nqueries	2019-12-10 22:16:15 +00:00
Mark Andrews	912ce87479	Make fctx->attributes atomic. FCTX_ATTR_SHUTTINGDOWN needs to be set and tested while holding the node lock but the rest of the attributes don't as they are task locked. Making fctx->attributes atomic allows both behaviours without races.	2019-12-03 08:58:53 +11:00
Mark Andrews	9ca6ad6311	Assign fctx->client when fctx is created rather when the join happens. This prevents races on fctx->client whenever a new fetch joins a existing fetch (by calling fctx_join) as it is now invariant for the active life of fctx.	2019-12-02 06:01:46 +00:00
Ondřej Surý	edd97cddc1	Refactor dns_name_dup() usage using the semantic patch	2019-11-29 14:00:37 +01:00
Ondřej Surý	a5189eefa5	lib/dns/resolver.c: Call dns_adb_endudpfetch() only for UDP queries The dns_adb_beginudpfetch() is called only for UDP queries, but the dns_adb_endudpfetch() is called for all queries, including TCP. This messages the quota counting in adb.c.	2019-11-19 02:53:56 +08:00
Michał Kępień	fce3c93ea2	Prevent TCP failures from affecting EDNS stats EDNS mechanisms only apply to DNS over UDP. Thus, errors encountered while sending DNS queries over TCP must not influence EDNS timeout statistics.	2019-10-31 09:54:05 +01:00
Michał Kępień	6cd115994e	Prevent query loops for misbehaving servers If a TCP connection fails while attempting to send a query to a server, the fetch context will be restarted without marking the target server as a bad one. If this happens for a server which: - was already marked with the DNS_FETCHOPT_EDNS512 flag, - responds to EDNS queries with the UDP payload size set to 512 bytes, - does not send response packets larger than 512 bytes, and the response for the query being sent is larger than 512 byes, then named will pointlessly alternate between sending UDP queries with EDNS UDP payload size set to 512 bytes (which are responded to with truncated answers) and TCP connections until the fetch context retry limit is reached. Prevent such query loops by marking the server as bad for a given fetch context if the advertised EDNS UDP payload size for that server gets reduced to 512 bytes and it is impossible to reach it using TCP.	2019-10-31 08:48:35 +01:00
Mark Andrews	622bef6aec	reset fctx->qmindcname and fctx->qminname after processing a delegation	2019-10-01 22:09:04 -07:00
Evan Hunt	488cb4da10	SERVFAIL if a prior qmin fetch has not been canceled when a new one starts	2019-10-01 20:41:53 -07:00

1 2 3 4 5 ...

765 Commits