mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-28 21:17:54 +00:00

Author	SHA1	Message	Date
Evan Hunt	7c5afebcdc	rename maybe_destroy() to maybe_cancel_validators() the maybe_destroy() function no longer destroys the fctx, so rename it and update the comments.	2022-04-27 10:54:28 -07:00
Evan Hunt	b4592d02a1	refactor fctx_done() to set fctx to NULL previously fctx_done() detached the fctx but did not clear the pointer passed into it from the caller. in some conditions, when rctx_done() was reached while waiting for a validator to complete, fctx_done() could be called twice on the same fetch, causing a double detach. fctx_done() now clears the fctx pointer, to reduce the chances of such mistakes.	2022-04-27 10:54:28 -07:00
Ondřej Surý	741a7096fc	Run resume_dslookup() from the correct task The rctx_chaseds() function calls dns_resolver_createfetch(), passing fctx->task as the target task to run resume_dslookup() from. This breaks task-based serialization of events as fctx->task is the task that the dns_resolver_createfetch() caller wants to receive its fetch completion event in; meanwhile, intermediate fetches started by the resolver itself (e.g. related to QNAME minimization) must use res->buckets[bucketnum].task instead. This discrepancy may cause trouble if the resume_dslookup() callback happens to be run concurrently with e.g. fctx_doshutdown(). Fix by passing the correct task to dns_resolver_createfetch() in rctx_chaseds().	2022-04-22 14:25:32 +02:00
Ondřej Surý	f0feaa3305	Remove isc_task_sendto(anddetach) functions The only place where isc_task_sendto() was used was in dns_resolver unit, where the "sendto" part was actually no-op, because dns_resolver uses bound tasks. Remove the isc_task_sendto() and isc_task_sendtoanddetach() functions in favor of using bound tasks create with isc_task_create_bound(). Additionally, cache the number of running netmgr threads (nworkers) locally to reduce the number of function calls.	2022-04-19 14:24:36 +02:00
Ondřej Surý	ae01ec2823	Don't use reference counting in isc_timer unit The reference counting and isc_timer_attach()/isc_timer_detach() semantic are actually misleading because it cannot be used under normal conditions. The usual conditions under which is timer used uses the object where timer is used as argument to the "timer" itself. This means that when the caller is using `isc_timer_detach()` it needs the timer to stop and the isc_timer_detach() does that only if this would be the last reference. Unfortunately, this also means that if the timer is attached elsewhere and the timer is fired it will most likely be use-after-free, because the object used in the timer no longer exists. Remove the reference counting from the isc_timer unit, remove isc_timer_attach() function and rename isc_timer_detach() to isc_timer_destroy() to better reflect how the API needs to be used. The only caveat is that the already executed event must be destroyed before the isc_timer_destroy() is called because the timer is no longet attached to .ev_destroy_arg.	2022-04-02 01:23:15 +02:00
Evan Hunt	e9ef3defa4	consolidate fibonacci hashing in one place Fibonacci hashing was implemented in four separate places (rbt.c, rbtdb.c, resolver.c, zone.c). This commit combines them into a single implementation. The hash_32() function is now replaced with isc_hash_bits32().	2022-03-28 14:44:21 -07:00
Ondřej Surý	20f0936cf2	Remove use of the inline keyword used as suggestion to compiler Historically, the inline keyword was a strong suggestion to the compiler that it should inline the function marked inline. As compilers became better at optimising, this functionality has receded, and using inline as a suggestion to inline a function is obsolete. The compiler will happily ignore it and inline something else entirely if it finds that's a better optimisation. Therefore, remove all the occurences of the inline keyword with static functions inside single compilation unit and leave the decision whether to inline a function or not entirely on the compiler NOTE: We keep the usage the inline keyword when the purpose is to change the linkage behaviour.	2022-03-25 08:33:43 +01:00
Ondřej Surý	584f0d7a7e	Simplify way we tag unreachable code with only ISC_UNREACHABLE() Previously, the unreachable code paths would have to be tagged with: INSIST(0); ISC_UNREACHABLE(); There was also older parts of the code that used comment annotation: /* NOTREACHED */ Unify the handling of unreachable code paths to just use: UNREACHABLE(); The UNREACHABLE() macro now asserts when reached and also uses __builtin_unreachable(); when such builtin is available in the compiler.	2022-03-25 08:33:43 +01:00
Ondřej Surý	fe7ce629f4	Add FALLTHROUGH macro for __attribute__((fallthrough)) Gcc 7+ and Clang 10+ have implemented __attribute__((fallthrough)) which is explicit version of the /* FALLTHROUGH / comment we are currently using. Add and apply FALLTHROUGH macro that uses the attribute if available, but does nothing on older compilers. In one case (lib/dns/zone.c), using the macro revealed that we were using the / FALLTHROUGH */ comment in wrong place, remove that comment.	2022-03-25 08:33:43 +01:00
Aram Sargsyan	f0f3370e14	Check if the fetch is shutting down in resume_dslookup() The fetch can be in the shutting down state when resume_dslookup() is trying to operate on it. This is also a security issue, because a malicious actor can set up a name server which delays certain queries in such a way that the fetch will time out and shut down, which will cause named to crash. Add a check to see if the fetch has the shutting down attribute set, and cancel any further operations on it in such case. A similar bug had been fixed earlier for the resume_qmin() function, see [GL #966].	2022-03-16 22:11:49 +01:00
Mark Andrews	fe1bbba259	Look for zones deeper than the current domain or forward name When caching glue, we need to ensure that there is no closer source of truth for the name. If the owner name for the glue record would be answered by a locally configured zone, do not cache.	2022-03-16 22:11:49 +01:00
Mark Andrews	c289913e5c	Check cached names for possible "forward only" clause When caching additional and glue data not from a forwarder, we must check that there is no "forward only" clause covering the owner name that would take precedence. Such names would normally be allowed by baliwick rules, but a "forward only" zone introduces a new baliwick scope.	2022-03-16 22:11:49 +01:00
Mark Andrews	7e37b5e379	Check that the forward declaration is unchanged and not overridden If we are using a fowarder, in addition to checking that names to be cached are subdomains of the forwarded namespace, we must also check that there are no subsidiary forwarded namespaces which would take precedence. To be safe, we don't cache any responses if the forwarding configuration has changed since the query was sent.	2022-03-16 22:11:49 +01:00
Mark Andrews	5dc3b25d03	Add additional name checks when using a forwarder When using a forwarder, check that the owner name of response records are within the bailiwick of the forwarded name space.	2022-03-16 22:11:49 +01:00
Ondřej Surý	79b5ccbf34	Implement isc_interval_t on top of isc_time_t Change the isc_interval_t implementation from separate data type and separate implementation to be shim implementation on top of isc_time_t. The distinction between isc_interval_t and isc_time_t has been kept because they are semantically different - isc_interval_t is relative and isc_time_t is absolute, but this allows isc_time_t and isc_interval_t to be freely interchangeable, f.e. this: isc_time_t t1; isc_interval_t interval; isc_time_t t2; isc_interval_set(interval, isc_time_seconds(t2), isc_time_nanoseconds(t2);; isc_time_subtract(t1, interval, t2); isc_interval_set(interval, isc_time_seconds(t2), isc_time_nanoseconds(t2)); to just: isc_time_t t1; isc_interval_t interval; isc_time_t t2; isc_time_subtract(t1, t2, interval); without introducing a whole set of new functions.	2022-03-14 13:00:05 -07:00
Ondřej Surý	e6ca2a651f	Refactor isc_timer_reset() use with semantic patch Add and apply semantic patch to remove expires argument from the isc_timer_reset() calls through the codebase.	2022-03-14 13:00:05 -07:00
Ondřej Surý	27850a5ad2	Change isc_timer_reset() usage to never use expires argument There were two places where expires argument (absolute isc_time_t value) was being used. Both places has been converted to use relative interval argument in preparation of simplification and refactoring of isc_timer API.	2022-03-14 13:00:05 -07:00
Ondřej Surý	c259cecc90	Refactor isc_timer_create() to just create timer The isc_timer_create() function was a bit conflated. It could have been used to create a timer and start it at the same time. As there was a single place where this was done before (see the previous commit for nta.c), this was cleaned up and the isc_timer_create() function was changed to only create new timer.	2022-03-14 13:00:05 -07:00
Michał Kępień	b645e28167	Add "UNUSED(fctx);" to FCTXTRACE() macro stubs Commit b6d40b3c4e4452cee6f7f06d889e4e8d113e9588 removed most uses of the 'fctx' variable from the rctx_dispfail() function: it is now only needed by the FCTXTRACE3() macro. However, when --enable-querytrace is not in effect, that macro evaluates to a list of UNUSED() macros that does not include "UNUSED(fctx);". This triggers the following compilation warning when building without --enable-querytrace: resolver.c: In function 'rctx_dispfail': resolver.c:7888:21: warning: unused variable 'fctx' [-Wunused-variable] 7888 \| fetchctx_t fctx = rctx->fctx; \| ^~~~ Fix by adding "UNUSED(fctx);" lines to all FCTXTRACE*() macros. This is safe to do because all of those macros use the 'fctx' local variable, so there is no danger of introducing new errors caused by use of undeclared identifiers.	2022-02-21 10:50:38 +01:00
Evan Hunt	b6d40b3c4e	correct TCP error handling in dispatch and resolver - certain TCP result codes, including ISC_R_EOF and ISC_R_CONNECTIONRESET, were being mapped to ISC_R_SHUTTINGDOWN before calling the response handler in tcp_recv_cancelall(). the result codes should be passed through to the response handler without being changed. - the response handlers, resquery_response() and req_response(), had code to return immediately if encountering ISC_R_EOF, but this is not the correct behavior; that should only happen in the case of ISC_R_CANCELED when it was the caller that canceled the operation - ISC_R_CONNECTIONRESET was not being caught in rctx_dispfail(). - removed code in rctx_dispfail() to retry queries without EDNS when receiving ISC_R_EOF; this is now treated the same as any other connection failure.	2022-02-17 09:59:12 +01:00
Michał Kępień	f286c845b0	Log the result of each resolver priming attempt When a resolver priming attempt completes, the following message is currently logged: resolver priming query complete This message is identical for both successful and failed priming attempts. Consider the following log excerpts: - successful priming attempt: 10-Feb-2022 11:33:11.272 all zones loaded 10-Feb-2022 11:33:11.272 running 10-Feb-2022 11:33:19.722 resolver priming query complete - failed priming attempt: 10-Feb-2022 11:33:29.978 all zones loaded 10-Feb-2022 11:33:29.978 running 10-Feb-2022 11:33:38.432 timed out resolving '_.org/A/IN': 2001:500:9f::42#53 10-Feb-2022 11:33:38.522 timed out resolving './NS/IN': 2001:500:9f::42#53 10-Feb-2022 11:33:42.132 timed out resolving '_.org/A/IN': 2001:500:12::d0d#53 10-Feb-2022 11:33:42.285 timed out resolving './NS/IN': 2001:500:12::d0d#53 10-Feb-2022 11:33:44.685 resolver priming query complete Include the result of each priming attempt in the relevant log message to give the administrator better insight into named's resolver priming process.	2022-02-16 13:26:12 +01:00
Ondřej Surý	58bd26b6cf	Update the copyright information in all files in the repository This commit converts the license handling to adhere to the REUSE specification. It specifically: 1. Adds used licnses to LICENSES/ directory 2. Add "isc" template for adding the copyright boilerplate 3. Changes all source files to include copyright and SPDX license header, this includes all the C sources, documentation, zone files, configuration files. There are notes in the doc/dev/copyrights file on how to add correct headers to the new files. 4. Handle the rest that can't be modified via .reuse/dep5 file. The binary (or otherwise unmodifiable) files could have license places next to them in <foo>.license file, but this would lead to cluttered repository and most of the files handled in the .reuse/dep5 file are system test files.	2022-01-11 09:05:02 +01:00
Mark Andrews	dc8595936c	remove broken-nsec and reject-000-label options	2021-12-23 15:13:46 +11:00
Evan Hunt	157d7bd0e9	incidental cleanups the 'dipsatchmgr->state' was never set, so the MGR_IS_SHUTTINGDOWN macro was always false. both of these have been removed. renamed the 'dispatch->state' field to 'tcpstate' to make its purpose less ambiguous. changed an FCTXTRACE log message from "response did not match question" to the more correctly descriptive "invalid question section".	2021-12-08 10:22:03 -08:00
Evan Hunt	4d4cea243a	restore the fetch lifetime timer the lifetime expiry timer for the fetch context was removed when we switched to using in-band netmgr timeouts. however, it turns out some dependency loops can occur between a fetch and the ADB the validator; these deadlocks were formerly broken when the timer fired, and now there's no timer. we can fix these errors individually, but in the meantime we don't want the server to get hung at shutdown because of dangling fetches. this commit puts back a single timer, which fires two seconds after the fetch should have completed, and shuts it down. it also logs a message at level INFO so we know about the problems when they occur.	2021-12-03 09:49:24 +01:00
Mark Andrews	0aaaa8768f	Reject NSEC records with next field with \000 label A number of DNS implementation produce NSEC records with bad type maps that don't contain types that exist at the name leading to NODATA responses being synthesize instead of the records in the zone. NSEC records with these bad type maps often have the NSEC NSEC field set to '\000.QNAME'. We look for the first label of this pattern. e.g. example.com NSEC \000.example.com SOA NS NSEC RRSIG example.com RRRSIG NSEC ... example.com SOA ... example.com RRRSIG SOA ... example.com NS ... example.com RRRSIG NS ... example.com A ... example.com RRRSIG A ... A is missing from the type map. This introduces a temporary option 'reject-000-label' to control this behaviour.	2021-12-02 14:27:18 +01:00
Mark Andrews	733f58a7a5	Allow servers that emit broken NSEC records to be identified 'server <prefix> { broken-nsec yes; };' can now be used to stop NSEC records from negative responses from servers in the given prefix being cached and hence available to synth-from-dnssec.	2021-12-02 14:27:14 +01:00
Mark Andrews	454c29046f	Check that SOA and DNSKEY are consistent in NSEC typemaps If there is a SOA record present then there should also be a DNSKEY record present as the DNSKEY is supposed to live at the zone apex like the SOA.	2021-12-02 14:24:37 +01:00
Mark Andrews	8ff2c133b5	Add dns_nsec_requiredtypespresent checks an NSEC rdataset to ensure that both NSEC and RRSIG are present in the type map. These types are required for the NSEC to be valid	2021-12-02 14:18:42 +01:00
Mark Andrews	c8a7f92b9e	Allow "black lies" to be cached "black lies" differ from "white lies" in that the owner name of the NSEC record matches the QNAME and the intent is to return NODATA instead of NXDOMAIN for all types. Caching this NSEC does not lead to unexpected behaviour on synthesis when the QNAME matches the NSEC owner which it does for the the general "white lie" response. "black lie" QNAME NSEC \000.QNAME NSEC RRSIG "white lie" QNAME- NSEC QNAME+ NSEC RRSIG where QNAME- is a name that is close to QNAME but sorts before QNAME and QNAME+ is a that is close to QNAME but sorts after QNAME. Black lies are safe to cache as they don't bring into existence names that are not intended to exist. "Black lies" intentional change NXDOMAIN to NODATA. "White lies" bring QNAME- into existence and named would synthesis NODATA for QNAME+ if it is queried for that name instead of discovering the, presumable, NXDOMAIN response. Note rejection NSEC RRsets with NEXT names starting with the label '\000' renders this change ineffective (see reject-000-label).	2021-12-02 14:18:41 +01:00
Mark Andrews	6fae151c9d	Do not cache minimal NSEC records (NSEC + RRSIG only) these are not useful for dnssec synthesis as they can result in false NODATA responses and just consume cache memory	2021-12-02 14:18:41 +01:00
Evan Hunt	326a4fc13b	fix a use-after-free in resolver when processing a mismatched response, we call dns_dispatch_getnext(). If that fails, for example because of a timeout, fctx_done() is called, which cancels all queries. This triggers a crash afterward when fctx_cancelquery() is called, and is unnecessary since fctx_done() would have been called later anyway.	2021-11-22 11:35:34 +01:00
Ondřej Surý	af0b8d0ba8	Enable lame response detection even with disabled lame cache Previously, when lame cache would be disabled by setting lame-ttl to 0, it would also disable lame answer detection. In this commit, we enable the lame response detection even when the lame cache is disabled. This enables stopping answer processing early rather than going through the whole answer processing flow.	2021-10-28 12:05:58 +02:00
Evan Hunt	128068fde2	check for loops in ADB finds If an ADB find is started on behalf of a resolver fetch, and fails to find any addresses but has a pending resolver fetch associated with it, then we need to check whether the fetch it's waiting on is the one that created it. If so, it can never finish and needs to be terminated.	2021-10-21 02:15:43 -07:00
Ondřej Surý	ef831e9dfe	Remove isc_mem_t from fctxbucket_t The fctxbucket_t properly attaches to the fetchctx_t, so it can safely use its memory context. Save a little bit of memory by removing own memory context from fctxbucket_t.	2021-10-19 09:18:14 +02:00
Ondřej Surý	b01d75be36	Use fibonacci hashing for zone counter buckets Change the 'dbuckets' hash table in resolver.c to use fibonacci hashing like the RBT.	2021-10-18 14:35:44 -07:00
Evan Hunt	09028dd38f	Simplify fctx_unlink() and fctx_destroy() These functions are always called together; this commit combines them.	2021-10-18 14:27:13 -07:00
Ondřej Surý	d609425bf3	Rewrite fetchctx_t reference counting Using proper attach/detach functions for the fetch context instead of fctx_increference() and _decreference() makes it easier to debug reference counting errors in the resolver. Fixed several such errors that were found as a result.	2021-10-18 14:27:13 -07:00
Evan Hunt	a55589f881	remove all references to isc_socket and related types Removed socket.c, socket.h, and all references to isc_socket_t, isc_socketmgr_t, isc_sockevent_t, etc.	2021-10-15 01:01:25 -07:00
Ondřej Surý	e603983ec9	Stop providing branch prediction information The __builtin_expect() can be used to provide the compiler with branch prediction information. The Gcc manual says[1] on the subject: In general, you should prefer to use actual profile feedback for this (-fprofile-arcs), as programmers are notoriously bad at predicting how their programs actually perform. Stop using __builtin_expect() and ISC_LIKELY() and ISC_UNLIKELY() macros to provide the branch prediction information as the performance testing shows that named performs better when the __builtin_expect() is not being used. 1. https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fexpect	2021-10-14 10:33:24 +02:00
Ondřej Surý	ecb74bae0d	dns/resolver.c: Return void when ISC_R_SUCCESS is only returned value With isc_mem_get() and dns_name_dup() no longer being able to fail, some functions can now only return ISC_R_SUCCESS. Change the return type to void for the following function(s): * dns_resolver_addalternate()	2021-10-13 05:47:48 +02:00
Ondřej Surý	2e3a2eecfe	Make isc_result a static enum Remove the dynamic registration of result codes. Convert isc_result_t from unsigned + #defines into 32-bit enum type in grand unified <isc/result.h> header. Keep the existing values of the result codes even at the expense of the description and identifier tables being unnecessary large. Additionally, add couple of: switch (result) { [...] default: break; } statements where compiler now complains about missing enum values in the switch statement.	2021-10-06 11:22:20 +02:00
Evan Hunt	436424c458	increment fctx references while waiting for validator We need to ensure the fctx isn't freed while the validator is pending.	2021-10-05 10:18:48 -07:00
Evan Hunt	24dbf9849e	refactor dispatch cancellation Renamed some functions for clarity and readability: - dns_dispatch_addresponse() -> dns_dispatch_add() - dns_dispatch_removeresponse() -> dns_dispatch_done() The dns_dispatch_cancel() function now calls dns_dispatch_done() directly, so it is no longer ever necessary to call both functions. dns_dispatch_cancel() is used to terminate dispatch connections that are still pending, while dns_dispatch_done() is used when they are complete.	2021-10-05 10:18:48 -07:00
Evan Hunt	5948aa7766	clarify fctx_cancelquery() behavior Cleaned up dereferencing of query objects, and added a comment explaining it better.	2021-10-05 10:18:48 -07:00
Evan Hunt	dc1203b426	resolver: Fixed shutdown processing - Prevent shutdown races: attach/detach to dns_resolver in dns_fetch_t and fctx_t; delay destruction of fctx when finds are still active; reference the fctx while canceling; reverse the order of fctx_destroy() and empty_bucket(). - Don't resend queries if fetches have been canceled. - It's possible for fctx_doshutdown() to run before a TCP connection has completed. if the query is not on the queries list, then it is not canceled, but the adbaddrinfo is freed. when tcp_connected() runs later, the query is in an inconstent state. to fix this, we add the query to queries before running dns_dispatch_connect(), instead of in the connect callback. - Combined the five fctx_cleanup* functions into a single one. - Added comments and changed some names to make this code easier to understand.	2021-10-02 11:43:59 -07:00
Evan Hunt	6ea7d59ad2	dispatch: Clean up connect and recv callbacks - disp_connected() has been split into two functions, udp_connected() (which takes 'resp' as an argument) and tcp_connected() (which takes 'disp', and calls the connect callbacks for all pending resps). - In dns_dispatch_connect(), if a connection is already open, we need to detach the dispentry immediately because we won't be running tcp_connected(). - dns_disptach_cancel() also now calls the connect callbacks for pending TCP responses, and the response callbacks for open TCP connections waiting on read. - If udp_connected() runs after dns_dispatch_cancel() has been called, ensure that the caller's connect callback is run. - If a UDP connection fails with EADDRINUSE, we try again up to five times with a different local port number before giving up. - If a TCP connection is canceled while still pending connection, the connect timeout may still fire. we attach the dispatch before connecting to ensure that it won't be detached too soon in this case. - The dispentry is no longer removed from the pending list when deactivating, so that the connect callback can still be run if dns_dispatch_removeresponse() was run while the connecting was pending. - Rewrote dns_dispatch_gettcp() to avoid a data race. - startrecv() and dispatch_getnext() can be called with a NULL resp when using TCP. - Refactored udp_recv() and tcp_recv() and added result logging. - EOF is now treated the same as CANCELED in response callbacks. - ISC_R_SHUTTINGDOWN is sent to the reponse callbacks for all resps if tcp_recv() is triggered by a netmgr shutdown. (response callbacks are not sent by udp_recv() in this case.)	2021-10-02 11:39:56 -07:00
Evan Hunt	8551ad026f	dispatch: Fix several connect-related issues - startrecv() and getnext() have been rewritten. - Don't set TCP flag when connecting a UDP dispatch. - Prevent TCP connections from trying to connect twice. - dns_dispatch_gettcp() can now find a matching TCP dispatch that has not yet fully connected, and attach to it. when the connection is completed, the connect callbacks are run for all of the pending entries. - An atomic 'state' variable is now used for connection state instead of attributes. - When dns_dispatch_cancel() is called on a TCP dispatch entry, only that one entry is canceled. the dispatch itself should not be shut down until there are no dispatch entries left associated with it. - Other incidental cleanup, including removing DNS_DISPATCHATTR_IPV4 and _IPV6 (they were being set in the dispatch attributes but never used), cleaning up dns_requestmgr_create(), and renaming dns_dispatch_read() to the more descriptive dns_dispatch_resume().	2021-10-02 11:39:56 -07:00
Ondřej Surý	e317386090	dispatch: Remove 'timeout' callback - It is no longer necessary to pass a 'timeout' callback to dns_dispatch_addresponse(); timeouts are handled directly by the 'response' callback instead. - The netmgr handle is no longer passed to dispatch callbacks, since they don't (and can't) use it. instead, dispatch_cb_t now takes a result code, region, and argument. - Cleaned up timeout-related tests in dispatch_test.c	2021-10-02 11:39:56 -07:00
Evan Hunt	8b532d2e64	dispatch: Refactor to eliminate dns_dispatchevent - Responses received by the dispatch are no longer sent to the caller via a task event, but via a netmgr-style recv callback. the 'action' parameter to dns_dispatch_addresponse() is now called 'response' and is called directly from udp_recv() or tcp_recv() when a valid response has been received. - All references to isc_task and isc_taskmgr have been removed from dispatch functions. - All references to dns_dispatchevent_t have been removed and the type has been deleted. - Added a task to the resolver response context, to be used for fctx events. - When the caller cancels an operation, the response handler will be called with ISC_R_CANCELED; it can abort immediately since the caller will presumably have taken care of cleanup already. - Cleaned up attach/detach in resquery and request.	2021-10-02 11:39:56 -07:00

... 2 3 4 5 6 ...

994 Commits