mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-29 05:28:00 +00:00

Author	SHA1	Message	Date
Ondřej Surý	c605d75ea5	Use -release instead of -version-info for internal library SONAMEs The BIND 9 libraries are considered to be internal only and hence the API and ABI changes a lot. Keeping track of the API/ABI changes takes time and it's a complicated matter as the safest way to make everything stable would be to bump any library in the dependency chain as in theory if libns links with libdns, and a binary links with both, and we bump the libdns SOVERSION, but not the libns SOVERSION, the old libns might be loaded by binary pulling old libdns together with new libdns loaded by the binary. The situation gets even more complicated with loading the plugins that have been compiled with few versions old BIND 9 libraries and then dynamically loaded into the named. We are picking the safest option possible and usable for internal libraries - instead of using -version-info that has only a weak link to BIND 9 version number, we are using -release libtool option that will embed the corresponding BIND 9 version number into the library name. That means that instead of libisc.so.1701 (as an example) the library will now be named libisc-9.17.10.so.	2021-01-25 14:19:53 +01:00
Ondřej Surý	e493e04c0f	Refactor TLSDNS module to work with libuv/ssl directly * Following the example set in 634bdfb16d8, the tlsdns netmgr module now uses libuv and SSL primitives directly, rather than opening a TLS socket which opens a TCP socket, as the previous model was difficult to debug. Closes #2335. * Remove the netmgr tls layer (we will have to re-add it for DoH) * Add isc_tls API to wrap the OpenSSL SSL_CTX object into libisc library; move the OpenSSL initialization/deinitialization from dstapi needed for OpenSSL 1.0.x to the isc_tls_{initialize,destroy}() * Add couple of new shims needed for OpenSSL 1.0.x * When LibreSSL is used, require at least version 2.7.0 that has the best OpenSSL 1.1.x compatibility and auto init/deinit * Enforce OpenSSL 1.1.x usage on Windows * Added a TLSDNS unit test and implemented a simple TLSDNS echo server and client.	2021-01-25 09:19:22 +01:00
Michał Kępień	347d666b0f	Update library API versions	2021-01-21 08:57:22 +01:00
Matthijs Mekking	87744f218d	Remove a lot of obsoleted options These options were ancient or made obsolete a long time ago, it is safe to remove them. Also stop printing ancient options, they should be treated the same as unknown options. Removed options: lwres, geoip-use-ecs, sit-secret, use-ixfr, acache-cleaning-interval, acache-enable, additional-from-auth, additional-from-cache, allow-v6-synthesis, dnssec-enable, max-acache-size, nosit-udp-size, queryport-pool-ports, queryport-pool-updateinterval, request-sit, use-queryport-pool, and support-ixfr.	2021-01-19 10:12:40 +01:00
JINMEI Tatuya	2ed12d90c9	more s/recurse/async/ for consistency (no behavior change)	2021-01-06 13:14:13 -08:00
JINMEI Tatuya	d520f01c7b	detach fetchhandle before resume query processing otherwise, another hook async event or DNS recursion would trigger an assertion failure.	2021-01-06 13:14:13 -08:00
Michał Kępień	2c44266a5a	Update library API versions	2020-12-16 22:05:50 +01:00
Ondřej Surý	ef685bab5c	Print warning when falling back to increment soa serial method When using the `unixtime` or `date` method to update the SOA serial, `named` and `dnssec-signzone` would silently fallback to `increment` method to prevent the new serial number to be smaller than the old serial number (using the serial number arithmetics). Add a warning message when such fallback happens.	2020-12-11 10:48:28 +01:00
Ondřej Surý	7ba18870dc	Reformat sources using clang-format-11	2020-12-08 18:36:23 +01:00
Ondřej Surý	634bdfb16d	Refactor netmgr and add more unit tests This is a part of the works that intends to make the netmgr stable, testable, maintainable and tested. It contains a numerous changes to the netmgr code and unfortunately, it was not possible to split this into smaller chunks as the work here needs to be committed as a complete works. NOTE: There's a quite a lot of duplicated code between udp.c, tcp.c and tcpdns.c and it should be a subject to refactoring in the future. The changes that are included in this commit are listed here (extensively, but not exclusively): * The netmgr_test unit test was split into individual tests (udp_test, tcp_test, tcpdns_test and newly added tcp_quota_test) * The udp_test and tcp_test has been extended to allow programatic failures from the libuv API. Unfortunately, we can't use cmocka mock() and will_return(), so we emulate the behaviour with #define and including the netmgr/{udp,tcp}.c source file directly. * The netievents that we put on the nm queue have variable number of members, out of these the isc_nmsocket_t and isc_nmhandle_t always needs to be attached before enqueueing the netievent_<foo> and detached after we have called the isc_nm_async_<foo> to ensure that the socket (handle) doesn't disappear between scheduling the event and actually executing the event. * Cancelling the in-flight TCP connection using libuv requires to call uv_close() on the original uv_tcp_t handle which just breaks too many assumptions we have in the netmgr code. Instead of using uv_timer for TCP connection timeouts, we use platform specific socket option. * Fix the synchronization between {nm,async}_{listentcp,tcpconnect} When isc_nm_listentcp() or isc_nm_tcpconnect() is called it was waiting for socket to either end up with error (that path was fine) or to be listening or connected using condition variable and mutex. Several things could happen: 0. everything is ok 1. the waiting thread would miss the SIGNAL() - because the enqueued event would be processed faster than we could start WAIT()ing. In case the operation would end up with error, it would be ok, as the error variable would be unchanged. 2. the waiting thread miss the sock->{connected,listening} = `true` would be set to `false` in the tcp_{listen,connect}close_cb() as the connection would be so short lived that the socket would be closed before we could even start WAIT()ing * The tcpdns has been converted to using libuv directly. Previously, the tcpdns protocol used tcp protocol from netmgr, this proved to be very complicated to understand, fix and make changes to. The new tcpdns protocol is modeled in a similar way how tcp netmgr protocol. Closes: #2194, #2283, #2318, #2266, #2034, #1920 * The tcp and tcpdns is now not using isc_uv_import/isc_uv_export to pass accepted TCP sockets between netthreads, but instead (similar to UDP) uses per netthread uv_loop listener. This greatly reduces the complexity as the socket is always run in the associated nm and uv loops, and we are also not touching the libuv internals. There's an unfortunate side effect though, the new code requires support for load-balanced sockets from the operating system for both UDP and TCP (see #2137). If the operating system doesn't support the load balanced sockets (either SO_REUSEPORT on Linux or SO_REUSEPORT_LB on FreeBSD 12+), the number of netthreads is limited to 1. * The netmgr has now two debugging #ifdefs: 1. Already existing NETMGR_TRACE prints any dangling nmsockets and nmhandles before triggering assertion failure. This options would reduce performance when enabled, but in theory, it could be enabled on low-performance systems. 2. New NETMGR_TRACE_VERBOSE option has been added that enables extensive netmgr logging that allows the software engineer to precisely track any attach/detach operations on the nmsockets and nmhandles. This is not suitable for any kind of production machine, only for debugging. * The tlsdns netmgr protocol has been split from the tcpdns and it still uses the old method of stacking the netmgr boxes on top of each other. We will have to refactor the tlsdns netmgr protocol to use the same approach - build the stack using only libuv and openssl. * Limit but not assert the tcp buffer size in tcp_alloc_cb Closes: #2061	2020-12-01 16:47:07 +01:00
Diego Fronza	95add01643	Silence coverity warnings in query.c Return value of dns_db_getservestalerefresh() and dns_db_getservestalettl() functions were previously unhandled. This commit purposefully ignore those return values since there is no side effect if those results are != ISC_R_SUCCESS, it also supress Coverity warnings.	2020-11-26 14:55:14 +00:00
Michał Kępień	2bb0a5dcdb	Update library API versions	2020-11-26 12:12:17 +01:00
JINMEI Tatuya	75cdd758ed	implementation of hook-based asynchronous functionality previously query plugins were strictly synchrounous - the query process would be interrupted at some point, data would be looked up or a change would be made, and then the query processing would resume immediately. this commit enables query plugins to initiate asynchronous processes and resume on a completion event, as with recursion.	2020-11-24 15:11:39 -08:00
JINMEI Tatuya	9c8dae041d	ns_query refactoring for hook-based recursion several small changes to query processing to make it easier to use hook-based recursion (and other asynchronous functionlity) later. - recursion quota check is now a separate function, check_recursionquota(), which is called by ns_query_recurse(). - pass isc_result to query_nxdomain() instead of bool. the value of 'empty_wild' will be determined in the function based on the passed result. this is similar to query_nodata(), and makes the signatures of the two functions more consistent. - pass the current 'result' value into plugin hooks.	2020-11-24 15:11:39 -08:00
Mark Andrews	e980affba0	Fix DNAME when QTYPE is CNAME or ANY The synthesised CNAME is not supposed to be followed when the QTYPE is CNAME or ANY as the lookup is satisfied by the CNAME record.	2020-11-19 10:18:01 +11:00
Diego Fronza	4827ad0ec4	Add stale-refresh-time option Before this update, BIND would attempt to do a full recursive resolution process for each query received if the requested rrset had its ttl expired. If the resolution fails for any reason, only then BIND would check for stale rrset in cache (if 'stale-cache-enable' and 'stale-answer-enable' is on). The problem with this approach is that if an authoritative server is unreachable or is failing to respond, it is very unlikely that the problem will be fixed in the next seconds. A better approach to improve performance in those cases, is to mark the moment in which a resolution failed, and if new queries arrive for that same rrset, try to respond directly from the stale cache, and do that for a window of time configured via 'stale-refresh-time'. Only when this interval expires we then try to do a normal refresh of the rrset. The logic behind this commit is as following: - In query.c / query_gotanswer(), if the test of 'result' variable falls to the default case, an error is assumed to have happened, and a call to 'query_usestale()' is made to check if serving of stale rrset is enabled in configuration. - If serving of stale answers is enabled, a flag will be turned on in the query context to look for stale records: query.c:6839 qctx->client->query.dboptions \|= DNS_DBFIND_STALEOK; - A call to query_lookup() will be made again, inside it a call to 'dns_db_findext()' is made, which in turn will invoke rbdb.c / cache_find(). - In rbtdb.c / cache_find() the important bits of this change is the call to 'check_stale_header()', which is a function that yields true if we should skip the stale entry, or false if we should consider it. - In check_stale_header() we now check if the DNS_DBFIND_STALEOK option is set, if that is the case we know that this new search for stale records was made due to a failure in a normal resolution, so we keep track of the time in which the failured occured in rbtdb.c:4559: header->last_refresh_fail_ts = search->now; - In check_stale_header(), if DNS_DBFIND_STALEOK is not set, then we know this is a normal lookup, if the record is stale and the query time is between last failure time + stale-refresh-time window, then we return false so cache_find() knows it can consider this stale rrset entry to return as a response. The last additions are two new methods to the database interface: - setservestale_refresh - getservestale_refresh Those were added so rbtdb can be aware of the value set in configuration option, since in that level we have no access to the view object.	2020-11-11 12:53:23 -03:00
Matthijs Mekking	b7856d2675	Cleanup duplicate definitions in query.h	2020-11-10 14:42:47 +00:00
Witold Kręcicki	2cfc8a45a4	Shutdown interface if we can't listen on it to avoid shutdown hang	2020-11-10 14:17:09 +01:00
Witold Kręcicki	f68fe9ff14	Fix a startup/shutdown crash in ns_clientmgr_create	2020-11-10 14:17:05 +01:00
Witold Kręcicki	38b78f59a0	Add DoT support to bind Parse the configuration of tls objects into SSL_CTX* objects. Listen on DoT if 'tls' option is setup in listen-on directive. Use DoT/DoH ports for DoT/DoH.	2020-11-10 14:16:55 +01:00
Mark Andrews	b09727a765	Implement DNSTAP support in ns_client_sendraw() ns_client_sendraw() is currently only used to relay UPDATE responses back to the client. dns_dt_send() is called with this assumption.	2020-11-10 06:15:46 +00:00
Ondřej Surý	127ba7e930	Add libssl libraries to Windows build This commit extends the perl Configure script to also check for libssl in addition to libcrypto and change the vcxproj source files to link with both libcrypto and libssl.	2020-11-09 16:00:28 +01:00
Evan Hunt	49d53a4aa9	use netmgr for xfrin Use isc_nm_tcpdnsconnect() in xfrin.c for zone transfers.	2020-11-09 13:45:43 +01:00
Ondřej Surý	37b9511ce1	Use libuv's shared library handling capabilities While libltdl is a feature-rich library, BIND 9 code only uses its basic capabilities, which are also provided by libuv and which BIND 9 already uses for other purposes. As libuv's cross-platform shared library handling interface is modeled after the POSIX dlopen() interface, converting code using the latter to the former is simple. Replace libltdl function calls with their libuv counterparts, refactoring the code as necessary. Remove all use of libltdl from the BIND 9 source tree.	2020-10-28 15:48:58 +01:00
Ondřej Surý	e2436159ab	Refactor the cleanup code in lt_dl code The cleanup code that would clean the object after plugin/dlz/dyndb loading has failed was duplicating the destructor for the object, so instead of the extra code, we just use the destructor instead.	2020-10-28 15:48:58 +01:00
Ondřej Surý	0f49b02fc5	Remove redundant lt_dlerror() calls The redundant lt_dlerror() calls were taken from the examples to clean any previous errors from lt_dl...() calls. However upon code inspection, it was discovered there are no such paths that could cause the lt_dlerror() to return spurious error messages.	2020-10-28 15:48:58 +01:00
Ondřej Surý	f7c82e406e	Fix the isc_nm_closedown() to actually close the pending connections 1. The isc__nm_tcp_send() and isc__nm_tcp_read() was not checking whether the socket was still alive and scheduling reads/sends on closed socket. 2. The isc_nm_read(), isc_nm_send() and isc_nm_resumeread() have been changed to always return the error conditions via the callbacks, so they always succeed. This applies to all protocols (UDP, TCP and TCPDNS).	2020-10-22 11:37:16 -07:00
Michał Kępień	9014ff0cc6	Update library API versions	2020-10-22 08:54:32 +02:00
Diego Fronza	d727eaae6c	Always return address records in additional section for NS queries	2020-10-21 12:03:42 -03:00
Ondřej Surý	bb990030d3	Simplify the EDNS buffer size logic for DNS Flag Day 2020 The DNS Flag Day 2020 aims to remove the IP fragmentation problem from the UDP DNS communication. In this commit, we implement the required changes and simplify the logic for picking the EDNS Buffer Size. 1. The defaults for `edns-udp-size`, `max-udp-size` and `nocookie-udp-size` have been changed to `1232` (the value picked by DNS Flag Day 2020). 2. The probing heuristics that would try 512->4096->1432->1232 buffer sizes has been removed and the resolver will always use just the `edns-udp-size` value. 3. Instead of just disabling the PMTUD mechanism on the UDP sockets, we now set IP_DONTFRAG (IPV6_DONTFRAG) flag. That means that the UDP packets won't get ever fragmented. If the ICMP packets are lost the UDP will just timeout and eventually be retried over TCP.	2020-10-05 16:21:21 +02:00
Ondřej Surý	33eefe9f85	The dns_message_create() cannot fail, change the return to void The dns_message_create() function cannot soft fail (as all memory allocations either succeed or cause abort), so we change the function to return void and cleanup the calls.	2020-09-29 08:22:08 +02:00
Diego Fronza	12d6d13100	Refactored dns_message_t for using attach/detach semantics This commit will be used as a base for the next code updates in order to have a better control of dns_message_t objects' lifetime.	2020-09-29 08:22:08 +02:00
Michał Kępień	b60d7345ed	Fix function overrides in unit tests on macOS Since Mac OS X 10.1, Mach-O object files are by default built with a so-called two-level namespace which prevents symbol lookups in BIND unit tests that attempt to override the implementations of certain library functions from working as intended. This feature can be disabled by passing the "-flat_namespace" flag to the linker. Fix unit tests affected by this issue on macOS by adding "-flat_namespace" to LDFLAGS used for building all object files on that operating system (it is not enough to only set that flag for the unit test executables).	2020-09-28 09:09:21 +02:00
Michał Kępień	8bdba2edeb	Drop function wrapping as it is redundant for now As currently used in the BIND source tree, the --wrap linker option is redundant because: - static builds are no longer supported, - there is no need to wrap around existing functions - what is actually required (at least for now) is to replace them altogether in unit tests, - only functions exposed by shared libraries linked into unit test binaries are currently being replaced. Given the above, providing the alternative implementations of functions to be overridden in lib/ns/tests/nstest.c is a much simpler alternative to using the --wrap linker option. Drop the code detecting support for the latter from configure.ac, simplify the relevant Makefile.am, and remove lib/ns/tests/wrap.c, updating lib/ns/tests/nstest.c accordingly (it is harmless for unit tests which are not calling the overridden functions).	2020-09-28 09:09:21 +02:00
Evan Hunt	86eddebc83	Purge memory pool upon plugin destruction The typical sequence of events for AAAA queries which trigger recursion for an A RRset at the same name is as follows: 1. Original query context is created. 2. An AAAA RRset is found in cache. 3. Client-specific data is allocated from the filter-aaaa memory pool. 4. Recursion is triggered for an A RRset. 5. Original query context is torn down. 6. Recursion for an A RRset completes. 7. A second query context is created. 8. Client-specific data is retrieved from the filter-aaaa memory pool. 9. The response to be sent is processed according to configuration. 10. The response is sent. 11. Client-specific data is returned to the filter-aaaa memory pool. 12. The second query context is torn down. However, steps 6-12 are not executed if recursion for an A RRset is canceled. Thus, if named is in the process of recursing for A RRsets when a shutdown is requested, the filter-aaaa memory pool will have outstanding allocations which will never get released. This in turn leads to a crash since every memory pool must not have any outstanding allocations by the time isc_mempool_destroy() is called. Fix by creating a stub query context whenever fetch_callback() is called, including cancellation events. When the qctx is destroyed, it will ensure the client is detached and the plugin memory is freed.	2020-09-25 13:32:34 -07:00
Mark Andrews	f0d9bf7c30	Clone the saved / query message buffers The message buffer passed to ns__client_request is only valid for the life of the the ns__client_request call. Save a copy of it when we recurse or process a update as ns__client_request will return before those operations complete.	2020-09-23 10:37:42 +10:00
Ondřej Surý	d4976e0ebe	Add separate prefetch nmhandle to ns_client_t As the query_prefetch() or query_rpzfetch() could be called during "regular" fetch, we need to introduce separate storage for attaching the nmhandle during prefetching the records. The query_prefetch() and query_rpzfetch() are guarded for re-entrance by .query.prefetch member of ns_client_t, so we can reuse the same .prefetchhandle for both.	2020-09-22 09:56:26 +02:00
Evan Hunt	dcee985b7f	update all copyright headers to eliminate the typo	2020-09-14 16:20:40 -07:00
Mark Andrews	baf165ffd0	clear pointer before subtracting reference	2020-09-14 11:02:33 +10:00
Evan Hunt	57b4dde974	change from isc_nmhandle_ref/unref to isc_nmhandle attach/detach Attaching and detaching handle pointers will make it easier to determine where and why reference counting errors have occurred. A handle needs to be referenced more than once when multiple asynchronous operations are in flight, so callers must now maintain multiple handle pointers for each pending operation. For example, ns_client objects now contain: - reqhandle: held while waiting for a request callback (query, notify, update) - sendhandle: held while waiting for a send callback - fetchhandle: held while waiting for a recursive fetch to complete - updatehandle: held while waiting for an update-forwarding task to complete control channel connection objects now contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback - cmdhandle: held while an rndc command is running httpd connections contain: - readhandle: held while waiting for a read callback - sendhandle: held while waiting for a send callback	2020-09-11 12:17:57 -07:00
Michał Kępień	b096a038e3	Update library API versions	2020-08-06 09:10:06 +02:00
Mark Andrews	20488d6ad3	Map DNS_R_BADTSIG to FORMERR Now that the log message has been printed set the result code to DNS_R_FORMERR. We don't do this via dns_result_torcode() as we don't want upstream errors to produce FORMERR if that processing end with DNS_R_BADTSIG.	2020-08-04 12:20:37 +00:00
Michał Kępień	97a2733ef9	Update library API versions	2020-07-15 22:54:13 +02:00
Diego Fronza	aab691d512	Fix ns_statscounter_recursclients underflow The basic scenario for the problem was that in the process of resolving a query, if any rrset was eligible for prefetching, then it would trigger a call to query_prefetch(), this call would run in parallel to the normal query processing. The problem arises due to the fact that both query_prefetch(), and, in the original thread, a call to ns_query_recurse(), try to attach to the recursionquota, but recursing client stats counter is only incremented if ns_query_recurse() attachs to it first. Conversely, if fetch_callback() is called before prefetch_done(), it would not only detach from recursionquota, but also decrement the stats counter, if query_prefetch() attached to te quota first that would result in a decrement not matched by an increment, as expected. To solve this issue an atomic bool was added, it is set once in ns_query_recurse(), allowing fetch_callback() to check for it and decrement stats accordingly. For a more compreensive explanation check the thread comment below: https://gitlab.isc.org/isc-projects/bind9/-/issues/1719#note_145857	2020-07-13 11:46:18 -03:00
Evan Hunt	23c7373d68	restore "blackhole" functionality the blackhole ACL was accidentally disabled with respect to client queries during the netmgr conversion. in order to make this work for TCP, it was necessary to add a return code to the accept callback functions passed to isc_nm_listentcp() and isc_nm_listentcpdns().	2020-06-30 17:29:09 -07:00
Mark Andrews	abe2c84b1d	Suppress cppcheck warnings: cppcheck-suppress objectIndex cppcheck-suppress nullPointerRedundantCheck	2020-06-25 12:04:36 +10:00
Evan Hunt	75c985c07f	change the signature of recv callbacks to include a result code this will allow recv event handlers to distinguish between cases in which the region is NULL because of error, shutdown, or cancelation.	2020-06-19 12:33:26 -07:00
Evan Hunt	9e740cad21	make isc_nmsocket_{attach,detach}{} functions private there is no need for a caller to reference-count socket objects. they need tto be able tto close listener sockets (i.e., those returned by isc_nm_listen{udp,tcp,tcpdns}), and an isc_nmsocket_close() function has been added for that. other sockets are only accessed via handles.	2020-06-19 09:39:50 +02:00
Michał Kępień	a8bc003d1b	Update library API versions	2020-06-18 10:03:05 +02:00
Mark Andrews	924e141a15	Adjust NS_CLIENT_TCP_BUFFER_SIZE and cleanup client_allocsendbuf NS_CLIENT_TCP_BUFFER_SIZE was 2 byte too large following the move to netmgr add associated changes to lib/ns/client.c and as a result an INSIST could be trigger if the DNS message being constructed had a checkpoint stage that fell in those two extra bytes. Adjusted NS_CLIENT_TCP_BUFFER_SIZE and cleaned up client_allocsendbuf now that the previously reserved 2 bytes are no longer used.	2020-06-18 09:59:19 +02:00

... 3 4 5 6 7 ...

626 Commits