mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-28 21:17:54 +00:00

Author	SHA1	Message	Date
Mark Andrews	8ec16c378d	Check NSEC3 iterations with dnssec-signzone	2021-04-29 17:18:26 +10:00
Michał Kępień	4a8d404876	Limit logging for verbose system tests The system test framework starts all named instances with the "-d 99" command line option (unless it is overridden by a named.args file in a given instance's working directory). This causes a lot of log messages to be written to named.run files - currently over 5 million lines for a single test suite run. While debugging information preserved in the log files is essential for troubleshooting intermittent test failures, some system tests involve sending hundreds or even thousands of queries, which causes the relevant log files to explode in size. When multiple tests (or even multiple test suites) are run in parallel, excessive logging contributes considerably to the I/O load on the test host, increasing the odds of intermittent test failures getting triggered. Decrease the debug level for the seven most verbose named instances: - use "-d 3" for ns2 in the "cacheclean" system test (it is the lowest logging level at which the test still passes without the need to apply any changes to tests.sh), - use "-d 1" for the other six named instances. This roughly halves the number of lines logged by each test suite run while still leaving enough information in the logs to allow at least basic troubleshooting in case of test failures. This approach was chosen as it results in a greater decrease in the number of lines logged than running all named instances with "-d 3", without causing any test failures.	2021-04-28 07:56:47 +02:00
Diego Fronza	d6224035d8	Add system test for the deadlock fix The test spawns 4 parallel workers that keep adding, modifying and deleting zones, the main thread repeatedly checks wheter rndc status responds within a reasonable period. While environment and timing issues may affect the test, in most test cases the deadlock that was taking place before the fix used to trigger in less than 7 seconds in a machine with at least 2 cores.	2021-04-22 15:45:55 +00:00
Petr Špaček	1746d2e84a	Add tests for the "tkey-gssapi-credential" option Four named instances in the "nsupdate" system test have GSS-TSIG support enabled. All of them currently use "tkey-gssapi-keytab". Configure two of them with "tkey-gssapi-credential" to test that option. As "tkey-gssapi-keytab" and "tkey-gssapi-credential" both provide the same functionality, no test modifications are required. The difference between the two options is that the value of "tkey-gssapi-keytab" is an explicit path to the keytab file to acquire credentials from, while the value of "tkey-gssapi-credential" is the name of the principal whose credentials should be used; those credentials are looked up in the keytab file expected by the Kerberos library, i.e. /etc/krb5.keytab by default. The path to the default keytab file can be overridden using by setting the KRB5_KTNAME environment variable. Utilize that variable to use existing keytab files with the "tkey-gssapi-credential" option. The KRB5_KTNAME environment variable should not interfere with the "tkey-gssapi-keytab" option. Nevertheless, rename one of the keytab files used with "tkey-gssapi-keytab" to something else than the contents of the KRB5_KTNAME environment variable in order to make sure that both "tkey-gssapi-keytab" and "tkey-gssapi-credential" are actually tested.	2021-04-22 16:15:22 +02:00
Ondřej Surý	b540722bc3	Refactor taskmgr to run on top of netmgr This commit changes the taskmgr to run the individual tasks on the netmgr internal workers. While an effort has been put into keeping the taskmgr interface intact, couple of changes have been made: * The taskmgr has no concept of universal privileged mode - rather the tasks are either privileged or unprivileged (normal). The privileged tasks are run as a first thing when the netmgr is unpaused. There are now four different queues in in the netmgr: 1. priority queue - netievent on the priority queue are run even when the taskmgr enter exclusive mode and netmgr is paused. This is needed to properly start listening on the interfaces, free resources and resume. 2. privileged task queue - only privileged tasks are queued here and this is the first queue that gets processed when network manager is unpaused using isc_nm_resume(). All netmgr workers need to clean the privileged task queue before they all proceed normal operation. Both task queues are processed when the workers are finished. 3. task queue - only (traditional) task are scheduled here and this queue along with privileged task queues are process when the netmgr workers are finishing. This is needed to process the task shutdown events. 4. normal queue - this is the queue with netmgr events, e.g. reading, sending, callbacks and pretty much everything is processed here. * The isc_taskmgr_create() now requires initialized netmgr (isc_nm_t) object. * The isc_nm_destroy() function now waits for indefinite time, but it will print out the active objects when in tracing mode (-DNETMGR_TRACE=1 and -DNETMGR_TRACE_VERBOSE=1), the netmgr has been made a little bit more asynchronous and it might take longer time to shutdown all the active networking connections. * Previously, the isc_nm_stoplistening() was a synchronous operation. This has been changed and the isc_nm_stoplistening() just schedules the child sockets to stop listening and exits. This was needed to prevent a deadlock as the the (traditional) tasks are now executed on the netmgr threads. * The socket selection logic in isc__nm_udp_send() was flawed, but fortunatelly, it was broken, so we never hit the problem where we created uvreq_t on a socket from nmhandle_t, but then a different socket could be picked up and then we were trying to run the send callback on a socket that had different threadid than currently running.	2021-04-20 23:22:28 +02:00
Evan Hunt	d0ec7d1f33	move samples/resolve.c to bin/tests/system "resolve" is used by the resolver system tests, and I'm not certain whether delv exercises the same code, so rather than remove it, I moved it to bin/tests/system.	2021-04-16 14:29:43 +02:00
Evan Hunt	056afe7bdc	remove sample-async sample code for export libraries is no longer needed and this code is not used for any internal tests. also, sample-gai.c had already been removed but there were some dangling references.	2021-04-16 14:29:43 +02:00
Evan Hunt	1beb05f3e2	remove dns_client_request() and related code continues the cleanup of dns_client started in the previous commit.	2021-04-16 14:29:43 +02:00
Evan Hunt	fb2a352e7c	remove dns_client_update() and related code the libdns client API is no longer being maintained for external use, we can remove the code that isn't being used internally, as well as the related tests.	2021-04-16 14:29:43 +02:00
Ondřej Surý	202b1d372d	Merge the tls_test.c into netmgr_test.c and extend the tests suite This commit merges TLS tests into the common Network Manager unit tests suite and extends the unit test framework to include support for additional "ping-pong" style tests where all data could be sent via lesser number of connections (the behaviour of the old test suite). The tests for TCP and TLS were extended to make use of the new mode, as this mode better translates to how the code is used in DoH. Both TLS and TCP tests now share most of the unit tests' code, as they are expected to function similarly from a users's perspective anyway. Additionally to the above, the TLS test suite was extended to include TLS tests using the connections quota facility.	2021-04-15 15:49:36 +03:00
Michal Nowak	cd0a34df1b	Move fromhex.pl script to bin/tests/system/ The fromhex.pl script needs to be copied from the source directory to the build directory before any test is run, otherwise the out-of-tree fails to find it. Given that the script is used only in system test, move it to bin/tests/system/.	2021-04-08 11:04:26 +02:00
Artem Boldariev	11ed7aac5d	TLS code refactoring, fixes and unit-tests This commit fixes numerous stability issues with TLS transport code as well as adds unit tests for it.	2021-04-01 17:31:29 +03:00
Matthijs Mekking	923c2a07bf	Update copyrights for keymgr2kasp This MR introduces a new system test 'keymgr2kasp' to test migration to 'dnssec-policy'. It moves some existing tests from the 'kasp' system test to here. Also a common script 'kasp.sh', to be used in kasp specific tests, is introduced.	2021-03-22 09:50:05 +01:00
Michał Kępień	185a1a5643	Install man page for named-compilezone The named-checkzone tool can also be invoked as named-compilezone. Make sure a man page is installed for that alias. Move and rename the "man_named-checkzone" label to prevent a Sphinx duplicate label warning from being raised (see commit 84862e96c1fcff6e7c1ca31884e2fad921afa4f7 for more information).	2021-03-22 09:36:48 +01:00
Ondřej Surý	42e4e3b843	Improve reliability of the netmgr unit tests The netmgr unit tests were designed to push the system limits to maximum by sending as many queries as possible in the busy loop from multiple threads. This mostly works with UDP, but in the stateful protocol where establishing the connection takes more time, it failed quite often in the CI. On FreeBSD, this happened more often, because the socket() call would fail spuriosly making the problem even worse. This commit does several things to improve reliability: * return value of isc_nm_<proto>connect() is always checked and retried when scheduling the connection fails * The busy while loop has been slowed down with usleep(1000); so the netmgr threads could schedule the work and get executed. * The isc_thread_yield() was replaced with usleep(1000); also to allow the other threads to do any work. * Instead of waiting on just one variable, we wait for multiple variables to reach the final value * We are wrapping the netmgr operations (connects, reads, writes, accepts) with reference counting and waiting for all the callbacks to be accounted for. This has two effects: a) the isc_nm_t is always clean of active sockets and handles when destroyed, so it will prevent the spurious INSIST(references == 1) from isc_nm_destroy() b) the unit test now ensures that all the callbacks are always called when they should be called, so any stuck test means that there was a missing callback call and it is always a real bug These changes allows us to remove the workaround that would not run certain tests on systems without port load-balancing.	2021-03-19 16:25:28 +01:00
treysis	6b2ea00621	Add filter-a plugin for IPv6-dominant environments (cherry picked from commit 78f6cd57e1cc166823415438fe2d19a324cf7a67)	2021-03-19 08:06:55 +01:00
Ondřej Surý	64cff61c02	Add TCP timeouts system test The system tests were missing a test that would test tcp-initial-timeout and tcp-idle-timeout. This commit adds new "timeouts" system test that adds: * Test that waits longer than tcp-initial-timeout and then checks whether the socket was closed * Test that sends and receives DNS message then waits longer than tcp-initial-timeout but shorter time than tcp-idle-timeout than sends DNS message again than waits longer than tcp-idle-timeout and checks whether the socket was closed * Similar test, but bursting 25 DNS messages than waiting longer than tcp-initial-timeout and shorter than tcp-idle-timeout than do second 25 DNS message burst * Check whether transfer longer than tcp-initial-timeout succeeds	2021-03-18 16:37:57 +01:00
Matthijs Mekking	ee0835d977	Fix a XoT crash The transport should also be detached when we skip a master, otherwise named will crash when sending a SOA query to the next master over TLS, because the transport must be NULL when we enter 'dns_view_gettransport'.	2021-03-16 10:11:12 +01:00
Evan Hunt	dbffb212ce	add basic DoH system tests - rename dot to doth, as it now covers both dot and doh. - merge xot into doth as it's closely related. - added long-lived key and cert files (expiring 2121). - add tests with https-get, https-post, http-plain, alternate endpoints, and both static and ephemeral TLS configuration. - incidentally fixed a memory leak in dig that occurred if +https was specified more than once.	2021-03-05 18:09:42 +02:00
Evan Hunt	a0aefa1de6	create 'journal' system test tests that version 1 journal files containing version 1 transaction headers are rolled forward correctly on server startup, then updated into version 2 journals. also checks journal file consistency and 'max-journal-size' behavior.	2021-03-03 17:54:47 -08:00
Ondřej Surý	a50f5d0cf5	Call isc__initialize()/isc__shutdown() from win32 DllMain Call the libisc isc__initialize() constructor and isc__shutdown() destructor from DllMain instead of having duplicate code between those and DllMain() code.	2021-03-01 14:24:57 +01:00
Ondřej Surý	cbbecfcc82	Add isc_trampoline API to have simple accounting around threads The current isc_hp API uses internal tid_v variable that gets incremented for each new thread using hazard pointers. This tid_v variable is then used as a index to global shared table with hazard pointers state. Since the tid_v is only incremented and never decremented the table could overflow very quickly if we create set of threads for short period of time, they finish the work and cease to exist. Then we create identical set of threads and so on and so on. This is not a problem for a normal `named` operation as the set of threads is stable, but the problematic place are the unit tests where we test network manager or other APIs (task, timer) that create threads. This commits adds a thin wrapper around any function called from isc_thread_create() that adds unique-but-reusable small digit thread id that can be used as index to f.e. hazard pointer tables. The trampoline wrapper ensures that the thread ids will be reused, so the highest thread_id number doesn't grow indefinitely when threads are created and destroyed and then created again. This fixes the hazard pointer table overflow on machines with many cores. [GL #2396]	2021-02-25 16:21:10 +01:00
Michal Nowak	079debaa10	Do not remove stderr from pict output Removing stderr from the pict tool serves no purpose and drops valuable information, we might use when debugging failed pairwise CI job, such as: Input Error: A parameter names must be unique	2021-02-23 15:23:58 +01:00
Ondřej Surý	494d0da522	Use library constructor/destructor to initialize OpenSSL Instead of calling isc_tls_initialize()/isc_tls_destroy() explicitly use gcc/clang attributes on POSIX and DLLMain on Windows to initialize and shutdown OpenSSL library. This resolves the issue when isc_nm_create() / isc_nm_destroy() was called multiple times and it would call OpenSSL library destructors from isc_nm_destroy(). At the same time, since we now have introduced the ctor/dtor for libisc, this commit moves the isc_mem API initialization (the list of the contexts) and changes the isc_mem_checkdestroyed() to schedule the checking of memory context on library unload instead of executing the code immediately.	2021-02-18 19:33:54 +01:00
Ondřej Surý	d1448a4c2a	Move the <isc/readline.h> header to bin/dig/readline.h The <isc/readline.h> header provided a compatibility shim to use when other non-GNU readline libraries are in use. The two places where readline library is being used is nslookup and nsupdate, so the header file has been moved to bin/dig directory and it's directly included from bin/nsupdate. This also conceals any readline headers exposed from the libisc headers.	2021-02-16 01:04:46 +00:00
Michal Nowak	4295c82e45	Add --enable-option-checking=fatal to ./configure in CI The --enable-option-checking=fatal option prevents ./configure from proceeding when an unknown option is used in the ./configure step in CI. This change will avoid adding unsupported ./configure options or options with typo or typo in pairwise testing "# [pairwise: ...]" marker.	2021-02-12 13:56:38 +01:00
Matthijs Mekking	51827ddcd3	Update copyrights for [#1810 ]	2021-02-09 11:59:08 +00:00
Evan Hunt	aa9d51c494	tls and http configuration code was unnecessarily complex removed the isc_cfg_http_t and isc_cfg_tls_t structures and the functions that loaded and accessed them; this can be done using normal config parser functions.	2021-02-03 12:06:17 +01:00
Ondřej Surý	1cc24a2c8b	Unit-test fixes and manual page updates for DoH configuration This commit contains fixes to unit tests to make them work well on various platforms (in particular ones shipping old versions of OpenSSL) and for different configurations. It also updates the generated manpage to include DoH configuration options.	2021-02-03 12:06:17 +01:00
Artem Boldariev	08da09bc76	Initial support for DNS-over-HTTP(S) This commit completes the support for DNS-over-HTTP(S) built on top of nghttp2 and plugs it into the BIND. Support for both GET and POST requests is present, as required by RFC8484. Both encrypted (via TLS) and unencrypted HTTP/2 connections are supported. The latter are mostly there for debugging/troubleshooting purposes and for the means of encryption offloading to third-party software (as might be desirable in some environments to simplify TLS certificates management).	2021-02-03 12:06:17 +01:00
Witold Kręcicki	7a96081360	nghttp2-based HTTP layer in netmgr This commit includes work-in-progress implementation of DNS-over-HTTP(S). Server-side code remains mostly untested, and there is only support for POST requests.	2021-02-03 12:06:17 +01:00
Artem Boldariev	6b9a31989c	Resurrect old TLS code This commit resurrects the old TLS code from 8f73c70d23e26954165fd44ce5617a95f112bcff. It also includes numerous stability fixes and support for isc_nm_cancelread() for the TLS layer. The code was resurrected to be used for DoH.	2021-02-03 12:06:17 +01:00
Ondřej Surý	e488309da7	implement xfrin via XoT Add support for a "tls" key/value pair for zone primaries, referencing either a "tls" configuration statement or "ephemeral". If set to use TLS, zones will send SOA and AXFR/IXFR queries over a TLS channel.	2021-01-29 12:07:38 +01:00
Mark Andrews	b1ecab6383	Detect overly long CHANGES lines	2021-01-28 13:49:02 +11:00
Michal Nowak	a247f24dfa	Add README.md file to rsabigexponent system test This README.md describes why is bigkey needed.	2021-01-26 11:40:42 +01:00
Ondřej Surý	c605d75ea5	Use -release instead of -version-info for internal library SONAMEs The BIND 9 libraries are considered to be internal only and hence the API and ABI changes a lot. Keeping track of the API/ABI changes takes time and it's a complicated matter as the safest way to make everything stable would be to bump any library in the dependency chain as in theory if libns links with libdns, and a binary links with both, and we bump the libdns SOVERSION, but not the libns SOVERSION, the old libns might be loaded by binary pulling old libdns together with new libdns loaded by the binary. The situation gets even more complicated with loading the plugins that have been compiled with few versions old BIND 9 libraries and then dynamically loaded into the named. We are picking the safest option possible and usable for internal libraries - instead of using -version-info that has only a weak link to BIND 9 version number, we are using -release libtool option that will embed the corresponding BIND 9 version number into the library name. That means that instead of libisc.so.1701 (as an example) the library will now be named libisc-9.17.10.so.	2021-01-25 14:19:53 +01:00
Evan Hunt	f472390bc2	Add CHANGES note for #2335	2021-01-25 09:19:22 +01:00
Ondřej Surý	e493e04c0f	Refactor TLSDNS module to work with libuv/ssl directly * Following the example set in 634bdfb16d8, the tlsdns netmgr module now uses libuv and SSL primitives directly, rather than opening a TLS socket which opens a TCP socket, as the previous model was difficult to debug. Closes #2335. * Remove the netmgr tls layer (we will have to re-add it for DoH) * Add isc_tls API to wrap the OpenSSL SSL_CTX object into libisc library; move the OpenSSL initialization/deinitialization from dstapi needed for OpenSSL 1.0.x to the isc_tls_{initialize,destroy}() * Add couple of new shims needed for OpenSSL 1.0.x * When LibreSSL is used, require at least version 2.7.0 that has the best OpenSSL 1.1.x compatibility and auto init/deinit * Enforce OpenSSL 1.1.x usage on Windows * Added a TLSDNS unit test and implemented a simple TLSDNS echo server and client.	2021-01-25 09:19:22 +01:00
Matthijs Mekking	dc6de216af	Update copyrights for [#1086 ]	2021-01-19 10:12:40 +01:00
Michał Kępień	f96e6a1e1d	Add the ISC DNSSEC Guide as a BIND 9 ARM appendix Add the ISC DNSSEC Guide to the BIND 9 ARM in order to include the former in every BIND release.	2021-01-08 13:12:20 +01:00
Mark Andrews	faf9d8beba	update for 2021	2021-01-04 11:52:00 +11:00
Matthijs Mekking	f1a097964c	Add test for cpu affinity Add a test to check BIND 9 honors CPU affinity mask. This requires some changes to the start script, to construct the named command.	2020-12-23 09:16:26 +11:00
Ondřej Surý	cb30d9892d	Remove the requirement for the release notes to have copyright The release notes doesn't have to have copyright header, it doesn't add any value there as the release notes are useless outside the project.	2020-12-09 10:38:05 +01:00
Ondřej Surý	151852f428	Fix datarace when UDP/TCP connect fails and we are in nmthread When we were in nmthread, the isc__nm_async_<proto>connect() function executes in the same thread as the isc__nm_<proto>connect() and on a failure, it would block indefinitely because the failure branch was setting sock->active to false before the condition around the wait had a chance to skip the WAIT(). This also fixes the zero system test being stuck on FreeBSD 11, so we re-enable the test in the commit.	2020-12-03 13:56:34 +01:00
Ondřej Surý	94afea9325	Don't use stack allocated buffer for uv_write() On FreeBSD, the stack is destroyed more aggressively than on Linux and that revealed a bug where we were allocating the 16-bit len for the TCPDNS message on the stack and the buffer got garbled before the uv_write() sendback was executed. Now, the len is part of the uvreq, so we can safely pass it to the uv_write() as the req gets destroyed after the sendcb is executed.	2020-12-03 08:58:16 +01:00
Ondřej Surý	0f57732d13	Skip the zero, xfer and ixfr tests on non-Linux platforms Due to the platform differences, on non-Linux platforms, the xfer and ixfr tests fails and zero test gets stuck. This commit will get reverted when we add support for netmgr multi-threading.	2020-12-01 17:24:06 +01:00
Ondřej Surý	634bdfb16d	Refactor netmgr and add more unit tests This is a part of the works that intends to make the netmgr stable, testable, maintainable and tested. It contains a numerous changes to the netmgr code and unfortunately, it was not possible to split this into smaller chunks as the work here needs to be committed as a complete works. NOTE: There's a quite a lot of duplicated code between udp.c, tcp.c and tcpdns.c and it should be a subject to refactoring in the future. The changes that are included in this commit are listed here (extensively, but not exclusively): * The netmgr_test unit test was split into individual tests (udp_test, tcp_test, tcpdns_test and newly added tcp_quota_test) * The udp_test and tcp_test has been extended to allow programatic failures from the libuv API. Unfortunately, we can't use cmocka mock() and will_return(), so we emulate the behaviour with #define and including the netmgr/{udp,tcp}.c source file directly. * The netievents that we put on the nm queue have variable number of members, out of these the isc_nmsocket_t and isc_nmhandle_t always needs to be attached before enqueueing the netievent_<foo> and detached after we have called the isc_nm_async_<foo> to ensure that the socket (handle) doesn't disappear between scheduling the event and actually executing the event. * Cancelling the in-flight TCP connection using libuv requires to call uv_close() on the original uv_tcp_t handle which just breaks too many assumptions we have in the netmgr code. Instead of using uv_timer for TCP connection timeouts, we use platform specific socket option. * Fix the synchronization between {nm,async}_{listentcp,tcpconnect} When isc_nm_listentcp() or isc_nm_tcpconnect() is called it was waiting for socket to either end up with error (that path was fine) or to be listening or connected using condition variable and mutex. Several things could happen: 0. everything is ok 1. the waiting thread would miss the SIGNAL() - because the enqueued event would be processed faster than we could start WAIT()ing. In case the operation would end up with error, it would be ok, as the error variable would be unchanged. 2. the waiting thread miss the sock->{connected,listening} = `true` would be set to `false` in the tcp_{listen,connect}close_cb() as the connection would be so short lived that the socket would be closed before we could even start WAIT()ing * The tcpdns has been converted to using libuv directly. Previously, the tcpdns protocol used tcp protocol from netmgr, this proved to be very complicated to understand, fix and make changes to. The new tcpdns protocol is modeled in a similar way how tcp netmgr protocol. Closes: #2194, #2283, #2318, #2266, #2034, #1920 * The tcp and tcpdns is now not using isc_uv_import/isc_uv_export to pass accepted TCP sockets between netthreads, but instead (similar to UDP) uses per netthread uv_loop listener. This greatly reduces the complexity as the socket is always run in the associated nm and uv loops, and we are also not touching the libuv internals. There's an unfortunate side effect though, the new code requires support for load-balanced sockets from the operating system for both UDP and TCP (see #2137). If the operating system doesn't support the load balanced sockets (either SO_REUSEPORT on Linux or SO_REUSEPORT_LB on FreeBSD 12+), the number of netthreads is limited to 1. * The netmgr has now two debugging #ifdefs: 1. Already existing NETMGR_TRACE prints any dangling nmsockets and nmhandles before triggering assertion failure. This options would reduce performance when enabled, but in theory, it could be enabled on low-performance systems. 2. New NETMGR_TRACE_VERBOSE option has been added that enables extensive netmgr logging that allows the software engineer to precisely track any attach/detach operations on the nmsockets and nmhandles. This is not suitable for any kind of production machine, only for debugging. * The tlsdns netmgr protocol has been split from the tcpdns and it still uses the old method of stacking the netmgr boxes on top of each other. We will have to refactor the tlsdns netmgr protocol to use the same approach - build the stack using only libuv and openssl. * Limit but not assert the tcp buffer size in tcp_alloc_cb Closes: #2061	2020-12-01 16:47:07 +01:00
Michal Nowak	9567cefd39	Drop bin/tests/headerdep_test.sh.in The bin/tests/headerdep_test.sh script has not been updated since it was first created and it cannot be used as-is with the current BIND source code. Better tools (e.g. "include-what-you-use") emerged since the script was committed back in 2000, so instead of trying to bring it up to date, remove it from the source repository.	2020-11-27 13:11:41 +01:00
Mark Andrews	bd9155590e	Check that missing cookies are handled	2020-11-26 20:48:46 +00:00
Michał Kępień	2011a86881	Set up release notes for BIND 9.17.8	2020-11-26 12:16:49 +01:00

1 2 3 4 5 ...

2615 Commits