mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-31 14:35:26 +00:00

Author	SHA1	Message	Date
Ondřej Surý	b6b0d81a36	Cleanup the __tsan_acquire/__tsan_release With ThreadSanitizer support added to the Userspace RCU, we no longer need to wrap the call_rcu and caa_container_of with __tsan_{acquire,release} hints. Remove the direct calls to __tsan_{acquire,release} and the isc_urcu_{container,cleanup} macros.	2023-07-28 08:59:08 +02:00
Ondřej Surý	dc3e07572b	Workaround AddressSanitizer overzealous check The cds_lfht_for_each_entry and cds_lfht_for_each_entry_duplicate macros had a code that operated on the NULL pointer, at the end of the list it was calling caa_container_of() on the NULL pointer in the init-clause and iteration-expression, but the result wasn't actually used anywhere because the cond-expression in the for loop has prevented executing loop-statement. This made AddressSanitizer notice the invalid operation and rightfully complain. This was reported to the upstream and fixed there. Pull the upstream fix into our <isc/urcu.h> header, so our CI checks pass.	2023-07-27 15:21:39 +02:00
Ondřej Surý	5321c474ea	Refactor isc_stats_create() and its downstream users to return void The isc_stats_create() can no longer return anything else than ISC_R_SUCCESS. Refactor isc_stats_create() and its variants in libdns, libns and named to just return void.	2023-07-27 11:37:44 +02:00
Evan Hunt	e37d02905c	add isc_loop_now() to get consistent time isc_loop_now() is a front-end to uv_now(), returning the start time of the current event loop tick.	2023-07-19 15:32:21 +02:00
Evan Hunt	4db150437e	clean up unused dns_db methods to reduce the amount of common code that will need to be shared between the separated cache and zone database implementations, clean up unused portions of dns_db. the methods dns_db_dump(), dns_db_isdnssec(), dns_db_printnode(), dns_db_resigned(), dns_db_expirenode() and dns_db_overmem() were either never called or were only implemented as nonoperational stub functions: they have now been removed. dns_db_nodefullname() was only used in one place, which turned out to be unnecessary, so it has also been removed. dns_db_ispersistent() and dns_db_transfernode() are used, but only the default implementation in db.c was ever actually called. since they were never overridden by database methods, there's no need to retain methods for them. in rbtdb.c, beginload() and endload() methods are no longer defined for the cache database, because that was never used (except in a few unit tests which can easily be modified to use the zone implementation instead). issecure() is also no longer defined for the cache database, as the cache is always insecure and the default implementation of dns_db_issecure() returns false. for similar reasons, hashsize() is no longer defined for zone databases. implementation functions that are shared between zone and cache are now prepended with 'dns__rbtdb_' so they can become nonstatic. serve_stale_ttl is now a common member of dns_db.	2023-07-17 14:50:25 +02:00
Tony Finch	e2eaefbf7a	Check for overflow when resizing a heap Ensure that the heap size calculations produce the correct answers, and use `isc_mem_reget()` instead of calling `get` and `put`. Closes #4122	2023-06-27 12:38:09 +02:00
Tony Finch	14f5b79c74	Check for overflow in jemalloc_shim When compiled using a malloc that lacks an equivalent to sallocx(), the jemalloc_shim adds a size prefix to each allocation. We must check that this does not overflow. Closes #4121	2023-06-27 12:38:09 +02:00
Tony Finch	92fcb7457c	Use isc_mem_callocate() in http_calloc() Closes #4120	2023-06-27 12:38:09 +02:00
Tony Finch	81d73600c1	Add isc_mem_callocate() for safer array allocation As well as clearing the fresh memory, `calloc()`-like functions must ensure that the count and size do not overflow when multiplied. Use `isc_mem_callocate()` in `isc__uv_calloc()`.	2023-06-27 12:38:09 +02:00
Tony Finch	7474cad4ad	Add <isc/overflow.h> for checked mul, add, and sub The `ISC_OVERFLOW_XXX()` macros are usually wrappers around `__builtin_xxx_overflow()`, with alternative implementations for compilers that lack the builtins. Replace the overflow checks in `isc/time.c` with the new macros.	2023-06-27 12:38:09 +02:00
Ondřej Surý	5bd9343c4e	Remove the explicit call_rcu thread creating and destruction The free_all_cpu_call_rcu_data() call can consume hundreds of milliseconds on shutdown. Don't try to be smart and let the RCU library handle this internally.	2023-06-27 07:59:00 +02:00
Tony Finch	e18ca83a3b	Improve statschannel HTTP Connection: header protocol conformance In HTTP/1.0 and HTTP/1.1, RFC 9112 section 9.6 says the last response in a connection should include a `Connection: close` header, but the statschannel server omitted it. In an HTTP/1.0 response, the statschannel server can sometimes send a `Connection: keep-alive` header when it is about to close the connection. There are two ways: If the first request on a connection is keep-alive and the second request is not, then _both_ responses have `Connection: keep-alive` but the connection is (correctly) closed after the second response. If a single request contains Connection: close Connection: keep-alive then RFC 9112 section 9.3 says the keep-alive header is ignored, but the statschannel sends a spurious keep-alive in its response, though it correctly closes the connection. To fix these bugs, make it more clear that the `httpd->flags` are part of the per-request-response state. The Connection: flags are now described in terms of the effect they have instead of what causes them to be set.	2023-06-15 17:03:09 +01:00
Ondřej Surý	a8e6c3b8f7	Make isc_result tables smaller The isc_result_t enum was to sparse when each library code would skip to next << 16 as a base. Remove the huge holes in the isc_result_t enum to make the isc_result tables more compact. This change required a rewrite how we map dns_rcode_t to isc_result_t and back, so we don't ever return neither isc_result_t value nor dns_rcode_t out of defined range.	2023-06-15 15:32:04 +02:00
Midnight Veil	dd6acc1cac	Translate POSIX errorcode EROFS to ISC_R_NOPERM Report "permission denied" instead of "unexpected error" when trying to update a zone file on a read-only file system.	2023-06-14 13:12:45 +01:00
Mark Andrews	e6e4ac05b8	Fix typo in synchronize_rcu macro (add h) synchronize_rcu has not been used until now in BIND9 and there was a typo in the define (a 'h' was missing).	2023-06-06 08:10:09 +10:00
Ondřej Surý	f760ee3f8c	Disable URCU inlining if inlined rcu_dereference() fails to compile In some cases, the inlined version rcu_dereference() would not compile when working on pointer to opaque struct (namely Ubuntu Jammy). Detect such condition in the autoconf and disable the inlining of the small functions if it breaks the build.	2023-06-01 16:51:38 +02:00
Mark Andrews	ac2e0bc3ff	Move isc_mem_put to after node is checked for equality isc_mem_put NULL's the pointer to the memory being freed. The equality test 'parent->r == node' was accidentally being turned into a test against NULL.	2023-05-29 01:40:57 +00:00
Evan Hunt	512e5e786b	don't set SHUTTINGDOWN until after calling the request callbacks if we set ISC_HTTPDMGR_SHUTTINGDOWN in the http manager before calling the pending request callbacks, it can trigger an assertion.	2023-05-27 00:41:37 +00:00
Michal Nowak	1fe5c008d6	Ensure "wrap" variable is non-NULL RUNTIME_CHECK on the "wrap" variable avoids possible NULL dereference: thread.c: In function 'thread_wrap': thread.c:60:15: error: dereference of possibly-NULL 'wrap' [CWE-690] [-Werror=analyzer-possible-null-dereference] 60 \| *wrap = (struct thread_wrap){ The RUNTIME_CHECK was there before `7d1ceaf35d`.	2023-05-19 11:02:59 +02:00
Michał Kępień	6029010dd2	Remove <isc/cmocka.h> The last use of the cmocka_add_test_byname() helper macro was removed in commit `63fe9312ff`. Remove the <isc/cmocka.h> header that defines it.	2023-05-18 15:12:23 +02:00
Tony Finch	c319ccd4c9	Fixes for liburcu-qsbr Move registration and deregistration of the main thread from `isc_loopmgr_run()` into `isc__initialize()` / `isc__shutdown()`: liburcu-qsbr fails an assertion if we try to use it from an unregistered thread, and we need to be able to use it when the event loops are not running. Use `rcu_assign_pointer()` and `rcu_dereference()` in qp-trie transactions so that they properly mark threads as online. The RCU-protected pointer is no longer declared atomic because liburcu does not (yet) use standard C atomics. Fix the definition of `isc_qsbr_rcu_dereference()` to return the referenced value, and to call the right function inside liburcu. Change the thread sanitizer suppressions to match any variant of `rcu_*_barrier()`	2023-05-15 20:49:42 +00:00
Tony Finch	afae41aa40	Check the return value from uv_async_send() An omission pointed out by the following report from Coverity: /lib/isc/loop.c: 483 in isc_loopmgr_pause() >>> CID 455002: Error handling issues (CHECKED_RETURN) >>> Calling "uv_async_send" without checking return value (as is done elsewhere 5 out of 6 times). 483 uv_async_send(&loop->pause_trigger);	2023-05-15 18:52:04 +01:00
Evan Hunt	b4ac7faee9	allow streamdns read to resume after timeout when reading on a streamdns socket failed due to timeout, but the dispatch was still waiting for other responses, it would resume reading by calling isc_nm_read() again. this caused an assertion because the socket was already reading. we now check that either the socket is reading, or that it was already reading on the same handle.	2023-05-13 23:31:45 -07:00
Tony Finch	fc770a8bd0	Remove the now-unused ISC_STACK We are using the liburcu concurrent data structures instead.	2023-05-12 20:49:43 +01:00
Tony Finch	f11cc83142	Use per-CPU RCU helper threads Create and free per-CPU helper threads from the main thread and tell thread sanitizer to suppress leaking threads. (We are not leaking threads ourselves and we can safely ignore the Userspace-RCU thread leaks.)	2023-05-12 20:48:31 +01:00
Tony Finch	c377e0a9e3	Help thread sanitizer to cope with liburcu All the places the qp-trie code was using `call_rcu()` needed `__tsan_release()` and `__tsan_acquire()` annotations, so add a couple of wrappers to encapsulate this pattern. With these wrappers, the tests run almost clean under thread sanitizer. The remaining problems are due to `rcu_barrier()` which can be suppressed using `.tsan-suppress`. It does not suppress the whole of `liburcu`, because we would like thread sanitizer to detect problems in `call_rcu()` callbacks, which are called from `liburcu`. The CI jobs have been updated to use `.tsan-suppress` by default, except for a special-case job that needs the additional suppressions in `.tsan-suppress-extra`. We might be able to get rid of some of this after liburcu gains support for thread sanitizer. Note: the `rcu_barrier()` suppression is not entirely effective: tsan sometimes reports races that originate inside `rcu_barrier()` but tsan has discarded the stack so it does not have the information required to suppress the report. These "races" can be made much easier to reproduce by adding `atexit_sleep_ms=1000` to `TSAN_OPTIONS`. The problem with tsan's short memory can be addressed by increasing `history_size`: when it is large enough (6 or 7) the `rcu_barrier()` stack usually survives long enough for suppression to work.	2023-05-12 20:48:31 +01:00
Tony Finch	05ca11e122	Remove isc_qsbr (we are using liburcu instead) This commit breaks the qp-trie code.	2023-05-12 20:48:31 +01:00
Tony Finch	cd0795beea	Slightly more sanitary thread dispatch Tell thread sanitizer that the thread wrapper is released before passing it to a new thread.	2023-05-12 20:48:31 +01:00
Tony Finch	2e0c954806	Wait for RCU to finish before destroying a memory context Memory reclamation by `call_rcu()` is asynchronous, so during shutdown it can lose a race with the destruction of its memory context. When we defer memory reclamation, we need to attach to the memory context to indicate that it is still in use, but that is not enough to delay its destruction. So, call `rcu_barrier()` in `isc_mem_destroy()` to wait for pending RCU work to finish before proceeding to destroy the memory context.	2023-05-12 20:48:31 +01:00
Tony Finch	4f97a679f0	A macro for the size of a struct with a flexible array member It can be fairly long-winded to allocate space for a struct with a flexible array member: in general we need the size of the struct, the size of the member, and the number of elements. Wrap them all up in a STRUCT_FLEX_SIZE() macro, and use the new macro for the flexible arrays in isc_ht and dns_qp.	2023-05-12 20:48:31 +01:00
Ondřej Surý	fd3522c37b	Add Userspace-RCU to global CFLAGS and LIBS The Userspace-RCU headers are now needed for more parts of the libisc and libdns, thus we need to add it globally to prevent compilation failures on systems with non-standard Userspace-RCU installation path.	2023-05-12 14:16:25 +02:00
Ondřej Surý	00f1823366	Change the isc_quota API to use cds_wfcqueue internally The isc_quota API was using locked list of isc_job_t objects to keep the waiting TCP accepts. Change the isc_quota implementation to use cds_wfcqueue internally - the enqueue is wait-free and only dequeue needs to be locked.	2023-05-12 14:16:25 +02:00
Ondřej Surý	7b1d985de2	Change the isc_async API to use cds_wfcqueue internally The isc_async API was using lock-free stack (where enqueue operation was not wait-free). Change the isc_async to use cds_wfcqueue internally - enqueue and splice (move the queue members from one list to another) is nonblocking and wait-free.	2023-05-12 14:16:25 +02:00
Ondřej Surý	7220851f67	Replace glue_cache hashtable with direct link in rdatasetheader Instead of having a global hashtable with a global rwlock for the GLUE cache, move the glue_list directly into rdatasetheader and use Userspace-RCU to update the pointer when the glue_list is empty. Additionally, the cached glue_lists needs to be stored in the RBTDB version for early cleaning, otherwise the circular dependencies between nodes and glue_lists will prevent nodes to be ever cleaned up.	2023-05-12 13:25:39 +02:00
Michal Nowak	31935a3537	Disable ASAN in nsupdate for fatal cases Clang 16 LeakSanitizer reports a memory leak when dns_request_create() returned a TLS error in the nsupdate system test. While technically a memory leak on error handling, it's not a problem because the program is immediately terminated; nsupdate is not expected to run for a prolonged time.	2023-05-11 13:39:51 +02:00
Mark Andrews	9fcd42c672	Re-write remove_old_tsversions and greatest_version Stop deliberately breaking const rules by copying file->name into dirbuf and truncating it there. Handle files located in the root directory properly. Use unlinkat() from POSIX 200809.	2023-05-03 09:12:34 +02:00
Matthijs Mekking	70629d73da	Fix purging old log files with absolute file path Removing old timestamp or increment versions of log backup files did not work when the file is an absolute path: only the entry name was provided to the file remove function. The dirname was also bogus, since the file separater was put back too soon. Fix these issues to make log file rotation work when the file is configured to be an absolute path.	2023-05-03 09:12:11 +02:00
Tony Finch	7d1ceaf35d	Move per-thread RCU setup into isc_thread All the per-loop `libuv` setup remains in `isc_loop`, but the per-thread RCU setup is moved to `isc_thread` alongside the other per-thread setup. This avoids repeating the per-thread setup for `call_rcu()` helpers, and explains a little better why some parts of the per-thread setup is missing for `call_rcu()` helpers. This also removes the per-loop `call_rcu()` helpers as we refactored the isc__random_initialize() in the previous commit.	2023-04-27 12:38:53 +02:00
Ondřej Surý	65021dbf52	Move the isc_random API initialization to the thread_local variable Instead of writing complicated wrappers for every thread, move the initialization back to isc_random unit and check whether the random seed was initialized with a thread_local variable. Ensure that isc_entropy_get() returns a non-zero seed. This avoids problems with thread sanitizer tests getting stuck in an infinite loop.	2023-04-27 12:38:53 +02:00
Tony Finch	e0248bf60f	Simplify isc_thread a little Remove the `isc_threadarg_t` and `isc_threadresult_t` typedefs which were unhelpful disguises for `void *`, and free the dummy jemalloc allocation sooner.	2023-04-27 12:38:53 +02:00
Tony Finch	06f534fa69	Avoid spurious compilation failures in liburcu headers When liburcu is not installed from a system package, its headers are not treated as system headers by the compiler, so BIND's -Werror and other warning options take effect. The liburcu headers have a lot of inline functions, some of which do not use all their arguments, which BIND's build treats as an error.	2023-04-27 12:38:53 +02:00
Ondřej Surý	c2c907d728	Improve the Userspace RCU integration This commit allows BIND 9 to be compiled with different flavours of Userspace RCU, and improves the integration between Userspace RCU and our event loop: - In the RCU QSBR, the thread is put offline when polling and online when rcu_dereference, rcu_assign_pointer (or friends) are called. - In other RCU modes, we check that we are not reading when reaching the quiescent callback in the event loop. - We register the thread before uv_work_run() callback is called and after it has finished. The rcu_(un)register_thread() has a large overhead, but that's fine in this case.	2023-04-27 12:38:53 +02:00
Ondřej Surý	58663574b9	Use server socket to log TCP accept failures The accept_connection() could detach from the child socket on a failure, so we need to keep and use the server socket for logging the accept failures.	2023-04-27 11:07:57 +02:00
Ondřej Surý	27ad3a65f9	Fix potential UAF when shutting down isc_httpd Use the ISC_LIST_FOREACH_SAFE() macro to safely walk the running https and shut them down in a manner safe from deletion.	2023-04-25 08:16:46 +02:00
Ondřej Surý	ae997d9e21	Add ISC_LIST_FOREACH(_SAFE) macros There's a recurring pattern walking the ISC_LISTs that just repeats over and over. Add two macros: * ISC_LIST_FOREACH(list, elt, link) - walk the static list * ISC_LIST_FOREACH_SAFE(list, elt, link, next) - walk the list in a manner that's safe against list member deletions	2023-04-25 08:16:46 +02:00
Evan Hunt	0393b54afb	add a result code for ENOPROTOOPT, EPROTONOSUPPORT there was no isc_result_t value for invalid protocol errors that could be returned from libuv.	2023-04-21 12:42:10 +02:00
Ondřej Surý	b497e90179	Add isc_spinlock unit with shim pthread_spin implementation The spinlock is small (atomic_uint_fast32_t at most), lightweight synchronization primitive and should only be used for short-lived and most of the time a isc_mutex should be used. Add a isc_spinlock unit which is either (most of the time) a think wrapper around pthread_spin API or an efficient shim implementation of the simple spinlock.	2023-04-21 12:10:02 +02:00
Ondřej Surý	3b10814569	Fix the streaming read callback shutdown logic When shutting down TCP sockets, the read callback calling logic was flawed, it would call either one less callback or one extra. Fix the logic in the way: 1. When isc_nm_read() has been called but isc_nm_read_stop() hasn't on the handle, the read callback will be called with ISC_R_CANCELED to cancel active reading from the socket/handle. 2. When isc_nm_read() has been called and isc_nm_read_stop() has been called on the on the handle, the read callback will be called with ISC_R_SHUTTINGDOWN to signal that the dormant (not-reading) socket is being shut down. 3. The .reading and .recv_read flags are little bit tricky. The .reading flag indicates if the outer layer is reading the data (that would be uv_tcp_t for TCP and isc_nmsocket_t (TCP) for TLSStream), the .recv_read flag indicates whether somebody is interested in the data read from the socket. Usually, you would expect that the .reading should be false when .recv_read is false, but it gets even more tricky with TLSStream as the TLS protocol might need to read from the socket even when sending data. Fix the usage of the .recv_read and .reading flags in the TLSStream to their true meaning - which mostly consist of using .recv_read everywhere and then wrapping isc_nm_read() and isc_nm_read_stop() with the .reading flag. 4. The TLS failed read helper has been modified to resemble the TCP code as much as possible, clearing and re-setting the .recv_read flag in the TCP timeout code has been fixed and .recv_read is now cleared when isc_nm_read_stop() has been called on the streaming socket. 5. The use of Network Manager in the named_controlconf, isccc_ccmsg, and isc_httpd units have been greatly simplified due to the improved design. 6. More unit tests for TCP and TLS testing the shutdown conditions have been added. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Artem Boldariev <artem@isc.org>	2023-04-20 12:58:32 +02:00
Ondřej Surý	f677cf6b73	Remove unused netmgr->worker->sendbuf By inspecting the code, it was discovered that .sendbuf member of the isc__nm_networker_t was unused and just consuming ~64k per worker. Remove the member and the association allocation/deallocation.	2023-04-14 16:20:14 +02:00
Ondřej Surý	1715cad685	Refactor the isc_quota code and fix the quota in TCP accept code In `e185412872`, the TCP accept quota code became broken in a subtle way - the quota would get initialized on the first accept for the server socket and then deleted from the server socket, so it would never get applied again. Properly fixing this required a bigger refactoring of the isc_quota API code to make it much simpler. The new code decouples the ownership of the quota and acquiring/releasing the quota limit. After (during) the refactoring it became more clear that we need to use the callback from the child side of the accepted connection, and not the server side.	2023-04-12 14:10:37 +02:00

1 2 3 4 5 ...

4894 Commits