mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-09-02 07:35:26 +00:00

Author	SHA1	Message	Date
Mark Andrews	ed4e00713f	Check opcode of messages returned by dns_request_getresponse	2021-07-20 15:17:32 +10:00
Artem Boldariev	87f79a67f2	Add a missing break on error when adding a DoH endpoint The break on error when adding a DoH endpoint was implied but somehow gotten missed. This commit fixes that.	2021-07-16 11:50:22 +03:00
Artem Boldariev	590e8e0b86	Make max number of HTTP/2 streams configurable This commit makes number of concurrent HTTP/2 streams per connection configurable as a mean to fight DDoS attacks. As soon as the limit is reached, BIND terminates the whole session. The commit adds a global configuration option (http-streams-per-connection) which can be overridden in an http <name> {...} statement like follows: http local-http-server { ... streams-per-connection 100; ... }; For now the default value is 100, which should be enough (e.g. NGINX uses 128, but it is a full-featured WEB-server). When using lower numbers (e.g. ~70), it is possible to hit the limit with e.g. flamethrower.	2021-07-16 11:50:22 +03:00
Artem Boldariev	03a557a9bb	Add (http-)listener-clients option (DoH quota mechanism) This commit adds support for http-listener-clients global options as well as ability to override the default in an HTTP server description, like: http local-http-server { ... listener-clients 100; ... }; This way we have ability to specify per-listener active connections quota globally and then override it when required. This is exactly what AT&T requested us: they wanted a functionality to specify quota globally and then override it for specific IPs. This change functionality makes such a configuration possible. It makes sense: for example, one could have different quotas for internal and external clients. Or, for example, one could use BIND's internal ability to serve encrypted DoH with some sane quota value for internal clients, while having un-encrypted DoH listener without quota to put BIND behind a load balancer doing TLS offloading for external clients. Moreover, the code no more shares the quota with TCP, which makes little sense anyway (see tcp-clients option), because of the nature of interaction of DoH clients: they tend to keep idle opened connections for longer periods of time, preventing the TCP and TLS client from being served. Thus, the need to have a separate, generally larger, quota for them. Also, the change makes any option within "http <name> { ... };" statement optional, making it easier to override only required default options. By default, the DoH connections are limited to 300 per listener. I hope that it is a good initial guesstimate.	2021-07-16 11:50:20 +03:00
Artem Boldariev	954240467d	Verify HTTP paths both in incoming requests and in config file This commit adds the code (and some tests) which allows verifying validity of HTTP paths both in incoming HTTP requests and in BIND's configuration file.	2021-07-16 10:28:08 +03:00
Michał Kępień	29d8d35869	Tweak query_addds() comments to avoid confusion It has been noticed that commit `7a87bf468b` did not only fix NSEC record handling in signed, insecure delegations prepared using both wildcard expansion and CNAME chaining - it also inadvertently fixed DS record handling in signed, secure delegations of that flavor. This is because the 'rdataset' variable in the relevant location in query_addds() can be either a DS RRset or an NSEC RRset. Update a code comment in query_addds() to avoid confusion. Update the comments describing the purpose of query_addds() so that they also mention NSEC(3) records.	2021-07-16 07:20:15 +02:00
Mark Andrews	616896d735	zone->requeststats_on was not being set at the correct point	2021-07-16 13:49:22 +10:00
Matthijs Mekking	577bf913b9	Relax zone_cdscheck function If we have a CDS or CDNSKEY we at least need to have a DNSKEY with the same algorithm published and signing the CDS RRset. Same for CDNSKEY of course. This relaxes the zone_cdscheck function, because before the CDS or CDNSKEY had to match a DNSKEY, now only the algorithm has to match. This allows a provider in a multisigner model to update the CDS/CDNSKEY RRset in the zone that is served by the other provider.	2021-07-14 12:10:11 -07:00
Evan Hunt	4f6e2317e9	document isc__trampoline Added some header file documentation to the isc__trampoline implementation in trampoline_p.h.	2021-07-14 10:55:12 -07:00
Artem Boldariev	64cd7e8a7f	Fix crash in DoH on empty query string in GET requests An unhandled code path left GET query string data uninitialised (equal to NULL) and led to a crash during the requests' base64 data decoding. This commit fixes that.	2021-07-13 16:53:51 +03:00
Ondřej Surý	a9e6a7ae57	Disable setting the thread affinity It was discovered that setting the thread affinity on both the netmgr and netthread threads lead to inconsistent recursive performance because sometimes the netmgr and netthread threads would compete over single resource and sometimes not. Removing setting the affinity causes a slight dip in the authoritative performance around 5% (the measured range was from 3.8% to 7.8%), but the recursive performance is now consistently good.	2021-07-13 14:48:29 +02:00
Ondrej Sury	6eca4b402e	Use max_align_t for memory sizeinfo alignment on OpenBSD On OpenBSD and more generally on platforms without either jemalloc or malloc_(usable_)size, we need to increase the alignment for the memory to sizeof(max_align_t) as with plain sizeof(void *), the compiled code would be crashing when accessing the returned memory.	2021-07-13 13:48:33 +02:00
Matthijs Mekking	b676163933	Fix leak in checkds code In 'checkds_send_toaddr' there is a goto bug that causes the TSIG key and DNS message to not be detached. Remove the offending goto statement.	2021-07-13 11:12:37 +02:00
Ondrej Sury	23751fe252	Cache the isc_os_ncpu() result It was discovered that on some platforms (f.e. Alpine Linux with MUSL) the result of isc_os_ncpus() call differ when called before and after we drop privileges. This commit changes the isc_os_ncpus() call to cache the result from the first call and thus always return the same value during the runtime of the named. The first call to isc_os_ncpus() is made as soon as possible on the library initalization.	2021-07-13 09:12:04 +02:00
Ondřej Surý	ce03015d48	Remove nonnull attribute from isc_mem_{get,allocate,reallocate} The isc_mem_get(), isc_mem_allocate() and isc_mem_reallocate() can return NULL ptr in case where the allocation size is NULL. Remove the nonnull attribute from the functions' declarations. This stems from the following definition in the C11 standard: > If the size of the space requested is zero, the behavior is > implementation-defined: either a null pointer is returned, or the > behavior is as if the size were some nonzero value, except that the > returned pointer shall not be used to access an object. In this case, we return NULL as it's easier to detect errors when accessing pointer from zero-sized allocation which should obviously never happen.	2021-07-12 10:02:18 +02:00
Ondřej Surý	d1a9e549b1	Fix the real allocation size in OpenBSD rallocx shim In the rallocx() shim for OpenBSD (that's the only platform that doesn't have malloc_size() or malloc_usable_size() equivalent), the newly allocated size was missing the extra size_t member for storing the allocation size leading to size_t sized overflow at the end of the reallocated memory chunk.	2021-07-12 08:43:14 +02:00
Mark Andrews	3945c289bb	Reset errcnt at the start of each subtest	2021-07-12 03:47:11 +00:00
Mark Andrews	9528ba5a22	Fix unchecked returns of dns_name_fromtext (cherry picked from commit `8538c762cb`)	2021-07-12 13:26:29 +10:00
Mark Andrews	ce5207699d	Fix unchecked return of isc_rwlock_lock and isc_rwlock_unlock (cherry picked from commit `bcaf23dd27`)	2021-07-12 13:26:29 +10:00
Mark Andrews	3c942a3e3a	Update out of date comment	2021-07-12 12:33:46 +10:00
Mark Andrews	a3fda086f7	Check that there was no OPT record before falling back to plain DNS on FORMERR.	2021-07-12 12:30:03 +10:00
Mark Andrews	c5e1c35e45	Silence untrusted loop bound on nsec3param.iterations 630 1. tainted_argument: Calling function dns_rdata_tostruct taints argument nsec3param.iterations. [show details] 631 result = dns_rdata_tostruct(nsec3rdata, &nsec3param, NULL); 2. Condition !!(result == 0), taking true branch. 3. Condition !!(result == 0), taking true branch. 632 RUNTIME_CHECK(result == ISC_R_SUCCESS); 633 634 dns_fixedname_init(&fixed); CID 281425 (#1 of 1): Untrusted loop bound (TAINTED_SCALAR) 4. tainted_data: Passing tainted expression nsec3param.iterations to dns_nsec3_hashname, which uses it as a loop boundary. [show details] Ensure that tainted values are properly sanitized, by checking that their values are within a permissible range. 635 result = dns_nsec3_hashname(&fixed, rawhash, &rhsize, vctx->origin, 636 vctx->origin, nsec3param.hash, 637 nsec3param.iterations, nsec3param.salt, 638 nsec3param.salt_length);	2021-07-12 00:48:13 +00:00
Mark Andrews	f0fdca90f2	Silence tainted scalar on rdlen 2042 ttl = isc_buffer_getuint32(&j->it.source); 13. tainted_data_transitive: Call to function isc_buffer_getuint16 with tainted argument j->it.source.base returns tainted data. [show details] 14. var_assign: Assigning: rdlen = isc_buffer_getuint16(&j->it.source), which taints rdlen. 2043 rdlen = isc_buffer_getuint16(&j->it.source); 2044 2045 / 2046 * Parse the rdata. 2047 */ 15. Condition j->it.source.used - j->it.source.current != rdlen, taking false branch. 2048 if (isc_buffer_remaininglength(&j->it.source) != rdlen) { 2049 FAIL(DNS_R_FORMERR); 2050 } 16. var_assign_var: Assigning: j->it.source.active = j->it.source.current + rdlen. Both are now tainted. 2051 isc_buffer_setactive(&j->it.source, rdlen); 2052 dns_rdata_reset(&j->it.rdata); 17. lower_bounds: Checking lower bounds of unsigned scalar j->it.source.active by taking the true branch of j->it.source.active > j->it.source.current. CID 316506 (#1 of 1): Untrusted loop bound (TAINTED_SCALAR) 18. tainted_data: Passing tainted expression j->it.source.active to dns_rdata_fromwire, which uses it as a loop boundary. [show details] Ensure that tainted values are properly sanitized, by checking that their values are within a permissible range. 2053 CHECK(dns_rdata_fromwire(&j->it.rdata, rdclass, rdtype, &j->it.source, 2054 &j->it.dctx, 0, &j->it.target));	2021-07-12 00:22:07 +00:00
Mark Andrews	83fd38dd2c	Silence use of tainted scalar 2607 43. tainted_argument: Calling function journal_read_xhdr taints argument xhdr.size. [show details] 2608 result = journal_read_xhdr(j1, &xhdr); 44. Condition rewrite, taking true branch. 45. Condition result == 29, taking false branch. 2609 if (rewrite && result == ISC_R_NOMORE) { 2610 break; 2611 } 46. Condition result != 0, taking false branch. 2612 CHECK(result); 2613 47. var_assign_var: Assigning: size = xhdr.size. Both are now tainted. 2614 size = xhdr.size; CID 331088 (#3 of 3): Untrusted allocation size (TAINTED_SCALAR) 48. tainted_data: Passing tainted expression size to isc__mem_get, which uses it as an allocation size. [show details] Ensure that tainted values are properly sanitized, by checking that their values are within a permissible range. 2615 buf = isc_mem_get(mctx, size);	2021-07-12 00:22:07 +00:00
Ondřej Surý	29a285a67d	Revert the allocate/free -> get/put change from jemalloc change In the jemalloc merge request, we missed the fact that ah_frees and ah_handles are reallocated which is not compatible with using isc_mem_get() for allocation and isc_mem_put() for deallocation. This commit reverts that part and restores use of isc_mem_allocate() and isc_mem_free().	2021-07-09 18:19:57 +02:00
Ondřej Surý	9c3bebc26f	Properly disable the "water" in isc_mem The proper way how to disable the water limit in the isc_mem context is to call: isc_mem_setwater(ctx, NULL, NULL, 0, 0); this ensures that the old water callback is called with ISC_MEM_LOWATER if the callback was called with ISC_MEM_HIWATER before. Historically, there were some places where the limits were disabled by calling: isc_mem_setwater(ctx, water, water_arg, 0, 0); which would also call the old callback, but it also causes the water_t to be allocated and extra check to be executed because water callback is not NULL. This commits unifies the calls to disable water to the preferred form.	2021-07-09 15:58:02 +02:00
Artem Boldariev	3673abc53c	Use restrict and const in isc_mempool_t This commit makes add restrict and const modifiers to some variables to aid compiler to do its optimizations.	2021-07-09 15:58:02 +02:00
Artem Boldariev	c11a401add	Do not use atomic variables in isc_mempool_t As now mempool objects intended to be used in a thread-local manner, there is no point in using atomic here.	2021-07-09 15:58:02 +02:00
Ondřej Surý	63b06571b9	Use isc_mem_get() and isc_mem_put() in isc_mem_total test Previously, the isc_mem_allocate() and isc_mem_free() would be used for isc_mem_total test, but since we now use the real allocation size (sallocx, malloc_size, malloc_usable_size) to track the allocation size, it's impossible to get the test value right. Changing the test to use isc_mem_get() and isc_mem_put() will use the exact size provided, so the test would work again on all the platforms even when jemalloc is not being used.	2021-07-09 15:58:02 +02:00
Ondřej Surý	6f162e8aa4	Rewrite isc_mem water to use single atomic exchange operation This commit refactors the water mechanism in the isc_mem API to use single pointer to a water_t structure that can be swapped with atomic_exchange operation instead of having four different values (water, water_arg, hi_water, lo_water) in the flat namespace. This reduces the need for locking and prevents a race when water and water_arg could be desynchronized.	2021-07-09 15:58:02 +02:00
Ondřej Surý	798333d456	Allow size == 0 in isc_mem_{get,allocate,reallocate} Calls to jemalloc extended API with size == 0 ends up in undefined behaviour. This commit makes the isc_mem_get() and friends calls more POSIX aligned: If size is 0, either a null pointer or a unique pointer that can be successfully passed to free() shall be returned. We picked the easier route (which have been already supported in the old code) and return NULL on calls to the API where size == 0.	2021-07-09 15:58:02 +02:00
Ondřej Surý	e20cc41e56	Use system allocator when jemalloc is unavailable This commit adds support for systems where the jemalloc library is not available as a package, here's the quick summary: * On Linux - the jemalloc is usually available as a package, if configured --without-jemalloc, the shim would be used around malloc(), free(), realloc() and malloc_usable_size() * On macOS - the jemalloc is available from homebrew or macports, if configured --without-jemalloc, the shim would be used around malloc(), free(), realloc() and malloc_size() * On FreeBSD - the jemalloc is the system allocator, we just need to check for <malloc_np.h> header to get access to non-standard API * On NetBSD - the jemalloc is the system allocator, we just need to check for <jemalloc/jemalloc.h> header to get access to non-standard API * On a system hostile to users and developers (read OpenBSD) - the jemalloc API is emulated by using ((size_t *)ptr)[-1] field to hold the size information. The OpenBSD developers care only for themselves, so why should we care about speed on OpenBSD?	2021-07-09 15:58:02 +02:00
Evan Hunt	2ce0de6995	Remove error checks in dns_message for mem allocations Removed error checks for several functions that can no longer fail due to failed memory allocation.	2021-07-09 15:58:02 +02:00
Ondřej Surý	e754360170	Remove atomic thread synchronization from the memory hot-path This commit refactors the hi/lo-water related code to remove contention on the hot path in the memory allocator.	2021-07-09 15:58:02 +02:00
Ondřej Surý	efb385ecdc	Clean up isc_mempool API - isc_mempool_get() can no longer fail; when there are no more objects in the pool, more are always allocated. checking for NULL return is no longer necessary. - the isc_mempool_setmaxalloc() and isc_mempool_getmaxalloc() functions are no longer used and have been removed.	2021-07-09 15:58:02 +02:00
Ondřej Surý	7cbfbc8faa	Clean up the dns_dispatch_getudp API Cleanup unused parts of dns_dispatch_getudp API, remove dns_dispatch_getudp_dup() function and related code.	2021-07-09 15:58:02 +02:00
Ondřej Surý	f487c6948b	Replace locked mempools with memory contexts Current mempools are kind of hybrid structures - they serve two purposes: 1. mempool with a lock is basically static sized allocator with pre-allocated free items 2. mempool without a lock is a doubly-linked list of preallocated items The first kind of usage could be easily replaced with jemalloc small sized arena objects and thread-local caches. The second usage not-so-much and we need to keep this (in libdns:message.c) for performance reasons.	2021-07-09 15:58:02 +02:00
Ondřej Surý	fd3ceec475	Add debug tracing capability to isc_mempool_create/destroy Previously, we only had capability to trace the mempool gets and puts, but for debugging, it's sometimes also important to keep track how many and where do the memory pools get created and destroyed. This commit adds such tracking capability.	2021-07-09 15:58:02 +02:00
Ondřej Surý	5ab05d1696	Replace isc_mem_allocate() usage with isc_mem_get() in netmgr.c The isc_mem_allocate() comes with additional cost because of the memory tracking. In this commit, we replace the usage with isc_mem_get() because we track the allocated sizes anyway, so it's possible to also replace isc_mem_free() with isc_mem_put().	2021-07-09 15:58:02 +02:00
Ondřej Surý	fcc6814776	Replace internal memory calls with non-standard jemalloc API The jemalloc non-standard API fits nicely with our memory contexts, so just rewrite the memory context internals to use the non-public API. There's just one caveat - since we no longer track the size of the allocation for isc_mem_allocate/isc_mem_free combination, we need to use sallocx() to get real allocation size in both allocator and deallocator because otherwise the sizes would not match.	2021-07-09 15:58:02 +02:00
Ondřej Surý	4b3d0c6600	Remove ISC_MEM_DEBUGSIZE and ISC_MEM_DEBUGRECORD The ISC_MEM_DEBUGSIZE and ISC_MEM_DEBUGCTX did sanity checks on matching size and memory context on the memory returned to the allocator. Those will no longer needed when most of the allocator will be replaced with jemalloc.	2021-07-09 15:58:02 +02:00
Ondřej Surý	692fd2a216	Remove default_memalloc and default_memfree Now that we have xmalloc:true enabled, we can remove our xmalloc-like wrappers around malloc and free.	2021-07-09 15:58:02 +02:00
Ondřej Surý	5184384efd	Add recommended jemalloc configuration for our load There's global variable called `malloc_conf` that can be used to configure jemalloc behaviour at the program startup. We use following configuration: * xmalloc:true - abort-on-out-of-memory enabled. * background_thread:true - Enable internal background worker threads to handle purging asynchronously. * metadata_thp:auto - allow jemalloc to use transparent huge page (THP) for internal metadata initially, but may begin to do so when metadata usage reaches certain level. * dirty_decay_ms:30000 - Approximate time in milliseconds from the creation of a set of unused dirty pages until an equivalent set of unused dirty pages is purged and/or reused. * muzzy_decay_ms:30000 - Approximate time in milliseconds from the creation of a set of unused muzzy pages until an equivalent set of unused muzzy pages is purged and/or reused. More information about the specific meaning can be found in the jemalloc manpage or online at http://jemalloc.net/jemalloc.3.html	2021-07-09 15:58:02 +02:00
Ondřej Surý	7f1c525625	Compile with jemalloc to reduce memory allocator contention The jemalloc allocator is scalable high performance allocator, this is the first in the series of commits that will add jemalloc as a memory allocator for BIND 9. This commit adds configure.ac check and Makefile modifications to use jemalloc as BIND 9 allocator.	2021-07-09 15:58:02 +02:00
Ondřej Surý	63924968d1	Add debug tracing capability to isc_mem_create/isc_mem_destroy Previously, we only had capability to trace the memory gets and puts, but for debugging, it's sometimes also important to keep track how many and where do the memory contexts get created and destroyed. This commit adds such tracking capability.	2021-07-09 15:58:02 +02:00
Artem Boldariev	c6d0e3d3a7	Return HTTP status code for small/malformed requests This commit makes BIND return HTTP status codes for malformed or too small requests. DNS request processing code would ignore such requests. Such an approach works well for other DNS transport but does not make much sense for HTTP, not allowing it to complete the request/response sequence. Suppose execution has reached the point where DNS message handling code has been called. In that case, it means that the HTTP request has been successfully processed, and, thus, we are expected to respond to it either with a message containing some DNS payload or at least to return an error status code. This commit ensures that BIND behaves this way.	2021-07-09 16:37:08 +03:00
Artem Boldariev	fedff2cd6c	Return "Bad Request" (400) in a case of Base64 decoding error This error code fits better than the more generic "Internal Server Error" (500) which implies that the problem is on the server. Also, do not end the whole HTTP/2 session on a bad request.	2021-07-09 16:26:46 +03:00
Artem Boldariev	1792740075	Ignore an "Accept" HTTP header value We were too strict regarding the value and presence of "Accept" HTTP header, slightly breaking compatibility with the specification. According to RFC8484 client SHOULD add "Accept" header to the requests but MUST be able to handle "application/dns-message" media type regardless of the value of the header. That basically suggests we ignore its value. Besides, verifying the value of the "Accept" header is a bit tricky because it could contain multiple media types, thus requiring proper parsing. That is doable but does not provide us with any benefits. Among other things, not verifying the value also fixes compatibility with clients, which could advertise multiple media types as supported, which we should accept. For example, it is possible for a perfectly valid request to contain "application/dns-message", "application/", and "/*" in the "Accept" header value. Still, we would treat such a request as invalid.	2021-07-09 16:26:46 +03:00
Artem Boldariev	7b6945fb60	Fix BIND hanging when browsers end HTTP/2 streams prematurely The commit fixes BIND hanging when browsers end HTTP/2 streams prematurely (for example, by sending RST_STREAM). It ensures that isc__nmsocket_prep_destroy() will be called for an HTTP/2 stream, allowing it to be properly disposed. The problem was impossible to reproduce using dig or DoH benchmarking software (e.g. flamethrower) because these do not tend to end HTTP/2 streams prematurely.	2021-07-09 15:42:44 +03:00
Artem Boldariev	094fcc10e7	Move the code which calls server read callback into a separate func This commit moves the code which calls server read callback into a separate function to avoid code repetition.	2021-07-09 15:42:44 +03:00

... 12 13 14 15 16 ...

14012 Commits