mir/bind - bind - Mike's Git repositories

mir/bind

mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-29 05:28:00 +00:00

Author	SHA1	Message	Date
Mark Andrews	fb2d0e2897	extend named-journalprint to be able to force the journal version named-journalprint can now upgrade or downgrade a journal file in place; the '-u' option upgrades and the '-d' option downgrades.	2021-03-03 17:54:47 -08:00
Evan Hunt	ee19966326	allow dns_journal_rollforward() to read old journal files when the 'max-ixfr-ratio' option was added, journal transaction headers were revised to include a count of RR's in each transaction. this made it impossible to read old journal files after an upgrade. this branch restores the ability to read version 1 transaction headers. when rolling forward, printing journal contents, if the wrong transaction header format is found, we can switch. when dns_journal_rollforward() detects a version 1 transaction header, it returns DNS_R_RECOVERABLE. this triggers zone_postload() to force a rewrite of the journal file in the new format, and also to schedule a dump of the zone database with minimal delay. journal repair is done by dns_journal_compact(), which rewrites the entire journal, ignoring 'max-journal-size'. journal size is corrected later. newly created journal files now have "BIND LOG V9.2" in their headers instead of "BIND LOG V9". files with the new version string cannot be read using the old transaction header format. note that this means newly created journal files will be rejected by older versions of named. named-journalprint now takes a "-x" option, causing it to print transaction header information before each delta, including its format version.	2021-03-03 17:54:47 -08:00
Matthijs Mekking	8c526cb67f	Purge keys implementation On each keymgr run, we now also check if key files can be removed. The 'purge-keys' interval determines how long keys should be retained after they have become completely hidden. Key files should not be removed if it has a state that is set to something else then HIDDEN, if purge-keys is 0 (disabled), if the key goal is set to OMNIPRESENT, or if the key is unused (a key is unused if no timing metadata set, and no states are set or if set, they are set to HIDDEN). If the last changed timing metadata plus the purge-keys interval is in the past, the key files may be removed. Add a dst_key_t variable 'purge' to signal that the key file should not be written to file again.	2021-02-23 09:16:48 +01:00
Matthijs Mekking	313de3a7e2	Add purge-keys config option Add a new option 'purge-keys' to 'dnssec-policy' that will purge key files for deleted keys. The option determines how long key files should be retained prior to removing the corresponding files from disk. If set to 0, the option is disabled and 'named' will not remove key files from disk.	2021-02-23 09:16:48 +01:00
Ondřej Surý	a5d2ce79c8	Stop including gssapi.h from dst/gssapi.h header The only reason for including the gssapi.h from the dst/gssapi.h header was to get the typedefs of gss_cred_id_t and gss_ctx_id_t. Instead of using those types directly this commit introduces dns_gss_cred_id_t and dns_gss_ctx_id_t types that are being used in the public API and privately retyped to their counterparts when we actually call the gss api. This also conceals the gssapi headers, so users of the libdns library doesn't have to add GSSAPI_CFLAGS to the Makefile when including libdns dst API.	2021-02-16 01:04:46 +00:00
Ondřej Surý	23c3bcc711	Stop including dnstap headers from <dns/dnstap.h> The <fstrm.h> and <protobuf-c/protobuf-c.h> headers are only directly included where used and we stopped exposing those headers from libdns headers.	2021-02-16 01:04:46 +00:00
Ondřej Surý	dce292e7f4	Stop including lmdb.h from <dns/view.h> The lmdb.h doesn't have to be included from the dns/view.h header as it is separately included where used. This stops exposing the inclusion of lmdb.h from the libdns headers.	2021-02-16 01:04:46 +00:00
Ondřej Surý	e488309da7	implement xfrin via XoT Add support for a "tls" key/value pair for zone primaries, referencing either a "tls" configuration statement or "ephemeral". If set to use TLS, zones will send SOA and AXFR/IXFR queries over a TLS channel.	2021-01-29 12:07:38 +01:00
Matthijs Mekking	aabdedeae3	Only start stale refresh window when resuming If we did not attempt a fetch due to fetch-limits, we should not start the stale-refresh-time window. Introduce a new flag DNS_DBFIND_STALESTART to differentiate between a resolver failure and unexpected error. If we are resuming, this indicates a resolver failure, then start the stale-refresh-time window, otherwise don't start the stale-refresh-time window, but still fall back to using stale data. (This commit also wraps some docstrings to 80 characters width)	2021-01-28 16:38:34 +01:00
Mark Andrews	dd3520ae41	Improve the diagnostic 'rndc retransfer' error message	2021-01-28 08:43:03 +11:00
Matthijs Mekking	fa0c9280d2	Update code flow in query.c wrt stale data First of all, there was a flaw in the code related to the 'stale-refresh-time' option. If stale answers are enabled, and we returned stale data, then it was assumed that it was because we were in the 'stale-refresh-time' window. But now we could also have returned stale data because of a 'stale-answer-client-timeout'. To fix this, introduce a rdataset attribute DNS_RDATASETATTR_STALE_WINDOW to indicate whether the stale cache entry was returned because the 'stale-refresh-time' window is active. Second, remove the special case handling when the result is DNS_R_NCACHENXRRSET. This can be done more generic in the code block when dealing with stale data. Putting all stale case handling in the code block when dealing with stale data makes the code more easy to follow. Update documentation to be more verbose and to match then new code flow.	2021-01-25 10:48:16 -03:00
Diego Fronza	171a5b7542	Add stale-answer-client-timeout option The general logic behind the addition of this new feature works as folows: When a client query arrives, the basic path (query.c / ns_query_recurse) was to create a fetch, waiting for completion in fetch_callback. With the introduction of stale-answer-client-timeout, a new event of type DNS_EVENT_TRYSTALE may invoke fetch_callback, whenever stale answers are enabled and the fetch took longer than stale-answer-client-timeout to complete. When an event of type DNS_EVENT_TRYSTALE triggers fetch_callback, we must ensure that the folowing happens: 1. Setup a new query context with the sole purpose of looking up for stale RRset only data, for that matters a new flag was added 'DNS_DBFIND_STALEONLY' used in database lookups. . If a stale RRset is found, mark the original client query as answered (with a new query attribute named NS_QUERYATTR_ANSWERED), so when the fetch completion event is received later, we avoid answering the client twice. . If a stale RRset is not found, cleanup and wait for the normal fetch completion event. 2. In ns_query_done, we must change this part: /* * If we're recursing then just return; the query will * resume when recursion ends. */ if (RECURSING(qctx->client)) { return (qctx->result); } To this: if (RECURSING(qctx->client) && !QUERY_STALEONLY(qctx->client)) { return (qctx->result); } Otherwise we would not proceed to answer the client if it happened that a stale answer was found when looking up for stale only data. When an event of type DNS_EVENT_FETCHDONE triggers fetch_callback, we proceed as before, resuming query, updating stats, etc, but a few exceptions had to be added, most important of which are two: 1. Before answering the client (ns_client_send), check if the query wasn't already answered before. 2. Before detaching a client, e.g. isc_nmhandle_detach(&client->reqhandle), ensure that this is the fetch completion event, and not the one triggered due to stale-answer-client-timeout, so a correct call would be: if (!QUERY_STALEONLY(client)) { isc_nmhandle_detach(&client->reqhandle); } Other than these notes, comments were added in code in attempt to make these updates easier to follow.	2021-01-25 10:47:14 -03:00
Diego Fronza	74840ec50b	Added dns_view_staleanswerenabled() function Since it takes a couple lines of code to check whether stale answers are enabled for a given view, code was extracted out to a proper function.	2021-01-25 10:47:14 -03:00
Matthijs Mekking	68d715a229	Publish CDS/CDNSKEY Delete Records Check if zone is transitioning from secure to insecure. If so, delete the CDS/CDNSKEY records, otherwise make sure they are not part of the RRset.	2020-12-23 09:02:11 +01:00
Matthijs Mekking	cf420b2af0	Treat dnssec-policy "none" as a builtin zone Configure "none" as a builtin policy. Change the 'cfg_kasp_fromconfig' api so that the 'name' will determine what policy needs to be configured. When transitioning a zone from secure to insecure, there will be cases when a zone with no DNSSEC policy (dnssec-policy none) should be using KASP. When there are key state files available, this is an indication that the zone once was DNSSEC signed but is reconfigured to become insecure. If we would not run the keymgr, named would abruptly remove the DNSSEC records from the zone, making the zone bogus. Therefore, change the code such that a zone will use kasp if there is a valid dnssec-policy configured, or if there are state files available.	2020-12-23 09:02:11 +01:00
Matthijs Mekking	8f2c5e45da	Add function to see if dst key uses kasp For purposes of zones transitioning back to insecure mode, it is practical to see if related keys have a state file associated.	2020-12-23 09:02:11 +01:00
Ondřej Surý	ef685bab5c	Print warning when falling back to increment soa serial method When using the `unixtime` or `date` method to update the SOA serial, `named` and `dnssec-signzone` would silently fallback to `increment` method to prevent the new serial number to be smaller than the old serial number (using the serial number arithmetics). Add a warning message when such fallback happens.	2020-12-11 10:48:28 +01:00
Ondřej Surý	7ba18870dc	Reformat sources using clang-format-11	2020-12-08 18:36:23 +01:00
Matthijs Mekking	64db30942d	Add NSEC3PARAM unit test, refactor zone.c Add unit test to ensure the right NSEC3PARAM event is scheduled in 'dns_zone_setnsec3param()'. To avoid scheduling and managing actual tasks, split up the 'dns_zone_setnsec3param()' function in two parts: 1. 'dns__zone_lookup_nsec3param()' that will check if the requested NSEC3 parameters already exist, and if a new salt needs to be generated. 2. The actual scheduling of the new NSEC3PARAM event (if needed).	2020-11-26 10:43:59 +01:00
Matthijs Mekking	6b5d7357df	Detect NSEC3 salt collisions When generating a new salt, compare it with the previous NSEC3 paremeters to ensure the new parameters are different from the previous ones. This moves the salt generation call from 'bin/named/*.s' to 'lib/dns/zone.c'. When setting new NSEC3 parameters, you can set a new function parameter 'resalt' to enforce a new salt to be generated. A new salt will also be generated if 'salt' is set to NULL. Logging salt with zone context can now be done with 'dnssec_log', removing the need for 'dns_nsec3_log_salt'.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	7878f300ff	Move logging of salt in separate function There may be a desire to log the salt without losing the context of log module, level, and category.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	6f97bb6b1f	Change nsec3param salt config to saltlen Upon request from Mark, change the configuration of salt to salt length. Introduce a new function 'dns_zone_checknsec3aram' that can be used upon reconfiguration to check if the existing NSEC3 parameters are in sync with the configuration. If a salt is used that matches the configured salt length, don't change the NSEC3 parameters.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	00c5dabea3	Add check for NSEC3 and key algorithms NSEC3 is not backwards compatible with key algorithms that existed before the RFC 5155 specification was published.	2020-11-26 10:43:59 +01:00
Matthijs Mekking	7039c5f805	Check nsec3param configuration values Check 'nsec3param' configuration for the number of iterations. The maximum number of iterations that are allowed are based on the key size (see https://tools.ietf.org/html/rfc5155#section-10.3). Check 'nsec3param' configuration for correct salt. If the string is not "-" or hex-based, this is a bad salt.	2020-11-26 10:43:27 +01:00
Matthijs Mekking	114af58ee2	Support for NSEC3 in dnssec-policy Implement support for NSEC3 in dnssec-policy. Store the configuration in kasp objects. When configuring a zone, call 'dns_zone_setnsec3param' to queue an nsec3param event. This will ensure that any previous chains will be removed and a chain according to the dnssec-policy is created. Add tests for dnssec-policy zones that uses the new 'nsec3param' option, as well as changing to new values, changing to NSEC, and changing from NSEC.	2020-11-26 10:43:27 +01:00
Matthijs Mekking	84a4273074	Move generate_salt function to lib/dns/nsec3 We will be using this function also on reconfig, so it should have a wider availability than just bin/named/server.	2020-11-26 10:43:27 +01:00
Mark Andrews	38d6f68de4	add dns_dns64_findprefix	2020-11-25 08:25:29 +11:00
Diego Fronza	581e2a8f28	Check 'stale-refresh-time' when sharing cache between views This commit ensures that, along with previous restrictions, a cache is shareable between views only if their 'stale-refresh-time' value are equal.	2020-11-11 12:53:24 -03:00
Diego Fronza	4827ad0ec4	Add stale-refresh-time option Before this update, BIND would attempt to do a full recursive resolution process for each query received if the requested rrset had its ttl expired. If the resolution fails for any reason, only then BIND would check for stale rrset in cache (if 'stale-cache-enable' and 'stale-answer-enable' is on). The problem with this approach is that if an authoritative server is unreachable or is failing to respond, it is very unlikely that the problem will be fixed in the next seconds. A better approach to improve performance in those cases, is to mark the moment in which a resolution failed, and if new queries arrive for that same rrset, try to respond directly from the stale cache, and do that for a window of time configured via 'stale-refresh-time'. Only when this interval expires we then try to do a normal refresh of the rrset. The logic behind this commit is as following: - In query.c / query_gotanswer(), if the test of 'result' variable falls to the default case, an error is assumed to have happened, and a call to 'query_usestale()' is made to check if serving of stale rrset is enabled in configuration. - If serving of stale answers is enabled, a flag will be turned on in the query context to look for stale records: query.c:6839 qctx->client->query.dboptions \|= DNS_DBFIND_STALEOK; - A call to query_lookup() will be made again, inside it a call to 'dns_db_findext()' is made, which in turn will invoke rbdb.c / cache_find(). - In rbtdb.c / cache_find() the important bits of this change is the call to 'check_stale_header()', which is a function that yields true if we should skip the stale entry, or false if we should consider it. - In check_stale_header() we now check if the DNS_DBFIND_STALEOK option is set, if that is the case we know that this new search for stale records was made due to a failure in a normal resolution, so we keep track of the time in which the failured occured in rbtdb.c:4559: header->last_refresh_fail_ts = search->now; - In check_stale_header(), if DNS_DBFIND_STALEOK is not set, then we know this is a normal lookup, if the record is stale and the query time is between last failure time + stale-refresh-time window, then we return false so cache_find() knows it can consider this stale rrset entry to return as a response. The last additions are two new methods to the database interface: - setservestale_refresh - getservestale_refresh Those were added so rbtdb can be aware of the value set in configuration option, since in that level we have no access to the view object.	2020-11-11 12:53:23 -03:00
Evan Hunt	1170a52f48	remove isc_task from xfrin since the network manager is now handling timeouts, xfrin doesn't need an isc_task object. it may be necessary to revert this later if we find that it's important for zone_xfrdone() to be executed in the zone task context. currently things seem to be working well without that, though.	2020-11-09 13:45:43 +01:00
Evan Hunt	a8d28881d1	remove isc_timer from xfrin the network manager can now handle timeouts, so it isn't necessary for xfrin to use isc_timer for the purpose any longer.	2020-11-09 13:45:43 +01:00
Evan Hunt	49d53a4aa9	use netmgr for xfrin Use isc_nm_tcpdnsconnect() in xfrin.c for zone transfers.	2020-11-09 13:45:43 +01:00
Michal Nowak	1f6f7ccad6	Drop unused portlist code	2020-10-22 13:11:16 +02:00
Michal Nowak	e67737aa75	Drop unused dbtable code	2020-10-22 13:11:16 +02:00
Ondřej Surý	bb990030d3	Simplify the EDNS buffer size logic for DNS Flag Day 2020 The DNS Flag Day 2020 aims to remove the IP fragmentation problem from the UDP DNS communication. In this commit, we implement the required changes and simplify the logic for picking the EDNS Buffer Size. 1. The defaults for `edns-udp-size`, `max-udp-size` and `nocookie-udp-size` have been changed to `1232` (the value picked by DNS Flag Day 2020). 2. The probing heuristics that would try 512->4096->1432->1232 buffer sizes has been removed and the resolver will always use just the `edns-udp-size` value. 3. Instead of just disabling the PMTUD mechanism on the UDP sockets, we now set IP_DONTFRAG (IPV6_DONTFRAG) flag. That means that the UDP packets won't get ever fragmented. If the ICMP packets are lost the UDP will just timeout and eventually be retried over TCP.	2020-10-05 16:21:21 +02:00
Matthijs Mekking	70d1ec432f	Use explicit result codes for 'rndc dnssec' cmd It is better to add new result codes than to overload existing codes.	2020-10-05 10:53:46 +02:00
Matthijs Mekking	edc53fc416	Various rndc dnssec -checkds fixes While working on 'rndc dnssec -rollover' I noticed the following (small) issues: - The key files where updated with hints set to "-when" and that should always be "now. - The kasp system test did not properly update the test number when calling 'rndc dnssec -checkds' (and ensuring that works). - There was a missing ']' in the rndc.c help output.	2020-10-05 10:53:46 +02:00
Matthijs Mekking	df8276aef0	Add manual key rollover logic Add to the keymgr a function that will schedule a rollover. This basically means setting the time when the key needs to retire, and updating the key lifetime, then update the state file. The next time that named runs the keymgr the new lifetime will be taken into account.	2020-10-05 10:52:19 +02:00
Ondřej Surý	33eefe9f85	The dns_message_create() cannot fail, change the return to void The dns_message_create() function cannot soft fail (as all memory allocations either succeed or cause abort), so we change the function to return void and cleanup the calls.	2020-09-29 08:22:08 +02:00
Diego Fronza	12d6d13100	Refactored dns_message_t for using attach/detach semantics This commit will be used as a base for the next code updates in order to have a better control of dns_message_t objects' lifetime.	2020-09-29 08:22:08 +02:00
Matthijs Mekking	8beda7d2ea	Add -expired flag to rndc dumpdb command This flag is the same as -cache, but will use a different style format that will also print expired entries (awaiting cleanup) from the cache.	2020-09-23 16:08:29 +02:00
Mark Andrews	f0d9bf7c30	Clone the saved / query message buffers The message buffer passed to ns__client_request is only valid for the life of the the ns__client_request call. Save a copy of it when we recurse or process a update as ns__client_request will return before those operations complete.	2020-09-23 10:37:42 +10:00
Evan Hunt	dcee985b7f	update all copyright headers to eliminate the typo	2020-09-14 16:20:40 -07:00
Mark Andrews	0b2555e8cf	Address use after free between view, resolver and nta. Hold a weak reference to the view so that it can't go away while nta is performing its lookups. Cancel nta timers once all external references to the view have gone to prevent them triggering new work.	2020-08-11 11:00:49 +10:00
Matthijs Mekking	46fcd927e7	rndc dnssec -checkds set algorithm In the rare case that you have multiple keys acting as KSK and that have the same keytag, you can now set the algorithm when calling '-checkds'.	2020-08-07 11:26:09 +02:00
Matthijs Mekking	a25f49f153	Make 'parent-registration-delay' obsolete With the introduction of 'checkds', the 'parent-registration-delay' option becomes obsolete.	2020-08-07 11:26:09 +02:00
Matthijs Mekking	04d8fc0143	Implement 'rndc dnssec -checkds' Add a new 'rndc' command 'dnssec -checkds' that allows the user to signal named that a new DS record has been seen published in the parent, or that an existing DS record has been withdrawn from the parent. Upon the 'checkds' request, 'named' will write out the new state for the key, updating the 'DSPublish' or 'DSRemoved' timing metadata. This replaces the "parent-registration-delay" configuration option, this was unreliable because it was purely time based (if the user did not actually submit the new DS to the parent for example, this could result in an invalid DNSSEC state). Because we cannot rely on the parent registration delay for state transition, we need to replace it with a different guard. Instead, if a key wants its DS state to be moved to RUMOURED, the "DSPublish" time must be set and must not be in the future. If a key wants its DS state to be moved to UNRETENTIVE, the "DSRemoved" time must be set and must not be in the future. By default, with '-checkds' you set the time that the DS has been published or withdrawn to now, but you can set a different time with '-when'. If there is only one KSK for the zone, that key has its DS state moved to RUMOURED. If there are multiple keys for the zone, specify the right key with '-key'.	2020-08-07 11:26:09 +02:00
Ondřej Surý	e24bc324b4	Fix the rbt hashtable and grow it when setting max-cache-size There were several problems with rbt hashtable implementation: 1. Our internal hashing function returns uint64_t value, but it was silently truncated to unsigned int in dns_name_hash() and dns_name_fullhash() functions. As the SipHash 2-4 higher bits are more random, we need to use the upper half of the return value. 2. The hashtable implementation in rbt.c was using modulo to pick the slot number for the hash table. This has several problems because modulo is: a) slow, b) oblivious to patterns in the input data. This could lead to very uneven distribution of the hashed data in the hashtable. Combined with the single-linked lists we use, it could really hog-down the lookup and removal of the nodes from the rbt tree[a]. The Fibonacci Hashing is much better fit for the hashtable function here. For longer description, read "Fibonacci Hashing: The Optimization that the World Forgot"[b] or just look at the Linux kernel. Also this will make Diego very happy :). 3. The hashtable would rehash every time the number of nodes in the rbt tree would exceed 3 * (hashtable size). The overcommit will make the uneven distribution in the hashtable even worse, but the main problem lies in the rehashing - every time the database grows beyond the limit, each subsequent rehashing will be much slower. The mitigation here is letting the rbt know how big the cache can grown and pre-allocate the hashtable to be big enough to actually never need to rehash. This will consume more memory at the start, but since the size of the hashtable is capped to `1 << 32` (e.g. 4 mio entries), it will only consume maximum of 32GB of memory for hashtable in the worst case (and max-cache-size would need to be set to more than 4TB). Calling the dns_db_adjusthashsize() will also cap the maximum size of the hashtable to the pre-computed number of bits, so it won't try to consume more gigabytes of memory than available for the database. FIXME: What is the average size of the rbt node that gets hashed? I chose the pagesize (4k) as initial value to precompute the size of the hashtable, but the value is based on feeling and not any real data. For future work, there are more places where we use result of the hash value modulo some small number and that would benefit from Fibonacci Hashing to get better distribution. Notes: a. A doubly linked list should be used here to speedup the removal of the entries from the hashtable. b. https://probablydance.com/2018/06/16/fibonacci-hashing-the-optimization-that-the-world-forgot-or-a-better-alternative-to-integer-modulo/	2020-07-21 08:44:26 +02:00
Michał Kępień	53120279b5	Fix locking for LMDB 0.9.26 When "rndc reconfig" is run, named first configures a fresh set of views and then tears down the old views. Consider what happens for a single view with LMDB enabled; "envA" is the pointer to the LMDB environment used by the original/old version of the view, "envB" is the pointer to the same LMDB environment used by the new version of that view: 1. mdb_env_open(envA) is called when the view is first created. 2. "rndc reconfig" is called. 3. mdb_env_open(envB) is called for the new instance of the view. 4. mdb_env_close(envA) is called for the old instance of the view. This seems to have worked so far. However, an upstream change [1] in LMDB which will be part of its 0.9.26 release prevents the above sequence of calls from working as intended because the locktable mutexes will now get destroyed by the mdb_env_close() call in step 4 above, causing any subsequent mdb_txn_begin() calls to fail (because all of the above steps are happening within a single named process). Preventing the above scenario from happening would require either redesigning the way we use LMDB in BIND, which is not something we can easily backport, or redesigning the way BIND carries out its reconfiguration process, which would be an even more severe change. To work around the problem, set MDB_NOLOCK when calling mdb_env_open() to stop LMDB from controlling concurrent access to the database and do the necessary locking in named instead. Reuse the view->new_zone_lock mutex for this purpose to prevent the need for modifying struct dns_view (which would necessitate library API version bumps). Drop use of MDB_NOTLS as it is made redundant by MDB_NOLOCK: MDB_NOTLS only affects where LMDB reader locktable slots are stored while MDB_NOLOCK prevents the reader locktable from being used altogether. [1] `2fd44e3251`	2020-07-10 11:29:18 +02:00
Evan Hunt	16e14353b1	add "primaries" as a synonym for "masters" in named.conf as "type primary" is preferred over "type master" now, it makes sense to make "primaries" available as a synonym too. added a correctness check to ensure "primaries" and "masters" cannot both be used in the same zone.	2020-07-01 11:11:34 -07:00

... 2 3 4 5 6 ...

2460 Commits