2
0
mirror of https://gitlab.isc.org/isc-projects/bind9 synced 2025-08-29 13:38:26 +00:00

42584 Commits

Author SHA1 Message Date
Ondřej Surý
4b4eb29452
Wait for memory reclamation to finish in named-checkconf
When named-checkzone loads the zone to the QP database, the delayed
memory reclamation could cause an assertion check on exit.  Add RCU
barrier to wait for the memory reclamation to complete.
2025-03-25 11:00:00 +01:00
Ondřej Surý
4297ae4795 [9.20] fix: dev: Fix invalid cache-line padding for qpcache buckets
The isc_queue_t was missing in the calculation of the required
padding size inside the qpcache bucket structure.

Backport of MR !10306

Merge branch 'backport-ondrej/qpcache-fix-invalid-padding-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10317
2025-03-25 09:59:33 +00:00
Ondřej Surý
817a0a8e8e Fix invalid cache-line padding for qpcache buckets
The isc_queue_t was missing in the calculation of the required
padding size inside the qpcache bucket structure.

(cherry picked from commit 3ef9b09620c3c3360498098fad5a33b765767ab2)
2025-03-25 09:59:02 +00:00
Evan Hunt
5d126d8081 [9.20] fix: usr: Don't enforce NOAUTH/NOCONF flags in DNSKEYs
All DNSKEY keys are able to authenticate. The `DNS_KEYTYPE_NOAUTH` (and `DNS_KEYTYPE_NOCONF`) flags were defined for the KEY rdata type, and are not applicable to DNSKEY. Previously, however, because the DNSKEY implementation was built on top of KEY, the `_NOAUTH` flag prevented authentication in DNSKEYs as well. This has been corrected.

Closes #5240

Backport of MR !10261

Merge branch 'backport-5240-ignore-noauth-flag-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10315
2025-03-25 07:23:26 +00:00
Mark Andrews
4a1ebbedad DNS_KEYTYPE_NOKEY is only applicable to KEY
(cherry picked from commit 53c6721abc49746d91e61a5bb2cbbea24d64dd72)
2025-03-24 23:52:02 -07:00
Evan Hunt
080299bf49 Don't check DNS_KEYFLAG_NOAUTH
All DNSKEY keys are able to authenticate. The DNS_KEYTYPE_NOAUTH
(and DNS_KEYTYPE_NOCONF) flags were defined for the KEY rdata type,
and are not applicable to DNSKEY.

Previously, because the DNSKEY implementation was built on top of
KEY, the NOAUTH flag prevented authentication in DNSKEYs as well.
This has been corrected.

(cherry picked from commit 5c21576f82f9f62c2e22aac920a37a4013ac3a80)
2025-03-24 23:52:02 -07:00
Evan Hunt
dc1ddd3e8a Tidy up keyvalue.h definitions
Use enums for DNS_KEYFLAG_, DNS_KEYTYPE_, DNS_KEYOWNER_, DNS_KEYALG_,
and DNS_KEYPROTO_ values.

Remove values that are never used.

Eliminate the obsolete DNS_KEYFLAG_SIGNATORYMASK. Instead, add three
more RESERVED bits for the key flag values that it covered but which
were never used.

(cherry picked from commit fee1ba40df939f25fc9258b2681a1a2bd7965f5d)
2025-03-25 06:40:49 +00:00
Evan Hunt
42ab4fce4a [9.20] rem: dev: Remove dns_qpmulti_lockedread declaration
This function was removed in 6217e434b57bd5d60ed69f792ae9a1a65a008f57 but not from the header file.

Backport of MR !10308

Merge branch 'backport-matthijs-remove-unused-qpmulti-lockedread-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10314
2025-03-25 06:37:14 +00:00
Matthijs Mekking
c0e92c6df9 Remove dns_qpmulti_lockedread declaration
This function was removed in 6217e434b57bd5d60ed69f792ae9a1a65a008f57
but not from the header file.

(cherry picked from commit 2c52aea3dc4093dfddc704e1b173f8f38543b4c0)
2025-03-25 06:02:17 +00:00
Michał Kępień
de2f0de267 [9.20] chg: test: Use isctest.asyncserver in the "upforwd" test
Replace the custom DNS server used in the "upforwd" system test with new
code based on the isctest.asyncserver module.  The ans4 server currently
used in that test is a copy of bin/tests/system/ans.pl modified to
receive queries over UDP and TCP without ever responding to any of them.

Closes #5012

Backport of MR !10283

Merge branch 'backport-5012-upforwd-asyncserver-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10312
2025-03-25 04:46:58 +00:00
Michał Kępień
785e6bc9d9 Use isctest.asyncserver in the "upforwd" test
Replace the custom DNS server used in the "upforwd" system test with new
code based on the isctest.asyncserver module.  The ans4 server currently
used in that test is a copy of bin/tests/system/ans.pl modified to
receive queries over UDP and TCP without ever responding to any of them.

(cherry picked from commit a8878cf35d6ea35f5580bf880a628889f885993f)
2025-03-25 04:08:28 +00:00
Michał Kępień
d4a59f9cd3 Add a response handler for ignoring all queries
Dropping all incoming queries is a typical use case for a custom server
used in BIND 9 system tests.  Add a response handler implementing that
behavior so that it can be reused.

(cherry picked from commit f24a534ff1b7be611ab320b041b58103d5607eae)
2025-03-25 04:08:28 +00:00
Michał Kępień
03756c8e05 Make response handlers global by default
Instead of requiring each class inheriting from ResponseHandler to
define its match() method, make the latter non-abstract and default to
returning True for all queries.  This will reduce the amount of
boilerplate code in custom servers.

(cherry picked from commit 75567f86ca66f7aa598ccb6c093af8224e5e8753)
2025-03-25 04:08:28 +00:00
Mark Andrews
7ebcc54d3b [9.20] fix: dev: Fix adbname reference
Call `dns_adbname_ref` before calling `dns_resolver_createfetch` to
ensure `adbname->name` remains stable for the life of the fetch.

Closes #5239

Backport of MR !10290

Merge branch 'backport-5239-fix-adb-reference-counting-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10303
2025-03-21 01:19:41 +00:00
Mark Andrews
db113bc5ad Fix gaining adbname reference
Call dns_adbname_ref before calling dns_resolver_createfetch to
ensure adbname->name remains stable for the life of the fetch.

(cherry picked from commit 8e7229f6411c193dd888fe63dac298cdf37e2099)
2025-03-21 00:29:45 +00:00
Matthijs Mekking
3a78a4c288 [9.20] fix: usr: Fix several small DNSSEC timing issues
The following small issues related to `dnssec-policy` have been fixed:
- In some cases the key manager inside BIND 9 could run every hour, while it could have run less often.
- While `CDS` and `CDNSKEY` records will be removed correctly from the zone when the corresponding `DS` record needs to be updated, the expected timing metadata when this will happen was never set.
- There were a couple of cases where the safety intervals are added inappropriately, delaying key rollovers longer than necessary.
- If you have identical `keys` in your `dnssec-policy`, they may be retired inappropriately. Note that having keys with identical properties is discouraged in all cases.

Closes #5242

Backport of MR !10251

Merge branch 'backport-5242-several-keymgr-issues-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10301
2025-03-20 13:57:51 +00:00
Matthijs Mekking
5cb7c19c23 Update Retired and Removed if we update lifetime
If we are updating the lifetime, and it was not set before, also
set/update the Retired and Removed timing metadata.

(cherry picked from commit 3e836a87e6ffd2afb7103c2a7f06f2ef5be748d1)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
4be38b606a Fix a key generation issue in the tests
The dnssec-keygen command for the ZSK generation for the zone
multisigner-model2.kasp was wrong (no ZSK was generated in the setup
script, but when 'named' is started, the missing ZSK was created
anyway by 'dnssec-policy'.

(cherry picked from commit b93cb2e80e222ff610d0403262fec841e0c7a699)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
3de8fa8709 Fix keymgr bug wrt setting the next time
Only set the next time the keymgr should run if the value is non zero.
Otherwise we default back to one hour. This may happen if there is one
or more key with an unlimited lifetime.

(cherry picked from commit 6c6b8796d3a7577c5954378a8cbd7449703fb691)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
ac8efcbf14 keymgr: also set DeleteCDS when setting PublishCDS
The keymgr never set the expected timing metadata when CDS/CDNSKEY
records for the corresponding key will be removed from the zone. This
is not troublesome, as key states dictate when this happens, but with
the new pytest we use the timing metadata to determine if the CDS and/or
CDNSKEY for the given key needs to be published.

(cherry picked from commit 8c9d2eb2bf588b2e2dee39986963d03a1edac391)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
04054bcb9a Fix wrong usage of safety intervals in keymgr
There are a couple of cases where the safety intervals are added
inappropriately:

1. When setting the PublishCDS/SyncPublish timing metadata, we don't
   need to add the publish-safety value if we are calculating the time
   when the zone is completely signed for the first time. This value
   is for when the DNSKEY has been published and we add a safety
   interval before considering the DNSKEY omnipresent.

2. The retire-safety value should only be added to ZSK rollovers if
   there is an actual rollover happening, similar to adding the sign
   delay.

3. The retire-safety value should only be added to KSK rollovers if
   there is an actual rollover happening. We consider the new DS
   omnipresent a bit later, so that we are forced to keep the old DS
   a bit longer.

(cherry picked from commit 63edc4435f8ddefbbabbf9731f2b44d59d68c40b)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
147ab68dc1 Fix a small keymgr bug
While converting the kasp system test to pytest, I encountered a small
bug in the keymgr code. We retire keys when there is more than one
key matching a 'keys' line from the dnssec-policy. But if there are
multiple identical 'keys' lines, as is the case for the test zone
'checkds-doubleksk.kasp', we retire one of the two keys that have the
same properties.

Fix this by checking if there are double matches. This is not fool proof
because there may be many keys for a few identical 'keys' lines, but it
is good enough for now. In practice it makes no sense to have a policy
that dictates multiple keys with identical properties.

(cherry picked from commit ef671919d539d3cc41b2fbd276cae0ef017d2891)
2025-03-20 13:57:45 +00:00
Matthijs Mekking
8f78219cc1 [9.20] fix: usr: Ensure max-clients-per-query is at least clients-per-query
If the `max-clients-per-query` option is set to a lower value than `clients-per-query`, the value is adjusted to match `clients-per-query`.

Closes #5224

Backport of MR !10241

Merge branch 'backport-5224-raise-max-clients-per-query-to-be-at-least-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10244
2025-03-20 13:57:03 +00:00
Matthijs Mekking
c5b8e1f5a1 Raise max-clients-per-query to be at least
In the case where 'clients-per-query' is larger than
'max-clients-per-query', raise 'max-clients-per-query' so that
'clients-per-query' equals 'max-clients-per-query' and log a warning
that this is what happened.

(cherry picked from commit f6f9645ed14660225786bd1eeae2b8345ad38b6d)
2025-03-20 09:08:25 +00:00
Matthijs Mekking
41cc6eeaaf Test new max-clients-per-query log warning
Make sure the new warning is logged.

(cherry picked from commit 1f674ef42eda5d55d113b3e05e5e638a27af703d)
2025-03-20 09:08:25 +00:00
Matthijs Mekking
15922a507d Update max-clients-per-query documentation
The new intended behavior is that 'max-clients-per-query' value is
raised to equal 'clients-per-query' if it is lower.

(cherry picked from commit f50753f303e8969610f28f3a64f81be4b5f5594b)
2025-03-20 09:08:25 +00:00
Mark Andrews
5de1b3ba3c [9.20] fix: usr: Fix write after free in validator code
Raw integer pointers were being used for the validator's nvalidations
and nfails values but the memory holding them could be freed before
they ceased to be used.  Use reference counted counters instead.

Closes #5239

Backport of MR !10248

Merge branch 'backport-5239-use-counter-for-nvalidations-and-nfailss-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10300
2025-03-20 03:45:07 +00:00
Mark Andrews
1f136f24a4 Use reference counted counters for nfail and nvalidations
The fetch context that held these values could be freed while there
were still active pointers to the memory.  Using a reference counted
pointer avoids this.

(cherry picked from commit bfbaacc9a0466395df6dafd2ddddfd9a53698187)
2025-03-20 01:30:43 +00:00
Andoni Duarte Pintado
b5c58fe6c0 Merge tag 'v9.20.7' into bind-9.20 2025-03-19 17:33:24 +01:00
Arаm Sаrgsyаn
1d8334a62a [9.20] fix: usr: Fix resolver statistics counters for timed out responses
When query responses timed out, the resolver could incorrectly increase the regular responses counters, even if no response was received. This has been fixed.

Closes #5193

Backport of MR !10227

Merge branch 'backport-5193-resolver-statistics-counters-fix-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10287
2025-03-19 11:19:39 +00:00
Aram Sargsyan
9c6fda031d Test resolver statistics when responses time out
Add a test to check that the timed out responses do not skew the
normal responses statistics counters.

(cherry picked from commit 0c7fa8d572bf3e742a627ff660175683e131908b)
2025-03-19 09:51:33 +00:00
Aram Sargsyan
afef69cab0 Fix the resolvers RTT-ranged responses statistics counters
When a response times out the fctx_cancelquery() function
incorrectly calculates it in the 'dns_resstatscounter_queryrtt5'
counter (i.e. >=1600 ms). To avoid this, the rctx_timedout()
function should make sure that 'rctx->finish' is NULL. And in order
to adjust the RTT values for the timed out server, 'rctx->no_response'
should be true. Update the rctx_timedout() function to make those
changes.

(cherry picked from commit 830e54811168bc3e69db93baf6132c18f3452f92)
2025-03-19 09:51:33 +00:00
Aram Sargsyan
2a4bbf1d2e Fix resolver responses statistics counter
The resquery_response() function increases the response counter without
checking if the response was successful. Increase the counter only when
the result indicates success.

(cherry picked from commit 12e7dfa397c92807bdc4e6f55918d46eb15e0600)
2025-03-19 09:51:33 +00:00
Michał Kępień
a492fb9963 [9.20] chg: test: asyncserver.py: TCP improvements
This branch started off as `michal/upforwd-asyncserver`.  It quickly
turned out that the critical `asyncserver.py` change that was needed for
the `upforwd` system test was for the server to be able to read multiple
TCP queries on a single connection.  As currently present in `main`,
`asyncserver.py` closes every client connection after servicing a single
query.  Retaining that behavior would cause the `upforwd` system test to
fail and, in general, capturing all data sent by a client seems more
useful in tests than just closing connections quickly.  `asyncserver.py`
can always be extended in the future (e.g. by adding a new
`ResponseAction` that the networking code would react to) to reinstate
the original behavior, if it turns out to be necessary.

While working on changing that particular `asyncserver.py` behavior, I
noticed a couple of other deficiencies in the TCP connection handling
code, so I started addressing them.  One thing led to another and before
I noticed, enough changes were applied to be worth doing a separate
merge request, particularly given that the actual rewrite of
`upforwd/ans4/ans.pl` using `asyncserver.py` is trivial once the
required changes to `asyncserver.py` itself are applied.

Backport of MR !10276

Merge branch 'backport-michal/asyncserver-tcp-improvements-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10284
2025-03-19 02:59:14 +00:00
Michał Kępień
54494a1368 Handle queries indefinitely on each TCP connection
Instead of closing every incoming TCP connection after handling a single
query, continue receiving queries on each TCP connection until the
client disconnects itself.  When coupled with response dropping, this
enables silently receiving all incoming data, simulating an unresponsive
server.

(cherry picked from commit 575a8745822ea4da706e8c7a93ad234d04b3cd03)
2025-03-18 15:33:33 +00:00
Michał Kępień
96766f3d29 Enable receiving chunked TCP DNS messages
A TCP DNS client may send its queries in chunks, causing
StreamReader.read() to return less data than previously declared by the
client as the DNS message length; even the two-octet DNS message length
itself may be split up into two single-octet transmissions.  Sending
data in chunks is valid client behavior that should not be treated as an
error.  Add a new helper method for reading TCP data in a loop, properly
distinguishing between chunked queries and client disconnections.  Use
the new method for reading all TCP data from clients.

(cherry picked from commit 68fe9a5df5c5298413449771c062f85e4b1b9ef3)
2025-03-18 15:33:33 +00:00
Michał Kępień
3d80d9778b Extend TCP logging
Emit more log messages from TCP connection handling code and extend
existing ones to improve debuggability of servers using asyncserver.py.

(cherry picked from commit 8c3f673f3777046e3d0afef8ffef6c86548ba8de)
2025-03-18 15:33:33 +00:00
Michał Kępień
3a1c0dba80 Handle connection resets during reading
A TCP peer may reset the connection at any point, but asyncserver.py
currently only handles connection resets when it is sending data to the
client.  Handle connection resets during reading in the same way.

(cherry picked from commit 748ed4259b66e4b33acf1d2584dc92da00d31aec)
2025-03-18 15:33:33 +00:00
Michał Kępień
7178efbf47 Refactor AsyncDnsServer._handle_tcp()
Split up AsyncDnsServer._handle_tcp() into a set of smaller methods to
improve code readability.

(cherry picked from commit a956947fbab189670db276ee352bc5d77f0a80b0)
2025-03-18 15:33:33 +00:00
Michał Kępień
cb9420b8cf Gracefully handle TCP client disconnections
Prevent premature client disconnections during reading from triggering
unhandled exceptions in TCP connection handling code.

(cherry picked from commit e4c3186a7ccce317a3319406dfe85c3722983a11)
2025-03-18 15:33:33 +00:00
Michał Kępień
5316ccf083 Simplify peer address formatting
Add a helper class, Peer, which holds the <host, port> tuple of a
connection endpoint and gets pretty-printed when formatted as a string.
This enables passing instances of this new class directly to logging
functions, eliminating the need for the AsyncDnsServer._format_peer()
helper method.

(cherry picked from commit 5764a9d66069f9351f9acc811796cd67d65d62c7)
2025-03-18 15:33:33 +00:00
Nicki Křížek
429be769dd [9.20] chg: ci: Allow re-run of the shotgun jobs to reduce false positives
The false positive rate is about 10-20 % when evaluating shotgun results
from a single run. Attempt to reduce the false positive rate by allowing
a re-run of failed jobs.

Backport of MR !10271

Merge branch 'backport-nicki/ci-shotgun-reduce-false-positives-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10279
2025-03-18 12:26:40 +00:00
Nicki Křížek
020f301a5c Allow re-run of the shotgun jobs to reduce false positive
The false positive rate is about 10-20 % when evaluating shotgun results
from a single run. Attempt to reduce the false positive rate by allowing
a re-run of failed jobs.

While there is a slight risk that barely noticable decreases in
performance might slip by more easily in MRs, they'd still likely pop up
during nightly or pre-release testing.

Also increase the tolerance threshold for DoH latency comparisons, as
those tests often experience increased jitter in the tail end latencies.

(cherry picked from commit 5eab352478623eef57008a274c3a6505d9c76390)
2025-03-18 09:30:05 +00:00
Nicki Křížek
7e6120e511 Adjust the load factor for shotgun:tcp test
With the slightly decreased load for the TCP test, the results appear to
be a little bit more stable.

(cherry picked from commit 7f8226a039b82d587114ee66662c05c673f0d87a)
2025-03-18 09:30:05 +00:00
Michał Kępień
eaea8c751f [9.20] chg: test: Use isctest.asyncserver in the "qmin" test
Replace custom DNS servers used in the "qmin" system test with new code
based on the isctest.asyncserver module.  The revised code employs zone
files and a limited amount of custom logic, which massively improves
test readability and maintainability, extends logging, and fixes
non-compliant replies sent by some of the custom servers in response to
certain queries (e.g. AA=0 in authoritative empty non-terminal
responses, non-glue address records in ADDITIONAL section).

Backport of MR !10195

Merge branch 'backport-michal/qmin-asyncserver-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10275
2025-03-18 06:39:36 +00:00
Michał Kępień
c5ae1a7f54
Broaden vulture exclude glob for ans.py servers
The vulture tool seems to be unable to follow how the parent classes
defined in bin/tests/system/qmin/qmin_ans.py use mandatory properties
specified by child classes in bin/tests/system/qmin/ans*/ans.py.  Make
the tool ignore not just ans.py servers, but also *_ans.py utility
modules above the ansX/ subdirectories to prevent false positives about
unused code from causing CI pipeline failures.

(cherry picked from commit dfd37918d6913b783ead915d608b5951386f5974)
2025-03-18 07:03:32 +01:00
Michał Kępień
5a26c218ac
Ignore .hypothesis files created by system tests
Some versions of the Hypothesis Python library - notably the one
included in stock OS repositories for Ubuntu 20.04 Focal Fossa - cause a
.hypothesis file to be created in a Python script's working directory
when the hypothesis module is present in its import chain.  Ignore such
files by adding them to the list of expected test artifacts to prevent
pytest teardown checks from failing due to these files appearing in the
file system after running system tests.

(cherry picked from commit f413ddbe5f2edfdeedc41603dcd2afe105ed2844)
2025-03-18 07:03:32 +01:00
Michał Kępień
0f53c1c6e5
Fix PYTHONPATH set for ans.py servers by start.pl
Commit 6c010a5644324947c8c13b5600cd8d988ae7684f caused the PYTHONPATH
environment variable to be set for ans.py servers started using
start.pl.  However, no system test has actually used the new
isctest.asyncserver module since that change was applied, so it has not
been noticed until now that including the source directory in PYTHONPATH
is only sufficient for in-tree builds.  Include the build directory
instead of the source directory in the PYTHONPATH environment variable
set for ans.py servers started by start.pl so that they work correctly
for both in-tree and out-of-tree builds.

(cherry picked from commit a799dd04adc08a062ec9961a026573abcc7c9181)
2025-03-18 07:03:32 +01:00
Michał Kępień
7b456deec3
Use isctest.asyncserver in the "qmin" test
Replace custom DNS servers used in the "qmin" system test with new code
based on the isctest.asyncserver module.  The revised code employs zone
files and a limited amount of custom logic, which massively improves
test readability and maintainability, extends logging, and fixes
non-compliant replies sent by some of the custom servers in response to
certain queries (e.g. AA=0 in authoritative empty non-terminal
responses, non-glue address records in ADDITIONAL section).

(cherry picked from commit 7faa34c6ee40653eeec23ef2df8093564cfc1891)
2025-03-18 07:03:32 +01:00
Michal Nowak
6f0d1551e2 [9.20] chg: ci: Disable linkcheck on dl.acm.org
The check fails with the following error for some time:

    403 Client Error: Forbidden for url: https://dl.acm.org/doi/10.1145/1315245.1315298

Backport of MR !10272

Merge branch 'backport-mnowak/linkcheck-disable-dl-acm-org-9.20' into 'bind-9.20'

See merge request isc-projects/bind9!10273
2025-03-17 17:26:05 +00:00