mirror of
https://gitlab.isc.org/isc-projects/bind9
synced 2025-08-28 21:17:54 +00:00
new draft
This commit is contained in:
parent
5af560664d
commit
25ae9d2f82
449
doc/draft/draft-ietf-idn-uri-02.txt
Normal file
449
doc/draft/draft-ietf-idn-uri-02.txt
Normal file
@ -0,0 +1,449 @@
|
||||
|
||||
|
||||
Network Working Group M. Duerst
|
||||
Internet-Draft W3C/Keio University
|
||||
Expires: December 30, 2002 July 1, 2002
|
||||
|
||||
|
||||
Internationalized Domain Names in URIs
|
||||
draft-ietf-idn-uri-02
|
||||
|
||||
Status of this Memo
|
||||
|
||||
This document is an Internet-Draft and is in full conformance with
|
||||
all provisions of Section 10 of RFC2026.
|
||||
|
||||
Internet-Drafts are working documents of the Internet Engineering
|
||||
Task Force (IETF), its areas, and its working groups. Note that
|
||||
other groups may also distribute working documents as Internet-
|
||||
Drafts.
|
||||
|
||||
Internet-Drafts are draft documents valid for a maximum of six months
|
||||
and may be updated, replaced, or obsoleted by other documents at any
|
||||
time. It is inappropriate to use Internet-Drafts as reference
|
||||
material or to cite them other than as "work in progress."
|
||||
|
||||
The list of current Internet-Drafts can be accessed at http://
|
||||
www.ietf.org/ietf/1id-abstracts.txt.
|
||||
|
||||
The list of Internet-Draft Shadow Directories can be accessed at
|
||||
http://www.ietf.org/shadow.html.
|
||||
|
||||
This Internet-Draft will expire on December 30, 2002.
|
||||
|
||||
Copyright Notice
|
||||
|
||||
Copyright (C) The Internet Society (2002). All Rights Reserved.
|
||||
|
||||
Abstract
|
||||
|
||||
This document proposes to upgrade the definition of URIs (RFC 2396)
|
||||
[RFC2396] to work consistently with internationalized domain names.
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 1]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
Table of Contents
|
||||
|
||||
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
|
||||
2. URI syntax changes . . . . . . . . . . . . . . . . . . . . . . 3
|
||||
3. Security considerations . . . . . . . . . . . . . . . . . . . 5
|
||||
4. Change Log . . . . . . . . . . . . . . . . . . . . . . . . . . 5
|
||||
4.1 Changes from draft-ietf-idn-uri--01 to draft-ietf-idn-uri-02 . 5
|
||||
4.2 Changes from draft-ietf-idn-uri--00 to draft-ietf-idn-uri-01 . 5
|
||||
References . . . . . . . . . . . . . . . . . . . . . . . . . . 5
|
||||
Author's Address . . . . . . . . . . . . . . . . . . . . . . . 7
|
||||
Full Copyright Statement . . . . . . . . . . . . . . . . . . . 8
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 2]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
1. Introduction
|
||||
|
||||
Internet domain names serve to identify hosts and services on the
|
||||
Internet in a convenient way. The IETF IDN working group [IDNWG] has
|
||||
been working on extending the character repertoire usable in domain
|
||||
names beyond a subset of US-ASCII.
|
||||
|
||||
One of the most important places where domain names appear are
|
||||
Uniform Resource Identifiers (URIs, [RFC2396], as modified by
|
||||
[RFC2732]). However, in the current definition of the generic URI
|
||||
syntax, the restrictions on domain names are 'hard-coded'. In
|
||||
Section 2, this document relaxes these restrictions by updating the
|
||||
syntax, and defines how internationalized domain names are encoded in
|
||||
URIs.
|
||||
|
||||
The syntax in this document has been choosen to further increase the
|
||||
uniformity of URI syntax, which is a very important principle of
|
||||
URIs.
|
||||
|
||||
In practice, escaped domanin names should be used as rarely as
|
||||
possible. Wherever possible, the actual characters in
|
||||
Internationalized Domain Names should be preserved as long as
|
||||
possible by using IRIs [IRI] rather than URIs, and only converting to
|
||||
URIs and then to ACE-encoded [IDNA] domain names (or ideally directly
|
||||
to ACE-encoding without even using URIs) when resolving the IRI.
|
||||
Also, this document does in no way exclude the use of ACE encoding
|
||||
directly in an URI domain name part. ACE encoding may be used
|
||||
directly in an URI domain name part if this is considered necessary
|
||||
for interoperability.
|
||||
|
||||
Please note that even with the definition of URIs in [RFC2396], some
|
||||
URIs can already contain host names with escaped characters. For
|
||||
example, mailto:example@w%33.org is legal per [RFC2396] because the
|
||||
mailto: URI scheme does not follow the generic syntax of [RFC2396].
|
||||
|
||||
2. URI syntax changes
|
||||
|
||||
The syntax of URIs [RFC2396] currently contains the following rules
|
||||
relevant to domain names:
|
||||
|
||||
hostname = *( domainlabel "." ) toplabel [ "." ]
|
||||
domainlabel = alphanum | alphanum *( alphanum | "-" ) alphanum
|
||||
toplabel = alpha | alpha *( alphanum | "-" ) alphanum
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 3]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
The later two rules are changed as follows:
|
||||
|
||||
domainlabel = anchar | anchar *( anchar | "-" ) anchar
|
||||
toplabel = achar | achar *( anchar | "-" ) anchar
|
||||
|
||||
and the following rules are added:
|
||||
|
||||
anchar = alphanum | escaped
|
||||
achar = alpha | escaped
|
||||
|
||||
Characters outside the repertoire (alphanum) are encoded by first
|
||||
encoding the characters in UTF-8 [RFC 2279], resulting in a sequence
|
||||
of octets, and then escaping these octets according to the rules
|
||||
defined in [RFC2396].
|
||||
|
||||
Using UTF-8 assures that this encoding interoperates with IRIs [IRI].
|
||||
It is also aligned with the recommendations in [RFC2277] and
|
||||
[RFC2718], and is consistent with the URN syntax [RFC2141] as well as
|
||||
recent URL scheme definitions that define encodings of non-ASCII
|
||||
characters based on UTF-8 (e.g., IMAP URLs [RFC2192] and POP URLs
|
||||
[RFC2384]).
|
||||
|
||||
The above syntax rules permit for domain names that are neither
|
||||
permitted as US-ASCII only domain names nor as internationalized
|
||||
domain names. However, such syntax should never be used, and will
|
||||
always be rejected by resolvers. For US-ASCII only domain names, the
|
||||
syntax rules in [RFC2396] are relevant. For example, http://
|
||||
www.w%33.org is legal, because the corresponding 'w3' is a legal
|
||||
'domainlabel' according to [RFC2396]. However, http://
|
||||
%2a.example.org is illegal because the corresponding '*' is not a
|
||||
legal 'domainlabel' according to [RFC2396]. For domain names
|
||||
containing non-ASCII characters, the legal domain names are those for
|
||||
which the ToASCII operation ([IDNA], [Nameprep]; using the unescaped
|
||||
UTF-8 values as input) is successful.
|
||||
|
||||
For consistency in comparison operations and for interoperability
|
||||
with older software, the following should be noted: 1) US-ASCII
|
||||
characters in domain names should not be escaped. 2) Because of the
|
||||
principle of syntax uniformity for URIs, it is always more prudent to
|
||||
take into account the possibility that US-ASCII characters are
|
||||
escaped.
|
||||
|
||||
The work of the IDN WG includes some procedures for name preparation
|
||||
[Nameprep]. Before encoding an internationalized domain name in an
|
||||
URI, this preparation step SHOULD be applied. However, the URI
|
||||
resolver MUST also apply any steps required as part of domain name
|
||||
resolution by [IDNA].
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 4]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
3. Security considerations
|
||||
|
||||
The security considerations of [RFC2396] and those applying to
|
||||
internationalized domain names apply. There may be an increased
|
||||
potential to smuggle escaped US-ASCII-based domain names across
|
||||
firewalls, although because of the uniform syntax principle for URIs,
|
||||
such a potential is already existing.
|
||||
|
||||
4. Change Log
|
||||
|
||||
4.1 Changes from draft-ietf-idn-uri--01 to draft-ietf-idn-uri-02
|
||||
|
||||
Moved change log to back
|
||||
|
||||
Changed to only change URIs; IRI syntax updated directly in IRI
|
||||
draft.
|
||||
|
||||
Removed syntax restriction on %hh in the US-ASCII part, but made
|
||||
clear that restrictions to domain names apply.
|
||||
|
||||
Made clear that escaped domain names in URIs should only be an
|
||||
intermediate representation.
|
||||
|
||||
Gave example of mailto: as already allowing escaped host names.
|
||||
|
||||
4.2 Changes from draft-ietf-idn-uri--00 to draft-ietf-idn-uri-01
|
||||
|
||||
Changed requirement for URI/IRI resolvers from MUST to SHOULD
|
||||
|
||||
Changed IRI syntax slightly (ichar -> idchar, based on changes in
|
||||
[IRI])
|
||||
|
||||
Various wording changes
|
||||
|
||||
References
|
||||
|
||||
[IDNA] Faltstrom, P., Hoffman, P. and A. Costello,
|
||||
"Internationalizing Domain Names in Applications (IDNA)",
|
||||
draft-ietf-idn-idna-09.txt (work in progress), May 2002,
|
||||
<http://www.ietf.org/internet-drafts/draft-ietf-idn-idna-
|
||||
09.txt>.
|
||||
|
||||
[IDNWG] "IETF Internationalized Domain Name (idn) Working Group".
|
||||
|
||||
[IRI] Duerst, M. and M. Suignard, "Internationalized Resource
|
||||
Identifiers (IRI)", draft-duerst-iri-01 (work in
|
||||
progress), July 2002.
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 5]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
[ISO10646] International Organization for Standardization,
|
||||
"Information Technology - Universal Multiple-Octet Coded
|
||||
Character Set (UCS) - Part 1: Architecture and Basic
|
||||
Multilingual Plane", ISO Standard 10646-1, October 2000.
|
||||
|
||||
[Nameprep] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep
|
||||
Profile for Internationalized Domain Names", draft-ietf-
|
||||
idn-nameprep-10.txt (work in progress), May 2002, <http:/
|
||||
/www.ietf.org/internet-drafts/draft-ietf-idn-nameprep-
|
||||
10.txt>.
|
||||
|
||||
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
|
||||
Requirement Levels", BCP 14, RFC 2119, March 1997.
|
||||
|
||||
[RFC2141] Moats, R., "URN Syntax", RFC 2141, May 1997.
|
||||
|
||||
[RFC2192] Newman, C., "IMAP URL Scheme", RFC 2192, September 1997.
|
||||
|
||||
[RFC2277] Alvestrand, H., "IETF Policy on Character Sets and
|
||||
Languages", BCP 18, RFC 2277, January 1998.
|
||||
|
||||
[RFC2279] Yergeau, F., "UTF-8, a transformation format of ISO
|
||||
10646", RFC 2279, January 1998.
|
||||
|
||||
[RFC2384] Gellens, R., "POP URL Scheme", RFC 2384, August 1998.
|
||||
|
||||
[RFC2396] Berners-Lee, T., Fielding, R. and L. Masinter, "Uniform
|
||||
Resource Identifiers (URI): Generic Syntax", RFC 2396,
|
||||
August 1998.
|
||||
|
||||
[RFC2640] Curtin, B., "Internationalization of the File Transfer
|
||||
Protocol", RFC 2640, July 1999.
|
||||
|
||||
[RFC2718] Masinter, L., Alvestrand, H., Zigmond, D. and R. Petke,
|
||||
"Guidelines for new URL Schemes", RFC 2718, November
|
||||
1999.
|
||||
|
||||
[RFC2732] Hinden, R., Carpenter, B. and L. Masinter, "Format for
|
||||
Literal IPv6 Addresses in URL's", RFC 2732, December
|
||||
1999.
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 6]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
Author's Address
|
||||
|
||||
Martin Duerst
|
||||
W3C/Keio University
|
||||
5322 Endo
|
||||
Fujisawa 252-8520
|
||||
Japan
|
||||
|
||||
Phone: +81 466 49 1170
|
||||
Fax: +81 466 49 1171
|
||||
EMail: duerst@w3.org
|
||||
URI: http://www.w3.org/People/D%C3%BCrst/
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 7]
|
||||
|
||||
Internet-Draft IDNs in URIs July 2002
|
||||
|
||||
|
||||
Full Copyright Statement
|
||||
|
||||
Copyright (C) The Internet Society (2002). All Rights Reserved.
|
||||
|
||||
This document and translations of it may be copied and furnished to
|
||||
others, and derivative works that comment on or otherwise explain it
|
||||
or assist in its implementation may be prepared, copied, published
|
||||
and distributed, in whole or in part, without restriction of any
|
||||
kind, provided that the above copyright notice and this paragraph are
|
||||
included on all such copies and derivative works. However, this
|
||||
document itself may not be modified in any way, such as by removing
|
||||
the copyright notice or references to the Internet Society or other
|
||||
Internet organizations, except as needed for the purpose of
|
||||
developing Internet standards in which case the procedures for
|
||||
copyrights defined in the Internet Standards process must be
|
||||
followed, or as required to translate it into languages other than
|
||||
English.
|
||||
|
||||
The limited permissions granted above are perpetual and will not be
|
||||
revoked by the Internet Society or its successors or assigns.
|
||||
|
||||
This document and the information contained herein is provided on an
|
||||
"AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
|
||||
TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
|
||||
BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
|
||||
HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
|
||||
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
|
||||
|
||||
Acknowledgement
|
||||
|
||||
Funding for the RFC Editor function is currently provided by the
|
||||
Internet Society.
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
|
||||
Duerst Expires December 30, 2002 [Page 8]
|
||||
|
||||
|
Loading…
x
Reference in New Issue
Block a user