IETF STANDARDS

The Internet Engineering Task Force (IETF) led the effort to create standards for using non-ASCII characters in the Domain Name System (DNS).


The DNS only recognizes ASCII characters A-Z, 0-9 and '-'. This limits the number of characters that can be utilized to build domain names to 37 of the more than 96,000 characters identified within Unicode. To create domain names from the range of Unicode characters, a character-encoding scheme that uniquely maps Unicode code points to an ASCII representation must be used and standardized.

The IETF published these standards related to Internationalized Domain Names (IDN): Encoding Schemes, Framework, Protocol, Unicode and Right-to-Left Scripts.

ENCODING SCHEME

The encoding scheme for IDNs uses Punycode, an ASCII Compatible Encoding (ACE) that encodes local language characters into ASCII characters such that DNS can accurately answer a request for an address record. To select Punycode as the ACE standard, IETF considered the balance between compression and implementation. Punycode allows the greatest number of characters (code points) to be represented and is not difficult to deploy.

FRAMEWORK [RFC 5890]

This RFC is one of a collection that, together, describe the protocol and usage context for a revision of Internationalized Domain Names for Applications (IDNA) that was largely completed in 2008, known within the series and elsewhere as "IDNA2008." The series replaces an earlier version of IDNA [RFC 3490] [RFC 3491]. For convenience, that version of IDNA is referred to as "IDNA2003." The newer version continues to use the Punycode algorithm [RFC3492] and the ACE (ASCII-Compatible Encoding) prefix from the earlier version.

PROTOCOL [RFC 5891]

This RFC describes the core IDNA2008 protocol and its operations. In combination with the "bi-directional" (Bidi) document described below, it explicitly updates and replaces [RFC 3490].

UNICODE [RFC 5892]

This RFC specifies rules for deciding whether a code point, considered in isolation or in context, is a candidate for inclusion in an IDN. It is part of the specification of IDNA2008.

RIGHT-TO-LEFT SCRIPTS [RFC 5893]

The use of right-to-left scripts in Internationalized Domain Names (IDNs) has presented several challenges. This RFC provides new Bidi rules for Internationalized Domain Names for Applications (IDNA) labels, based on the problems encountered with some scripts and some shortcomings in the 2003 IDNA Bidi criterion.

RATIONALE [RFC 5894]

This RFC provides the background, explanation and rationale for the need of new RFCs to tackle issues that have risen out of the previous version(s) of IDNA. The need to update the version of Unicode supported in IDNs is also discussed in this RFC.

PUBLISHED RFCS

These standards have been published and are now available:

  • RFC 3492 — Encoding Scheme (Punycode)
  • RFC 5890 — IDNA Framework
  • RFC 5891 — IDNA Protocol
  • RFC 5892 — IDNA Unicode
  • RFC 5893 — IDNA Right-to-Left Scripts
  • RFC 5894 — IDNA Rationale

FreeYourID is committed to following the IETF standards and supporting rapid deployment of this new technology.

NEED MORE INFO?