Jump to content

ISO 639-1

From Wikipedia, the free encyclopedia

ISO 639-1:2002, Codes for the representation of names of languages—Part 1: Alpha-2 code, is the first part of the ISO 639 series of international standards for language codes. Part 1 covers the registration of "set 1" two-letter codes.[1] There are 183 two-letter codes registered as of June 2021. The registered codes cover the world's major languages.

Infoterm (International Information Centre for Terminology) is the registration authority for ISO 639-1 codes.

These codes are a useful international and formal shorthand for indicating languages.

Examples of ISO 639-1 codes
Code ISO 639-1 language name Endonym
English
en English English
es Spanish español
pt Portuguese português
zh Chinese 中文, Zhōngwén

ISO 639-1 is more restrictive than other ISO 639 parts, such as ISO 639-2 and ISO 639-3, which cover a wider range of languages and variations.[2]

Many multilingual websites, including Wikipedia, use these codes to prefix URLs of localised versions: for example, ja.wikipedia.org is the Japanese version of Wikipedia.

History

[edit]

ISO 639, the original standard for language codes, was approved in 1967.[3] It was designed to represent major and primary national languages with well-established terminologies and lexicography.

In 1998 the standard was extended with an additional part, ISO 639-2, providing three-letter codes for a much wider range of languages.[4] The original standard was redesignated as ISO 639-1 in 2002, in an updated version.[1] Eventually five parts were issued, each being maintained as a separate standard.

In 2023, the different parts were combined into a unified standard, with the different code lists from each part now designated "sets".[5][6]

The last two-letter code added was ht, representing Haitian Creole on February 26, 2003.[7]

Updates

[edit]

New ISO 639-1 codes are not added if an ISO 639-2 "set 2" three-letter code exists, so systems that use ISO 639-1 and 639-2 codes, with 639-1 codes preferred, do not have to change existing codes.[8]

If an ISO 639-2 code that covers a group of languages is used, it might be overridden for some specific languages by a new ISO 639-1 code.

Part 3 (2007) of the standard, ISO 639-3, aiming to cover all known natural languages, largely supersedes the ISO 639-2 three-letter code standard.

There is no specification on treatment of macrolanguages, which are covered by ISO 639-3.

IETF language tags

[edit]

The use of ISO 639 language codes was encouraged by the introduction of IETF language tags via RFC 1766 in March 1995.[9] The current version of the specification is RFC 5646 from September 2009.[10]

See also

[edit]

References

[edit]
  1. ^ a b "ISO 639-1:2002". ISO. July 2002. Retrieved 5 October 2025.
  2. ^ "Frequently Asked Questions (FAQ) - Codes for the representation of names of languages (Library of Congress)". Language Coding Agency. Library of Congress. Retrieved 5 October 2025.
  3. ^ "ISO/R 639:1967". International Organization for Standardization. 1988-03-01. Retrieved 2012-08-05.
  4. ^ ISO 639-2:1998 – Codes for the representation of names of languages – Part 2: Alpha-3 code.
  5. ^ "ISO 639 – Language code". International Organization for Standardization.
  6. ^ ISO 639:2023 – Code for individual languages and language groups.
  7. ^ "ISO 639-1 Languages: Haitian". Library of Congress.
  8. ^ ISO 639 Joint Advisory Committee - Working principles for ISO 639 maintenance
  9. ^ H. Alvestrand (March 1995). Tags for the Identification of Languages. Network Working Group. doi:10.17487/RFC1766. RFC 1766. Obsolete. Obsoleted by RFC 3066 and 3282.
  10. ^ Phillips, A.; Davis, M., eds. (September 2009). Tags for Identifying Languages. IETF Network Working Group. doi:10.17487/RFC5646. BCP 47. RFC 5646. Best Current Practice 47. Obsoletes RFC 4646.
[edit]