The characters in the range U+0400–U+045F are basically the characters from ISO 8859-5 moved upward by 864 positions. The next characters in the Cyrillic block, range U+0460–U+0489, are historical letters, some of which are still used for Church Slavonic. The characters in the range U+048A–U+04FF and the complete Cyrillic Supplement block (U+0500-U+052F) are additional letters for various languages that are written with Cyrillic script. Two characters are in the Phonetic Extensions block: U+1D2BᴫCYRILLIC LETTER SMALL CAPITAL EL from the Uralic Phonetic Alphabet and U+1D78ᵸMODIFIER LETTER CYRILLIC EN for transcribing nasal vowels.
Unicode includes few precomposed accented Cyrillic letters; the others can be combined by adding U+0301 ("combining acute accent") after the accented vowel (e.g., е́ у́ э́); see below.
Several diacritical marks not specific to Cyrillic can be used with Cyrillic text, including:
in Combining Diacritical Marks block U+0300–U+036F.
U+0301◌́COMBINING ACUTE ACCENT (as common Cyrillic stress mark).To input an accented letter with acute accent: for the letter R (for example), digit R0301 (without space between letter and number), then select 0301 only and press Alt + X = Ŕ.
U+0300◌̀COMBINING GRAVE ACCENT (as stress mark in Bulgarian).
U+0303◌̃COMBINING TILDE (in non Slavic languages)
U+0304◌̄COMBINING MACRON (in non Slavic languages)
U+0306◌̆COMBINING BREVE (with й but also other letters in non Slavic languages)
U+0307◌̇COMBINING DOT ABOVE (in transliterations of other writing systems)
U+0308◌̈COMBINING DIAERESIS (in non Slavic languages)
U+030A◌̊COMBINING RING ABOVE (in non Slavic languages)
U+030B◌̋COMBINING DOUBLE ACUTE ACCENT (in non Slavic languages)
U+030C◌̌COMBINING CARON (in non Slavic languages)
U+030F◌̏COMBINING DOUBLE GRAVE ACCENT (with ѷ in old spelling)
U+0311◌̑COMBINING INVERTED BREVE (in 19th century Aleut alphabet)
U+0323◌̣COMBINING DOT BELOW (in transliterations of other writing systems)
U+0328◌̨COMBINING OGONEK (in 19th century Lithuanian or Polish cyrillic alphabets)
U+0331◌̱COMBINING MACRON BELOW (in transliterations of other writing systems)
U+033E◌̾COMBINING VERTICAL TILDE (in 19th century Polish cyrillic alphabet)
in Combining Diacritical Marks for Symbols block U+20D0–U+20F0
U+20DD◌⃝COMBINING ENCLOSING CIRCLE (as Cyrillic ten thousands sign).
In the table below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and canonical decompositions are included.
Table of characters
Blocks
The Cyrillic block (U+0400 – U+04FF) was added to the Unicode Standard in October, 1991 with the release of version 1.0:
The Cyrillic Supplement block (U+0500 – U+052F) was added to the Unicode Standard in March, 2002 with the release of version 3.2:
The Cyrillic Extended-A (U+2DE0 – U+2DFF) and Cyrillic Extended-B (U+A640 – U+A69F) blocks were added to the Unicode Standard in April, 2008 with the release of version 5.1:
The Cyrillic Extended-C block (U+1C80 – U+1C8F) was added to the Unicode Standard in June, 2016 with the release of version 9.0:
The Cyrillic Extended-D block (U+1E030 – U+1E08F) was added to the Unicode Standard in September, 2022 with the release of version 15.0:
See also
List of Cyrillic letters
Cyrillic script
Cyrillic alphabets
References
Gordana Jovanović, ed. (2009). Стандардизација старословенског ћириличког писма и његова регистрација у Уникоду [Standardization of the Old Church Slavonic Cyrillic Script and its Registration in Unicode] (PDF). Vol. CXXV (Scientific Meetings ed.). Belgrade: Serbian Academy of Sciences and Arts. ISBN 978-86-7025-494-7. Archived from the original (PDF) on 2012-03-18. Retrieved 2011-08-07.