help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close (c), None (n), Open (o) | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Bidi_Paired_Bracket | Show Values | |||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
String | Case_Folding | Show Values | ||
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
String | UCD | CJK_Radical | Show Values | |
Equivalent_Unified_Ideograph | Show Values | |||
kSimplifiedVariant | Show Values | |||
kTraditionalVariant | Show Values | |||
Emoji | Binary | UCD | Extended_Pictographic | No (N), Yes (Y) |
UTS | Basic_Emoji | No (N), Yes (Y) | ||
Emoji | No (N), Yes (Y) | |||
Emoji_Component | No (N), Yes (Y) | |||
Emoji_Modifier | No (N), Yes (Y) | |||
Emoji_Modifier_Base | No (N), Yes (Y) | |||
Emoji_Presentation | No (N), Yes (Y) | |||
RGI_Emoji | No, Yes | |||
RGI_Emoji_Flag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Keycap_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Modifier_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Tag_Sequence | No (N), Yes (Y) | |||
RGI_Emoji_Zwj_Sequence | No (N), Yes (Y) | |||
Enumerated | UCD | Regional_Indicator | No (N), Yes (Y) | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
Name_Alias | Show Values | |||
Named_Sequences | Show Values | |||
Named_Sequences_Prov | ||||
String | Nameslist | subhead | Show Values | |
UCD | Name | Show Values | ||
Script_Extensions | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
IDNA | Enumerated | UTS | Idn_2008 | na (na), NV8 (nv8), XV8 (xv8) |
Idn_Status | deviation (dv), disallowed (da), disallowed_STD3_mapped (ds3m), disallowed_STD3_valid (ds3v), ignored (i), mapped (m), valid (v) | |||
idna2003 | deviation, disallowed, ignored, mapped, valid | |||
idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED | |||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | Idn_Mapping | Show Values | ||
toIdna2003 | Show Values | |||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Enumerated | Indic_Positional_Category | Show Values | ||
Indic_Syllabic_Category | Show Values | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | ICU | isNFM | No, Yes |
UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) | ||
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
Enumerated | UCD | Canonical_Combining_Class | Show Values | |
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
String | ICU | toNFM | Show Values | |
UCD | NFKC_Casefold | Show Values | ||
Unicode | toNFC | Show Values | ||
toNFD | Show Values | |||
toNFKC | Show Values | |||
toNFKD | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
kAccountingNumeric | Show Values | |||
kOtherNumeric | Show Values | |||
kPrimaryNumeric | Show Values | |||
Numeric | Numeric_Value | Show Values | ||
Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
bmp | No, Yes | |||
Security | Enumerated | UTS | Confusable_MA | Show Values |
Identifier_Status | Allowed (a), Restricted (r) | |||
Identifier_Type | Show Values | |||
Shaping and Rendering | Binary | UCD | Join_Control | No (N), Yes (Y) |
Enumerated | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | ||
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Prepended_Concatenation_Mark | No (N), Yes (Y) | |||
Sentence_Break | Show Values | |||
Standardized_Variant | Show Values | |||
Vertical_Orientation | Rotated (R), Transformed_Rotated (Tr), Transformed_Upright (Tu), Upright (U) | |||
Word_Break | Show Values | |||
UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values | |||
Z-Other | Other | Other | Composition_Exclusion | Other |
Confusable_ML | Other | |||
Confusable_SA | Other | |||
Confusable_SL | Other | |||
Decomposition_Mapping | Other | |||
Do_Not_Emit_Preferred | Other | |||
Do_Not_Emit_Type | Other | |||
Emoji_DCM | Other | |||
Emoji_KDDI | Other | |||
Emoji_SB | Other | |||
exemplar | Other | |||
exemplar_aux | Other | |||
exemplar_punct | Other | |||
Expands_On_NFC | Other | |||
Expands_On_NFD | Other | |||
Expands_On_NFKC | Other | |||
Expands_On_NFKD | Other | |||
FC_NFKC_Closure | Other | |||
ID_Compat_Math_Continue | Other | |||
ID_Compat_Math_Start | Other | |||
IDS_Unary_Operator | Other | |||
Indic_Conjunct_Break | Other | |||
Jamo_Short_Name | Other | |||
kAlternateTotalStrokes | Other | |||
kBigFive | Other | |||
kCangjie | Other | |||
kCantonese | Other | |||
kCCCII | Other | |||
kCheungBauer | Other | |||
kCheungBauerIndex | Other | |||
kCihaiT | Other | |||
kCNS1986 | Other | |||
kCNS1992 | Other | |||
kCompatibilityVariant | Other | |||
kCowles | Other | |||
kDaeJaweon | Other | |||
kDefinition | Other | |||
kEACC | Other | |||
kEH_Cat | Other | |||
kEH_Core | Other | |||
kEH_Desc | Other | |||
kEH_Func | Other | |||
kEH_FVal | Other | |||
kEH_HG | Other | |||
kEH_IFAO | Other | |||
kEH_JSesh | Other | |||
kEH_NoMirror | Other | |||
kEH_NoRotate | Other | |||
kEH_UniK | Other | |||
kFanqie | Other | |||
kFenn | Other | |||
kFennIndex | Other | |||
kFourCornerCode | Other | |||
kFrequency | Other | |||
kGB0 | Other | |||
kGB1 | Other | |||
kGB3 | Other | |||
kGB5 | Other | |||
kGB7 | Other | |||
kGB8 | Other | |||
kGradeLevel | Other | |||
kGSR | Other | |||
kHangul | Other | |||
kHanYu | Other | |||
kHanyuPinlu | Other | |||
kHanyuPinyin | Other | |||
kHDZRadBreak | Other | |||
kHKGlyph | Other | |||
kHKSCS | Other | |||
kIBMJapan | Other | |||
kIICore | Other | |||
kIRG_GSource | Other | |||
kIRG_HSource | Other | |||
kIRG_JSource | Other | |||
kIRG_KPSource | Other | |||
kIRG_KSource | Other | |||
kIRG_MSource | Other | |||
kIRG_SSource | Other | |||
kIRG_TSource | Other | |||
kIRG_UKSource | Other | |||
kIRG_USource | Other | |||
kIRG_VSource | Other | |||
kIRGDaeJaweon | Other | |||
kIRGDaiKanwaZiten | Other | |||
kIRGHanyuDaZidian | Other | |||
kIRGKangXi | Other | |||
kJa | Other | |||
kJapanese | Other | |||
kJapaneseKun | Other | |||
kJapaneseOn | Other | |||
kJinmeiyoKanji | Other | |||
kJis0 | Other | |||
kJIS0213 | Other | |||
kJis1 | Other | |||
kJoyoKanji | Other | |||
kKangXi | Other | |||
kKarlgren | Other | |||
kKorean | Other | |||
kKoreanEducationHanja | Other | |||
kKoreanName | Other | |||
kKPS0 | Other | |||
kKPS1 | Other | |||
kKSC0 | Other | |||
kKSC1 | Other | |||
kLau | Other | |||
kMainlandTelegraph | Other | |||
kMandarin | Other | |||
kMatthews | Other | |||
kMeyerWempe | Other | |||
kMojiJoho | Other | |||
kMorohashi | Other | |||
kNelson | Other | |||
kPhonetic | Other | |||
kPseudoGB1 | Other | |||
kReading | Other | |||
kRSAdobe_Japan1_6 | Other | |||
kRSJapanese | Other | |||
kRSKangXi | Other | |||
kRSKanWa | Other | |||
kRSKorean | Other | |||
kRSTUnicode | Other | |||
kRSUnicode | Other | |||
kSBGY | Other | |||
kSemanticVariant | Other | |||
kSMSZD2003Index | Other | |||
kSMSZD2003Readings | Other | |||
kSpecializedSemanticVariant | Other | |||
kSpoofingVariant | Other | |||
kSrc_NushuDuben | Other | |||
kStrange | Other | |||
kTaiwanTelegraph | Other | |||
kTang | Other | |||
kTGH | Other | |||
kTGHZ2013 | Other | |||
kTGT_MergedSrc | Other | |||
kTotalStrokes | Other | |||
kUnihanCore2020 | Other | |||
kVietnamese | Other | |||
kVietnameseNumeric | Other | |||
kXerox | Other | |||
kXHC1983 | Other | |||
kZhuangNumeric | Other | |||
kZVariant | Other | |||
Modifier_Combining_Mark | Other | |||
NFKC_Simple_Casefold | Other | |||
Other_Alphabetic | Other | |||
Other_Default_Ignorable_Code_Point | Other | |||
Other_Grapheme_Extend | Other | |||
Other_ID_Continue | Other | |||
Other_ID_Start | Other | |||
Other_Joining_Type | Other | |||
Other_Lowercase | Other | |||
Other_Math | Other | |||
Other_Uppercase | Other |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: Emoji, IDNA, Regex, Security, and UCA.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.9; ICU version: 74.1; Unicode/Emoji version: 15.1.0;