Warning, /kdevelop/kdevelop-pg-qt/unidata/ArabicShaping.txt is written in an unsupported language. File is not indexed.

0001 # ArabicShaping-6.0.0.txt
0002 # Date: 2010-04-30, 13:47:00 PDT [KW]
0003 #
0004 # This file is a normative contributory data file in the
0005 # Unicode Character Database.
0006 #
0007 # Copyright (c) 1991-2010 Unicode, Inc.
0008 # For terms of use, see http://www.unicode.org/terms_of_use.html
0009 #
0010 # This file defines the shaping classes for Arabic, Syriac, and N'Ko
0011 # positional shaping, repeating in machine readable form the
0012 # information exemplified in Tables 8-3, 8-7, 8-8, 8-11, 8-12,
0013 # 8-13, and 13-5 of The Unicode Standard, Version 6.0.
0014 #
0015 # See sections 8.2, 8.3, and 13.5 of The Unicode Standard, Version 6.0
0016 # for more information.
0017 #
0018 # Each line contains four fields, separated by a semicolon.
0019 #
0020 # Field 0: the code point, in 4-digit hexadecimal
0021 #   form, of an Arabic, Syriac, or N'Ko character.
0022 #
0023 # Field 1: gives a short schematic name for that character,
0024 #   abbreviated from the normative Unicode character name.
0025 #   Note that this schematic name is considered a comment,
0026 #   and does not constitute a formal property value.
0027 #
0028 # Field 2: defines the joining type (property name: Joining_Type)
0029 #   R Right_Joining
0030 #   L Left_Joining
0031 #   D Dual_Joining
0032 #   C Join_Causing
0033 #   U Non_Joining
0034 #   T Transparent
0035 #     See Section 8.2, Arabic for more information on these types.
0036 #
0037 # Field 3: defines the joining group (property name: Joining_Group)
0038 #
0039 # The values of the joining group are based schematically on character
0040 # names. Where a schematic character name consists of two or more parts separated
0041 # by spaces, the formal Joining_Group property value, as specified in
0042 # PropertyValueAliases.txt, consists of the same name parts joined by
0043 # underscores. Hence, the entry:
0044 #
0045 #   0629; TEH MARBUTA; R; TEH MARBUTA
0046 #
0047 # corresponds to [Joining_Group = Teh_Marbuta].
0048 #
0049 # Note: The property value now designated [Joining_Group = Teh_Marbuta_Goal] 
0050 #   used to apply to both of the following characters
0051 #   in earlier versions of the standard:
0052 #
0053 #   U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
0054 #   U+06C3 ARABIC LETTER TEH MARBUTA GOAL
0055 #
0056 #   However, it currently applies only to U+06C3, and *not* to U+06C2.
0057 #   To avoid destabilizing existing Joining_Group property aliases, the
0058 #   prior Joining_Group value for U+06C3 (Hamza_On_Heh_Goal) has been
0059 #   retained as a property value alias, despite the fact that it
0060 #   no longer applies to its namesake character, U+06C2.
0061 #   See PropertyValueAliases.txt.
0062 #
0063 # When other cursive scripts are added to the Unicode Standard in
0064 # the future, the joining group value of all its letters will default
0065 # to jg=No_Joining_Group in this data file. Other, more specific
0066 # joining group values will be defined only if an explicit proposal
0067 # to define those values exactly has been approved by the UTC. This
0068 # is the convention exemplified by the N'Ko script. Only the Arabic
0069 # and Syriac scripts currently have explicit joining group values defined.
0070 #
0071 # Note: Code points that are not explicitly listed in this file are
0072 # either of joining type T or U:
0073 #
0074 # - Those that not explicitly listed that are of General Category Mn, Me, or Cf
0075 #   have joining type T.
0076 # - All others not explicitly listed have joining type U.
0077 #
0078 # For an explicit listing of characters of joining type T, see
0079 # the derived property file DerivedJoiningType.txt.
0080 #
0081 # There are currently no characters of joining type L defined in Unicode.
0082 #
0083 # #############################################################
0084  
0085 # Unicode; Schematic Name; Joining Type; Joining Group
0086 
0087 # Arabic characters
0088 
0089 0600; ARABIC NUMBER SIGN; U; No_Joining_Group
0090 0601; ARABIC SIGN SANAH; U; No_Joining_Group
0091 0602; ARABIC FOOTNOTE MARKER; U; No_Joining_Group
0092 0603; ARABIC SIGN SAFHA; U; No_Joining_Group
0093 0608; ARABIC RAY; U; No_Joining_Group
0094 060B; AFGHANI SIGN; U; No_Joining_Group
0095 0620; YEH WITH RING; D; YEH
0096 0621; HAMZA; U; No_Joining_Group
0097 0622; MADDA ON ALEF; R; ALEF
0098 0623; HAMZA ON ALEF; R; ALEF
0099 0624; HAMZA ON WAW; R; WAW
0100 0625; HAMZA UNDER ALEF; R; ALEF
0101 0626; HAMZA ON YEH; D; YEH
0102 0627; ALEF; R; ALEF
0103 0628; BEH; D; BEH
0104 0629; TEH MARBUTA; R; TEH MARBUTA
0105 062A; TEH; D; BEH
0106 062B; THEH; D; BEH
0107 062C; JEEM; D; HAH
0108 062D; HAH; D; HAH
0109 062E; KHAH; D; HAH
0110 062F; DAL; R; DAL
0111 0630; THAL; R; DAL
0112 0631; REH; R; REH
0113 0632; ZAIN; R; REH
0114 0633; SEEN; D; SEEN
0115 0634; SHEEN; D; SEEN
0116 0635; SAD; D; SAD
0117 0636; DAD; D; SAD
0118 0637; TAH; D; TAH
0119 0638; ZAH; D; TAH
0120 0639; AIN; D; AIN
0121 063A; GHAIN; D; AIN
0122 063B; KEHEH WITH 2 DOTS ABOVE; D; GAF
0123 063C; KEHEH WITH 3 DOTS BELOW; D; GAF 
0124 063D; FARSI YEH WITH INVERTED V; D; FARSI YEH
0125 063E; FARSI YEH WITH 2 DOTS ABOVE; D; FARSI YEH
0126 063F; FARSI YEH WITH 3 DOTS ABOVE; D; FARSI YEH
0127 0640; TATWEEL; C; No_Joining_Group
0128 0641; FEH; D; FEH
0129 0642; QAF; D; QAF
0130 0643; KAF; D; KAF
0131 0644; LAM; D; LAM
0132 0645; MEEM; D; MEEM
0133 0646; NOON; D; NOON
0134 0647; HEH; D; HEH
0135 0648; WAW; R; WAW
0136 0649; ALEF MAKSURA; D; YEH
0137 064A; YEH; D; YEH
0138 066E; DOTLESS BEH; D; BEH
0139 066F; DOTLESS QAF; D; QAF
0140 0671; HAMZAT WASL ON ALEF; R; ALEF
0141 0672; WAVY HAMZA ON ALEF; R; ALEF
0142 0673; WAVY HAMZA UNDER ALEF; R; ALEF
0143 0674; HIGH HAMZA; U; No_Joining_Group
0144 0675; HIGH HAMZA ALEF; R; ALEF
0145 0676; HIGH HAMZA WAW; R; WAW
0146 0677; HIGH HAMZA WAW WITH DAMMA; R; WAW
0147 0678; HIGH HAMZA YEH; D; YEH
0148 0679; TEH WITH SMALL TAH; D; BEH
0149 067A; TEH WITH 2 DOTS VERTICAL ABOVE; D; BEH
0150 067B; BEH WITH 2 DOTS VERTICAL BELOW; D; BEH
0151 067C; TEH WITH RING; D; BEH
0152 067D; TEH WITH 3 DOTS ABOVE DOWNWARD; D; BEH
0153 067E; TEH WITH 3 DOTS BELOW; D; BEH
0154 067F; TEH WITH 4 DOTS ABOVE; D; BEH
0155 0680; BEH WITH 4 DOTS BELOW; D; BEH
0156 0681; HAMZA ON HAH; D; HAH
0157 0682; HAH WITH 2 DOTS VERTICAL ABOVE; D; HAH
0158 0683; HAH WITH MIDDLE 2 DOTS; D; HAH
0159 0684; HAH WITH MIDDLE 2 DOTS VERTICAL; D; HAH
0160 0685; HAH WITH 3 DOTS ABOVE; D; HAH
0161 0686; HAH WITH MIDDLE 3 DOTS DOWNWARD; D; HAH
0162 0687; HAH WITH MIDDLE 4 DOTS; D; HAH
0163 0688; DAL WITH SMALL TAH; R; DAL
0164 0689; DAL WITH RING; R; DAL
0165 068A; DAL WITH DOT BELOW; R; DAL
0166 068B; DAL WITH DOT BELOW AND SMALL TAH; R; DAL
0167 068C; DAL WITH 2 DOTS ABOVE; R; DAL
0168 068D; DAL WITH 2 DOTS BELOW; R; DAL
0169 068E; DAL WITH 3 DOTS ABOVE; R; DAL
0170 068F; DAL WITH 3 DOTS ABOVE DOWNWARD; R; DAL
0171 0690; DAL WITH 4 DOTS ABOVE; R; DAL
0172 0691; REH WITH SMALL TAH; R; REH
0173 0692; REH WITH SMALL V; R; REH
0174 0693; REH WITH RING; R; REH
0175 0694; REH WITH DOT BELOW; R; REH
0176 0695; REH WITH SMALL V BELOW; R; REH
0177 0696; REH WITH DOT BELOW AND DOT ABOVE; R; REH
0178 0697; REH WITH 2 DOTS ABOVE; R; REH
0179 0698; REH WITH 3 DOTS ABOVE; R; REH
0180 0699; REH WITH 4 DOTS ABOVE; R; REH
0181 069A; SEEN WITH DOT BELOW AND DOT ABOVE; D; SEEN
0182 069B; SEEN WITH 3 DOTS BELOW; D; SEEN
0183 069C; SEEN WITH 3 DOTS BELOW AND 3 DOTS ABOVE; D; SEEN
0184 069D; SAD WITH 2 DOTS BELOW; D; SAD
0185 069E; SAD WITH 3 DOTS ABOVE; D; SAD
0186 069F; TAH WITH 3 DOTS ABOVE; D; TAH
0187 06A0; AIN WITH 3 DOTS ABOVE; D; AIN
0188 06A1; DOTLESS FEH; D; FEH
0189 06A2; FEH WITH DOT MOVED BELOW; D; FEH
0190 06A3; FEH WITH DOT BELOW; D; FEH
0191 06A4; FEH WITH 3 DOTS ABOVE; D; FEH
0192 06A5; FEH WITH 3 DOTS BELOW; D; FEH
0193 06A6; FEH WITH 4 DOTS ABOVE; D; FEH
0194 06A7; QAF WITH DOT ABOVE; D; QAF
0195 06A8; QAF WITH 3 DOTS ABOVE; D; QAF
0196 06A9; KEHEH; D; GAF
0197 06AA; SWASH KAF; D; SWASH KAF
0198 06AB; KAF WITH RING; D; GAF
0199 06AC; KAF WITH DOT ABOVE; D; KAF
0200 06AD; KAF WITH 3 DOTS ABOVE; D; KAF
0201 06AE; KAF WITH 3 DOTS BELOW; D; KAF
0202 06AF; GAF; D; GAF
0203 06B0; GAF WITH RING; D; GAF
0204 06B1; GAF WITH 2 DOTS ABOVE; D; GAF
0205 06B2; GAF WITH 2 DOTS BELOW; D; GAF
0206 06B3; GAF WITH 2 DOTS VERTICAL BELOW; D; GAF
0207 06B4; GAF WITH 3 DOTS ABOVE; D; GAF
0208 06B5; LAM WITH SMALL V; D; LAM
0209 06B6; LAM WITH DOT ABOVE; D; LAM
0210 06B7; LAM WITH 3 DOTS ABOVE; D; LAM
0211 06B8; LAM WITH 3 DOTS BELOW; D; LAM
0212 06B9; NOON WITH DOT BELOW; D; NOON
0213 06BA; DOTLESS NOON; D; NOON
0214 06BB; DOTLESS NOON WITH SMALL TAH; D; NOON
0215 06BC; NOON WITH RING; D; NOON
0216 06BD; NYA; D; NYA
0217 06BE; KNOTTED HEH; D; KNOTTED HEH
0218 06BF; HAH WITH MIDDLE 3 DOTS DOWNWARD AND DOT ABOVE; D; HAH
0219 06C0; HAMZA ON HEH; R; TEH MARBUTA
0220 06C1; HEH GOAL; D; HEH GOAL
0221 06C2; HAMZA ON HEH GOAL; D; HEH GOAL
0222 06C3; TEH MARBUTA GOAL; R; TEH MARBUTA GOAL
0223 06C4; WAW WITH RING; R; WAW
0224 06C5; WAW WITH BAR; R; WAW
0225 06C6; WAW WITH SMALL V; R; WAW
0226 06C7; WAW WITH DAMMA; R; WAW
0227 06C8; WAW WITH ALEF ABOVE; R; WAW
0228 06C9; WAW WITH INVERTED SMALL V; R; WAW
0229 06CA; WAW WITH 2 DOTS ABOVE; R; WAW
0230 06CB; WAW WITH 3 DOTS ABOVE; R; WAW
0231 06CC; FARSI YEH; D; FARSI YEH
0232 06CD; YEH WITH TAIL; R; YEH WITH TAIL
0233 06CE; FARSI YEH WITH SMALL V; D; FARSI YEH
0234 06CF; WAW WITH DOT ABOVE; R; WAW
0235 06D0; YEH WITH 2 DOTS VERTICAL BELOW; D; YEH
0236 06D1; YEH WITH 3 DOTS BELOW; D; YEH
0237 06D2; YEH BARREE; R; YEH BARREE
0238 06D3; HAMZA ON YEH BARREE; R; YEH BARREE
0239 06D5; AE; R; TEH MARBUTA
0240 06DD; ARABIC END OF AYAH; U; No_Joining_Group
0241 06EE; DAL WITH INVERTED V; R; DAL
0242 06EF; REH WITH INVERTED V; R; REH
0243 06FA; SEEN WITH DOT BELOW AND 3 DOTS ABOVE; D; SEEN
0244 06FB; DAD WITH DOT BELOW; D; SAD
0245 06FC; GHAIN WITH DOT BELOW; D; AIN
0246 06FF; HEH WITH INVERTED V; D; KNOTTED HEH
0247 
0248 # Syriac characters
0249 
0250 0710; ALAPH; R; ALAPH
0251 0712; BETH; D; BETH
0252 0713; GAMAL; D; GAMAL
0253 0714; GAMAL GARSHUNI; D; GAMAL
0254 0715; DALATH; R; DALATH RISH
0255 0716; DOTLESS DALATH RISH; R; DALATH RISH
0256 0717; HE; R; HE
0257 0718; WAW; R; SYRIAC WAW
0258 0719; ZAIN; R; ZAIN
0259 071A; HETH; D; HETH
0260 071B; TETH; D; TETH
0261 071C; TETH GARSHUNI; D; TETH
0262 071D; YUDH; D; YUDH
0263 071E; YUDH HE; R; YUDH HE
0264 071F; KAPH; D; KAPH
0265 0720; LAMADH; D; LAMADH
0266 0721; MIM; D; MIM
0267 0722; NUN; D; NUN
0268 0723; SEMKATH; D; SEMKATH
0269 0724; FINAL SEMKATH; D; FINAL SEMKATH
0270 0725; E; D; E
0271 0726; PE; D; PE
0272 0727; REVERSED PE; D; REVERSED PE
0273 0728; SADHE; R; SADHE
0274 0729; QAPH; D; QAPH
0275 072A; RISH; R; DALATH RISH
0276 072B; SHIN; D; SHIN
0277 072C; TAW; R; TAW
0278 072D; PERSIAN BHETH; D; BETH
0279 072E; PERSIAN GHAMAL; D; GAMAL
0280 072F; PERSIAN DHALATH; R; DALATH RISH
0281 074D; SOGDIAN ZHAIN; R; ZHAIN
0282 074E; SOGDIAN KHAPH; D; KHAPH
0283 074F; SOGDIAN FE; D; FE
0284 
0285 # Arabic supplement characters
0286 
0287 0750; BEH WITH 3 DOTS HORIZONTALLY BELOW; D; BEH
0288 0751; BEH WITH DOT BELOW AND 3 DOTS ABOVE; D; BEH
0289 0752; BEH WITH 3 DOTS POINTING UPWARDS BELOW; D; BEH
0290 0753; BEH WITH 3 DOTS POINTING UPWARDS BELOW AND 2 DOTS ABOVE; D; BEH
0291 0754; BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
0292 0755; BEH WITH INVERTED SMALL V BELOW; D; BEH
0293 0756; BEH WITH SMALL V; D; BEH
0294 0757; HAH WITH 2 DOTS ABOVE; D; HAH
0295 0758; HAH WITH 3 DOTS POINTING UPWARDS BELOW; D; HAH
0296 0759; DAL WITH 2 DOTS VERTICALLY BELOW AND SMALL TAH; R; DAL
0297 075A; DAL WITH INVERTED SMALL V BELOW; R; DAL
0298 075B; REH WITH STROKE; R; REH
0299 075C; SEEN WITH 4 DOTS ABOVE; D; SEEN
0300 075D; AIN WITH 2 DOTS ABOVE; D; AIN
0301 075E; AIN WITH 3 DOTS POINTING DOWNWARDS ABOVE; D; AIN
0302 075F; AIN WITH 2 DOTS VERTICALLY ABOVE; D; AIN
0303 0760; FEH WITH 2 DOTS BELOW; D; FEH
0304 0761; FEH WITH 3 DOTS POINTING UPWARDS BELOW; D; FEH
0305 0762; KEHEH WITH DOT ABOVE; D; GAF
0306 0763; KEHEH WITH 3 DOTS ABOVE; D; GAF
0307 0764; KEHEH WITH 3 DOTS POINTING UPWARDS BELOW; D; GAF
0308 0765; MEEM WITH DOT ABOVE; D; MEEM
0309 0766; MEEM WITH DOT BELOW; D; MEEM
0310 0767; NOON WITH 2 DOTS BELOW; D; NOON
0311 0768; NOON WITH SMALL TAH; D; NOON
0312 0769; NOON WITH SMALL V; D; NOON
0313 076A; LAM WITH BAR; D; LAM
0314 076B; REH WITH 2 DOTS VERTICALLY ABOVE; R; REH
0315 076C; REH WITH HAMZA ABOVE; R; REH
0316 076D; SEEN WITH 2 DOTS VERTICALLY ABOVE; D; SEEN
0317 076E; HAH WITH SMALL TAH BELOW; D; HAH
0318 076F; HAH WITH SMALL TAH AND 2 DOTS; D; HAH
0319 0770; SEEN WITH SMALL TAH AND 2 DOTS; D; SEEN
0320 0771; REH WITH SMALL TAH AND 2 DOTS; R; REH
0321 0772; HAH WITH SMALL TAH ABOVE; D; HAH
0322 0773; ALEF WITH DIGIT TWO ABOVE; R; ALEF
0323 0774; ALEF WITH DIGIT THREE ABOVE; R; ALEF
0324 0775; FARSI YEH WITH DIGIT TWO ABOVE; D; FARSI YEH
0325 0776; FARSI YEH WITH DIGIT THREE ABOVE; D; FARSI YEH
0326 0777; YEH WITH DIGIT FOUR BELOW; D; YEH
0327 0778; WAW WITH DIGIT TWO ABOVE; R; WAW
0328 0779; WAW WITH DIGIT THREE ABOVE; R; WAW
0329 077A; YEH BARREE WITH DIGIT TWO ABOVE; D; BURUSHASKI YEH BARREE
0330 077B; YEH BARREE WITH DIGIT THREE ABOVE; D; BURUSHASKI YEH BARREE
0331 077C; HAH WITH DIGIT FOUR BELOW; D; HAH
0332 077D; SEEN WITH DIGIT FOUR ABOVE; D; SEEN
0333 077E; SEEN WITH INVERTED V; D; SEEN
0334 077F; KAF WITH 2 DOTS ABOVE; D; KAF
0335 
0336 # N'Ko Characters
0337 
0338 07CA; NKO A; D; No_Joining_Group
0339 07CB; NKO EE; D; No_Joining_Group
0340 07CC; NKO I; D; No_Joining_Group
0341 07CD; NKO E; D; No_Joining_Group
0342 07CE; NKO U; D; No_Joining_Group
0343 07CF; NKO OO; D; No_Joining_Group
0344 07D0; NKO O; D; No_Joining_Group
0345 07D1; NKO DAGBASINNA; D; No_Joining_Group
0346 07D2; NKO N; D; No_Joining_Group
0347 07D3; NKO BA; D; No_Joining_Group
0348 07D4; NKO PA; D; No_Joining_Group
0349 07D5; NKO TA; D; No_Joining_Group
0350 07D6; NKO JA; D; No_Joining_Group
0351 07D7; NKO CHA; D; No_Joining_Group
0352 07D8; NKO DA; D; No_Joining_Group
0353 07D9; NKO RA; D; No_Joining_Group
0354 07DA; NKO RRA; D; No_Joining_Group
0355 07DB; NKO SA; D; No_Joining_Group
0356 07DC; NKO GBA; D; No_Joining_Group
0357 07DD; NKO FA; D; No_Joining_Group
0358 07DE; NKO KA; D; No_Joining_Group
0359 07DF; NKO LA; D; No_Joining_Group
0360 07E0; NKO NA WOLOSO; D; No_Joining_Group
0361 07E1; NKO MA; D; No_Joining_Group
0362 07E2; NKO NYA; D; No_Joining_Group
0363 07E3; NKO NA; D; No_Joining_Group
0364 07E4; NKO HA; D; No_Joining_Group
0365 07E5; NKO WA; D; No_Joining_Group
0366 07E6; NKO YA; D; No_Joining_Group
0367 07E7; NKO NYA WOLOSO; D; No_Joining_Group
0368 07E8; NKO JONA JA; D; No_Joining_Group
0369 07E9; NKO JONA CHA; D; No_Joining_Group
0370 07EA; NKO JONA RA; D; No_Joining_Group
0371 07FA; NKO LAJANYALAN; C; No_Joining_Group
0372 
0373 # Other
0374 
0375 200C; ZERO WIDTH NON-JOINER; U; No_Joining_Group
0376 200D; ZERO WIDTH JOINER; C; No_Joining_Group
0377 
0378 # EOF