Skip to content

Commit 5879e56

Browse files
authored
Arabic crown letters (#833)
* UnicodeData.txt lines from the proposal * No U+ * Too many semicolons * lb=AL for the letters, lb=CM for the crown * Arabic * ArabicShaping.txt from the proposal * New Joining_Groups * Regenerate UCD * GenerateEnums * Updated ArabicShaping.txt * More Joining_Groups * Regenerate UCD * GenerateEnums * Move the security invariants to their own CI check * run * cd * Now add the file * Bring back accidentally removed ArabicShaping.txt lines * MCM * Regenerate UCD * EMIT_GITHUB_ERRORS * Do not use ICU property values * emit errors for the right file * hoist * deduplicate * Mark’d ye his words? he would not take yͤ Crowne * Regenerate UCD * The merging will continue until morale improves
1 parent 30dd6f0 commit 5879e56

24 files changed

+262
-77
lines changed

unicodetools/data/ucd/dev/ArabicShaping.txt

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -853,6 +853,28 @@ A873; PHAGS-PA CANDRABINDU; U; No_Joining_Group
853853
10EC4; KAF WITH VERTICAL 2 DOTS BELOW; D; KAF
854854
10EC6; THIN NOON; D; THIN NOON
855855
10EC7; DOTLESS YEH WITH 4 DOTS BELOW; D; YEH
856+
10ED9; CROWN BEH; L; CROWN BEH
857+
10EDA; DOTLESS CROWN BEH WITH 3 DOTS BELOW; L; CROWN BEH
858+
10EDB; DOTLESS CROWN BEH WITH 2 DOTS ABOVE; L; CROWN BEH
859+
10EDC; DOTLESS CROWN BEH WITH 3 DOTS ABOVE; L; CROWN BEH
860+
10EDD; CROWN HAH WITH DOT BELOW; L; CROWN HAH
861+
10EDE; CROWN HAH; L; CROWN HAH
862+
10EDF; CROWN HAH WITH DOT ABOVE; L; CROWN HAH
863+
10EE0; CROWN SEEN; L; CROWN SEEN
864+
10EE1; CROWN SEEN WITH 3 DOTS ABOVE; L; CROWN SEEN
865+
10EE2; CROWN SAD; L; CROWN SAD
866+
10EE3; CROWN SAD WITH DOT ABOVE; L; CROWN SAD
867+
10EE4; CROWN TAH; L; CROWN TAH
868+
10EE5; CROWN TAH WITH DOT ABOVE; L; CROWN TAH
869+
10EE6; CROWN AIN; L; CROWN AIN
870+
10EE7; CROWN AIN WITH DOT ABOVE; L; CROWN AIN
871+
10EE8; CROWN FEH; L; CROWN FEH
872+
10EE9; DOTLESS CROWN FEH WITH TWO DOTS ABOVE; L; CROWN FEH
873+
10EEA; CROWN KAF; L; CROWN KAF
874+
10EEB; CROWN MEEM; L; CROWN MEEM
875+
10EEC; DOTLESS CROWN BEH WITH DOT ABOVE; L; CROWN BEH
876+
10EED; CROWN HEH; L; CROWN HEH
877+
10EEE; DOTLESS CROWN BEH WITH 2 DOTS BELOW; L; CROWN BEH
856878

857879
# Sogdian Characters
858880

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-18.0.0.txt
2-
# Date: 2025-11-11, 17:40:05 GMT
2+
# Date: 2025-11-12, 22:34:41 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2123,11 +2123,13 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21232123
# Newly assigned in Unicode 18.0.0 (September, 2025)
21242124

21252125
20C2..20C3 ; 18.0 # [2] RUFIYAA SIGN..UAE DIRHAM SIGN
2126+
10ED9..10EEE ; 18.0 # [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
2127+
10EF9 ; 18.0 # ARABIC MARK CROWN
21262128
18CD6..18CDA ; 18.0 # [5] KHITAN SMALL SCRIPT CHARACTER-18CD6..KHITAN SMALL SCRIPT CHARACTER-18CDA
21272129
18D1F..18D20 ; 18.0 # [2] TANGUT IDEOGRAPH-18D1F..TANGUT IDEOGRAPH-18D20
21282130
1F7DB ; 18.0 # BULLET IN DOUBLE CIRCLE
21292131
1F7F1..1F7FF ; 18.0 # [15] CIRCLE WITH DOUBLE VERTICAL AND HORIZONTAL LINE..RHOMBUS
21302132

2131-
# Total code points: 25
2133+
# Total code points: 48
21322134

21332135
# EOF

unicodetools/data/ucd/dev/DerivedCoreProperties.txt

Lines changed: 21 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedCoreProperties-18.0.0.txt
2-
# Date: 2025-11-11, 17:40:24 GMT
2+
# Date: 2025-11-12, 22:35:08 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1055,6 +1055,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
10551055
10EC2..10EC4 ; Alphabetic # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
10561056
10EC5 ; Alphabetic # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
10571057
10EC6..10EC7 ; Alphabetic # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
1058+
10ED9..10EEE ; Alphabetic # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
10581059
10EFA..10EFC ; Alphabetic # Mn [3] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC COMBINING ALEF OVERLAY
10591060
10F00..10F1C ; Alphabetic # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
10601061
10F27 ; Alphabetic # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@@ -1466,7 +1467,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
14661467
30000..3134A ; Alphabetic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
14671468
31350..33479 ; Alphabetic # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
14681469

1469-
# Total code points: 147428
1470+
# Total code points: 147450
14701471

14711472
# ================================================
14721473

@@ -3382,7 +3383,7 @@ FFF9..FFFB ; Case_Ignorable # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLI
33823383
10D6F ; Case_Ignorable # Lm GARAY REDUPLICATION MARK
33833384
10EAB..10EAC ; Case_Ignorable # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
33843385
10EC5 ; Case_Ignorable # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
3385-
10EFA..10EFF ; Case_Ignorable # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
3386+
10EF9..10EFF ; Case_Ignorable # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
33863387
10F46..10F50 ; Case_Ignorable # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
33873388
10F82..10F85 ; Case_Ignorable # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
33883389
11001 ; Case_Ignorable # Mn BRAHMI SIGN ANUSVARA
@@ -3547,7 +3548,7 @@ E0001 ; Case_Ignorable # Cf LANGUAGE TAG
35473548
E0020..E007F ; Case_Ignorable # Cf [96] TAG SPACE..CANCEL TAG
35483549
E0100..E01EF ; Case_Ignorable # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
35493550

3550-
# Total code points: 2794
3551+
# Total code points: 2795
35513552

35523553
# ================================================
35533554

@@ -6792,6 +6793,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
67926793
10EC2..10EC4 ; ID_Start # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
67936794
10EC5 ; ID_Start # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
67946795
10EC6..10EC7 ; ID_Start # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
6796+
10ED9..10EEE ; ID_Start # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
67956797
10F00..10F1C ; ID_Start # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
67966798
10F27 ; ID_Start # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
67976799
10F30..10F45 ; ID_Start # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -7038,7 +7040,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
70387040
30000..3134A ; ID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
70397041
31350..33479 ; ID_Start # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
70407042

7041-
# Total code points: 145923
7043+
# Total code points: 145945
70427044

70437045
# ================================================
70447046

@@ -7972,7 +7974,8 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
79727974
10EC2..10EC4 ; ID_Continue # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
79737975
10EC5 ; ID_Continue # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
79747976
10EC6..10EC7 ; ID_Continue # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
7975-
10EFA..10EFF ; ID_Continue # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
7977+
10ED9..10EEE ; ID_Continue # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
7978+
10EF9..10EFF ; ID_Continue # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
79767979
10F00..10F1C ; ID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
79777980
10F27 ; ID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
79787981
10F30..10F45 ; ID_Continue # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -8471,7 +8474,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
84718474
31350..33479 ; ID_Continue # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
84728475
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
84738476

8474-
# Total code points: 149247
8477+
# Total code points: 149270
84758478

84768479
# ================================================
84778480

@@ -9016,6 +9019,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
90169019
10EC2..10EC4 ; XID_Start # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
90179020
10EC5 ; XID_Start # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
90189021
10EC6..10EC7 ; XID_Start # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
9022+
10ED9..10EEE ; XID_Start # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
90199023
10F00..10F1C ; XID_Start # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
90209024
10F27 ; XID_Start # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
90219025
10F30..10F45 ; XID_Start # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -9262,7 +9266,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
92629266
30000..3134A ; XID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
92639267
31350..33479 ; XID_Start # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
92649268

9265-
# Total code points: 145900
9269+
# Total code points: 145922
92669270

92679271
# ================================================
92689272

@@ -10197,7 +10201,8 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1019710201
10EC2..10EC4 ; XID_Continue # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
1019810202
10EC5 ; XID_Continue # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
1019910203
10EC6..10EC7 ; XID_Continue # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
10200-
10EFA..10EFF ; XID_Continue # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
10204+
10ED9..10EEE ; XID_Continue # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
10205+
10EF9..10EFF ; XID_Continue # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
1020110206
10F00..10F1C ; XID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
1020210207
10F27 ; XID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
1020310208
10F30..10F45 ; XID_Continue # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -10696,7 +10701,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1069610701
31350..33479 ; XID_Continue # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
1069710702
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1069810703

10699-
# Total code points: 149228
10704+
# Total code points: 149251
1070010705

1070110706
# ================================================
1070210707

@@ -11014,7 +11019,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
1101411019
10D24..10D27 ; Grapheme_Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
1101511020
10D69..10D6D ; Grapheme_Extend # Mn [5] GARAY VOWEL SIGN E..GARAY CONSONANT NASALIZATION MARK
1101611021
10EAB..10EAC ; Grapheme_Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
11017-
10EFA..10EFF ; Grapheme_Extend # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
11022+
10EF9..10EFF ; Grapheme_Extend # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
1101811023
10F46..10F50 ; Grapheme_Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
1101911024
10F82..10F85 ; Grapheme_Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
1102011025
11001 ; Grapheme_Extend # Mn BRAHMI SIGN ANUSVARA
@@ -11176,7 +11181,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
1117611181
E0020..E007F ; Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
1117711182
E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1117811183

11179-
# Total code points: 2232
11184+
# Total code points: 2233
1118011185

1118111186
# ================================================
1118211187

@@ -12480,6 +12485,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1248012485
10EC6..10EC7 ; Grapheme_Base # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
1248112486
10ED0 ; Grapheme_Base # Po ARABIC BIBLICAL END OF VERSE
1248212487
10ED1..10ED8 ; Grapheme_Base # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
12488+
10ED9..10EEE ; Grapheme_Base # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
1248312489
10F00..10F1C ; Grapheme_Base # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
1248412490
10F1D..10F26 ; Grapheme_Base # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
1248512491
10F27 ; Grapheme_Base # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@@ -12985,7 +12991,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1298512991
30000..3134A ; Grapheme_Base # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
1298612992
31350..33479 ; Grapheme_Base # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479
1298712993

12988-
# Total code points: 157519
12994+
# Total code points: 157541
1298912995

1299012996
# ================================================
1299112997

@@ -13436,7 +13442,7 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
1343613442
10D24..10D27 ; InCB; Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
1343713443
10D69..10D6D ; InCB; Extend # Mn [5] GARAY VOWEL SIGN E..GARAY CONSONANT NASALIZATION MARK
1343813444
10EAB..10EAC ; InCB; Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
13439-
10EFA..10EFF ; InCB; Extend # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
13445+
10EF9..10EFF ; InCB; Extend # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
1344013446
10F46..10F50 ; InCB; Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
1344113447
10F82..10F85 ; InCB; Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
1344213448
11001 ; InCB; Extend # Mn BRAHMI SIGN ANUSVARA
@@ -13596,6 +13602,6 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
1359613602
E0020..E007F ; InCB; Extend # Cf [96] TAG SPACE..CANCEL TAG
1359713603
E0100..E01EF ; InCB; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1359813604

13599-
# Total code points: 2217
13605+
# Total code points: 2218
1360013606

1360113607
# EOF

unicodetools/data/ucd/dev/EastAsianWidth.txt

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# EastAsianWidth-18.0.0.txt
2-
# Date: 2025-11-10, 23:51:44 GMT
2+
# Date: 2025-11-11, 12:32:11 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1968,7 +1968,8 @@ FFFD ; A # So REPLACEMENT CHARACTER
19681968
10EC6..10EC7 ; N # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
19691969
10ED0 ; N # Po ARABIC BIBLICAL END OF VERSE
19701970
10ED1..10ED8 ; N # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
1971-
10EFA..10EFF ; N # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
1971+
10ED9..10EEE ; N # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
1972+
10EF9..10EFF ; N # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
19721973
10F00..10F1C ; N # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
19731974
10F1D..10F26 ; N # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
19741975
10F27 ; N # Lo OLD SOGDIAN LIGATURE AYIN-DALETH

unicodetools/data/ucd/dev/LineBreak.txt

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# LineBreak-18.0.0.txt
2-
# Date: 2025-11-10, 23:51:46 GMT
2+
# Date: 2025-11-11, 12:32:14 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2825,7 +2825,8 @@ FFFD ; AI # So REPLACEMENT CHARACTER
28252825
10EC6..10EC7 ; AL # Lo [2] ARABIC LETTER THIN NOON..ARABIC LETTER YEH WITH FOUR DOTS BELOW
28262826
10ED0 ; BA # Po ARABIC BIBLICAL END OF VERSE
28272827
10ED1..10ED8 ; AL # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
2828-
10EFA..10EFF ; CM # Mn [6] ARABIC DOUBLE VERTICAL BAR BELOW..ARABIC SMALL LOW WORD MADDA
2828+
10ED9..10EEE ; AL # Lo [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
2829+
10EF9..10EFF ; CM # Mn [7] ARABIC MARK CROWN..ARABIC SMALL LOW WORD MADDA
28292830
10F00..10F1C ; AL # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
28302831
10F1D..10F26 ; AL # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
28312832
10F27 ; AL # Lo OLD SOGDIAN LIGATURE AYIN-DALETH

unicodetools/data/ucd/dev/NormalizationTest.txt

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
1-
# NormalizationTest-17.0.0.txt
2-
# Date: 2025-06-30, 06:16:16 GMT
1+
# NormalizationTest-18.0.0.txt
2+
# Date: 2025-11-11, 12:32:23 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -18701,6 +18701,8 @@ FFEE;FFEE;FFEE;25CB;25CB; # (○; ○; ○; ○; ○; ) HALFWIDTH WHITE CIRCLE
1870118701
0061 10EAB 0315 0300 05AE 0062;0061 05AE 10EAB 0300 0315 0062;0061 05AE 10EAB 0300 0315 0062;0061 05AE 10EAB 0300 0315 0062;0061 05AE 10EAB 0300 0315 0062; # (a◌𐺫◌̕◌̀◌֮b; a◌֮◌𐺫◌̀◌̕b; a◌֮◌𐺫◌̀◌̕b; a◌֮◌𐺫◌̀◌̕b; a◌֮◌𐺫◌̀◌̕b; ) LATIN SMALL LETTER A, YEZIDI COMBINING HAMZA MARK, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
1870218702
0061 0315 0300 05AE 10EAC 0062;00E0 05AE 10EAC 0315 0062;0061 05AE 0300 10EAC 0315 0062;00E0 05AE 10EAC 0315 0062;0061 05AE 0300 10EAC 0315 0062; # (a◌̕◌̀◌֮◌𐺬b; à◌֮◌𐺬◌̕b; a◌֮◌̀◌𐺬◌̕b; à◌֮◌𐺬◌̕b; a◌֮◌̀◌𐺬◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, YEZIDI COMBINING MADDA MARK, LATIN SMALL LETTER B
1870318703
0061 10EAC 0315 0300 05AE 0062;0061 05AE 10EAC 0300 0315 0062;0061 05AE 10EAC 0300 0315 0062;0061 05AE 10EAC 0300 0315 0062;0061 05AE 10EAC 0300 0315 0062; # (a◌𐺬◌̕◌̀◌֮b; a◌֮◌𐺬◌̀◌̕b; a◌֮◌𐺬◌̀◌̕b; a◌֮◌𐺬◌̀◌̕b; a◌֮◌𐺬◌̀◌̕b; ) LATIN SMALL LETTER A, YEZIDI COMBINING MADDA MARK, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
18704+
0061 0315 0300 05AE 10EF9 0062;00E0 05AE 10EF9 0315 0062;0061 05AE 0300 10EF9 0315 0062;00E0 05AE 10EF9 0315 0062;0061 05AE 0300 10EF9 0315 0062; # (a◌̕◌̀◌֮◌𐻹b; à◌֮◌𐻹◌̕b; a◌֮◌̀◌𐻹◌̕b; à◌֮◌𐻹◌̕b; a◌֮◌̀◌𐻹◌̕b; ) LATIN SMALL LETTER A, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, ARABIC MARK CROWN, LATIN SMALL LETTER B
18705+
0061 10EF9 0315 0300 05AE 0062;0061 05AE 10EF9 0300 0315 0062;0061 05AE 10EF9 0300 0315 0062;0061 05AE 10EF9 0300 0315 0062;0061 05AE 10EF9 0300 0315 0062; # (a◌𐻹◌̕◌̀◌֮b; a◌֮◌𐻹◌̀◌̕b; a◌֮◌𐻹◌̀◌̕b; a◌֮◌𐻹◌̀◌̕b; a◌֮◌𐻹◌̀◌̕b; ) LATIN SMALL LETTER A, ARABIC MARK CROWN, COMBINING COMMA ABOVE RIGHT, COMBINING GRAVE ACCENT, HEBREW ACCENT ZINOR, LATIN SMALL LETTER B
1870418706
0061 059A 0316 1DFA 10EFA 0062;0061 1DFA 0316 10EFA 059A 0062;0061 1DFA 0316 10EFA 059A 0062;0061 1DFA 0316 10EFA 059A 0062;0061 1DFA 0316 10EFA 059A 0062; # (a◌֚◌̖◌᷺◌𐻺b; a◌᷺◌̖◌𐻺◌֚b; a◌᷺◌̖◌𐻺◌֚b; a◌᷺◌̖◌𐻺◌֚b; a◌᷺◌̖◌𐻺◌֚b; ) LATIN SMALL LETTER A, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, ARABIC DOUBLE VERTICAL BAR BELOW, LATIN SMALL LETTER B
1870518707
0061 10EFA 059A 0316 1DFA 0062;0061 1DFA 10EFA 0316 059A 0062;0061 1DFA 10EFA 0316 059A 0062;0061 1DFA 10EFA 0316 059A 0062;0061 1DFA 10EFA 0316 059A 0062; # (a◌𐻺◌֚◌̖◌᷺b; a◌᷺◌𐻺◌̖◌֚b; a◌᷺◌𐻺◌̖◌֚b; a◌᷺◌𐻺◌̖◌֚b; a◌᷺◌𐻺◌̖◌֚b; ) LATIN SMALL LETTER A, ARABIC DOUBLE VERTICAL BAR BELOW, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, LATIN SMALL LETTER B
1870618708
0061 059A 0316 1DFA 10EFB 0062;0061 1DFA 0316 10EFB 059A 0062;0061 1DFA 0316 10EFB 059A 0062;0061 1DFA 0316 10EFB 059A 0062;0061 1DFA 0316 10EFB 059A 0062; # (a◌֚◌̖◌᷺◌𐻻b; a◌᷺◌̖◌𐻻◌֚b; a◌᷺◌̖◌𐻻◌֚b; a◌᷺◌̖◌𐻻◌֚b; a◌᷺◌̖◌𐻻◌֚b; ) LATIN SMALL LETTER A, HEBREW ACCENT YETIV, COMBINING GRAVE ACCENT BELOW, COMBINING DOT BELOW LEFT, ARABIC SMALL LOW NOON, LATIN SMALL LETTER B

0 commit comments

Comments
 (0)