Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 11 additions & 2 deletions unicodetools/data/security/dev/confusables.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# confusables.txt
# Date: 2025-10-25, 07:52:31 GMT
# Date: 2025-11-12, 00:37:27 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -7550,6 +7550,8 @@ FA0C ; 5140 ; MA # ( 兀 → 兀 ) CJK COMPATIBILITY IDEOGRAPH-FA0C → CJK UNIF

FA74 ; 5145 ; MA # ( 充 → 充 ) CJK COMPATIBILITY IDEOGRAPH-FA74 → CJK UNIFIED IDEOGRAPH-5145 #

5151 ; 514C ; MA # ( 兑 → 兌 ) CJK UNIFIED IDEOGRAPH-5151 → CJK UNIFIED IDEOGRAPH-514C #

FA32 ; 514D ; MA # ( 免 → 免 ) CJK COMPATIBILITY IDEOGRAPH-FA32 → CJK UNIFIED IDEOGRAPH-514D #
2F80E ; 514D ; MA # ( 免 → 免 ) CJK COMPATIBILITY IDEOGRAPH-2F80E → CJK UNIFIED IDEOGRAPH-514D #

Expand Down Expand Up @@ -7950,6 +7952,8 @@ FA04 ; 5B85 ; MA # ( 宅 → 宅 ) CJK COMPATIBILITY IDEOGRAPH-FA04 → CJK UNIF

2F86D ; 5BC3 ; MA # ( 寃 → 寃 ) CJK COMPATIBILITY IDEOGRAPH-2F86D → CJK UNIFIED IDEOGRAPH-5BC3 #

96BA ; 5BC9 ; MA # ( 隺 → 寉 ) CJK UNIFIED IDEOGRAPH-96BA → CJK UNIFIED IDEOGRAPH-5BC9 #

2F86E ; 5BD8 ; MA # ( 寘 → 寘 ) CJK COMPATIBILITY IDEOGRAPH-2F86E → CJK UNIFIED IDEOGRAPH-5BD8 #

F95F ; 5BE7 ; MA # ( 寧 → 寧 ) CJK COMPATIBILITY IDEOGRAPH-F95F → CJK UNIFIED IDEOGRAPH-5BE7 #
Expand Down Expand Up @@ -9341,6 +9345,8 @@ F9C2 ; 84FC ; MA # ( 蓼 → 蓼 ) CJK COMPATIBILITY IDEOGRAPH-F9C2 → CJK UNIF

2F9AC ; 8564 ; MA # ( 蕤 → 蕤 ) CJK COMPATIBILITY IDEOGRAPH-2F9AC → CJK UNIFIED IDEOGRAPH-8564 #

32A8F ; 2EDB5 ; MA # ( 𲪏 → 𮶵 ) CJK UNIFIED IDEOGRAPH-32A8F → CJK UNIFIED IDEOGRAPH-2EDB5 #

2F9AD ; 26F2C ; MA # ( 𦼬 → 𦼬 ) CJK COMPATIBILITY IDEOGRAPH-2F9AD → CJK UNIFIED IDEOGRAPH-26F2C #

F923 ; 85CD ; MA # ( 藍 → 藍 ) CJK COMPATIBILITY IDEOGRAPH-F923 → CJK UNIFIED IDEOGRAPH-85CD #
Expand Down Expand Up @@ -9581,6 +9587,8 @@ F937 ; 8DEF ; MA # ( 路 → 路 ) CJK COMPATIBILITY IDEOGRAPH-F937 → CJK UNIF

2F9D ; 8EAB ; MA #* ( ⾝ → 身 ) KANGXI RADICAL BODY → CJK UNIFIED IDEOGRAPH-8EAB #

8EB2 ; 8EB1 ; MA # ( 躲 → 躱 ) CJK UNIFIED IDEOGRAPH-8EB2 → CJK UNIFIED IDEOGRAPH-8EB1 #

F902 ; 8ECA ; MA # ( 車 → 車 ) CJK COMPATIBILITY IDEOGRAPH-F902 → CJK UNIFIED IDEOGRAPH-8ECA #
2F9E ; 8ECA ; MA #* ( ⾞ → 車 ) KANGXI RADICAL CART → CJK UNIFIED IDEOGRAPH-8ECA #

Expand Down Expand Up @@ -9810,6 +9818,7 @@ FACA ; 97FF ; MA # ( 響 → 響 ) CJK COMPATIBILITY IDEOGRAPH-FACA → CJK UNIF
FACB ; 980B ; MA # ( 頋 → 頋 ) CJK COMPATIBILITY IDEOGRAPH-FACB → CJK UNIFIED IDEOGRAPH-980B #
2F9FE ; 980B ; MA # ( 頋 → 頋 ) CJK COMPATIBILITY IDEOGRAPH-2F9FE → CJK UNIFIED IDEOGRAPH-980B #
2F9FF ; 980B ; MA # ( 頋 → 頋 ) CJK COMPATIBILITY IDEOGRAPH-2F9FF → CJK UNIFIED IDEOGRAPH-980B #
2EA07 ; 980B ; MA # ( 𮨇 → 頋 ) CJK UNIFIED IDEOGRAPH-2EA07 → CJK UNIFIED IDEOGRAPH-980B #

F9B4 ; 9818 ; MA # ( 領 → 領 ) CJK COMPATIBILITY IDEOGRAPH-F9B4 → CJK UNIFIED IDEOGRAPH-9818 #

Expand Down Expand Up @@ -10014,5 +10023,5 @@ FACE ; 9F9C ; MA # ( 龜 → 龜 ) CJK COMPATIBILITY IDEOGRAPH-FACE → CJK UNIF

2FD5 ; 9FA0 ; MA #* ( ⿕ → 龠 ) KANGXI RADICAL FLUTE → CJK UNIFIED IDEOGRAPH-9FA0 #

# total: 6605
# total: 6610

23 changes: 20 additions & 3 deletions unicodetools/data/security/dev/confusablesSummary.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# confusablesSummary.txt
# Date: 2025-10-25, 07:52:31 GMT
# Date: 2025-11-12, 00:37:27 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -14197,6 +14197,10 @@
(‎ 充 ‎) 5145 CJK UNIFIED IDEOGRAPH-5145
← (‎ 充 ‎) FA74 CJK COMPATIBILITY IDEOGRAPH-FA74

# 兌 兑
(‎ 兌 ‎) 514C CJK UNIFIED IDEOGRAPH-514C
← (‎ 兑 ‎) 5151 CJK UNIFIED IDEOGRAPH-5151

# 免 免 免
(‎ 免 ‎) 514D CJK UNIFIED IDEOGRAPH-514D
← (‎ 免 ‎) FA32 CJK COMPATIBILITY IDEOGRAPH-FA32
Expand Down Expand Up @@ -14734,6 +14738,10 @@
(‎ 寃 ‎) 5BC3 CJK UNIFIED IDEOGRAPH-5BC3
← (‎ 寃 ‎) 2F86D CJK COMPATIBILITY IDEOGRAPH-2F86D

# 寉 隺
(‎ 寉 ‎) 5BC9 CJK UNIFIED IDEOGRAPH-5BC9
← (‎ 隺 ‎) 96BA CJK UNIFIED IDEOGRAPH-96BA

# 寘 寘
(‎ 寘 ‎) 5BD8 CJK UNIFIED IDEOGRAPH-5BD8
← (‎ 寘 ‎) 2F86E CJK COMPATIBILITY IDEOGRAPH-2F86E
Expand Down Expand Up @@ -16715,6 +16723,10 @@
(‎ 躗 ‎) 8E97 CJK UNIFIED IDEOGRAPH-8E97
← (‎ 躛 ‎) 8E9B CJK UNIFIED IDEOGRAPH-8E9B

# 躱 躲
(‎ 躱 ‎) 8EB1 CJK UNIFIED IDEOGRAPH-8EB1
← (‎ 躲 ‎) 8EB2 CJK UNIFIED IDEOGRAPH-8EB2

# 軔 軔
(‎ 軔 ‎) 8ED4 CJK UNIFIED IDEOGRAPH-8ED4
← (‎ 軔 ‎) 2F9DE CJK COMPATIBILITY IDEOGRAPH-2F9DE
Expand Down Expand Up @@ -16956,8 +16968,9 @@
← (‎ 響 ‎) FA69 CJK COMPATIBILITY IDEOGRAPH-FA69
← (‎ 響 ‎) FACA CJK COMPATIBILITY IDEOGRAPH-FACA

# 頋 頋 頋 頋
# 頋 𮨇 頋 頋 頋
(‎ 頋 ‎) 980B CJK UNIFIED IDEOGRAPH-980B
← (‎ 𮨇 ‎) 2EA07 CJK UNIFIED IDEOGRAPH-2EA07
← (‎ 頋 ‎) FACB CJK COMPATIBILITY IDEOGRAPH-FACB
← (‎ 頋 ‎) 2F9FE CJK COMPATIBILITY IDEOGRAPH-2F9FE
← (‎ 頋 ‎) 2F9FF CJK COMPATIBILITY IDEOGRAPH-2F9FF
Expand Down Expand Up @@ -17872,5 +17885,9 @@
(‎ 𪘀 ‎) 2A600 CJK UNIFIED IDEOGRAPH-2A600
← (‎ 𪘀 ‎) 2FA1D CJK COMPATIBILITY IDEOGRAPH-2FA1D

# total : 7659
# 𮶵 𲪏
(‎ 𮶵 ‎) 2EDB5 CJK UNIFIED IDEOGRAPH-2EDB5
← (‎ 𲪏 ‎) 32A8F CJK UNIFIED IDEOGRAPH-32A8F

# total : 7664

Original file line number Diff line number Diff line change
Expand Up @@ -5811,3 +5811,10 @@ A8CF ; 007C 007C # SAURASHTRA DOUBLE DANDA
17C4 ; 17C1 17B6
17C7 ; 0983
11303 ; 0983

# CJK confusables from UTC #185 Action Items
5BC9 ; 96BA
8EB1 ; 8EB2
514C ; 5151
980B ; 2EA07
2EDB5 ; 32A8F
12 changes: 11 additions & 1 deletion unicodetools/data/security/dev/data/source/formatted-source.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# formatted-source.txt
# Date: 2025-10-25, 07:52:30 GMT
# Date: 2025-11-12, 00:37:25 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -4400,6 +4400,8 @@
513F ; 16FF2 # ( 儿 ~ 𖿲 ) CJK UNIFIED IDEOGRAPH-513F ~ CHINESE SMALL SIMPLIFIED ER
513F ; 3126 # ( 儿 ~ ㄦ ) CJK UNIFIED IDEOGRAPH-513F ~ BOPOMOFO LETTER ER

514C ; 5151 # ( 兌 ~ 兑 ) CJK UNIFIED IDEOGRAPH-514C ~ CJK UNIFIED IDEOGRAPH-5151

5553 ; 555F # ( 啓 ~ 啟 ) CJK UNIFIED IDEOGRAPH-5553 ~ CJK UNIFIED IDEOGRAPH-555F

5861 ; 586B # ( 塡 ~ 填 ) CJK UNIFIED IDEOGRAPH-5861 ~ CJK UNIFIED IDEOGRAPH-586B
Expand All @@ -4408,6 +4410,8 @@

5AAF ; 5B00 # ( 媯 ~ 嬀 ) CJK UNIFIED IDEOGRAPH-5AAF ~ CJK UNIFIED IDEOGRAPH-5B00

5BC9 ; 96BA # ( 寉 ~ 隺 ) CJK UNIFIED IDEOGRAPH-5BC9 ~ CJK UNIFIED IDEOGRAPH-96BA

5CC0 ; 2B73A # ( 峀 ~ 𫜺 ) CJK UNIFIED IDEOGRAPH-5CC0 ~ CJK UNIFIED IDEOGRAPH-2B73A

5DFF ; 5E02 # ( 巿 ~ 市 ) CJK UNIFIED IDEOGRAPH-5DFF ~ CJK UNIFIED IDEOGRAPH-5E02
Expand Down Expand Up @@ -4462,12 +4466,16 @@

8E97 ; 8E9B # ( 躗 ~ 躛 ) CJK UNIFIED IDEOGRAPH-8E97 ~ CJK UNIFIED IDEOGRAPH-8E9B

8EB1 ; 8EB2 # ( 躱 ~ 躲 ) CJK UNIFIED IDEOGRAPH-8EB1 ~ CJK UNIFIED IDEOGRAPH-8EB2

8EFF ; 8F27 # ( 軿 ~ 輧 ) CJK UNIFIED IDEOGRAPH-8EFF ~ CJK UNIFIED IDEOGRAPH-8F27

8FB6 ; 2ECC # ( 辶 ~ ⻌ ) CJK UNIFIED IDEOGRAPH-8FB6 ~ CJK RADICAL SIMPLIFIED WALK

93AD ; 93AE # ( 鎭 ~ 鎮 ) CJK UNIFIED IDEOGRAPH-93AD ~ CJK UNIFIED IDEOGRAPH-93AE

980B ; 2EA07 # ( 頋 ~ 𮨇 ) CJK UNIFIED IDEOGRAPH-980B ~ CJK UNIFIED IDEOGRAPH-2EA07

9E42 ; 9E43 # ( 鹂 ~ 鹃 ) CJK UNIFIED IDEOGRAPH-9E42 ~ CJK UNIFIED IDEOGRAPH-9E43

A04A ; A49E # ( ꁊ ~ ꒞ ) YI SYLLABLE PUT ~ YI RADICAL PUT
Expand Down Expand Up @@ -4768,6 +4776,8 @@ A99D ; A9A3 # ( ꦝ ~ ꦣ ) JAVANESE LETTER DDA ~ JAVANESE LETTER DA MAHAPRANA

2D161 ; 2F82D # ( 𭅡 ~ 卑 ) CJK UNIFIED IDEOGRAPH-2D161 ~ CJK COMPATIBILITY IDEOGRAPH-2F82D

2EDB5 ; 32A8F # ( 𮶵 ~ 𲪏 ) CJK UNIFIED IDEOGRAPH-2EDB5 ~ CJK UNIFIED IDEOGRAPH-32A8F

31E7C ; 2F96E # ( 𱹼 ~ 緇 ) CJK UNIFIED IDEOGRAPH-31E7C ~ CJK COMPATIBILITY IDEOGRAPH-2F96E

FB54 ; FBE6 # ( ‎ﭔ‎ ~ ‎ﯦ‎ ) ARABIC LETTER BEEH INITIAL FORM ~ ARABIC LETTER E INITIAL FORM
Expand Down
Loading