Please wait while loading

IRG Working Set 2024v4.0

Source: Lee COLLINS
Date: Generated on 2026-01-15

Show Deleted | Show comments from version: 1.0 2.0 3.0 4.0 | Show comments with status: Show All New Only Unresolved Only

Unification

Showing 5 comments.

SnImage/SourceComment TypeDescription
00619
00619
口 30.13.2
GZ-2352301
TS 16 · IDS
Oppose Unification
[ Unresolved from v3.0 ]
It's difficult to clarify the semantics of GZ-2352301 and U+20F4F. Given that most instances of the pairs are already encoded separately, it seems better not to unify.
00112
00112
人 9.4.3
SAT-10237
TS 6 · IDS 𠆢
Oppose Unification
[ Unresolved from v2.0 ]
Agree with #8122. It's always useful to have unusual components encoded for purposes of illustration.
00920
00920
土 32.11.2
UK-30590
TS 14 · IDS
Unification
[ Unresolved from v3.0 ]
This character is also used in VIetnamese, with reading "mảnh", as shown on P. 786 of ĐTĐCN shown below. Vietnam will also propose horizontal extension to 2D3EB, using the form ⿰土莽, which is better attested.

04643
04643
黑 203.4.5
幺 52.13.2
VN-F0188
TS 16 · IDS
Oppose Unification
[ Unresolved from v3.0 ]
The analysis in the evidence says that the shape is composed of "hắc (黑)" and "ảo (幻)". As shown below, 幻 read as nôm "ảo" has one meaning of dark, obscure, etc. This is the semantic in this character.




xxx Is a different word, "u" with a similar meaning of "dark"

Oppose Unification
[ Unresolved from v3.0 ]
I didn't finish the above, but meant to say that 黝 (U+9EDD) is basically a different word "u", used in the compound "u hắc", whereas VN-F0188 is "ảo", having a wider range of meanings: "not current; obscure, ephemeral, etc." They should be kept separate.

Attributes

Showing 143 comments.

SnImage/SourceComment TypeDescription
03601
03601
贝 154′.2.2
GCA-J0023
TS 6 · IDS
FS
[ Unresolved from v1.0 ]
FS=3
04602
04602
鸟 196′.10.1
GCA-J0159
TS 15 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
11
Total Stroke Count
[ Unresolved from v1.0 ]
16
04621
04621
卤 197′.8.2
GCA-T0002
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
3
03267
03267
艸 140.7.4
GCW-00202
TS 11 · IDS
IDS
[ Unresolved from v2.0 ]
Agree with logic for IDS changed noted above.
03822
03822
辵 162.12.1
GCW-00228
TS 15 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
16
04396
04396
魚 195.3.5
GCW-00247
TS 14 · IDS
FS
[ Unresolved from v1.0 ]
2
04448
04448
鱼 195′.3.5
GCW-00248
TS 11 · IDS
FS
[ Unresolved from v1.0 ]
2
04609
04609
鸟 196′.11.5
GCW-00261
TS 16 · IDS
FS
[ Unresolved from v1.0 ]
2
04271
04271
马 187′.13.1
GDM-00489
TS 16 · IDS
FS
[ Unresolved from v1.0 ]
2
04485
04485
鱼 195′.7.5
GPGLG-4004
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
2
03993
03993
邑 163.6.1
GXM-00476
TS 9 · IDS
Radical
[ Unresolved from v1.0 ]
163
Total Stroke Count
[ Unresolved from v1.0 ]
9
03999
03999
邑 163.8.3
GXM-00477
TS 11 · IDS
Radical
[ Unresolved from v1.0 ]
163
04170
04170
風 182.2.3
GZ-0341301
TS 11 · IDS
FS
[ Unresolved from v1.0 ]
2
03606
03606
贝 154′.6.4
GZ-1111302
TS 10 · IDS
FS
[ Unresolved from v1.0 ]
FS=2
04144
04144
革 177.8.3
网 122.13.1
GZ-2672204
TS 17 · IDS
FS
[ Unresolved from v1.0 ]
4
03702
03702
車 159.4.1
GZ-2912503
TS 11 · IDS
FS
[ Unresolved from v1.0 ]
FS=4
00744
00744
囗 31.17.4
GZ-2972401
TS 20 · IDS
Radical
[ Unresolved from v2.0 ]
radical is 31 (囗 U+56D7)
04569
04569
夕 36.23.2
鳥 196.15.3
GZ-3842501
TS 26 · IDS 𬷨
IDS
[ Unresolved from v1.0 ]
⿰外𬷨
04148
04148
韋 178.9.4
GZ-4901401
TS 19 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
10
Total Stroke Count
[ Unresolved from v1.0 ]
19
04168
04168
頁 181.15.3
GZHSJ-0005
TS 24 · IDS
IDS
[ Unresolved from v1.0 ]
⿳⿴𦥑同冖頁, 𦥑 produces the expected number of strokes.
04649
04649
黽 205.13.2
GZHSJ-0021
TS 26 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
14
Total Stroke Count
[ Unresolved from v1.0 ]
27
FS
[ Unresolved from v1.0 ]
1
03838
03838
邑 163.8.5
GZHSJ-0038
TS 10 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
11
04281
04281
骨 188.9.4
GZHSJ-0053
TS 18 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
19
03816
03816
辵 162.11.2
GZHSJ-0070
TS 14 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
15
03815
03815
辵 162.11.1
GZHSJ-0125
TS 14 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
15
04640
04640
黃 201.6.2
GZHSJ-0145
TS 17 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
18, based on IRGN2221 which suggests 12 strokes for 黃/黄
03832
03832
邑 163.4.3
GZHSJ-0162
TS 6 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
7
02769
02769
示 113.14.2
KC-10153
TS 19 · IDS
Residual Stroke Count
[ Unresolved from v2.0 ]
13
Total Stroke Count
[ Unresolved from v2.0 ]
18
04236
04236
香 186.18.1
KC-10181
TS 27 · IDS
FS
[ Unresolved from v1.0 ]
2 (based on IRGN 954AR)
03746
03746
車 159.10.1
KC-10197
TS 17 · IDS
FS
[ Unresolved from v1.0 ]
FS=2
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=11, TS=18
03879
03879
金 167.7.4
KC-10260
TS 15 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
8
Total Stroke Count
[ Unresolved from v1.0 ]
16
03545
03545
言 149.25.2
SAT-09174
TS 32 · IDS
FS
[ Unresolved from v1.0 ]
FS=5
03757
03757
車 159.11.5
SAT-09336
TS 18 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=10, TS=17
03903
03903
金 167.11.3
SAT-09597
TS 19 · IDS 𣐂
FS
[ Unresolved from v1.0 ]
2
04645
04645
黑 203.10.2
SAT-09617
TS 21 · IDS
FS
[ Unresolved from v1.0 ]
5
Total Stroke Count
[ Unresolved from v1.0 ]
22
04632
04632
鹿 198.15.4
SAT-09630
TS 26 · IDS 鹿
FS
[ Unresolved from v1.0 ]
2
04215
04215
食 184.14.1
SAT-09712
TS 23 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
22
03527
03527
言 149.8.3
SAT-09780
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
FS = 3
03859
03859
里 166.7.2
SAT-09832
TS 11 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
4
FS
[ Unresolved from v1.0 ]
1
03636
03636
足 157.5.2
SAT-10051
TS 12 · IDS
FS
[ Unresolved from v1.0 ]
FS=1
03843
03843
邑 163.17.1
SAT-10079
TS 20 · IDS 𦾔
FS
[ Unresolved from v1.0 ]
2
04283
04283
骨 188.14.5
SAT-10195
TS 24 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
15
Total Stroke Count
[ Unresolved from v1.0 ]
25
04063
04063
雨 173.10.2
T13-3F42
TS 18 · IDS
FS
[ Unresolved from v1.0 ]
1
04083
04083
雨 173.13.3
T13-3F4F
TS 21 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
14
Total Stroke Count
[ Unresolved from v1.0 ]
22
04084
04084
雨 173.13.3
T13-3F51
TS 21 · IDS
FS
[ Unresolved from v1.0 ]
4
02888
02888
竹 118.10.3
TB-7D2D
TS 16 · IDS
FS
[ Unresolved from v2.0 ]
4
04059
04059
雨 173.10.1
TB-7E5E
TS 18 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
11
Total Stroke Count
[ Unresolved from v1.0 ]
19
04570
04570
鳥 196.17.2
TB-7E79
TS 28 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
15
Total Stroke Count
[ Unresolved from v1.0 ]
26
04633
04633
鹿 198.16.1
TB-7E7B
TS 27 · IDS 鹿
Residual Stroke Count
[ Unresolved from v1.0 ]
15
Total Stroke Count
[ Unresolved from v1.0 ]
26
04248
04248
馬 187.6.4
TC-6576
TS 16 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
8
Total Stroke Count
[ Unresolved from v1.0 ]
18
03964
03964
門 169.9.5
TC-6625
TS 17 · IDS 𤕰
Residual Stroke Count
[ Unresolved from v1.0 ]
8
Total Stroke Count
[ Unresolved from v1.0 ]
16
04548
04548
鳥 196.8.4
TC-6836
TS 19 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
7
Total Stroke Count
[ Unresolved from v1.0 ]
18
04000
04000
阜 170.8.3
土 32.8.5
TC-743B
TS 11 · IDS 𨸼
FS
[ Unresolved from v1.0 ]
4
04195
04195
食 184.5.3
TD-3463
TS 13 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
4
Total Stroke Count
[ Unresolved from v1.0 ]
12
04395
04395
魚 195.3.3
TD-584A
TS 14 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
4
Total Stroke Count
[ Unresolved from v1.0 ]
15
03630
03630
走 156.8.1
TD-647C
TS 15 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=9, TC=16 (IRGN2221)
02756
02756
示 113.10.3
TD-7B2B
TS 15 · IDS
FS
[ Unresolved from v2.0 ]
4
04152
04152
音 180.9.1
TE-297C
TS 18 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
8
Total Stroke Count
[ Unresolved from v1.0 ]
17
03934
03934
金 167.20.3
TE-7D7B
TS 28 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
21
Total Stroke Count
[ Unresolved from v1.0 ]
29
03632
03632
走 156.8.5
UK-30090
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
FS=2
03824
03824
辵 162.12.4
UK-30195
TS 16 · IDS
FS
[ Unresolved from v1.0 ]
4
04445
04445
鱼 195′.2.5
UK-30227
TS 10 · IDS
FS
[ Unresolved from v1.0 ]
3
04201
04201
食 184.7.1
UK-30268
TS 16 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
15
FS
[ Unresolved from v1.0 ]
Unihan data, the ORT Attributes predictor, and most other candidates in WS2024 give 8. It would be better to be consistent.
03881
03881
金 167.7.5
UK-30292
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
1
03755
03755
車 159.11.2
UK-30348
T13-3E7A
TS 18 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=12, TS=19
04196
04196
食 184.6.1
UK-30458
TS 15 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
14
Total Stroke Count
[ Unresolved from v1.0 ]
Unihan data, the ORT Attributes predictor, and most other candidates in WS2024 give 8. It would be better to be consistent.
Total Stroke Count
[ Unresolved from v1.0 ]
Given the variations across geographies and font designs, and the fact that unification precludes most shape-based determination of attributes, CJKJRG / IRG originally chose to use the Kangxi values, the most common denominator in dictionaries used by the CJKV countries. This avoided a lot of fruitless debate. Kangxi is 9 strokes, but as you point out, that later changed. I'm fine with either 8 or 9, but we should be consistent moving forward and change the ORT tools to support our decision. Otherwise, maybe we should just stop using TS.
03561
03561
豆 151.4.5
UK-30559
TS 11 · IDS
FS
[ Unresolved from v1.0 ]
FS=3
04216
04216
食 184.14.1
UK-30582
TS 23 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
22
03615
03615
贝 154′.15.3
UK-30630
TS 19 · IDS
FS
[ Unresolved from v1.0 ]
FS=2
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=14, TC=20, based on Attributes Predictor, which counts the extra dot in 署
03611
03611
贝 154′.12.3
UK-30644
TS 16 · IDS
FS
[ Unresolved from v1.0 ]
FS=4
04009
04009
阜 170.11.3
UK-30801
TS 14 · IDS
FS
[ Unresolved from v1.0 ]
4
03866
03866
金 167.6.1
UTC-03274
TS 14 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
8
FS
[ Unresolved from v1.0 ]
2
Total Stroke Count
[ Unresolved from v1.0 ]
16
03884
03884
金 167.8.1
UTC-03282
TS 16 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
9
Total Stroke Count
[ Unresolved from v1.0 ]
17
03907
03907
金 167.11.4
UTC-03285
TS 19 · IDS
FS
[ Unresolved from v1.0 ]
3
02762
02762
示 113.11.4
UTC-03292
TS 15 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
16
02758
02758
示 113.11.1
UTC-03293
TS 15 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
16
02742
02742
示 113.8.1
UTC-03294
TS 12 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
13
04508
04508
鱼 195′.10.1
UTC-03463
TS 13 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
18
04605
04605
鸟 196′.10.1
UTC-03470
TS 15 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
9
Total Stroke Count
[ Unresolved from v1.0 ]
14
04618
04618
鸟 196′.16.1
UTC-03476
TS 21 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
17
Total Stroke Count
[ Unresolved from v1.0 ]
22
04262
04262
馬 187.19.2
UTC-03481
TS 28 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
29
00198
00198
儿 10.10.2
VN-F0039
TS 12 · IDS
Radical
[ Unresolved from v3.0 ]
Almost all cases of V-source characters with 光 on the left side have 10 as the primary radical. I'm not sure it's a good idea to change. Also, kRSUnicode for 光 is 10, not 42. We should change that.
00278
00278
刀 18.10.1
VN-F005A
TS 12 · IDS
Residual Stroke Count
[ Unresolved from v3.0 ]
The ORT Attributes predictor gives 5 for strokes for 巨. Keep 10 and TS 12
00295
00295
力 19.5.1
VN-F0062
TS 7 · IDS
Residual Stroke Count
[ Unresolved from v3.0 ]
The ORT tool gives a count of 5. for 巨
01155
01155
尸 44.11.1
VN-F0156
TS 14 · IDS
IDS
[ Unresolved from v3.0 ]
Agree with #7734
01138
01138
小 42.14.1
工 48.14.1
VN-F016E
TS 16 · IDS
Residual Stroke Count
[ Unresolved from v3.0 ]
According to the attributes predictor, 巨 is 5 strokes, so SC1 should be 14, but TS = 17
01319
01319
工 48.15.5
VN-F0170
TS 18 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
The IRG Attributes Predictor counts 巨 as 5 strokes, Unihan has 4. We should discuss and document the stroke count we are going to use and fix the ORT if we decide it's 4. Otherwise keep TC=18.
01736
01736
日 72.4.4
VN-F025A
TS 8 · IDS
IDS
[ Unresolved from v2.0 ]
The IDS proposed above seems confusing. 㓁 is a variant of rad. 122 and always appears above. If we merely want to reduce the # of strokes, U+5197 would be better since it can have the shape ⿱冖儿.
01802
01802
日 72.11.5
VN-F0266
TS 15 · IDS
Residual Stroke Count
[ Unresolved from v2.0 ]
The current values come from the ORT Attributes Predictor. Unihan gives contradictory values for U+52D9: TC= 10, but RSUnicode = 19.9 (total 11)
Residual Stroke Count
[ Unresolved from v2.0 ]
IRGN 954AR counts 攵 and variants (including 夂?) as 4, so we need to agree on the composition of 务 (U+52A1) and 務.
Residual Stroke Count
[ Unresolved from v2.0 ]
IRGN2171 suggests 9 for U+6544 敄.
01813
01813
日 72.13.5
VN-F0269
TS 17 · IDS
Residual Stroke Count
[ Unresolved from v2.0 ]
The ORT Attributes Predictor gives 13 strokes for U+9115 and U+9109. So keep the current RS values until we publish an agreed upon value for IRG work.
Residual Stroke Count
[ Unresolved from v3.0 ]
The ORT tools should be updated to reflect IRGN2221
02519
02519
疒 104.6.4
VN-F037D
TS 10 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
11
02843
02843
穴 116.14.5
VN-F03FD
TS 19 · IDS
FS
[ Unresolved from v2.0 ]
1
02986
02986
糸 120.9.4
VN-F045E
TS 15 · IDS
FS
[ Unresolved from v1.0 ]
5
03619
03619
赤 155.12.1
VN-F056C
TS 17 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
SC=12, TC=19
03620
03620
赤 155.12.1
VN-F056D
TS 17 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
TS=19
03682
03682
足 157.15.2
VN-F0597
TS 22 · IDS 𧾷
FS
[ Unresolved from v2.0 ]
IRGN954AR shows 乙 (5) , but the ORT Attributes predictor gives 2. 2 seems better since it is the first stroke in
FS
[ Unresolved from v2.0 ]
Above comment should end with 賞.
04166
04166
頁 181.12.5
阜 170.18.2
VN-F060F
TS 20 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
21
04416
04416
魚 195.10.3
VN-F0677
TS 21 · IDS
FS
[ Unresolved from v1.0 ]
ORT Attributes predictor gives FS=3, which is correct?

04240
04240
馬 187.4.5
VN-F07F9
TS 14 · IDS
IDS
[ Unresolved from v1.0 ]
⿱⿻又丷馬
02772
02772
禸 114.8.4
八 12.11.4
VN-F085E
TS 13 · IDS
Total Stroke Count
[ Unresolved from v2.0 ]
Agree with #8371
03805
03805
辵 162.9.2
VN-F109A
TS 13 · IDS 𠀐
Residual Stroke Count
[ Unresolved from v1.0 ]
8
Total Stroke Count
[ Unresolved from v1.0 ]
12
00126
00126
人 9.7.3
VN-F1F11
TS 9 · IDS
FS
[ Unresolved from v3.0 ]
The ORT Attributes predictor give 3 as FS, which is the first stroke of 乃. Otherwise, it should be 2, the first stroke of 旦
03203
03203
自 132.16.1
VN-F1FBD
TS 26 · IDS
Residual Stroke Count
[ Unresolved from v2.0 ]
Needs update to 22
02611
02611
目 109.8.4
VN-F200F
TS 13 · IDS
Residual Stroke Count
[ Unresolved from v1.0 ]
The attributes predictor tool gives 8 for the stroke count. https://hc.jsecs.org/irg/ws2021/app/attributes-predictor.php?ids=%E2%BF%B1亡目务&radical=109.0
04549
04549
鳥 196.8.4
VN-F20BC
TS 19 · IDS
Radical
[ Unresolved from v2.0 ]
Agree with comment #10413
03083
03083
羊 123.12.5
VN-F2112
TS 18 · IDS
Total Stroke Count
[ Unresolved from v1.0 ]
18 is correct. According to the Attributes Predictor and Unihan data, 羊 is 6, not 7. Both give 10 strokes for 羞.
Total Stroke Count
[ Unresolved from v3.0 ]
The current Unihan data shows varying counts for the sheep radical in this position, although 7 does seem more common. It seems somewhat more intuitive to use 6, since the base character is 6 and the design of many glyphs use the base ⺶, not the ⿱𦍌丿. Either way, we need to update the unihan data and tools.

Evidence

Showing 17 comments.

SnImage/SourceComment TypeDescription
00619
00619
口 30.13.2
GZ-2352301
TS 16 · IDS
New evidence
[ Unresolved from v3.0 ]
ĐTĐCN, p. 0565, reading "hẹm"



Ngũ Thiên Tự (五千字), p. 34, reading "dèm"

00282
00282
刀 18.10.4
UK-30539
TS 12 · IDS
New evidence
[ Unresolved from v3.0 ]
ĐTĐCN, p. 539, "gươm" (means 'sword', thought to be early borrowing from OC)

00511
00511
口 30.10.3
UK-30834
TS 13 · IDS
New evidence
[ Unresolved from v3.0 ]
VN-F097E

00720
00720
口 30.16.3
UK-30943
TS 19 · IDS
New evidence
[ Unresolved from v3.0 ]
ĐTĐCN, p. 916, reading "ngã"

00601
00601
口 30.12.3
UK-30946
TS 15 · IDS
New evidence
[ Unresolved from v3.0 ]
ĐTĐCN, p 1075, reading "phọc"

00381
00381
口 30.3.5
UTC-03296
TS 6 · IDS
New evidence
[ Unresolved from v3.0 ]
ĐTĐCN:0885, reading "nỉ". It is noteworthy that the structural analysis given is 口 + 己 (kỉ), which differs phonetically and semantically from 口 + 已 (dĩ) in Vietnamese.

00236
00236
冫 15.4.3
VN-F0046
TS 6 · IDS
Evidence
[ Unresolved from v3.0 ]
This also appears in the Khó Chữ Hán Nôm Mã Hoá, the last published document with the status of a national standard.



As noted in the first evidence above, this is composed with a "nháy" reading mark. The problem is that Unicode does not allow variation of marks, so there is no way to encode this separate from another character VN-F0074 using IVS

01010
01010
大 37.10.1
VN-F0122
TS 13 · IDS
New evidence
[ Unresolved from v2.0 ]
Evidence from Kho Chữ Hán Nôm Mã Hoá p. 584

01311
01311
巛 47.12.5
VN-F0166
TS 15 · IDS
Evidence
[ Unresolved from v3.0 ]
The analysis in the evidence says that the left side is half of "huấn" (訓), hence the 川. We could consider revising the glyph so that it looks more like the right side of 訓.
03456
03456
行 144.14.5
VN-F052A
TS 20 · IDS
Unclear evidence response
[ Unresolved from v3.0 ]
This is also found in the Vietnamese standard: Kho chữ Hán Nôm mã hoá, as V+6056F, see image below

One thing to consider. Vietnamese uses both U+8862, in the original sense of road, intersection, with Sino-Viet reading "cù" and VN-F052A. Only VN-F052A is found with the reading "cò", meaning "stork" or "egret". The element feather, 羽, appears to distinguish the meaning.

00340
00340
厂 27.8.4
VN-F2002
TS 10 · IDS
Evidence
[ Unresolved from v2.0 ]
This is currently the only example we have of this character. The component ⿻沈丶 is thought to derive from a simplified form of 㴷 (đắm: shipwrecked, see TĐCNTD p. 341), where 耽 has been reduced to the form V+60779 shown below:



The more common form is V+607C5 in the above, also shown here from the same source as VN-F2002:



An appropriate normalization would be V+607C5.
02978
02978
糸 120.8.1
VN-F2057
TS 14 · IDS
Misidentified glyph
[ Unresolved from v3.0 ]
I agree that 剌 ("lạt") would be a better phonetic than 刺 ("thích"). The original manuscript is not readily available now, so it's hard to conclude.
00310
00310
匕 21.9.1
里 166.6.1
VN-F205C
TS 13 · IDS 𫧇
New evidence
[ Unresolved from v3.0 ]
Here is a transliteration below the original Nôm Tày followed by Vietnamese translation

00144
00144
人 9.9.2
VN-F2079
TS 11 · IDS 𱽗
Evidence
[ Unresolved from v2.0 ]
We recently found that this character is the orignal source given for V4-407A, on p. 517 of "Góp phần nghiên cữu văn hoá Việt Nam"



V4-407A is currently encoded as 𫢠 U+2B8A0. One solution would be to move V4-407A to WS2024:00144 and change kIRG_VSource for U+2B8A0 to VN-2B8A0.
04577
04577
鸟 196′.2.3
几 16.5.3
VN-F2080
TS 7 · IDS
Evidence
[ Unresolved from v2.0 ]
The reading is shown in the transliteration that on the page that follows (in green). The note explains that the original text is in Vietnamese, so no translation is given. "phượng" is the legendary bird typically translated as "phoenix". It is more commonly written: 鳯.

00324
00324
十 24.8.4
VN-F2085
TS 10 · IDS
Evidence
[ Unresolved from v3.0 ]
bưởng means half, or side; The Vietnamese translation of "bưởng đông" shown is "phía đông" or "East side"
04549
04549
鳥 196.8.4
VN-F20BC
TS 19 · IDS
Evidence
[ Unresolved from v3.0 ]
The evidence is quite clear as to shape and meaning. As to the unusual shape, it's true that this is the only example we have found of 𫠓, used as a radical, so we could consider using the full form, 鳥.

Glyph Design & Normalization

Showing 25 comments.

SnImage/SourceComment TypeDescription
02113
02113
水 85.9.5
GZ-4722601
TS 12 · IDS
Glyph design
[ Unresolved from v3.0 ]
Based on the above discussion, the conclusion is that the glyph and IDS should be changed to ⿰氵𱚹
03217
03217
舛 136.0.1
一 1.7.2
SAT-09449
TS 8 · IDS
Glyph design
[ Unresolved from v2.0 ]
We need to resolve the issue of design
04142
04142
革 177.5.1
SAT-09987
TS 14 · IDS
Glyph design
[ Unresolved from v1.0 ]
Please consider normalizing the form 戹 to be more in keeping with the design found in other Japanese fonts.
04541
04541
鳥 196.6.1
UTC-03116
VN-F0696
TS 17 · IDS
Glyph design
[ Unresolved from v2.0 ]
We are considering changing the design so that the left side looks more like the 戎 in UTC-03116
01348
01348
幺 52.8.5
VN-F0186
TS 11 · IDS
Normalization
[ Unresolved from v3.0 ]
This has been normalized using rule # 2-0. See IRGN2673
01667
01667
手 64.13.4
VN-F0227
TS 16 · IDS 𦵘
Glyph design
[ Unresolved from v3.0 ]
We will fix the glyph
01686
01686
手 64.18.4
VN-F0246
TS 21 · IDS
Glyph design
[ Unresolved from v3.0 ]
This is the only example in Vietnamese of 類 as a component. The preferred form is 類 (V1-6C22), so it would be better to normalize the glyph to reflect that.
01886
01886
木 75.7.3
VN-F028C
TS 11 · IDS
Glyph design
[ Unresolved from v2.0 ]
There are more than 40 glyphs using the same design in the NomNaTong font. It would be a significant effort to change them all. We would need to better understand the rationale for this design before making such a change.
02685
02685
石 112.8.5
VN-F03C5
TS 13 · IDS
Glyph design
[ Unresolved from v1.0 ]
There are 21 Vietnamese characters with 叕 as an immediate constituent. The distribution of the stroke shape in question is about half and half. We will investigate the issues with normalization.
03016
03016
糸 120.17.3
VN-F0471
TS 23 · IDS
Normalization
[ Unresolved from v3.0 ]
Normalization based on rule 1-4 in IRGN2673
03193
03193
肉 130.15.4
VN-F04BE
TS 19 · IDS
Glyph design
[ Unresolved from v2.0 ]
The glyph already has that general shape, can you provide more detail?
Normalization
[ Unresolved from v3.0 ]
Note that the horizontal extension will are working on will include U+6447 摇, U+9065 遥, and U+7476 瑶, as well as U+7AB0 窰, the right side of VN-F04BE. U+7AB0 currently has the shape shown below.



VN-F04BE and U+7AB0 both already have the suggested general structure, ⿱爫缶. Is the desire here to move the 爫 one or two pixels up and to the left so it is the same as U+55C2, etc?
00216
00216
八 12.6.3
VN-F066F
TS 8 · IDS
Glyph design
[ Unresolved from v1.0 ]
We will address this.
04560
04560
鳥 196.11.2
VN-F06D3
TS 22 · IDS
Glyph design
[ Unresolved from v2.0 ]
Re. #8603, please provide more detail concerning the revision.
01659
01659
手 64.12.4
VN-F0B52
TS 15 · IDS
Glyph design
[ Unresolved from v2.0 ]
The phonetic, "dan" argues for U+67EC. Here is another analysis (Vũ Văn Kính, "Tự điễn chứ Nôm" p. 225) showing that the traditional and simplified forms both contain U+67EC, read "lan", as phonetic.

01563
01563
手 64.4.4
VN-F0CBC
TS 7 · IDS 𠬠
Glyph design
[ Unresolved from v2.0 ]
The element on the right is a simplification of the characters 沒 / 没, read "một", through these steps 没 > 𠬛 > 𠬠 or 𱥺 > 𠬠. There are 2 basic forms, 𠬠 and 𰰝. This is documented in the character definition shown in the image below from TĐCNTD p. 802



Below is an example of VN-F0CBC from "Lục Vân Tiên" showing a form somewhat between 𠬠 and 𰰝



Historically, there are many examples of 𰰝, but the current trend is to standardize on 𠬠, as shown in this the "BẢNG CHỮ HÁN NÔM CHUẨN THƯỜNG DÙNG" http://www.hannom-rcv.org/NS/bchnctd%20300623.pdf
00157
00157
人 9.11.4
VN-F165C
TS 13 · IDS
Normalization
[ Unresolved from v3.0 ]
Both forms are valid, but there is a preference for 盗 (U+76D7), so it is normalized to that.
01011
01011
大 37.10.1
VN-F1F13
TS 13 · IDS
Glyph design
[ Unresolved from v2.0 ]
About the comment #8629. I assume that the suggestion is to change the 亅 in the top 可 to 丨. That's a reasonable suggestion, but there are at least 6 other V-source characters already encoded (𣘁 U+24819, etc.) that use the current design. Since that's a majority, the lesser impact solution would be to normalize the remaining characters 哥 U+54E5, 歌 U+6B4C, U+2BC04, and VN-F176E (in WS2021) to use 亅.
00955
00955
土 32.14.3
VN-F1FD4
TS 16 · IDS
Normalization
[ Unresolved from v3.0 ]
This is not a significant difference. Most of the glyphs in NomNaTong with the 鬼 component retain the 厶. Over time, we will normalize to that shape. We can update the normalization guidelines
03560
03560
谷 150.7.3
VN-F1FDB
TS 14 · IDS
Glyph design
[ Unresolved from v2.0 ]
Both variants are found in Vietnamese, TĐCNDG entry shown below has U+79C3 秃. In the NomNaTong font there are 5 glyphs composed with U+79C3 秃 and 5 composed with U+79BF 禿. Of the characters with V-Source references, if we normalize to 禿, we would also want to change 𥟉 U+257C9 / V3-3531 and 𥟹 U+257F9 / V2-7F31. If we normalize to 秃, we would only change U+22B33 / VN-22B33

00340
00340
厂 27.8.4
VN-F2002
TS 10 · IDS
Normalization
[ Unresolved from v3.0 ]
The inner component should be changed to ⿰氵⿻⿻冖九丶, V+607C5 as shown in #8650 above
01887
01887
木 75.7.3
VN-F2066
TS 11 · IDS
Glyph design
[ Unresolved from v3.0 ]
will change the glyph
01943
01943
木 75.11.3
VN-F2067
TS 15 · IDS
Glyph design
[ Unresolved from v3.0 ]
will change the glyph
04183
04183
𲋄 182″.4.1
VN-F2093
TS 8 · IDS 𲋄
Glyph design
[ Unresolved from v3.0 ]
A reviewer, Luu Quang Truong, points out that closer inspection of the original font reveals the top element to be 主. See image below. This makes sense as a phonetic, and there are other examples: "gio": 𠰍 (giỏ), 𬚶 (giỏ), etc. We propose changing the glyph and attributes to reflect that.
SC=5, FS=4, TS=9

04647
04647
黑 203.12.3
VN-F2568
TS 24 · IDS
Glyph design
[ Unresolved from v3.0 ]
This was normalized incorrectly. As can be seen here

The original structure was ⿰黑⿱⿰夕丰木. The word "kịt" means dark, dense. Based on the phonetic value, "kịt", this should have been normalized to ⿰黑桀, with 桀 as phonetic.

Other

Showing 16 comments.

SnImage/SourceComment TypeDescription
00052
00052
丶 3.6.3
GZ-4731601
TS 7 · IDS
Other
[ Unresolved from v3.0 ]
芮 (U+82AE) in Vietnam is used for Tày "rườn" (house), similar in meaning to the Zhuang word.
02563
02563
白 106.9.1
KC-10128
TS 14 · IDS
Other
[ Unresolved from v2.0 ]
FS needs update
01622
01622
手 64.9.4
SAT-10222
TS 12 · IDS 𥥛
Other
[ Unresolved from v3.0 ]
We still need to resolve the question regarding glyph shape noted above.
01153
01153
尸 44.10.1
SAT-90030
TS 13 · IDS
Other
[ Unresolved from v3.0 ]
Related to 𡱁 (U+21C41) and possibly 𡲑 (U+21C91)
04296
04296
鬼 194.1.1
T13-3F6A
TS 8 · IDS
Other
[ Unresolved from v1.0 ]
Should consider a new radical number for this ⿱田儿 variant of 194
04599
04599
鸟 196′.9.2
UK-30053
TS 14 · IDS
Other
[ Unresolved from v1.0 ]
The wrong glyph is marked in Evidence 1, but the correct one is next to it.
03250
03250
艸 140.5.3
UTC-03295
TS 8 · IDS
Other
[ Unresolved from v1.0 ]
What is the justification for labeling this as similar to U+31FC3?
Other
[ Unresolved from v2.0 ]
Evidence # 3 for UTC-03292, which has 逃入清化 (he fled into Thanh Hoá), parallels the phrase 奔清⿱花一 above and suggests that this character is a variant of 化 (U+5316, read hoá). Thanh Hoá is more commonly written 清化.
01144
01144
尸 44.2.4
UTC-03390
TS 5 · IDS
Other
[ Unresolved from v3.0 ]
#9327 shows that we are already living with script-hybrid characters without any problem. Since the nature and attributes of these characters are not fundamentally different from CJK Ideographs, I see no need to postpone.
00236
00236
冫 15.4.3
VN-F0046
TS 6 · IDS
Other
[ Unresolved from v3.0 ]
According to the analysis in the dictionary, 冫 is a mark indicating that this character is to be read differently from its standard Sino-Vietnamese reading of "cặp".
02913
02913
竹 118.15.1
VN-F0423
TS 21 · IDS
Other
[ Unresolved from v2.0 ]
Both characters, U+23813 and VN-F0423 mean "a type of bamboo". The major sources, BTCN, ĐTĐCN, GĐNHV, KCHN, and Takeuchi, all show the form VN-F0423, with 竹, appropriately, as the radical. Since VN-F0423 appears to be the correct form, if we were to unify these, Vietnam would request changing the representative glyph for U+23813 to be that of VN-F0423.
03331
03331
艸 140.13.2
VN-F04F6
TS 17 · IDS 𠣜
Other
[ Unresolved from v2.0 ]
It's possible that the shape in #8394 was intended, but the only evidence we have shows the current glyph design. Here is KCHN

03544
03544
言 149.23.2
貝 154.23.
VN-F1E12
TS 30 · IDS
Other
[ Unresolved from v1.0 ]
second radical should have FS=4
00032
00032
丨 2.1.3
VN-F1F1F
TS 2 · IDS &S2-01;
Other
[ Unresolved from v3.0 ]
Note, this is a full character as well as a component. It is used as an abbreviation of 饒 (U+9952), read "nhiêu" or "nhiều".
02531
02531
疒 104.9.3
VN-F2049
TS 14 · IDS 𢚩
Other
[ Unresolved from v2.0 ]
SC and TC need to be updated as noted above.
00144
00144
人 9.9.2
VN-F2079
TS 11 · IDS 𱽗
Other
[ Unresolved from v3.0 ]
Will also have to change kIRG_VSource for U+2B8A0 to VN-2B8A0.