Please wait while loading

IRG Working Set 2021v3.0

Source: Andrew WEST
Date: Generated on 2022-07-04

Show Deleted

Unification

SnImage/SourceComment TypeDescription
02402
02402
犬 94.10
13 · ㇑ (2)
GKJ-00565
Unification
U+25855
Appears to be an error for U+25855 𥡕 in a single edition. Suggest postponing for additional evidence.
01413
01413
心 61.14
17 · ㇔ (4)
KC-05106
Unification
U+2AB3A
The glyph shown in the evidence is suspicious because no other encoded character has the component ⿱宷日. It is possibly an error for U+2AB3A 𪬺 (K5-01E2). Consider unifying to 𪬺 (U+2AB3A) if cognate.
01388
01388
心 61.13
17 · ㇒ (3)
SAT-06249
UCV
Consider adding UCV for 解~觧. See also TE-3773 (⿱觧虫) and VN-F1B70 (⿱觧会).
01263
01263
广 53.11
14 · ㇐ (1)
TD-4B2A
广𤲅
UCV
The supplementary evidence provided by Eiso Chan shows the character to be a variant of 庵. It seems like 𤲅 is a common variant of 奄, and a possible candidate for a UCV.
02054
02054
水 85.9
12 · ㇠ (5)
UK-20083
Oppose Unification
Based on the second evidence provided by Eiso Chan, ⿰氵架 is not a misprint but a known variant of 深. Therefore keep in M-set.
00764
00764
囗 31.3
6 · ㇐ (1)
UK-20835
Unification
U+211AE
As 𡆮 (⿴囗士) is only listed in the "備考" section of KXZD, the KX glyph form cannot be considered to be authoritative, and KX should not be used as a source reference. The definitions "土入口也" and "沙土入口" clearly indicate that ⿴囗土 is the correct glyph form for this character. Given that GHZR (which supercedes the GHZ-10711.06 source reference) has corrected the glyph to ⿴囗土, the preferred solution is to correct the G glyph for U+211AE to ⿴囗土 and amend the source reference to GHZR-10766.06. If this solution is accepted then UK will agree to unification and withdraw UK-20835. Horizontal extension is only required if China is unwilling to change the glyph for U+211AE.
03525
03525
艸 140.8
12 · ㇠ (5)
UK-20848
Unification
U+2C74B
Agree to unify to 𬝋 (U+2C74B).
01535
01535
手 64.12
16 · ㇒ (3)
VN-F18A1
UCV
Given the large number of V source characters which show variation of 手~扌 on the left, could we consider adding a UCV for 手~扌 where cognate?
01510
01510
手 64.11
14 · ㇑ (2)
VN-F18C5
𦰟
UCV
Given the large number of V source characters which show variation of 手~扌 on the left, could we consider adding a UCV for 手~扌 where cognate?
03754
03754
角 148.12
19 · ㇔ (4)
VN-F1B70
UCV
Consider adding UCV for 解~觧. See also TE-3773 (⿱觧虫) and SAT-06249 (⿱觧心).


Attributes

SnImage/SourceComment TypeDescription
02039
02039
水 85.9
13 · ㇐ (1)
GKJ-00957
𠫴
IDS
Keep IDS as ⿰氵𠫴 because the IDS is purely descriptive of the character's shape, and is not intended to provide any semantic information.
01737
01737
月 74.6
10 · ㇒ (3)
TC-4A25
Radical
Could Lee or Eiso confirm that VN-F195D uses R74 'moon' radical?
01752
01752
月 74.11
15 · ㇔ (4)
UK-20238
Radical
Keep R74 'moon'
01760
01760
月 74.14
18 · ㇐ (1)
UK-20488
UTC-03203
Radical
Agree to change to R130 'meat'
01269
01269
广 53.14
17 · ㇒ (3)
VN-F0191
广
IDS
Agree with #1531.
01365
01365
心 61.11
14 · ㇑ (2)
VN-F01CC
𭁈𰀁
IDS
Change IDS to ⿰忄𱑏 once Unicode 15.0 is released.
02042
02042
水 85.9
12 · ㇐ (1)
VN-F0732
IDS
Change IDS to ⿰汫⿱⿵𠆢丶𰆊 if the rhs is not 令. If the rhs is a form of 令 then normalize the glyph to use the standard V form of 令.
01329
01329
心 61.6
10 · ㇠ (5)
VN-F1871
Radical
U+21756 𡝖 (entry above in the evidence) is under R38 女 so it would make sense for this character to also be under R38.
01249
01249
幺 52.16
19 · ㇒ (3)
VN-F187B
IDS
Change IDS to ⿰𱥯幾 once Unicode 15.0 has been published.
04237
04237
門 169.11
19 · ㇑ (2)
VN-F1BD9
𭁈𰀁
IDS
Change IDS to ⿰門𱑏 once Unicode 15.0 is released.
01278
01278
弋 56.8
11 · ㇐ (1)
VN-F1CFE
Radical
I'm not sure what IRG rules you are referring to. However, IRG PnP §2.2.1 d. (5) c) states:

"If the technically correct (aka semantic) radical for an ideograph hampers its discoverability, or is region-dependent, the primary radical shall be assigned as though made by an ideograph expert who is neither a specialist in the history of the Han script nor familiar with ideograph etymology. The technically correct radical can be assigned as a second radical. Both are shown in the code charts, though the primary one serves as the basis for ordering within a CJK Unified Ideographs block."

Therefore, in this case the primary radical should be R126 而.


Evidence

SnImage/SourceComment TypeDescription
01792
01792
木 75.8
12 · ㇐ (1)
GDM-00239
𰿖
Evidence
Please provide the source for the evidence image (title, publisher, year, page)
01811
01811
木 75.9
13 · ㇔ (4)
GDM-00242
Evidence
Isn't there some question about the suitability of 汉字海 as a reliable source? I think it can be used as secondary evidence, but I would prefer to see additional evidence for ⿰木弯 as a place name.
02065
02065
水 85.10
13 · ㇔ (4)
GDM-00248
New evidence
Add image of web page cited in #5128

01586
01586
鳥 196.8
19 · ㇐ (1)
GKJ-00293
Evidence
The evidence is insufficient. Firstly, please show the full page of the evidence. Secondly please show the original source that is being quoted so we can be sure that character shown in the modern source is not a mistake.
01408
01408
鳥 196.9
20 · ㇔ (4)
GKJ-00318
Evidence
It is difficult to trust the modern edition of the text as experience shows that modern editions often introduce glyph errors or create imaginary characters. Therefore, please show an image of the original text, apparently from 《明憲宗純皇帝實錄卷之六十八》.
02423
02423
犬 94.12
15 · ㇑ (2)
GKJ-00552
Evidence
The full reference for the source should be provided (author, title, publisher, year), as well as the name of the author and title for the piece in which this character occurs.
02390
02390
犬 94.9
12 · ㇒ (3)
GKJ-00555
TB-5027
New evidence
This character occurs in Japanese sources for the name of the Chuja Islands 楸子群島 in Korea (source twitter):

02410
02410
犬 94.10
13 · ㇔ (4)
GKJ-00558
Evidence
Complete reference for the source (author, title, publisher, year) would be useful. It would also be interesting to see the complete page, and not just a tiny extract.
02457
02457
犬 94.16
19 · ㇒ (3)
GKJ-00560
Evidence
Current evidence is insufficient. Please provide complete page of the evidence so we can properly evaluate the proposed character.
02402
02402
犬 94.10
13 · ㇑ (2)
GKJ-00565
New evidence
集韻卷五 has U+25855 𥡕

02458
02458
犬 94.17
20 · ㇒ (3)
GKJ-00572
Evidence
The current evidence is insufficient. Please provide an image from an edition of 禮記 which shows this character. Otherwise it should be postponed pending additional evidence.
01844
01844
木 75.12
16 · ㇒ (3)
GKJ-00575
𭸟
Evidence
Is it possible to see what other editions of 集韻 give? My suspicion is that ⿰木𭸟 is a corrupt form of some other character, perhaps U+23682 𣚂.
02412
02412
犬 94.10
13 · ㇔ (4)
GKJ-00583
Evidence
This appears to be a one-off error for 嫁. Therefore suggest to postpone pending additional evidence.
01954
01954
毛 82.13
17 · ㇒ (3)
GKJ-00649
Evidence
Experience suggests that the modern edition may have an incorrect character, so please provide an image of the original text that is quoted here.
01636
01636
斤 69.8
12 · ㇠ (5)
GKJ-00976
Evidence
Based on the additonal evidence produced by Huang Junliang, it would seem that the submitted character is an error for 斸. Therefore suggest to postpone pending additional evidence.
01612
01612
攴 66.10
14 · ㇔ (4)
GKJ-01011
𰗣
Evidence
The glyph in the original evidence is unclear, and the evidence produced by Conifer Tseng indicates that the character should be 敘. Therefore suggest to postpone pending additional evidence that this is not a one-off error for 敘.
01720
01720
日 72.15
19 · ㇒ (3)
KC-05180
Unclear evidence
The left side of the glyph is unclear. Is there any additional evidence?
01248
01248
幺 52.9
12 · ㇒ (3)
TD-2632
Evidence
I agree completely with Ken Lunde's comments on TCA-submitted ideographs with insufficient evidence (there is an implicit "agree" comment from me on all of Ken Lunde's repeats of this comment for other TCA characters).
02088
02088
水 85.12
15 · ㇔ (4)
UK-20013
Evidence
Google search finds mostly "冬積柴水中為霖以取之", and Wikisource has "冬積柴水中為罧以取之", so it seems likely that ⿰氵⿱㓁林 is a corrupt form in this edition. Suggest to postpone pending additional evidence that ⿰氵⿱㓁林 is correct or is a stable error.
01918
01918
歹 78.5
9 · ㇠ (5)
UK-20069
New evidence
《汉语方言大词典》 p. 7321 (evidence image from UK-02874)
02081
02081
水 85.11
14 · ㇔ (4)
UK-20085
𠅤
New evidence
《武強縣新志》 appears to have 漁, although I cannot confirm this.

Evidence
Based on #7499, ⿰氵𠅤 is probably an error for U+6F01 漁, therefore suggest to postpone pending additional evidence that this is not an error form in this one source.
01626
01626
文 67.7
11 · ㇒ (3)
UK-20120
Evidence
⿰文我 here could plausibly be an error for 斌. Is there any other source which lists this person's name so that we can check whether ⿰文我 is correct?
02078
02078
水 85.11
14 · ㇒ (3)
UK-20146
New evidence
Screenshot of web page cited in #7235

00855
00855
土 32.9
12 · ㇒ (3)
UK-20148
New evidence
《欽定四庫全書》:《{{https://www.kanripo.org/text/KR2k0081/003#16a 籌海圖編}}》巻三 folio 12b:
02426
02426
犬 94.12
15 · ㇑ (2)
UK-20178
Evidence
弇州續稿卷一百二十三 has 撮. Quite probable that ⿰犭最 is a one-off error for U+64AE 撮. Therefore suggest to postpone pending additional evidence.
01795
01795
木 75.8
12 · ㇐ (1)
UK-20181
Evidence
This is a list of Song dynasty officials in a Qing dynasty book. There is a high probability that the character ⿱直木 is a mistake. 《景定建康志》卷二十四 gives the name as 趙汝末. Suggest to postpone pending additional evidence for the correct form of this person's name.
01609
01609
攴 66.5
9 · ㇐ (1)
UK-20192
Evidence
It is not clear to me what ⿰末攵 should mean here, and it seems quite possible that it is a one-off error for some other character (maybe 救). It would seem prudent to postpone pending additional evidence to confirm that this is not an error character.
01370
01370
心 61.12
16 · ㇐ (1)
UK-20195
Evidence
The evidence seems insufficient. The character is quite possibly a mistake for some other encoded character, and the single occurence of this name in a note is not sufficient to determine whether this is a genuine character or an error character. Suggest to postpone pending additional evidence.
00900
00900
土 32.13
16 · ㇑ (2)
UK-20265
New evidence
《光緒六年庚辰科會試同年齒錄・一》p. 32:
New evidence
《道光庚子恩科直省同年譜·四川、廣東、廣西、雲南、貴州羽》p. 715:
02064
02064
水 85.10
13 · ㇒ (3)
UK-20337
New evidence
《高僧法顯傳》[T2085] gives "三千僧共揵搥食"



Therefore ⿰氵追 is a variant of or error for U+6425 搥.
01743
01743
月 74.8
12 · ㇑ (2)
UK-20338
Evidence
Other versions of this text give "急蘭亦䚟" (e.g. 《元史》卷二百一十), so ⿰月㝵 is likely an error for 䚟. Therefore suggest to postpone pending additional evidence that this is not a one-off error in this edition.
01300
01300
彳 60.10
13 · ㇠ (5)
UK-20395
𰙩
Evidence
Evidently a variant or mistake for 𢕬 (U+2256C). As《天聞閣琴譜》 does not seem to be a very reliable source, I suggest to postpone pending additional evidence.
02073
02073
水 85.11
16 · ㇑ (2)
UK-20398
Evidence
It appears to be a variant of 慕. It remains postponed, pending additional evidence that ⿱莫氺 is not an idiosyncratic form used only in this one source.
01264
01264
广 53.11
14 · ㇐ (1)
UK-20424
广
Evidence
Other editions have "其妙聲,則清靜厭瘱", so the character here (which is not very clear anyway) should be an error/corruption for 瘱. Therefore suggest to postpone pending additional evidence.
04522
04522
骨 188.4
14 · ㇒ (3)
UK-20427
Evidence
Based on the comment by Eiso Chan UK-20427 seems to be a one-off error for U+4BCB 䯋, and therefore not appropriate for encoding. Therefore we WITHDRAW this character.
03800
03800
豆 151.19
26 · ㇑ (2)
UK-20440
Evidence
I agree that ⿰豊豊 corresponds to 艷體 in the form of the poem quoted in #7485. However, here we have "{⿰豊豊}飡瓊瑤華" which preserves the five-character poetic metre, so ⿰豊豊 cannot be an accidental mistranscription (I think it is impossible to accidentally mistranscribe 艷體 as ⿰豊豊). It is possible that "{⿰豊豊}飡瓊瑤華" is the original form of the poem, and "艷體餐瑤華" is a revised version which expands the unusual character ⿰豊豊. But even if "艷體餐瑤華" is the original version, I believe that "{⿰豊豊}飡瓊瑤華" must have been a deliberate revision. Therefore, I think the evidence for ⿰豊豊 is satisfactory, and the character should be kept in the M-set.
02760
02760
石 112.7
12 · ㇑ (2)
UK-20478
New evidence
《欽定四庫全書》:《{{https://www.kanripo.org/text/KR2k0081/003#16a 籌海圖編}}》巻三 folio 16a:
01297
01297
彳 60.8
11 · ㇠ (5)
UK-20640
Evidence
Other editions have "寬以濟猛,嚴而不殘" so the submitted character should be a mistake for 猛. Suggest to postpone pending additional evidence.
01707
01707
日 72.12
16 · ㇠ (5)
UK-20958
Evidence
The correct name of this Song dynasty person is 林瞪 (1003-1059), therefore ⿰日登 is certainly an error for 瞪. If this is a one-off error in this source then the character should not be encoded. Therefore suggest to postpone pending additional evidence.
00486
00486
口 30.7
10 · ㇠ (5)
UTC-00332
𥁀
New evidence
Confirmation that the correct form of the character is ⿰口⿱刀臼 (unifiable with U+5557 啗).

Source: Bernard F. Meyer and Theodore F. Wempe, "The Student's Cantonese-English Dictionary" (3rd ed., Hong Kong, 1947) p. 626 (taâm)
01721
01721
日 72.16
20 · ㇒ (3)
V0-3962
Misidentified glyph
The evidence shows U+232F1 𣋱.


Glyph Design & Normalization

SnImage/SourceComment TypeDescription
01732
01732
肉 130.5
9 · ㇐ (1)
GZ-0092201
Glyph design
The 2.0 glyph is not an improvement. The top of the vertical stroke of 巾 does not conform to PRC conventions, and the left shoulder of 巾 should not protrude. Cf. the G-source glyph for U+4F48:
01483
01483
手 64.9
12 · ㇔ (4)
GZ-4422409
Glyph design
The 2.0 glyph is not an improvement, as it does not match the evidence or conform to PRC conventions for 礻 (should not show the final stroke protruding on the left). Therefore suggest reverting to the 1.0 glyph.
01374
01374
心 61.12
16 · ㇑ (2)
UTC-00630
Glyph design
The UTC glyph uses the G form of 跋. The H form of 跋 is as shown in the evidence. Therefore suggest changing the glyph to match the evidence (i.e. use expected H glyph form).
01285
01285
彐 58.12
15 · ㇔ (4)
VN-F1A08
Glyph design
The form of 灵 used for almost all V-source characters is the same as the G-form (i.e. no protruding horizontal stroke). Only U+306FB (Ext. G) and U+31ADF (Ext. H) use the same form of 灵 as shown for this character (VN-F1A08). It would be nice if Vietnam could use a consistent form of 灵 for all characters.


Other

SnImage/SourceComment TypeDescription
02051
02051
水 85.9
12 · ㇔ (4)
UK-20175
Comment
The evidence text implies that ⿰氵冠 may be a variant of 滱.
00745
00745
口 30.19
22 · ㇒ (3)
UK-20207
Comment
The following character (⿰氵𥬡) is WS2021-02079 UK-20208. The fact that both characters in his retirement sobriquet are unencoded suggests that he deliberately created two novel characters.
02079
02079
水 85.11
14 · ㇒ (3)
UK-20208
𥬡
Comment
The preceding character (⿱興同) is WS2021-00745 UK-20207. The fact that both characters in his retirement sobriquet are unencoded suggests that he deliberately created two novel characters.
00029
00029
丿 4.4
5 · ㇠ (5)
UK-20538
Comment
See this SCMP article for additional background discussion of the "X也" gender-neutral 3rd-person pronoun. Refusing to encode this character would be doing a great disservice to the user community who want to use this gender-neutral pronoun form.
Comment
Additional discussion on twitter showing a glyph form similar to that proposed by the UK with 㐅 rather than X:
02068
02068
水 85.10
13 · ㇔ (4)
UK-20797
Comment
The new UCV allows for the character to be normalized to ⿰氵害 and use this IDS. The character should be kept in the M-set.


Data for Unihan

SnImage/SourceComment TypeDescription
01498
01498
手 64.10
13 · ㇔ (4)
UK-20107
Semantic variant
U+69BA 榺
03754
03754
角 148.12
19 · ㇔ (4)
VN-F1B70
Semantic variant
VN-F0547 (⿱解会)
02074
02074
水 85.11
14 · ㇑ (2)
UK-20665
Simp variant
WS2021-00245 GDM-00245
01362
01362
心 61.11
15 · ㇐ (1)
UK-20252
Trad variant
U+6197 憗
01435
01435
手 64.4
7 · ㇑ (2)
UK-20266
Trad variant
U+22B72 𢭲
04664
04664
魚' 195'.4
12 · ㇠ (5)
UK-20804
Trad variant
U+311A7 𱆧
Evidence provided for G_Z3862201 indicates that U+311A7 is a Zhuang word meaning 'pond loach' (泥鰍). UK-20804 also means 'pond loach', but in the Minnan language, which is a strange coincidence as Zhuang and Minnan are unrelated languages.