Please wait while loading

IRG Working Set 2021v5.0

Source: Andrew WEST
Date: Generated on 2026-01-19

Show Deleted | Show comments from version: 1.0 2.0 3.0 4.0 5.0 6.0 7.0
The Image/Source column is displayed as it was in WS2021 v5.0. The character may have a different status in the latest working set.

Unification

SnImage/SourceComment TypeDescription
02899
02899
穴 116.7.2
GDM-00313
TS 12 · IDS
Oppose Unification
In principle I think ⿱宀总 could be unified with 𥦗 (U+25997), and represented using IVS. However, I think there is merit in Wang Xieyang's argument that ⿱宀总 and ⿱穴𢗀 are intrinsically different characters because they have different radicals. Ken's argument (#14618) also makes me inclined to take a pragmatic view of the encoding of this character. Therefore I now support encoding, and as sufficient evidence has now been supplied, GDM-00313 should be moved back to the M-set.
01878
01878
木 75.16.3
GKJ-00298
TS 20 · IDS
Oppose Unification
Oppose unification to 𣝼 (U+2377C) as there is no liding rule, and a new UCV for ⿳自㓁𠔽 ~ ⿱鳥𠔿 ~ ⿱鳥囚 is not reasonable. Sufficient evidence has been provided that this is a stable variant which should be encoded.
03191
03191
耳 128.1.5
GKJ-00727
TS 7 · IDS 𡿨
Oppose Unification
Useful to encode this character, but agree to a potential future UCV with 取 if characters with this component are proposed in the future.
02963
02963
竹 118.12.2
SAT-04240
TS 18 · IDS 𠝣
Oppose Unification
Without applying the hypothetical liding rule SAT-04240 cannot be unified to 𥳭 (U+25CED) as there is no UCV for 刀~刂.
03165
03165
羊 123.11.1
SAT-06015
TS 17 · IDS
Oppose Unification
Oppose unification. The inclusion of ⿱殸⬚ in UCV 312d seems unreasonable as it is not an example of "differences in relative length of strokes" (j-2). UCV 312d should only cover ⿱𣪊⬚ and ⿹𣪊⬚, and ⿱殸⬚ should be removed from the rule.
01240
01240
干 51.3.4
SAT-06176
TS 6 · IDS
Oppose Unification
Oppose unification. No rule for this unification, and as SAT-06176 already exists as a component in five characters (𱲵𢬪𥞯𨒳𠱪) it is useful to encode it.
00413
00413
厶 28.8.5
二 7.8.5
SAT-06514
TS 10 · IDS 𠓞
Unification
U+4E9D
Maybe unifiable with 亝 (U+4E9D)?
03698
03698
衣 145.4.1
SAT-06753
TS 10 · IDS 𧘇
Unification
Other editions of 《禪要經》 give "衰酢", therefore SAT-06753 should be a variant of U+8870 衰 or U+2E571 𮕱 (a variant of 衰). Therefore suggest ad hoc unification to either U+8870 or U+2E571.
01799
01799
木 75.8.5
UK-20573
TS 12 · IDS 𠃛𠃛
Oppose Unification
Oppose unification as the right side of UK-20573 bears no resemblence to 曶. Unified glyph forms should be easily recognizable as variant forms of the same character.
02536
02536
玉 96.17.1
UK-20941
TS 22 · IDS
Unification
U+24A8B
U+81E8
There are three evidences showing ⿱臨玉 but only one showing 𤪋, therefore ⿱臨玉 seems likely to be the correct form in this case. We can accept either ad hoc unification of UK-20941 with 𤪋 (U+24A8B), or a new ucv if there are other examples of 臨~𰯲 glyph variation.
00036
00036
丿 4.11.5
VN-F0BE9
TS 12 · IDS 𠂊丿𰀁
Oppose Unification
Oppose unification and suggested over-broad UCV.


Attributes

SnImage/SourceComment TypeDescription
04700
04700
鳥 196.6.1
GKJ-00341
TS 17 · IDS
IDS
Change IDS and glyph to ⿰夹鳥.
04184
04184
鳥 196.12.3
金 167.15.3
GKJ-00363
TS 23 · IDS
Radical
Remove second radical 167.0 as it is not likely that anyone would analyze the character as 金 plus "⿰欠鳥".
04186
04186
金 167.15.4
言 149.16.3
GKJ-00507
TS 23 · IDS
Radical
The purpose of the second radical is to aid discoverability, but it is unlikely that anyone would expect to find this character under 言 radical, so agree that the second radical is not required in this case.
01988
01988
水 85.1.2
UK-20514
TS 5 · IDS
IDS
⿲㇇丨𰛅
01755
01755
月 74.12.4
UK-20554
TS 16 · IDS
Radical
Keep as Radical 74.0 (月) as the association with '腦' (meat radical) is only one modern author's interpretation, and not necessarily reliable. My own interpretation is that the character represents 'the moon through the window'.
04473
04473
饣 184′.10.3
UK-20802
TS 13 · IDS
Radical
Secondary radical seems unnecessary as this character is the simplified form of the common character U+993D 餽, so the primary radical should be obvious.
00005
00005
一 1.6.5
VN-F0BEA
TS 7 · IDS &S8-01;
IDS
㇯龍𦚏
03983
03983
辵 162.11.3
VN-F19B0
TS 15 · IDS
Radical
PnP section 2.2.1 d. (5) c): "If the technically correct (aka semantic) radical for an ideograph hampers its discoverability, or is region-dependent, the primary radical shall be assigned as though made by an ideograph expert who is neither a specialist in the history of the Han script nor familiar with ideograph etymology. The technically correct radical can be assigned as a second radical."

Therefore use Radical 82.0 (毛) as the primary radical for this character, and add 162.0 (辵) as a second radical if considered necessary.
03863
03863
走 156.10.5
VN-F1CBE
TS 17 · IDS
IDS
Personally I think ⿺起豸 is OK as the V glyph form of 起 is ⿺走己, and the IDS only looks wrong because of the particular font used here.


Evidence

SnImage/SourceComment TypeDescription
04813
04813
鸟 196′.9.5
GDM-00231
TS 14 · IDS
New evidence
SJ/T 11239—2001 39-81:

03920
03920
身 158.4.1
GDM-00257
TS 11 · IDS
Evidence
Three evidence names given, but only one evidence image shown.

⿰身犬 is possibly a variant/mistake for ⿰身大 which also has the reading māng according to Kushim Jiang. Additional evidence would be helpful.
00099
00099
二 7.3.2
山 46.2.1
GDM-00330
TS 5 · IDS
New evidence
《古壮字字典》 p. 392 [vut]:
04630
04630
魚 195.11.3
GKJ-00231
TS 22 · IDS
New evidence
The text in the evidence seems to be related to the quotation given in the entry for 《續方言》 in 《四庫全書總目提要》 which has "𧐒𧐎" instead of "䲇⿰魚覓". Therefore ⿰魚覓 can be considered to be a variant of U+2740E 𧐎. I think it is OK to move it back to the M-set.

04588
04588
魚 195.7.1
GKJ-00244
TS 18 · IDS
Evidence
Probably a variant of 蜇, not a modern typo. It would be helpful to see the original source for the text given in the evidence, but I think it is OK to encode on the given evidence.
04597
04597
魚 195.8.1
GKJ-00259
TS 19 · IDS
Evidence
Evidence is good, therefore move back to the M-set.
04686
04686
鱼 195′.17.5
GKJ-00285
TS 25 · IDS 𬶨
Evidence
It seems very probable that ⿰鱼𬶨 is a mistake for ⿰鱼暨 (simplified form of U+29F59 𩽙) in the two modern evidences given (someone just added a fish radical to the left of 𬶨 without noticing that the fish radical form of the character has 暨 on the right). If ⿰鱼𬶨 is correct we would expect ⿰魚鱀 to exist as well, but it does not.

The two evidences provided are both general texts which just mention Baiji dolphin in passing, so they cannot be considered to be authoritative sources. Please try to find additional evidence for either ⿰鱼𬶨 or ⿰鱼暨 from a zoological source that specifically discusses the Baiji dolphin. If there is no additional evidence then the character should be withdrawn. If additional evidence shows ⿰鱼暨 then suggest to change IDS and glyph to ⿰鱼暨.
04683
04683
鱼 195′.15.1
GKJ-00287
TS 23 · IDS
Evidence
Evidence is sufficient, return to M-set.
04723
04723
鳥 196.8.3
GKJ-00295
TS 19 · IDS 𠈞
Evidence
Variant/mistake for U+2A03E 𪀾 which forms the common word "𪀾鷔". As only one modern typeset source given as evidence, suggest to postpone or withdraw.
04791
04791
鳥 196.17.3
GKJ-00307
TS 28 · IDS
Evidence
What does "箕裘方~起" mean? It looks like ⿰爵鳥 should be a corruption of some reasonably common character, but it is not clear to me what character it should be. An internet search does not come up with any other examples of this phrase.
04740
04740
鳥 196.9.5
GKJ-00310
TS 20 · IDS
Evidence
Agree with #2818 that the glyph is suspicious, and may be a mistake for something like 𪆑. Suggest to postpone pending additional evidence.
04729
04729
鳥 196.9.1
GKJ-00332
TS 20 · IDS
Evidence
Evidence 1 is a quote from 禮記: 「五月鴂則鳴。」鴂者,百鷯也。 Therefore ⿰查鳥 should be a mistake for U+9DEF 鷯.
04712
04712
鳥 196.7.2
GKJ-00347
TS 18 · IDS
New evidence
《古壮字字典》 p. 77 [cingz]:
02664
02664
鳥 196.5.5
GKJ-00348
TS 16 · IDS
New evidence
《古壮字字典》 p. 384 [niuj]:
04774
04774
鳥 196.12.5
GKJ-00359
TS 23 · IDS
New evidence
Full page evidence from 《綠雪堂遺集》卷十三:

New evidence
《靜志居詩話》卷十八 gives ⿰画鳥 (?):



The word ⿰畫/画鳥鶘 seems to be a variant of 鵡鶘.
04760
04760
鳥 196.13.2
GKJ-00367
TE-493F
TS 24 · IDS
Evidence
The text in the evidence from China is a quotation from 【木華·海賦】: "鷸如驚鳧之失侶". Therefore the character is a variant or mistake for U+9A5A 驚.
04785
04785
鳥 196.14.4
GKJ-00373
TS 25 · IDS
Evidence
Stable error for U+4D10 䴐, therefore keep.
04706
04706
鳥 196.6.4
GKJ-00374
TS 17 · IDS
Evidence
Stable error for U+2A054 𪁔 in the word "𪀥𪁔", therefore keep.
04771
04771
鳥 196.12.4
GKJ-00380
TS 23 · IDS
New evidence
⿰雇鳥 is clearly a mistake for U+9D73 鵳, e.g. as in this example from 《字彙補》:



Are the two pieces sufficient to consider this a stable error?
04732
04732
鳥 196.9.2
GKJ-00386
TS 20 · IDS
Evidence
Agree with #5319 that ⿰⿱目火鳥 is a variant/mistake for U+9DAA 鶪. I suppose two pieces of evidence counts as a stable error.
04772
04772
鳥 196.12.4
GKJ-00388
TS 23 · IDS
New evidence
《蠕範》卷四:

04161
04161
金 167.12.3
GKJ-00488
TS 20 · IDS
New evidence
Looks like this ought to be a SAT character.
一切經音義 (T2128) 卷34:
04075
04075
金 167.7.3
GKJ-00491
TS 15 · IDS
Evidence
I don't really understand the evidence. What relationship has ⿰金狂 to the word 鍖銋? Is there any additional evidence for this character?
04096
04096
金 167.9.2
GKJ-00493
TS 17 · IDS
New evidence
江西通志(清雍正刊本) 99:5b
04181
04181
金 167.14.4
GKJ-00497
TS 22 · IDS 𮭲
Evidence
New evidence still looks like a variant form of U+495D 䥝 áo (same rime).
01226
01226
巾 50.8.3
GKJ-00503
TS 11 · IDS
New evidence
Kangxi Dictionary 《康熙字典》 p. 927:
04200
04200
金 167.19.3
GKJ-00527
TS 27 · IDS
Evidence
The glyph form is suspicious as ⿱𥫗鼎 does not exist as an independent character or as a component in any other character. I strongly suspect that ⿰金⿱𥫗鼎 is a mistake for U+28BB0 𨮰 zhá is it has the same reading (士戛 is also the fanqie for 鍘 zhá) and same meaning (a type of knife used to cut hay).

Therefore I suggest to postpone pending additional evidence, or withdraw.
04087
04087
钅 167′.8.3
GKJ-00531
TS 13 · IDS
Evidence
《世説新語校箋》(中華書局,1984年) p. 75 also gives U+93A9 鎩 which should not simplify to ⿰钅刹. Without any more evidence for ⿰钅刹, I suggest that GKJ-00531 is withdrawn.

04931
04931
鼠 208.9.4
GKJ-00638
TS 22 · IDS
Evidence
Mistake for U+9F31 鼱, see for example 山堂肆考

04924
04924
鼠 208.7.3
GKJ-00662
TS 20 · IDS
New evidence
The list of eight rodents given in Evidence 2 (鼸鼶鼮鼣鼭鼤䶅䶈) corresponds to the list of eight rodents given in 説略



where the second character (鼶) is written as ⿺鼠虎. Evidence 2 notes that the original form of 鼶 is ⿰鼠秃 which makes little sense as it is not close phonetically or graphically. Based on the new evidence, the original form of 鼶 is written as ⿺鼠虎, and ⿰鼠秃 is a mistake for ⿺鼠虎.

I suggest to change IDS and glyph to ⿰鼠虎 to match the new evidence.
04869
04869
鹿 198.9.4
GKJ-00672
TS 20 · IDS 鹿
Evidence
All 4 evidences show error forms encoded characters:

Evidence 1 and 4 should be mistake for U+2A2A8 𪊨 as 《説文解字》 gives 麂 as a variant of 𪊨.

Evidence 2 is a mistake for U+2A2A8 𪊨 (see same text in 兩漢博聞).

Evidence 3 is a mistake for U+9E90 麐 as 《宋史》卷218 gives the name "希麐".
04857
04857
鹿 198.7.3
GKJ-00673
TS 18 · IDS 鹿
Evidence
The first evidence should be a mistake for U+9E95 麕.
04836
04836
鹿 198.4.2
日 72.11.4
GKJ-00674
TS 15 · IDS 鹿
New evidence
Shuowen Jiezi (Zhonghua Shuju 1963, p. 139) states: "古文暴从日麃聲" and shows the seal script form of 𣋴:



Therefore, ⿸鹿日 is a mistake for U+232F4 𣋴.
New evidence
《宋本玉篇》gives ⿸麃日 which should be unifiable with U+232F4 𣋴:

04843
04843
鹿 198.6.1
GKJ-00679
TS 17 · IDS 鹿
Evidence
The evidence quotes a poem by the Song dynasty poet Su Zhe 蘇轍. There are two versions of the poem, one with U+9E96 麖 jīng and one with U+9E8F 麏 jūn.

蘇轍《𡗝中詩》: 江流日益深,民語漸以變。遙想彼中人,狀類麖/麏鹿竄。

In the evidence ⿸鹿吉 should be an error for 麏. It cannot be considered a variant as 吉 is entering tone, whereas 麖 and 麏 are both level tone, so it would not fit the tonal pattern of the poem.

Therefore suggest to postpone for additional evidence.
Evidence
The standard text of 金光明最勝王經 has "上妙香水灑遊塵".
04837
04837
鹿 198.4.4
GKJ-00683
TS 15 · IDS 鹿
New evidence
{{https://upload.wikimedia.org/wikipedia/commons/9/9d/WUL-ru05_03254_%E6%BB%87%E7%B9%AB_24.pdf 《滇繫》}

New evidence
張愈光詩文選 卷七 has U+9E83 麃:



Based on this evidence, ⿸鹿心 should be a mistake for U+9E83 麃.
04873
04873
鹿 198.10.1
GKJ-00686
TS 21 · IDS 鹿
Evidence
One-off error for U+93CA 鏊. Suggest postpone for additional evidence.
04880
04880
鹿 198.11.4
GKJ-00687
TS 22 · IDS 鹿
Evidence
One-off error for U+9E96 麖, therefore suggest to postpone for additional evidence.
04017
04017
邑 163.11.4
GKJ-00693
TS 18 · IDS 鹿
Evidence
The text given in the evidence seems to be a mistake. Kangxi Dictionary p. 1276 states: "説文本作⿰麃邑"



And Shuowen Jiezi (Zhonghua Shuju 1963, p. 132) does indeed have an entry for U+287BB 𨞻, which is of course written as ⿰麃邑 in seal script.



So ⿰鹿邑 is a mistake for ⿰麃邑 which is the archaic form for U+287BB 𨞻. As all characters with rhs 阝 can be said to be written with 邑 in the Shuowen dictionary, I do not think that it is a good idea to separately encode any more variant forms of characters with 邑 for 阝. Therefore I suggest a new UCV for 阝~邑, and withdraw GKJ-00693.
04854
04854
鹿 198.7.1
GKJ-00707
TS 18 · IDS 鹿
Evidence
Evidence 1 quotes a poem by the Song dynasty poet Su Zhe 蘇轍 which has U+9E96 麖 jīng or U+9E8F 麏 jūn in this position (see GKJ-00679), so ⿸鹿吾 is probably a mistake for U+9E8F 麏. The other three evidences are OK.
04858
04858
鹿 198.7.3
GKJ-00716
TS 18 · IDS 鹿
Evidence
None of the evidences show the glyph form very clearly, so it is not certain that the component is 免. I think that they are all intended to be forms of U+9E91 麑. Therefore suggest to postpone for better evidence.
02880
02880
禾 115.9.3
GKJ-00728
TS 14 · IDS
Evidence
⿰𥝌敂 is very similar to U+2354C 𣕌
04730
04730
鳥 196.9.1
GKJ-00739
TS 20 · IDS
Evidence
In response to Tao Yang, I guess it is a question of whether we should encode known error forms noted in critical editions of texts, or whether cited error forms such as this should be represented as PUA characters. (I personally am happy to encode error forms if they are cited in printed editions.)
03802
03802
豕 152.8.3
GKJ-00744
TS 15 · IDS
Evidence
Please provide correct evidence source. And a full page would be nice.
02467
02467
玄 95.4.3
GKJ-00752
TS 9 · IDS
Evidence
Please provide the full page of evidence to help understand the context for the *two* unencoded characters ⿰糹哥 and ⿰玄欠.
04684
04684
鱼 195′.16.2
GKJ-00762
TS 24 · IDS
Evidence
Evidence is not sufficient. It cites 《隨息居飲食譜》 but I cannot find U+2CD77 𬵷 in this text. Therefore suggest to postpone pending additional evidence.
04675
04675
鱼 195′.9.1
GKJ-00768
TS 17 · IDS
Evidence
The evidence is not sufficient as the source is not very reliable, and has many glyph errors. The evidence quotes a long-lost book (食經 by 崔禹錫), so it is unclear what the actual source for the character is. As there is no G-source for U+2CD5C 𬵜, it is not obvious that the original source referred to U+2CD5C 𬵜. I therefore suggest to postpone pending additional evidence that GKJ-00768 is the simplified form of U+2CD5C 𬵜.
03332
03332
艸 140.6.3
GKJ-00867
TC-4F5F
TS 10 · IDS
New evidence
《古壮字字典》 p. 247 [haz]:
03798
03798
豆 151.5.3
GKJ-00965
TS 12 · IDS
Misidentified glyph
This is obviously a corrupt form of some common character as the text is using it as a size comparison that readers are expected to understand. I strongly suspect it is just a corrupt form of U+8C4C 豌 (variant form ⿰豆⿱宀外 with 宀 lost). Therefore suggest withdaw or pending.
04521
04521
骨 188.3.3
GKJ-00972
TS 13 · IDS
Evidence
Agree with Comment #13582 that ⿰骨彡 is probably a corrupt form of 骸. 《三農紀》 is not a very reliable source, with many glyph errors, so I suggest to postpone pending additional evidence.
03894
03894
足 157.10.4
GKJ-00992
TS 17 · IDS 𧾷𧘇
Evidence
It looks like this character should be a corruption of some common character, but it is not obvious to me what it should be. Therefore I suggest to postpone for additional evidence.
04555
04555
鬼 194.6.1
GKJ-00998
UK-20785
TS 16 · IDS
New evidence
《道法會元》卷92:
04290
04290
隹 172.10.4
GKJ-00999
TS 18 · IDS
Evidence
Other sources for this text give "硫黃、灘石、白南礬、各四兩". The glyph in the evidence provided is unclear, but should be a corrupt form of U+7058 灘.

Therefore I suggest to withdraw GKJ-00999.
04546
04546
鬲 193.5.2
GKJ-01000
TS 15 · IDS
Evidence
Based on Comment #13587, this is likely a one-off error for U+878D 融. Therefore suggest to postpone pending additional evidence.
04567
04567
鬼 194.12.3
GKJ-01004
UK-20788
TS 22 · IDS
New evidence
《道法會元》卷92:
New evidence
《白海瓊眞人語録》19a:


Also provides additional evidence for UK-20785, UK-20786, nd UK-20787.
00564
00564
口 30.10.4
GZ-0201308
TS 13 · IDS
New evidence
英語集全 (Ying ü tsap ts'ün) = The Chinese and English instructor (Canton, 1862) 5:6b:
00334
00334
力 19.6.3
GZ-2202202
TS 8 · IDS
New evidence
00591
00591
口 30.11.4
GZ-2372404
TS 14 · IDS
New evidence
英語集全 (Ying ü tsap ts'ün) = The Chinese and English instructor (Canton, 1862) 2:80a
01751
01751
月 74.10.4
GZ-2402302
TS 14 · IDS
New evidence
David Holm, "Mapping the Old Zhuang Character Script" (Brill, 2013) p. 600:
01314
01314
心 61.4.4
GZ-2552206
TS 7 · IDS
New evidence
01444
01444
手 64.5.3
GZ-4551301
VN-F18EB
TS 8 · IDS
New evidence
02331
02331
犬 94.4.1
GZ-4812502
TS 7 · IDS
New evidence
David Holm, "Mapping the Old Zhuang Character Script" (Brill, 2013) p. 712:
03049
03049
糸 120.3.2
SAT-05998
VN-F06F7
TS 9 · IDS
New evidence
《古壮字字典》 p. 168 [gaen]:
00728
00728
口 30.17.2
TB-5C54
TS 20 · IDS
New evidence
Phonetic glosses within 英語集全 give ⿰口闊 rather than ⿰口濶, e.g. 5:60b:


A new UCV for 闊~濶 would be helpful.
00379
00379
十 24.11.3
TC-655A
TS 13 · IDS
New evidence
00193
00193
人 9.11.2
TD-3535
TS 13 · IDS
New evidence
02524
02524
玉 96.13.3
TE-2753
TS 17 · IDS 𦥯
New evidence
Unifiable by UCV 252 with ⿱𦥯玉 seen in these sources:

《江西通志》(清雍正刊本)卷 55:



《江南通志》(清乾隆元年刊本)卷177:



I suggest China consider a horizontal extension for ⿱𦥯玉.
03832
03832
貝 154.12.1
TE-3848
TS 19 · IDS
Misidentified glyph
I very much think that the character with 虫 bug (!) must be a mistaken form of U+8D12 贒, which is most likely to be the character the living person actually uses. Can you please show the original handwritten evidence provided by the living person who uses this character so that we can see if they really do want to be a 'bug' 虫 or 'loyal' 忠.
04728
04728
鳥 196.8.4
TE-3A41
TS 19 · IDS
Evidence
《全遼文》卷九(蕭裕魯墓志銘) apparently has "鸞〈⿱放鳥〉羽儀".
03442
03442
艸 140.14.3
TE-7729
TS 18 · IDS 𥠅
New evidence
Looks like the same character (unifiable per UCV 401) used in the description of Brazil on this 1602 world map (from {https://twitter.com/egasmb/status/1650216295198072835 twitter}):
02088
02088
水 85.13.2
UK-20013
TS 16 · IDS
New evidence
David Holm, "Mapping the Old Zhuang Character Script" (Brill, 2013) p. 651:
04646
04646
魚 195.13.1
UK-20102
TS 24 · IDS
New evidence
《古壮字字典》 p. 287 [lingz]:
02970
02970
竹 118.13.1
UK-20212
TS 19 · IDS
Evidence
⿱竹鼓 in the evidence shown by Tao Yang is a corruption of U+4D7E 䵾.
01978
01978
气 84.10.3
UK-20224
TS 14 · IDS
New evidence
《古壮字字典》 p. 238 [haeu]:
00720
00720
口 30.16.5
UK-20342
UTC-03218
TS 19 · IDS
New evidence
《英語集全》 = "The Chinese and English instructor" (Canton, 1862) 1:47b:
00536
00536
口 30.9.2
UK-20358
VN-F1751
TS 12 · IDS
New evidence
《太上三洞神呪》卷八(月君呪):
01920
01920
歹 78.8.1
UK-20374
TS 12 · IDS
Evidence
I think ⿰歹表 is not a "mistake", but can be considered to be a variant form of U+6B8D 殍 using a different phonetic component.
00490
00490
口 30.8.1
UK-20376
TS 11 · IDS
New evidence
00231
00231
人 9.17.1
UK-20503
UTC-00536
TS 19 · IDS
New evidence
00048
00048
乙 5.4.3
UK-20570
TS 5 · IDS
Evidence
The Han script sometimes incorporates characters and symbols from other writing systems, for example 𣥬𤔞𧳤𠐂 are corrupt forms of the Tangut characters 𘜶𗵐𘏨𘔭 recorded in a Song dynasty numismatic work; U+303C3 𰏃 is a corrupt form of a Khitan character; U+2CF01 𬼁 is derived from the dram sign ʒ (U+0292); U+2CF04 𬼄 is derived from the ounce sign ℥ (U+2125); and WS2021 UTC-03225 is derived from the pound sign ℔ (U+2114). In the sources for UK-20570 etc. corrupt forms of certain syllables of the Siddham script which are used in mantras have been treated as Han characters, so it is appropriate to encode them as Han ideographs. Note that the Siddham script is already encoded since Unicode 7.0, so encoding these particular Han-ified Siddham syllables does not affect the use of Unicode Siddham.
New evidence
Via Sven Osterkamp (@schrift_sprache) on twitter, these are the nine characters in 《字孳補》 (the source used for the glyph forms in the UK submission), which together form the Mantra of Ratnasikhine Tathagata (寶髻如來護生咒), i.e. 唵縛嚩悉波羅摩尼莎訶 which corresponds to "oṁ va svara maṇi svāhā" in Sanskrit. The 7th character corresponds to Sanskrit ṇi, which matches the evidence given by Huang Junliang in #13137, thus I think both glyph forms represent the same Sanskrit transcription character, so it should be OK to change the glyph form for UK-20570 based on the new evidence.

New evidence
Via Edward W. (@edwardW2) on twitter, this is an example of the Mantra of Ratnasikhine Tathagata (寶髻如來護生咒) from 《通天曉》(1841 ed.), here noted as corresponding to "唵縛嚩悉波羅摩尼莎訶" (i,e, "oṁ va svara maṇi svāhā"). The glyph form for the 7th character matches the form shown in the evidence from Huang Junliang (also the glyph forms for the 1st, 3rd, and 9th characters match the correct forms suggested by evidence from Huang Junliang).

02305
02305
牛 93.1.5
UK-20572
TS 5 · IDS
Evidence
The Han script sometimes incorporates characters and symbols from other writing systems, for example 𣥬𤔞𧳤𠐂 are corrupt forms of the Tangut characters 𘜶𗵐𘏨𘔭 recorded in a Song dynasty numismatic work; U+303C3 𰏃 is a corrupt form of a Khitan character; U+2CF01 𬼁 is derived from the dram sign ʒ (U+0292); U+2CF04 𬼄 is derived from the ounce sign ℥ (U+2125); and WS2021 UTC-03225 is derived from the pound sign ℔ (U+2114). In the sources for UK-20572 etc. corrupt forms of certain syllables of the Siddham script which are used in mantras have been treated as Han characters, so it is appropriate to encode them as Han ideographs. Note that the Siddham script is already encoded since Unicode 7.0, so encoding these particular Han-ified Siddham syllables does not affect the use of Unicode Siddham.
00562
00562
口 30.10.3
UK-20590
TS 13 · IDS
New evidence
New evidence
《太上三洞神呪》卷七(欻火誓呪):
00757
00757
口 30.24.1
UK-20597
TS 27 · IDS
New evidence
New evidence
《太上三洞神呪》卷八(火軸衮鬼呪):
00269
00269
冖 14.2.3
UK-20637
TS 4 · IDS
New evidence
《古壮字字典》 p. 40 [byat]:
New evidence
00620
00620
口 30.12.3
UK-20670
TS 15 · IDS
New evidence
《古壮字字典》 p. 474 [swnx,swnj]:
New evidence
《太上三洞神呪》卷八(火軸衮鬼呪):
00613
00613
口 30.12.2
UK-20671
TS 15 · IDS
New evidence
《道法會元》卷159:
00516
00516
口 30.8.4
UK-20672
TS 11 · IDS
New evidence
英語集全 (Ying ü tsap ts'ün) = The Chinese and English instructor (Canton, 1862) 2:65a
03765
03765
言 149.4.2
UK-20679
TS 11 · IDS
New evidence
《道法會元》卷91:
New evidence
《道法會元》卷94:
04371
04371
雨 173.20.3
UK-20680
TS 28 · IDS
New evidence
《道法會元》卷94:
00662
00662
口 30.14.2
UK-20689
TS 17 · IDS
New evidence
《太上三洞神呪》卷四(天心總咒):
00538
00538
口 30.9.2
UK-20694
TS 12 · IDS
New evidence
《太上三洞神呪》卷十(烈雷呪):
00699
00699
口 30.16.1
UK-20714
TS 19 · IDS
New evidence
《古壮字字典》 p. 102 [daeuj]:
00563
00563
口 30.10.3
UK-20715
TS 13 · IDS
New evidence
《太上三洞神呪》卷八(會雷呪):
00439
00439
口 30.4.3
UK-20716
TS 7 · IDS
New evidence
《早期香港史研究資料選輯》(p. 280)



I think "{⿰口丹}𠺮哪" is the name of a fake British ship in a fake report.
01220
01220
己 49.9.2
UK-20731
TS 12 · IDS
New evidence
《太上三洞神呪》卷三(滅魔咒):
00498
00498
口 30.8.2
UK-20767
VN-F1742
TS 11 · IDS
New evidence
《太上三洞神呪》卷五(玉音召咒):
00568
00568
口 30.10.4
UK-20771
TS 13 · IDS
New evidence
《太上三洞神呪》卷八(日君呪):
00624
00624
口 30.12.4
UK-20772
TS 15 · IDS
New evidence
《太上三洞神呪》卷八(三十六字天罡呪):
04547
04547
鬼 194.4.1
UK-20786
TS 14 · IDS
New evidence
《道法會元》卷92:
04569
04569
鬼 194.12.4
UK-20787
TS 22 · IDS
New evidence
《太上三洞神呪》卷三(破穢咒):
04565
04565
鬼 194.9.5
UK-20794
TS 19 · IDS
New evidence
《道法會元》卷91:
00790
00790
土 32.4.3
UK-20847
TS 7 · IDS
New evidence
《古壮字字典》 p. 72 [ciengz]:
03228
03228
肉 130.6.2
UK-20852
TS 10 · IDS
New evidence
《古壮字字典》 p. 353 [ndang]:
00575
00575
口 30.11.1
UK-20982
TS 14 · IDS
New evidence
籌辦夷務始末(故宮博物院影印清內府抄本)道光朝卷72 folio 5a (see comment #13903 on UTC-00566)



"囉𪡈{⿰口聃}" is the transcription of the name of the Canton-based British translator Robert Thom (1807–1846). The first character is here written as ⿰口𣆀 with a missing stroke, but ⿰口聃 should be the correct form of the character.
00688
00688
口 30.15.2
UK-20989
TS 18 · IDS
New evidence
《古壮字字典》 p. 495 [yaed]:
00755
00755
口 30.22.4
UK-20994
TS 25 · IDS
New evidence
英語集全 (Ying ü tsap ts'ün) = The Chinese and English instructor (Canton, 1862) 2:35b
01583
01583
手 64.15.5
UTC-00394
TS 18 · IDS
New evidence
《古壮字字典》 p. 11 [bag]:
02328
02328
犬 94.2.5
UTC-03232
TS 5 · IDS
New evidence
New evidence
《中华字海》 p. 491
02258
02258
火 86.16.2
V4-4D51
TS 20 · IDS 𬞕
New evidence
《古壮字字典》 p. 278 [lanh]:
01302
01302
彳 60.12.3
VN-F019F
TS 15 · IDS
New evidence
《古壮字字典》 p. 10 [baez]:
00639
00639
口 30.13.2
VN-F16D6
TS 16 · IDS
New evidence
《古壮字字典》 p. 483 [vangq]:
01443
01443
手 64.5.2
VN-F18E8
TS 8 · IDS
New evidence
《古壮字字典》 p. 221 [gyaeng]:
01454
01454
手 64.6.1
VN-F18EC
TS 9 · IDS
New evidence
《古壮字字典》 p. 94 [daeh]:
00101
00101
而 126.2.1
VN-F1AF3
TS 8 · IDS
New evidence
《古壮字字典》 p. 396 [nyih]:


Glyph Design & Normalization

SnImage/SourceComment TypeDescription
04875
04875
鹿 198.10.3
GKJ-00694
TS 21 · IDS 鹿
Normalization
As the G-form of U+9EA2 麢 is ⿸鹿霝 it is OK to normalize to ⿸鹿留.
04862
04862
鹿 198.8.1
GKJ-00701
TE-3A4D
TS 19 · IDS 鹿
Normalization
Suggest to normalize China glyph from ⿰鹿竒 to ⿰鹿奇.
03693
03693
行 144.15.4
GKJ-00914
TS 21 · IDS
Glyph design
The 7th stroke of 養 should be modified to accord with PRC conventions (cf. G glyph for U+3526 㔦).
03416
03416
艸 140.13.1
火 86.13.2
TE-275B
TS 17 · IDS
Glyph design
The positioning of the dot is very peculiar. The expected position of the optional dot of 者 is immediately above the 日 component.
02151
02151
水 85.20.4
UK-20052
TS 23 · IDS
Glyph design
Agree to change IDS and glyph to ⿰氵兢.
02077
02077
水 85.11.3
UK-20122
TS 14 · IDS
Glyph design
Correct the 谷 component to join the middle left and right strokes.
02709
02709
目 109.12.1
UK-20411
TS 17 · IDS 𣈆
Glyph design
Agree to change IDS and glyph to ⿰目晉 per #6140
03796
03796
谷 150.17.2
UK-20419
TS 24 · IDS
Glyph design
Correct the 谷 component to join the middle left and right strokes.
00048
00048
乙 5.4.3
UK-20570
TS 5 · IDS
Glyph design
Agree to update the glyph to match the new evidence.
02305
02305
牛 93.1.5
UK-20572
TS 5 · IDS
Glyph design
Agree to update the glyph to match the new evidence.
00613
00613
口 30.12.2
UK-20671
TS 15 · IDS
Glyph design
Agree to improve the glyph.
02526
02526
玉 96.13.4
UK-20906
TS 17 · IDS
Glyph design
TCA comment is reasonable, therefore agree to change glyph to ⿰王兾 if there are no other objections.
03951
03951
車 159.12.5
UK-20955
TS 19 · IDS
Glyph design
Based on the new evidence, I suggest to normalize the glyph form to ⿰車參.
00776
00776
囗 31.9.4
UK-20973
TS 12 · IDS
Glyph design
Re #10956, the glyph shown in the evidence appears to be an imperfect form of ⿴囗峦, with a damaged final stroke. However, the final horizontal stroke is still visible as a thin line below 山, so ⿴囗峦 should be correct, and no change to the glyph is required.
03814
03814
貝 154.2.3
VN-F1B83
TS 9 · IDS
Glyph design
Font glyph component 乃 is strange, and does not match the evidence. Please confirm whether this is the preferred Vietnam form of 乃, and if not then correct the font glyph.


Other

SnImage/SourceComment TypeDescription
03919
03919
身 158.3.1
GDM-00256
TS 10 · IDS
Other
⿰身大 has the reading māng according to Kushim Jiang.
02899
02899
穴 116.7.2
GDM-00313
TS 12 · IDS
Other
I would be interested in knowing how common U+25997 𥦗 is. The source is is GHZ-42731.08, but 《汉语大字典》(第二版) p. 2922 does not give any sources, and merely states 同“窓(窗)”.



As L F Cheng says (#14310), maybe the dictionary form is a normalization of ⿱宀总. If 𥦗 is not a common form, then China can consider changing the glyph for U+25997 to ⿱宀总.
03825
03825
貝 154.9.1
GDM-00316
TS 13 · IDS
Other
⿱不貴 is not encoded. My strong opinion is that every Chinese simplified character that we encode should have a corresponding traditional form, and if it does not then we should simply add the corresponding traditional form to the same working set.
04328
04328
雨 173.10.4
GDM-00353
TS 18 · IDS
Other
Other variants of ~𩃱:
04316
雨 173.8.4
GDM-00354
TS 16 · IDS
Evidence accepted, IRG 58.

04314
雨 173.8.3
GDM-00355
TS 16 · IDS
Evidence accepted, IRG 59.
04316
04316
雨 173.8.4
GDM-00354
TS 16 · IDS
Other
04314
04314
雨 173.8.3
GDM-00355
TS 16 · IDS
Other
04612
04612
魚 195.9.3
GKJ-00238
TS 20 · IDS
Other
The character ⿰車夷 shown before ⿰魚俞 in 《開元占經》 is not encoded.
04791
04791
鳥 196.17.3
GKJ-00307
TS 28 · IDS
Other
Thank you Eiso for the helpful discussion. It make sense that ⿰爵鳥 is a variant of 鵲, but it is odd that there are no other attestations of this character that I can find.
04719
04719
鳥 196.8.2
GKJ-00325
TS 19 · IDS
Other
Evidence indicates that this is a corrupt form of U+9DBB 鶻.
03670
03670
虫 142.23.2
GKJ-00355
TS 29 · IDS
Comment
Strongly support not encoding as cjkui simple ligatures of two characters (i.e. where two characters are simply squeezed together into a single character space for aesthetic or practical purposes) where there is no semantic difference between the ligatured and unligatured forms.
04199
04199
金 167.18.3
GKJ-00484
TS 26 · IDS
Other
The character ⿰音歸 in the evidence is not encoded or proposed for enciding.
04850
04850
鹿 198.6.3
GKJ-00670
TS 17 · IDS 鹿
Other
I feel like ⿸鹿合 should be a mistake for an encoded character, but I have been unable to find any additional evidence for or against encoding.
04843
04843
鹿 198.6.1
GKJ-00679
TS 17 · IDS 鹿
Other
Perhaps ⿸鹿吉 is a corruption of ⿸鹿吾 shown in Evidence 1 for GKJ-00707.
04842
04842
鹿 198.5.5
GKJ-00715
TS 16 · IDS 鹿𢎤
Other
𢎤 is an ancient form of 射, therefore ⿸鹿𢎤 is a non-unifiable variant of U+9E9D 麝.
02467
02467
玄 95.4.3
GKJ-00752
TS 9 · IDS
Other
⿰糹哥 is not encoded or proposed for encoding. I hope that China will submit ⿰糹哥 for the next working set.
04675
04675
鱼 195′.9.1
GKJ-00768
TS 17 · IDS
Other
Note that ⿰魚要 is unifiable to U+2CD5C 𬵜 under UCV #159 (襾~覀).
03820
03820
貝 154.6.3
KC-05735
TS 13 · IDS
Other
Yes, it is odd. I cannot find any evidence for an old or variant form of 顯 which looks like this. My suspicion is that it is simply a corrupt form of 費.
03368
03368
艸 140.9.2
SAT-08693
TS 13 · IDS
Other
Similar alternations between 丶 and 厶 include U+5154 兔 and U+2A781 𪞁; and U+26351 𦍑 and U+7F97 羗.
03442
03442
艸 140.14.3
TE-7729
TS 18 · IDS 𥠅
Comment
On second thoughts, maybe the character shown in the map is U+27068 𧁨.
00490
00490
口 30.8.1
UK-20376
TS 11 · IDS
Comment
Above evidence shows ⿰口恊 which is potentially unifiable with ⿰口協 with a new UCV.
01204
01204
山 46.19.1
UK-20408
TS 22 · IDS
Comment
"⿰山施" is not encoded or proposed for encoding. It should be a variant of U+21D8A 𡶊 shown in the previous two evidences; also a variant of U+5D3A 崺.
03765
03765
言 149.4.2
UK-20679
TS 11 · IDS
Other
Evidence is sufficient, no need to postpone.
01615
01615
攴 66.12.1
UTC-03224
TS 16 · IDS
Comment
I suggest not to encode UTC-03224 as a cjkui as it is a simple ligature of the two characters 敕 and 令 with no difference in meaning or reading. I think it is best to represent simple ligatures such as this (where the two ligatured characters maintain their original glyph forms) as ZWJ-ligatures (or using some other mechanism as discussed in L2/23-073).

Note that there are several different encoded forms of 敕 (勅勑𠡠敕𱡘𢽟) which could all potentially form ligatures with 令.


Data for Unihan

SnImage/SourceComment TypeDescription
04734
04734
鳥 196.9.3
GKJ-00322
TS 20 · IDS
Semantic variant
U+9DC2 鷂
04731
04731
鳥 196.9.2
GKJ-00327
TS 20 · IDS
Semantic variant
U+9C02 鰂
04725
04725
鳥 196.8.3
GKJ-00356
TS 19 · IDS
Semantic variant
U+294E5 𩓥
04732
04732
鳥 196.9.2
GKJ-00386
TS 20 · IDS
Semantic variant
U+9DAA 鶪
04175
04175
金 167.14.1
GKJ-00523
TS 22 · IDS
Semantic variant
U+791F 礟 (U+792E 礮)
04876
04876
鹿 198.10.3
GKJ-00700
TS 21 · IDS 鹿
Semantic variant
U+9A68 驨
04872
04872
鹿 198.10.5
GKJ-00712
TS 21 · IDS 鹿
Semantic variant
Variant of U+9E8F 麏 which Shuowen Jiezi gives as a variant of U+9E87 麇.
04451
04451
食 184.8.3
GKJ-01017
TS 16 · IDS
Semantic variant
U+25EC0 𥻀 (U+7CCC 糌)
03723
03723
衣 145.11.3
SAT-06100
TS 17 · IDS 𰲝
Semantic variant
U+892B 褫
03982
03982
辵 162.11.3
SATM-90367
TS 15 · IDS
Semantic variant
U+908A 邊
04943
04943
齒 211.5.3
TE-3F55
TS 20 · IDS
Semantic variant
U+9F61 齡 ?
04662
04662
鱼 195′.4.3
GKJ-00760
TS 12 · IDS
Simp variant
U+9B64 魤
04663
04663
鱼 195′.4.4
GKJ-00767
TS 12 · IDS
Simp variant
U+29EDF 𩻟
04661
04661
鱼 195′.2.1
GKJ-00806
TS 10 · IDS
Simp variant
U+4C33 䰳
00590
00590
口 30.11.4
GZ-1742502
TS 14 · IDS
Simp variant
Eiso (#14608) is absolutely correct that 蓋 and 盖 are a T/S pair of characters, but are not a T/S pair for components. 蓋/盖 are defined in Table 1 of the General Tables of Simplified Characters (简化字总表), for T/S pairs which are explicitly not applicable to components "不作简化偏旁用的简化字".

However, the kSimplifiedVariant and kTraditionalVariant fields defined in Unihan_Variants.txt do not correspond to the officially-defined set of simplified/traditional characters, but include all sorts of non-standard and de facto simplifications, as well as treating old style and new style orthographic forms (新旧字形) as if they were traditional/simplified pairs (e.g. 呂/吕 are defined as a T/S pair in Unihan, even though they are not a 繁體字/简体字 pair but a 舊字形/新字形 pair).

It is no surprise that Unihan treats 蓋 and 盖 as T/S components for the character pairs 㯼/𣙥, 䡷/𰺡, 壒/𭏦, 礚/𥕤, 鑉/𫠁:

U+3BFC kSimplifiedVariant U+23665
U+23665 kTraditionalVariant U+3BFC

U+4877 kSimplifiedVariant U+30EA1
U+30EA1 kTraditionalVariant U+4877

U+58D2 kSimplifiedVariant U+2D3E6
U+2D3E6 kTraditionalVariant U+58D2

U+791A kSimplifiedVariant U+25564
U+25564 kTraditionalVariant U+791A

U+9449 kSimplifiedVariant U+2B801
U+2B801 kTraditionalVariant U+9449

Therefore, based on the current loose definitions of kSimplifiedVariant and kTraditionalVariant in Unihan, it is reasonable to add GZ-1742502 as a simplified variant of U+2103D 𡀽 (also U+6FED 濭 and U+31A1A 𱨚).

However, I think that the current Unihan definitions of simplified and traditional variants is very unsatisfactory, and needs to be thoroughly revised. My opinion would be to remove all old/new orthographic forms (such as 呂/吕), and define new keys for them (e.g. kOldForm and kNewForm). I would also remove kSimplifiedVariant from all characters which are not explicitly or implicitly defined as simplified characters in the latest official table of simplifications, and replace them with some other key.


Submitter Request

SnImage/SourceComment TypeDescription
02580
02580
田 102.3.5
UK-20046
TS 8 · IDS
Withdraw
Withdraw as an error form of U+2D60B 𭘋.
Consider adding a level 2 UCV for 田 and 由.
02081
02081
水 85.11.4
UK-20085
TS 14 · IDS 𠅤
Withdraw
Based on #7931 and #14357, withdraw.
02121
02121
水 85.16.4
UK-20114
TS 19 · IDS
Withdraw
Withdraw based on additional evidence.
00801
00801
土 32.5.4
UK-20130
TS 8 · IDS
Withdraw
Withdraw as a one-off error for U+2A8DF 𪣟
01922
01922
歹 78.8.3
UK-20118
TS 12 · IDS
Postpone
Text seems to be an error for "靡不殫心". Postpone for additional evidence.
01643
01643
方 70.9.3
UK-20956
TS 13 · IDS
Postpone
Probably a one-off error, therefore postpone pending additional evidence.