Please wait while loading

IRG Working Set 2024v3.0

Source: Xieyang WANG
Date: Generated on 2025-12-10

Show Deleted | Show comments from version: 1.0 2.0 3.0 | Show comments with status: Show All New Only Unresolved Only

Unification

Showing 4 comments.

SnImage/SourceComment TypeDescription
00678
00678
口 30.15.1
GCCPP-00027
TS 18 · IDS
Oppose Unification
I think 𡄙 (U+21119) is a variant of 賾. I can't find evidence showing that it is also a variant of 啧. Can you please provide related evidence?
Personally, I think it is better to encode it separately because 西游记 is very famous in China and the glyph is stable.
What's more, 𡃄(U+210C4), the error form of the submitted one has been encoded. The submitted character should have been encoded if the error didn't occur.
00574
00574
口 30.11.5
T13-3D4A
TS 14 · IDS
Oppose Unification
Too much difference.
00993
00993
大 37.4.2
TC-2A28
TS 7 · IDS
Oppose Unification
This is also used in personal name in the mainland of China. We want to keep it.
02453
02453
玉 96.13.3
UK-30024
TS 18 · IDS 𦥯
Oppose Unification
Pronunciation of WS2021-02524 seems to be the ‌dialectal‌ pronunciation of 王 or 璺. Meanwhile, this proposed character prounuciates yu4 or wen4(璺). So WS2021-02524 and the proposed one are actually non cognate.

Evidence

Showing 41 comments.

SnImage/SourceComment TypeDescription
02771
02771
示 113.16.1
GCCPP-00036
TS 21 · IDS
Evidence
[ Unresolved from v1.0 ]
It doesn't have to be an error form. It is reasonable that 䬙's left component changes to 票 in word 飘䬙.
Evidence
[ Unresolved from v1.0 ]
If this can be questionable, then all 类化字 can be questionable.
Evidence
[ Unresolved from v1.0 ]
The 飘䬙 form of 飘飖 is very common in ancient books. It is reasonable to say that ⿰票䍃 is a Leihua character(类化字).
民國新纂雲南通志

嘉靖寧波府志

嘉靖徽縣志

嘉靖尉氏縣志

光緒增修甘泉縣志

佩文韻府,清康熙武英殿本

太平御覽,四庫全書本

文山集,四庫全書本

春在堂詩編,民國春在堂全書本
02785
02785
禾 115.5.5
GCW-00180
TS 10 · IDS
New evidence
中国测绘科学院:库外字代码对照表

古文字谱系疏证,黄德宽,p832
03982
03982
门 169′.8.3
GCW-00239
TS 11 · IDS
Evidence
[ Unresolved from v2.0 ]
The glyph is very reasonable and the glyph is not misidentified.
⿱丶冂 is a variant of 门

The glyph in the index:

More cases in the book:
闼,U+95FC

𮤬,U+2E92C

𮤸,U+2E938
00009
00009
一 1.5.5
GDM-00377
TS 6 · IDS 𠫔𫶧
New evidence
[ Unresolved from v1.0 ]
宣统《楚雄县志》:


云南省楚雄市地名志(1983), page188
04590
04590
鸟 196′.7.3
GDM-00497
TS 12 · IDS
New evidence
[ Unresolved from v2.0 ]
陆丰县人民政府:广东省陆丰县标准地名录,page272
Evidence
[ Unresolved from v2.0 ]
04673
04673
龙 212′.6.2
GDM-00505
TS 11 · IDS
Unclear evidence response
[ Unresolved from v2.0 ]
Yes. And we have asked local people to confirm.
New evidence
中国测绘科学院:库外字代码对照表,2000年11月, page53
00139
00139
人 9.9.1
GZHSJ-0106
TS 11 · IDS 𦣻
New evidence
讀通鑑論
New evidence
From other books.



Modern uses are also important. We think the evidence is enough for encoding even if the evidence from old books are not provided.
04635
04635
麥 199.10.2
GZHSJ-0116
TS 21 · IDS
Evidence
[ Unresolved from v2.0 ]
We think the current evidences are enough for encoding. But we also welcome other experts to add additional evidence for them.
New evidence
[ Unresolved from v2.0 ]
It is not rare in old books, I'd like to provide two pieces of evidence here:
弇州四部稿,卷一百七十,说部,宛委余编十五,日本早稻田大学图书馆藏,page24

北山酒经,文渊阁四库全书本,page3
04561
04561
鳥 196.12.1
GZHSJ-0168
TS 23 · IDS
Evidence
[ Unresolved from v2.0 ]
Agree with Huang's comment #8514.
00333
00333
厂 27.4.1
KC-10051
TS 6 · IDS
New evidence
Seen in 《浙江省地名库外字代码对照表》
New evidence
大泌山房集,明万历刻本,卷七十五,page又11
A variant or an error form of 疣.

In Chinese place name, it is used in 浙江省德清县~家埭. Now the place name is 庞家埭. The character was seen in two verisons of the map and was digitalizied. So it is kind of a stable error form (variant) of 庞.
I think it is OK to encode the character based on all these evidence. And I think that the difference between 尤 and 龙 is major in this character with simple structure. I.E. even if the character is cognate with 庞, they should not be unified.
01868
01868
木 75.4.1
SAT-09787
TS 8 · IDS
Evidence
The context was cited from 《说文》. 说文:攫,扟也。 So it is an error form of 扟.
In 《慧琳一切经音义》, the character was written as 扟, 㭄 or ⿰木⿹⺄干. Personally, I suggest SAT to withdraw the character or unify it to 㭄(U+3B44). And I think withdrawing it is preferable.
慧琳一切经音义,狮谷莲社刻本



大正藏:


02554
02554
白 106.3.3
SAT-09837
TS 8 · IDS
New evidence
New evidence
The last pieceof evidence is from 精刻海若湯先生校訂音釋五侯鯖字海, 卷首,page30, left
The following is from 新刻洪武元韻勘正切字海篇群玉, 卷一, page10
02001
02001
止 77.4.1
一 1.7.3
TC-7854
TS 8 · IDS
New evidence
This character is widely seen in books on ancient Chinese writing. It is a 隶定字. It is useful and can be kept.
新甲骨文编(增订本),2014年,page82

甲骨文字诂林,1996年,page870
00383
00383
口 30.4.3
UK-30142
TS 7 · IDS
New evidence
未圆的梦 英雄少年周贲遗作选,民族出版社,1998年,page52
03487
03487
襾 146.6.5
UK-30620
TS 12 · IDS
Unclear evidence
[ Unresolved from v1.0 ]
I think that IRG experts had agreed that captions couldn't be the only source of the evidences for submitted ideographs in IRG meeting #62. So unless other evidences can be provided, the ideograph should be postponed.
The decision about using captions as evidences was clearly stated in the meeting so this kind of situation should not have happened.
Unclear evidence
[ Unresolved from v1.0 ]
We think that this ideograph should be postponed if no more qualified evidence can be provided. For more comments, please go to:
https://hc.jsecs.org/irg/ws2024/app/?find=UK-30621
Unclear evidence
[ Unresolved from v2.0 ]
The UK still has not been able to provide evidence from historical document or other high-quality sources for this character. On the contrary, the UK has been providing evidence of the use of this character by one or two users on unstable carriers, and such evidence is not sufficient to support the encoding of this character. We have made it very clear that the evidence of this type provided by the UK is not sufficient to support the encoding of this character. However, the experts from the UK seem unable to understand what we mean and they are still insisting on providing evidence of the same type with insufficient quality, and also insisting on saying things that are obviously inconsistent with the facts. This kind of behavior is of no benefit to the review work and cannot make these characters be removed to the M-set.
At the last meeting, we have already suggested that if the experts from the UK believe that this character is not a newly self-created character and has the value for encoding, they can simply publish a paper that includes this character and vouch for these characters with their own reputation. Let me be more straightforward. It should not be a difficult task for the experts from the UK to publish a paper. If the experts from the UK are not willing to risk their own reputation, they should not insist on using evidence that obviously does not meet the requirements to request the encoding of this character in the IRG.

英国(UK)至今仍未能提供这个字的历史文献证据或其他高质量来源,相反,英国一直在提供由一两个使用者在不稳定的载体上使用该字的证据,这些证据并不足以支持对该字进行编码。我们已经说得很清楚,英国提供的这种类型的证据不足以支持编码该字,但英国的专家似乎无法理解我们表达的意思,仍在坚持提供相同类型、效力不足的证据,坚持说一些明显与事实不符的话。这种行为对审核工作没有任何益处,也无法让这些字从 D-set 返回 M-set。
在上次会议上,我们已经建议:如果英国专家认为这个字并非个人新造,且有编码的价值,完全可以自己发表一篇包含此字的论文,以自己的声誉给这些字作担保。让我把话说的再直接一点:发表一篇论文对英国专家来说应该不是什么难事,如果英国专家不愿意拿自己的声誉冒险,就不应该坚持在 IRG 以明显不符合要求的证据来要求对该字进行编码。
Unclear evidence
[ Unresolved from v2.0 ]
China thinks the evidence is still not sufficient and the character should be kept in D-set until better evidednce is provided.
00061
00061
丿 4.10.5
UK-30621
TS 11 · IDS
Unclear evidence
[ Unresolved from v1.0 ]
I think that IRG experts had agreed that captions couldn't be the only source of the evidences for submitted ideographs in IRG meeting #62. So unless other evidences can be provided, the ideograph should be postponed.
The decision about using captions as evidences was clearly stated in the meeting so this kind of situation should not have happened.
Unclear evidence
[ Unresolved from v1.0 ]
It is not the matter of the defination of caption or lyrics. It is the matter of the quality of the evidences. It is ridiculous to dicuss the defination of caption here but focus on the quality of the evidences.
Evidence of GDM-00507 and GDM-00508 are from at least two different buildings, which stands in the real world. The buildings are not something easy to change or vanish. What's more, the two ideographs are used by many local people so they can be used in the plaques of the temples, which are sacred.
However, the evidence of this ideograph is from a vedio created by someone on the Internet and the vedio can be edited or deleted by the uploader at anytime he wants. The vedio, which is too weak for encoding, is not even from a published material.
https://www.bilibili.com/video/BV1Ki4y127Cm/
If this can be accepted as evidence, then we may be going to submit all this to IRG, there are even pronounciations and definations:
https://www.bilibili.com/video/BV198411s7Ft/


Moreover, I don't think IRG have to write every this kind of unstable thing, for example, captions, lyrics, articles, instructions, notes... in PnP, which is unnecessary and endless.
Unclear evidence
[ Unresolved from v1.0 ]
I think nonce or not should be proved by undoubted evidences. I'd like to help the submitter to find evidences meet IRG requirements but the current evidence can not prove that ⿰久闹 is differernt from ⿱因八 or ⿱中分 in "nonce or not".
It should be noted that our center proposed a document "Application for encoding some ideographs used in Chinese geographical names(IRGN2649)" to IRG before, which was pointed out by an expert that it is not suitable as the only evidence for encoding. Our center is a formal institution established by Sichuan International Studies University, which is belonging to The People's Government of Chongqing Municipality(重庆市人民政府). It will be very offensive and so unacceptable if videos on the internet are considered more trustable or suitable for encoding than an application with our seal on it.
Unclear evidence
[ Unresolved from v1.0 ]
The new evidence still shows no running text but only a screenshot of a computer font.
Unclear evidence
[ Unresolved from v1.0 ]
Screenshots of computer fonts and vedios from the internet are absolutely not qualified evidence for IRG. If no other evidence can be provided, then this ideograph should be postponed.
Unclear evidence
[ Unresolved from v1.0 ]
IRG PnP Version 17, page 11-12
Currently, IRG mainly accepts evidence from printed material if they are accepted as IRG sources.
In general, IRG DOES NOT accept multimedia material as IRG sources.
Note: the acceptance of the multimedia material, the popularity of the material, cultural influences, and other factors that warrants its acceptance.
We can't find a sentence in IRG PnP states that being posted on Instagram, Twitter or Bilibili once by any uploader will warrant the evidence's acceptance.

Furthermore, the screenshots of computer fonts prove nothing but the font producer has made the font. This cannot prove the shape is actually used in texts or even exists. As far as we know, the uploader of the vedio use ⿰久闹 just because he saw the font in a friend's computer without knowing the pronounciation or meaning.

I'd like to point out that using these as evidences is against UK's general requirements for the quality of evidences. I really don't think other experts will accept these two images as qualified evidences even if I were persuaded. So please find qualified evidences for the ideograph or postpone it.
Unclear evidence
[ Unresolved from v1.0 ]
Thank John for pointing that out and I am sorry that I made the mistake. But I still can't find a sentence in IRG PnP states that being posted on Instagram, Twitter or Bilibili once by any uploader will warrant the evidence's acceptance of IRG.
Comment #2304 says:"Also it is colour code so the lyrics are in red and the pronunciation and meaning in black. Therefore it is clear that the uploader understands the meaning and pronunciation."
I think it is obviously wrong. Logically, I can use 鹿 with pronunciation mǎ and meaning 马 in my vedio. It will be very ridiculous to say that 鹿 pronounciates mǎ and means 马 just based on my vedio. The paired pronunciation and meaning in the vedio proves nothing but only the uploader used ⿰久闹 with that pronunciation and meaning in the vedio. This fact warrants nothing.
I'd like to say that I am kind of sure that the uploader didn't know the pronunciation or meaning before using it. So please find qualified evidences for the ideograph or postpone it as experts will suggest in IRG meeings.

Comment #2304 also says:"It should of course go almost without mention that the evidence conforms to the requirements of the UK."
Comparing the evidences for this ideograph with the evidences for most of other ideographs, we still think that the evidences for this ideograph is against UK's general requirements for the quality of evidences. It would be very worrying if the quality of them were the same.
Unclear evidence
[ Unresolved from v1.0 ]
In comment #2798, it says "Furthermore since the up-loader of the video in 2022 was around 20 together". I think the submitter should provide the screenshot of them all to prove that this is true but not provide comments in texts only. Still, it is so clear that the quality of internet video uploaded by random uploaders is too weak for encoding.

Search result of 172画 huang in Bilibili
Should we encode huang? The number of the uploaders of huang is far bigger than 20.


Comment #2798 says:
Evidence 1 has many strengths:
- it is a primary source of evidence
- it shows the character is used in running text
- it shows clearly the shape of the character
- it accurately gives the pronunciation and meaning of the character
The second evidence:
- confirms the shape of the character
- shows the pre existence of the character
- shows that multiple fonts contain the character (the font used for the video is not that shown in the computing article)

However, even evidence 1 itself is suspicious, how can we assure the information in it is correct?
The second evidence is also too weak for encoding. In the process of making fonts for ideographs used in books, many errors can be found. Since both of the evidences are not qualified for encoding, these two evidences cannot be used to prove anything else.
Evidence
[ Unresolved from v1.0 ]
Sorry about the request. I thought that there are 20 people who use ⿰久闹. If 20 is the video that the uploader uploaded, then it cannot prove ⿰久闹 is valid to any extend.
Anyway, it will be too ridiculous for me to believe that vast majority of IRG experts will support encoding ⿰久闹 in the current situation.
Although I am not angry about the personal attack in Comment #2880 at all, but I still hope that there won't be any more.
Unclear evidence
[ Unresolved from v2.0 ]
Same kind of evidence like before. The uploader is also the same as the existing evidence. More convincing evidence needed.
Unclear evidence
[ Unresolved from v2.0 ]
The UK still has not been able to provide evidence from historical document or other high-quality sources for this character. On the contrary, the UK has been providing evidence of the use of this character by one or two users on unstable carriers, and such evidence is not sufficient to support the encoding of this character. We have made it very clear that the evidence of this type provided by the UK is not sufficient to support the encoding of this character. However, the experts from the UK seem unable to understand what we mean and they are still insisting on providing evidence of the same type with insufficient quality, and also insisting on saying things that are obviously inconsistent with the facts. This kind of behavior is of no benefit to the review work and cannot make these characters be removed to the M-set.
At the last meeting, we have already suggested that if the experts from the UK believe that this character is not a newly self-created character and has the value for encoding, they can simply publish a paper that includes this character and vouch for these characters with their own reputation. Let me be more straightforward. It should not be a difficult task for the experts from the UK to publish a paper. If the experts from the UK are not willing to risk their own reputation, they should not insist on using evidence that obviously does not meet the requirements to request the encoding of this character in the IRG.

英国(UK)至今仍未能提供这个字的历史文献证据或其他高质量来源,相反,英国一直在提供由一两个使用者在不稳定的载体上使用该字的证据,这些证据并不足以支持对该字进行编码。我们已经说得很清楚,英国提供的这种类型的证据不足以支持编码该字,但英国的专家似乎无法理解我们表达的意思,仍在坚持提供相同类型、效力不足的证据,坚持说一些明显与事实不符的话。这种行为对审核工作没有任何益处,也无法让这些字从 D-set 返回 M-set。
在上次会议上,我们已经建议:如果英国专家认为这个字并非个人新造,且有编码的价值,完全可以自己发表一篇包含此字的论文,以自己的声誉给这些字作担保。让我把话说的再直接一点:发表一篇论文对英国专家来说应该不是什么难事,如果英国专家不愿意拿自己的声誉冒险,就不应该坚持在 IRG 以明显不符合要求的证据来要求对该字进行编码。
Unclear evidence
[ Unresolved from v2.0 ]
China thinks the evidence is still not sufficient and the character should be kept in D-set until better evidednce is provided.
00236
00236
冫 15.4.3
VN-F0046
TS 5 · IDS
Evidence
If it is the case as comment #11739 said, I think it's OK to encode it.
03456
03456
行 144.14.5
VN-F052A
TS 20 · IDS
Unclear evidence
May be an one-off error, please verify the evidence.
越喃汉英四文对照新辞典
03621
03621
赤 155.12.1
VN-F056E
TS 19 · IDS
New evidence
越喃汉英四文对照新辞典,上海交通大学出版社,2023年,page400

Glyph Design & Normalization

Showing 5 comments.

SnImage/SourceComment TypeDescription
00060
00060
丿 4.7.2
GCW-00007
TS 8 · IDS 丿𠦆
Glyph design
[ Unresolved from v1.0 ]
The top of this ideograph should be the same as 甪. So the first stroke should be in contact with the bottom component as the two evideces show.
01162
01162
尸 44.16.3
GCW-00101
TS 19 · IDS
Normalization
[ Unresolved from v1.0 ]
Suggest to normalize the glyph to ⿺尾童 instead of changing the IDS.
03613
03613
贝 154′.14.2
GXM-00494
UTC-03291
TS 18 · IDS 𥈠
Glyph design
[ Unresolved from v1.0 ]
We'd like to keep the current glyph.
Mr. 朱永⿰贝睿 write his name like current glyph.

Source: https://www.mmcs.org.cn/kxjfc/kxjfc/zybr/bd/art/2023/art_310b238dedb6424298d5e31ac79134ae.html
What's more, 《康熙字典》 has 丿 as the third stroke of the 睿 part. Currently, this ideograph is mainly used as person name and people are more likely to use the glyph in 《康熙字典》.
Glyph design
[ Unresolved from v1.0 ]
We'd like to keep the current glyph. It agrees with the glyph used on Chinese ID cards. Personally, I recommend UTC to keep its current glyph, too.
01431
01431
心 61.2.3
GZ-0471101
TS 5 · IDS
Glyph design
[ Unresolved from v1.0 ]
Agree with Eiso.

Other

Showing 16 comments.

SnImage/SourceComment TypeDescription
03267
03267
艸 140.7.4
GCW-00202
TS 11 · IDS
Comment
[ Unresolved from v2.0 ]
Agree with Eiso and Lee.
03982
03982
门 169′.8.3
GCW-00239
TS 11 · IDS
Comment
[ Unresolved from v2.0 ]
To John's comment #10593:
Yes, very likely.
I have scanned the book again recently and the quality is better. Thank you for bringing the issue out.
04650
04650
黾 205′.5.3
GCW-00265
TS 13 · IDS
Comment
[ Unresolved from v2.0 ]
What's the difference between them?
00967
00967
土 32.18.5
GDM-00393
UK-30238
TS 21 · IDS
Comment
The difference is too much. I'd like to suggest that we handle the issue when related characters are proposed in the future.
03433
03433
虫 142.12.5
GDM-00470
TS 18 · IDS
Comment
[ Unresolved from v2.0 ]
The character was also used in the same word 螺~ with the word in the evidence provided.
00025
00025
一 1.12.3
页 181′.7.1
GDM-00485
TS 13 · IDS
Comment
[ Unresolved from v2.0 ]
Suggest no change.
04673
04673
龙 212′.6.2
GDM-00505
TS 11 · IDS
Comment
[ Unresolved from v2.0 ]
Suggest no change.
00261
00261
刀 18.5.5
SAT-09369
TS 7 · IDS
Comment
[ Unresolved from v1.0 ]
应劭 is a famous schoolar of the Han Dynasty. So ⿰召刀 is absolutely a variant of 劭.
03315
03315
艸 140.11.3
TD-7E4D
TS 15 · IDS
Comment
𠎍 U+2038D
03378
03378
虍 141.7.3
UK-30184
TS 13 · IDS
Other
[ Unresolved from v1.0 ]
Evidence No.3 is from 道光(1821-1850) 《潯州府志》, the glyph in it is ⿺虎戌.
Evidence NO.2 is 同治(1862-1875)《潯州府志》, the glyph in it is ⿺虎戊.
00061
00061
丿 4.10.5
UK-30621
TS 11 · IDS
Comment
[ Unresolved from v1.0 ]
Thanks for that. I will check my books these days, too.
Comment
[ Unresolved from v1.0 ]
As far as we are concerned, actual evidences are more credible than expert's experiences. Expert's experiences are very helpful when qualified evidences are provided. But the experiences can also be harmful if they are over relied. Thus although we have many excellent experts here, qualified evidences are still needed for this character(UK-30621), ⿰大老(UK-30639) and ⿰丫要(UK-30620).

If there are other subbmitted ideographs whose evidences are only from online video, we think that they should be postponed too if no more qualified evidence can be provided.
Comment
[ Unresolved from v2.0 ]
China, as a member body of WG2, disagreed to let the character go back to M-set because the current evidence is not sufficient. It is astonishing that this should be neglected so easily in the last day's meeting.
It is also astonishing that the three characters were added back to the M-set even if the evidence shows that the characters (especially the other two, ⿰丫要 and ⿰大老) are used only by one or two ordinary people in unstable internet vedios, and no one have seen the characters were used in historical document. Are these vedios have been considered authoritative evidence by IRG? Is this the way to ensure the quality of the standard is great? I really cannot understand it.
I just want to say that if the three characters (i.e. ⿰久闹, ⿰丫要 and ⿰大老) are added back to M-set based on the current evidence, our center will draft official documents to the Guangxi University(广西大学) to verify the origin of ⿰久闹 in the font and if its experts' comments were right. What's more, we will also draft official documents to the Ministry of Industry and Information Technology of the People's Republic of China(中华人民共和国工信部) stating the situation here.
01881
01881
艸 140.8.1
UTC-03277
TS 12 · IDS 𣏹
Comment
The 萩 should be an error form because 朱由⿱艹𣏹's brothers all have a 木 radical character in their names.
00103
00103
亠 8.60.3
心 61.58.4
UTC-03344
TS 62 · IDS
Comment
[ Unresolved from v1.0 ]
Suggest to keep the current glyph which is the same as Mr. 余云华's article.
00101
00101
亠 8.27.5
心 61.25.4
UTC-03347
TS 29 · IDS
Comment
[ Unresolved from v1.0 ]
According to IRGN2622, the proposed glyph can be unified to the glyph in the books. This means it is OK to normalize the glyph to the proposed glyph.

Data for Unihan

Showing 1 comments.

SnImage/SourceComment TypeDescription
03258
03258
艸 140.6.4
GCW-00201
TS 10 · IDS
Semantic variant
灌.
国家基础地理信息中心数据:

江西省会昌县地名志: