WebAbstract. This paper reports on our research to build a large-scale Tsinghua Chinese Treebank (TCT). We propose a two-stage approach to reduce manual proofreading labors … Web国内可用免费语料库(凡没有标注不可用的链接均可用)
Treebank Conversion based Self-training Strategy for Parsing
WebExploiting a Chinese-English bilingual wordlist for English-Chinese cross language information retrieval. In: Fifth International Workshop on Information Retrieval with Asian Languages, IRAL-2000. Hong Kong, September 30 to October 1, 2000.]] Web该工具包在标准数据集Chinese Treebank(CTB5)上分词的F1值可达97.3%,词性标注的F1值可达到92.9 ... THULAC: An Efficient Lexical Analyzer for Chinese. 2016. fisher arnold engineering
A Study on Automatic Recognition of Chinese Sentence Pairs
WebLanguage resources are very important for natural language processing research and applications. This paper will introduce our ongoing research work to build a situation-based language knowledge base for the Chinese language, based on two basic language resources: three Chinese semantic lexicons and a large scale Chinese treebank. WebTsinghua Chinese Treebank. EN. English Deutsch Français Español Português Italiano Român Nederlands Latina Dansk Svenska Norsk Magyar Bahasa Indonesia Türkçe Suomi … WebPenn Chinese Treebank (CTB) (Xue et al. 2005), Tsinghua Chinese Treebank (TCT) (Zhou 1996), and Peking Chinese Treebank.1 To obtain more training data for building syntactic parsers, it is desirable to use all the data together. However, such treebanks cannot be simply combined because they follow different annotation fisher arnold houston