跳至主要內容
:::

Chinese News Corpus

Inventor
Wei-Yun Ma, Keh-Jiann Chen
UpdataDate
Share
FB
Line

Docket Number:05T-1070910

Inventor:Wei-Yun Ma, Keh-Jiann Chen

IP Status:Trade Secret

 

Abstract:

Chinese News Corpus is developed and maintained by CKIP group in Academia Sinica. The corpus contains news texts and magazine documents in the period of 1990 - 1991, including 14 million words. In the past years, this project has been funded by the CCK Foundational for International Scholars Exchange, the National Science Council of R. O. C., and Academia Sinica at various staffs.

 

Fields of Application:

1. Information Retrieval
2. Lexicon Construction
3. Language Analysis
4. Language Understanding
5. Information Extraction
6. Media Comparison

 

Advantages when compared to the existing technologies:

Chinese News Corpus is huge in content, covering the entire year of news texts and magazine documents and from different media sources, providing a wealth of materials needed for Chinese language processing.

 

Contact Person:Ming-Chieh Chen / 886-2-2787-2509 / mingchieh@gate.sinica.edu.tw